1 code implementation • 19 Sep 2023 • Yang Gao, Ji Ma, Ivan Korotkov, Keith Hall, Dana Alon, Don Metzler
We propose the first multilingual scientific documents dataset, Open-access Multilingual Scientific Documents (OpenMSD), which has 74M papers in 103 languages and 778M citation pairs.
no code implementations • EACL 2021 • Ji Ma, Ivan Korotkov, Yinfei Yang, Keith Hall, Ryan Mcdonald
The question generation system is trained on general domain data, but is applied to documents in the targeted domain.