no code implementations • LREC 2016 • Xuansong Li, Martha Palmer, Nianwen Xue, Lance Ramshaw, Mohamed Maamouri, Ann Bies, Kathryn Conger, Stephen Grimes, Stephanie Strassel
High accuracy for automated translation and information retrieval calls for linguistic annotations at various language levels.
no code implementations • LREC 2016 • Xuansong Li, Jennifer Tracey, Stephen Grimes, Stephanie Strassel
Morphologically-rich languages pose problems for machine translation (MT) systems, including word-alignment errors, data sparsity and multiple affixes.
no code implementations • WS 2014 • Ann Bies, Zhiyi Song, Mohamed Maamouri, Stephen Grimes, Haejoong Lee, Jonathan Wright, Stephanie Strassel, Nizar Habash, Esk, Ramy er, Owen Rambow
no code implementations • LREC 2012 • Xuansong Li, Stephanie Strassel, Stephen Grimes, Safa Ismael, Mohamed Maamouri, Ann Bies, Nianwen Xue
Parallel aligned treebanks (PAT) are linguistic corpora annotated with morphological and syntactic structures that are aligned at sentence as well as sub-sentence levels.
no code implementations • LREC 2012 • Stephen Grimes, Katherine Peterson, Xuansong Li
We have been creating large-scale manual word alignment corpora for Arabic-English and Chinese-English language pairs in genres such as newsire, broadcast news and conversation, and web blogs.
no code implementations • LREC 2012 • Zhiyi Song, Safa Ismael, Stephen Grimes, David Doermann, Stephanie Strassel
LDC has developed a stable pipeline and infrastructures for collecting and annotating handwriting linguistic resources to support the evaluation of MADCAT and OpenHaRT.