1 code implementation • NAACL 2022 • Xun Yuan, Derek Pham, Sam Davidson, Zhou Yu
Currently available grammatical error correction (GEC) datasets are compiled using essays or other long-form text written by language learners, limiting the applicability of these datasets to other domains such as informal writing and conversational dialog.
no code implementations • EMNLP (ALW) 2020 • Sam Davidson, Qiusi Sun, Magdalena Wojcieszak
The classifier is trained on a dataset of Reddit posts, which are annotated for incivility, and further expanded using a combination of labeled data from Reddit and Twitter.
no code implementations • WNUT (ACL) 2021 • Sam Davidson, Jordan Hosier, Yu Zhou, Vijay Gurbani
We explore the application of state-of-the-art NER algorithms to ASR-generated call center transcripts.
no code implementations • 23 Sep 2023 • Sam Davidson, Salvatore Romeo, Raphael Shu, James Gung, Arshit Gupta, Saab Mansour, Yi Zhang
One of the major impediments to the development of new task-oriented dialogue (TOD) systems is the need for human evaluation at multiple stages and iterations of the development process.
1 code implementation • 23 May 2023 • Narutatsu Ri, Bill Sun, Sam Davidson, Zhou Yu
Although significant progress has been made in developing methods for Grammatical Error Correction (GEC), addressing word choice improvements has been notably lacking and enhancing sentence expressivity by replacing phrases with advanced expressions is an understudied aspect.
no code implementations • 31 Jul 2022 • Yu Li, Chun-Yen Chen, Dian Yu, Sam Davidson, Ryan Hou, Xun Yuan, Yinghua Tan, Derek Pham, Zhou Yu
This paper reports on progress towards building an online language learning tool to provide learners with conversational experience by using dialog systems as conversation practice partners.
1 code implementation • 15 Dec 2021 • Xun Yuan, Derek Pham, Sam Davidson, Zhou Yu
Currently available grammatical error correction (GEC) datasets are compiled using well-formed written text, limiting the applicability of these datasets to other domains such as informal writing and dialog.
no code implementations • 17 Nov 2020 • Kaihui Liang, Austin Chau, Yu Li, Xueyuan Lu, Dian Yu, Mingyang Zhou, Ishan Jain, Sam Davidson, Josh Arnold, Minh Nguyen, Zhou Yu
Gunrock 2. 0 is built on top of Gunrock with an emphasis on user adaptation.
no code implementations • WS 2020 • Alessio Miaschi, Sam Davidson, Dominique Brunato, Felice Dell{'}Orletta, Kenji Sagae, Claudia Helena Sanchez-Gutierrez, Giulia Venturi
In this paper we present an NLP-based approach for tracking the evolution of written language competence in L2 Spanish learners using a wide range of linguistic features automatically extracted from students{'} written productions.
no code implementations • LREC 2020 • Sam Davidson, Aaron Yamada, Fern, Paloma ez Mira, Car, Agustina o, Claudia H. Sanchez Gutierrez, Kenji Sagae
While annotated learner corpora of English are widely available, large learner corpora of Spanish are less common.
no code implementations • IJCNLP 2019 • Dian Yu, Michelle Cohn, Yi Mang Yang, Chun-Yen Chen, Weiming Wen, Jiaping Zhang, Mingyang Zhou, Kevin Jesse, Austin Chau, Antara Bhowmick, Shreenath Iyer, Giritheja Sreenivasulu, Sam Davidson, Ashwin Bhandare, Zhou Yu
Gunrock is the winner of the 2018 Amazon Alexa Prize, as evaluated by coherence and engagement from both real users and Amazon-selected expert conversationalists.
no code implementations • IJCNLP 2019 • Sam Davidson, Dian Yu, Zhou Yu
Dependency parsing of conversational input can play an important role in language understanding for dialog systems by identifying the relationships between entities extracted from user utterances.