no code implementations • LREC 2022 • Anisia Katinskaia, Maria Lebedeva, Jue Hou, Roman Yangarber
We present ReLCo— the Revita Learner Corpus—a new semi-automatically annotated learner corpus for Russian.
no code implementations • games (LREC) 2022 • Jue Hou, Ilmari Kylliäinen, Anisia Katinskaia, Giacomo Furlan, Roman Yangarber
Our goal is to keep the learner engaged in long practice sessions over many months—rather than for the short-term.
no code implementations • WS (NoDaLiDa) 2019 • Jue Hou, Maximilian Koppatz, José María Hoya Quecedo, Roman Yangarber
Named entity recognition (NER) is a well-researched task in the field of NLP, which typically requires large annotated corpora for training usable models.
1 code implementation • 22 Apr 2024 • Jue Hou, Anisia Katinskaia, Lari Kotilainen, Sathianpong Trangcasanchai, Anh-Duc Vu, Roman Yangarber
This paper investigates what insights about linguistic features and what knowledge about the structure of natural language can be obtained from the encodings in transformer language models. In particular, we explore how BERT encodes the government relation between constituents in a sentence.
1 code implementation • 19 May 2023 • Jun Wen, Jue Hou, Clara-Lea Bonzel, Yihan Zhao, Victor M. Castro, Vivian S. Gainer, Dana Weisenfeld, Tianrun Cai, Yuk-Lam Ho, Vidul A. Panickan, Lauren Costa, Chuan Hong, J. Michael Gaziano, Katherine P. Liao, Junwei Lu, Kelly Cho, Tianxi Cai
We propose a LAbel-efficienT incidenT phEnotyping (LATTE) algorithm to accurately annotate the timing of clinical events from longitudinal EHR data.
no code implementations • 9 May 2023 • Jue Hou, Anisia Katinskaia, Anh-Duc Vu, Roman Yangarber
Lastly, we show 4. that LMs of smaller size using morphological segmentation can perform comparably to models of larger size trained with BPE -- both in terms of (1) perplexity and (3) scores on downstream tasks.
no code implementations • 3 Dec 2022 • Anisia Katinskaia, Jue Hou, Anh-Duc Vu, Roman Yangarber
This paper presents the development of an AI-based language learning platform Revita.
no code implementations • 4 Nov 2022 • Yucong Lin, Jinhua Su, Yuhang Li, Yuhao Wei, Hanchao Yan, Saining Zhang, Jiaan Luo, Danni Ai, Hong Song, Jingfan Fan, Tianyu Fu, Deqiang Xiao, Feifei Wang, Jue Hou, Jian Yang
Deep learning methods have contributed substantially to the rapid advancement of medical image segmentation, the quality of which relies on the suitable design of loss functions.
no code implementations • 4 May 2021 • Jue Hou, Zijian Guo, Tianxi Cai
Risk modeling with EHR data is challenging due to a lack of direct observations on the disease outcome, and the high dimensionality of the candidate predictors.
no code implementations • WS 2019 • Jue Hou, Koppatz Maximilian, Jos{\'e} Mar{\'\i}a Hoya Quecedo, Nataliya Stoyanova, Roman Yangarber
This application of Elo provides ratings for learners and concepts which correlate well with subjective proficiency levels of the learners and difficulty levels of the concepts.
no code implementations • 29 Jun 2019 • Jue Hou, Jelena Bradic, Ronghui Xu
Estimating causal effects for survival outcomes in the high-dimensional setting is an extremely important topic for many biomedical applications as well as areas of social sciences.
no code implementations • 29 Jul 2017 • Jue Hou, Jelena Bradic, Ronghui Xu
The purpose of this paper is to construct confidence intervals for the regression coefficients in the Fine-Gray model for competing risks data with random censoring, where the number of covariates can be larger than the sample size.