no code implementations • WNUT (ACL) 2021 • Sangah Lee, Hyopil Shin
User-generated texts include various types of stylistic properties, or noises.
no code implementations • 1 Apr 2024 • Kyuhee Kim, Surin Lee, Sangah Lee
In this paper, we present KoCoNovel, a novel character coreference dataset derived from Korean literary texts, complete with detailed annotation guidelines.
1 code implementation • 21 Mar 2024 • Kyuhee Kim, Surin Lee, Sangah Lee
In many literary texts, emotions are indirectly conveyed through descriptions of actions, facial expressions, and appearances, necessitating emotion inference for narrative understanding.
no code implementations • 29 Nov 2023 • Jean Seo, Sungjoo Byun, Minha Kang, Sangah Lee
The Manchu language, with its roots in the historical Manchurian region of Northeast China, is now facing a critical threat of extinction, as there are very few speakers left.
no code implementations • 23 Nov 2023 • Dongjun Jang, Sangah Lee, Sungjoo Byun, Jinwoong Kim, Jean Seo, Minseok Kim, Soyeon Kim, Chaeyoung Oh, Jaeyoon Kim, Hyemi Jo, Hyopil Shin
This paper presents the DaG LLM (David and Goliath Large Language Model), a language model specialized for Korean and fine-tuned through Instruction Tuning across 41 tasks within 13 distinct categories.
1 code implementation • 10 Aug 2020 • Sangah Lee, Hansol Jang, Yunmee Baik, Suzi Park, Hyopil Shin
Since the appearance of BERT, recent works including XLNet and RoBERTa utilize sentence embedding models pre-trained by large corpora and a large number of parameters.