Search Results for author: Jinho D. Choi

Found 83 papers, 28 papers with code

UMR-Writer: A Web Application for Annotating Uniform Meaning Representations

no code implementations EMNLP (ACL) 2021 Jin Zhao, Nianwen Xue, Jens Van Gysel, Jinho D. Choi

We present UMR-Writer, a web-based application for annotating Uniform Meaning Representations (UMR), a graph-based, cross-linguistically applicable semantic representation developed recently to support the development of interpretable natural language applications that require deep semantic analysis of texts.

Automatic Enrichment of Abstract Meaning Representations

1 code implementation LREC (LAW) 2022 Yuxin Ji, Gregor Williamson, Jinho D. Choi

All code for this paper, including our automatic annotation tool, is made publicly available.

Automatic Generation of Large-scale Multi-turn Dialogues from Reddit

no code implementations COLING 2022 Daniil Huryn, William M. Hutsell, Jinho D. Choi

Our best method is applied to posts from those 10 subreddits for the creation of a corpus comprising 10, 098 dialogues (3. 3M tokens), 570 of which are compared against dialogues in three other datasets, Blended Skill Talk, Daily Dialogue, and Topical Chat.

Competence-Level Prediction and Resume \& Job Description Matching Using Context-Aware Transformer Models

no code implementations EMNLP 2020 Changmao Li, Elaine Fisher, Rebecca Thomas, Steve Pittard, Vicki Hertzberg, Jinho D. Choi

Given this dataset, novel transformer-based classification models are developed for two tasks: the first task takes a resume and classifies it to a CRC level (T1), and the second task takes both a resume and a job description to apply and predicts if the application is suited to the job (T2).

FantasyCoref: Coreference Resolution on Fantasy Literature Through Omniscient Writer’s Point of View

no code implementations CRAC (ACL) 2021 Sooyoun Han, Sumin Seo, Minji Kang, Jongin Kim, Nayoung Choi, Min Song, Jinho D. Choi

This paper presents a new corpus and annotation guideline for a novel coreference resolution task on fictional texts, and analyzes its unique characteristics.

coreference-resolution

Analysis of Zero-Shot Crosslingual Learning between English and Korean for Named Entity Recognition

1 code implementation EMNLP (MRL) 2021 Jongin Kim, Nayoung Choi, Seunghyun Lim, Jungwhan Kim, Soojin Chung, Hyunsoo Woo, Min Song, Jinho D. Choi

This paper presents a English-Korean parallel dataset that collects 381K news articles where 1, 400 of them, comprising 10K sentences, are manually labeled for crosslingual named entity recognition (NER).

named-entity-recognition Named Entity Recognition +1

ConvoSense: Overcoming Monotonous Commonsense Inferences for Conversational AI

1 code implementation27 Jan 2024 Sarah E. Finch, Jinho D. Choi

Mastering commonsense understanding and reasoning is a pivotal skill essential for conducting engaging conversations.

Aligning Speakers: Evaluating and Visualizing Text-based Diarization Using Efficient Multiple Sequence Alignment (Extended Version)

no code implementations14 Sep 2023 Chen Gong, Peilin Wu, Jinho D. Choi

This paper presents a novel evaluation approach to text-based speaker diarization (SD), tackling the limitations of traditional metrics that do not account for any contextual information in text.

Multiple Sequence Alignment speaker-diarization +1

Exploring the Impact of Human Evaluator Group on Chat-Oriented Dialogue Evaluation

no code implementations14 Sep 2023 Sarah E. Finch, James D. Finch, Jinho D. Choi

Human evaluation has been widely accepted as the standard for evaluating chat-oriented dialogue systems.

Chatbot Dialogue Evaluation

Leveraging Large Language Models for Automated Dialogue Analysis

1 code implementation12 Sep 2023 Sarah E. Finch, Ellie S. Paek, Jinho D. Choi

Developing high-performing dialogue systems benefits from the automatic identification of undesirable behaviors in system responses.

General Knowledge Language Modelling +1

Widely Interpretable Semantic Representation: Frameless Meaning Representation for Broader Applicability

no code implementations12 Sep 2023 Lydia Feng, Gregor Williamson, Han He, Jinho D. Choi

Despite its strengths, AMR is not easily applied to languages or domains without predefined semantic frames, and its use of numbered arguments results in semantic role labels, which are not directly interpretable and are semantically overloaded for parsers.

Towards Open-World Product Attribute Mining: A Lightly-Supervised Approach

1 code implementation26 May 2023 Liyan Xu, Chenwei Zhang, Xian Li, Jingbo Shang, Jinho D. Choi

We present a new task setting for attribute mining on e-commerce products, serving as a practical solution to extract open-world attributes without extensive human intervention.

Attribute

InterviewBot: Real-Time End-to-End Dialogue System to Interview Students for College Admission

no code implementations27 Mar 2023 ZiHao Wang, Nathan Keyes, Terry Crawford, Jinho D. Choi

We present the InterviewBot that dynamically integrates conversation history and customized topics into a coherent embedding space to conduct 10 mins hybrid-domain (open and closed) conversations with foreign students applying to U. S. colleges for assessing their academic and cultural readiness.

Don't Forget Your ABC's: Evaluating the State-of-the-Art in Chat-Oriented Dialogue Systems

3 code implementations18 Dec 2022 Sarah E. Finch, James D. Finch, Jinho D. Choi

Our method is used to evaluate four state-of-the-art open-domain dialogue systems and compared with existing approaches.

Chatbot Dialogue Evaluation

Online Coreference Resolution for Dialogue Processing: Improving Mention-Linking on Real-Time Conversations

no code implementations *SEM (NAACL) 2022 Liyan Xu, Jinho D. Choi

This paper suggests a direction of coreference resolution for online decoding on actively generated input such as dialogue, where the model accepts an utterance and its past context, then finds mentions in the current utterance as well as their referents, upon each dialogue turn.

coreference-resolution

Modeling Task Interactions in Document-Level Joint Entity and Relation Extraction

no code implementations NAACL 2022 Liyan Xu, Jinho D. Choi

We target on the document-level relation extraction in an end-to-end setting, where the model needs to jointly perform mention extraction, coreference resolution (COREF) and relation extraction (RE) at once, and gets evaluated in an entity-centric way.

coreference-resolution Document-level Relation Extraction +2

NL-Augmenter: A Framework for Task-Sensitive Natural Language Augmentation

2 code implementations6 Dec 2021 Kaustubh D. Dhole, Varun Gangal, Sebastian Gehrmann, Aadesh Gupta, Zhenhao Li, Saad Mahamood, Abinaya Mahendiran, Simon Mille, Ashish Shrivastava, Samson Tan, Tongshuang Wu, Jascha Sohl-Dickstein, Jinho D. Choi, Eduard Hovy, Ondrej Dusek, Sebastian Ruder, Sajant Anand, Nagender Aneja, Rabin Banjade, Lisa Barthe, Hanna Behnke, Ian Berlot-Attwell, Connor Boyle, Caroline Brun, Marco Antonio Sobrevilla Cabezudo, Samuel Cahyawijaya, Emile Chapuis, Wanxiang Che, Mukund Choudhary, Christian Clauss, Pierre Colombo, Filip Cornell, Gautier Dagan, Mayukh Das, Tanay Dixit, Thomas Dopierre, Paul-Alexis Dray, Suchitra Dubey, Tatiana Ekeinhor, Marco Di Giovanni, Tanya Goyal, Rishabh Gupta, Louanes Hamla, Sang Han, Fabrice Harel-Canada, Antoine Honore, Ishan Jindal, Przemyslaw K. Joniak, Denis Kleyko, Venelin Kovatchev, Kalpesh Krishna, Ashutosh Kumar, Stefan Langer, Seungjae Ryan Lee, Corey James Levinson, Hualou Liang, Kaizhao Liang, Zhexiong Liu, Andrey Lukyanenko, Vukosi Marivate, Gerard de Melo, Simon Meoni, Maxime Meyer, Afnan Mir, Nafise Sadat Moosavi, Niklas Muennighoff, Timothy Sum Hon Mun, Kenton Murray, Marcin Namysl, Maria Obedkova, Priti Oli, Nivranshu Pasricha, Jan Pfister, Richard Plant, Vinay Prabhu, Vasile Pais, Libo Qin, Shahab Raji, Pawan Kumar Rajpoot, Vikas Raunak, Roy Rinberg, Nicolas Roberts, Juan Diego Rodriguez, Claude Roux, Vasconcellos P. H. S., Ananya B. Sai, Robin M. Schmidt, Thomas Scialom, Tshephisho Sefara, Saqib N. Shamsi, Xudong Shen, Haoyue Shi, Yiwen Shi, Anna Shvets, Nick Siegel, Damien Sileo, Jamie Simon, Chandan Singh, Roman Sitelew, Priyank Soni, Taylor Sorensen, William Soto, Aman Srivastava, KV Aditya Srivatsa, Tony Sun, Mukund Varma T, A Tabassum, Fiona Anting Tan, Ryan Teehan, Mo Tiwari, Marie Tolkiehn, Athena Wang, Zijian Wang, Gloria Wang, Zijie J. Wang, Fuxuan Wei, Bryan Wilie, Genta Indra Winata, Xinyi Wu, Witold Wydmański, Tianbao Xie, Usama Yaseen, Michael A. Yee, Jing Zhang, Yue Zhang

Data augmentation is an important component in the robustness evaluation of models in natural language processing (NLP) and in enhancing the diversity of the data they are trained on.

Data Augmentation

Zero-Shot Cross-Lingual Machine Reading Comprehension via Inter-sentence Dependency Graph

1 code implementation1 Dec 2021 Liyan Xu, Xuchao Zhang, Bo Zong, Yanchi Liu, Wei Cheng, Jingchao Ni, Haifeng Chen, Liang Zhao, Jinho D. Choi

We target the task of cross-lingual Machine Reading Comprehension (MRC) in the direct zero-shot setting, by incorporating syntactic features from Universal Dependencies (UD), and the key features we use are the syntactic relations within each sentence.

Machine Reading Comprehension Sentence

What Went Wrong? Explaining Overall Dialogue Quality through Utterance-Level Impacts

no code implementations EMNLP (NLP4ConvAI) 2021 James D. Finch, Sarah E. Finch, Jinho D. Choi

Improving user experience of a dialogue system often requires intensive developer effort to read conversation logs, run statistical analyses, and intuit the relative importance of system shortcomings.

Chatbot Weakly-supervised Learning

Intensionalizing Abstract Meaning Representations: Non-Veridicality and Scope

1 code implementation EMNLP (LAW, DMR) 2021 Gregor Williamson, Patrick Elliott, Yuxin Ji, Jinho D. Choi

We adopt a scope node from the literature and provide an explicit multidimensional semantics utilizing Cooper storage which allows us to derive the de re and de dicto scope readings as well as intermediate scope readings which prove difficult for accounts without a scope node.

Translation

StreamSide: A Fully-Customizable Open-Source Toolkit for Efficient Annotation of Meaning Representations

1 code implementation20 Sep 2021 Jinho D. Choi, Gregor Williamson

This demonstration paper presents StreamSide, an open-source toolkit for annotating multiple kinds of meaning representations.

Sentence

The Stem Cell Hypothesis: Dilemma behind Multi-Task Learning with Transformer Encoders

1 code implementation EMNLP 2021 Han He, Jinho D. Choi

Multi-task learning with transformer encoders (MTL) has emerged as a powerful technique to improve performance on closely-related tasks for both accuracy and efficiency while a question still remains whether or not it would perform as well on tasks that are distinct in nature.

Multi-Task Learning NER +1

ELIT: Emory Language and Information Toolkit

1 code implementation8 Sep 2021 Han He, Liyan Xu, Jinho D. Choi

We introduce ELIT, the Emory Language and Information Toolkit, which is a comprehensive NLP framework providing transformer-based end-to-end models for core tasks with a special focus on memory efficiency while maintaining state-of-the-art accuracy and speed.

AMR Parsing Constituency Parsing +9

Boosting Cross-Lingual Transfer via Self-Learning with Uncertainty Estimation

1 code implementation EMNLP 2021 Liyan Xu, Xuchao Zhang, Xujiang Zhao, Haifeng Chen, Feng Chen, Jinho D. Choi

Recent multilingual pre-trained language models have achieved remarkable zero-shot performance, where the model is only finetuned on one source language and directly evaluated on target languages.

Cross-Lingual Transfer named-entity-recognition +4

Adapted End-to-End Coreference Resolution System for Anaphoric Identities in Dialogues

no code implementations ACL (CODI, CRAC) 2021 Liyan Xu, Jinho D. Choi

We present an effective system adapted from the end-to-end neural coreference resolution model, targeting on the task of anaphora resolution in dialogues.

coreference-resolution Transfer Learning

Levi Graph AMR Parser using Heterogeneous Attention

1 code implementation ACL (IWPT) 2021 Han He, Jinho D. Choi

Coupled with biaffine decoders, transformers have been effectively adapted to text-to-graph transduction and achieved state-of-the-art performance on AMR parsing.

AMR Parsing

View Distillation with Unlabeled Data for Extracting Adverse Drug Effects from User-Generated Data

no code implementations NAACL (SMM4H) 2021 Payam Karisani, Jinho D. Choi, Li Xiong

Then a classifier is trained on each view to label a set of unlabeled documents to be used as an initializer for a new classifier in the other view.

Word Embeddings

Competence-Level Prediction and Resume & Job Description Matching Using Context-Aware Transformer Models

no code implementations5 Nov 2020 Changmao Li, Elaine Fisher, Rebecca Thomas, Steve Pittard, Vicki Hertzberg, Jinho D. Choi

This paper presents a comprehensive study on resume classification to reduce the time and labor needed to screen an overwhelming number of applications significantly, while improving the selection of suitable candidates.

Revealing the Myth of Higher-Order Inference in Coreference Resolution

1 code implementation EMNLP 2020 Liyan Xu, Jinho D. Choi

We find that given a high-performing encoder such as SpanBERT, the impact of HOI is negative to marginal, providing a new perspective of HOI to this task.

Avg Clustering +2

Emora: An Inquisitive Social Chatbot Who Cares For You

no code implementations10 Sep 2020 Sarah E. Finch, James D. Finch, Ali Ahmadvand, Ingyu, Choi, Xiangjue Dong, Ruixiang Qi, Harshita Sahijwani, Sergey Volokhin, Zihan Wang, ZiHao Wang, Jinho D. Choi

Inspired by studies on the overwhelming presence of experience-sharing in human-human conversations, Emora, the social chatbot developed by Emory University, aims to bring such experience-focused interaction to the current field of conversational AI.

Chatbot intent-classification +1

XD at SemEval-2020 Task 12: Ensemble Approach to Offensive Language Identification in Social Media Using Transformer Encoders

no code implementations SEMEVAL 2020 Xiangjue Dong, Jinho D. Choi

This paper presents six document classification models using the latest transformer encoders and a high-performing ensemble model for a task of offensive language identification in social media.

Document Classification Language Identification

Adaptation of Multilingual Transformer Encoder for Robust Enhanced Universal Dependency Parsing

no code implementations WS 2020 Han He, Jinho D. Choi

Our results show that models using the multilingual encoder outperform ones using the language specific encoders for most languages.

Dependency Parsing

Emora STDM: A Versatile Framework for Innovative Dialogue System Development

1 code implementation SIGDIAL (ACL) 2020 James D. Finch, Jinho D. Choi

This demo paper presents Emora STDM (State Transition Dialogue Manager), a dialogue system development framework that provides novel workflows for rapid prototyping of chat-based dialogue managers as well as collaborative development of complex interactions.

Dialogue Management Management

Towards Unified Dialogue System Evaluation: A Comprehensive Analysis of Current Evaluation Protocols

no code implementations SIGDIAL (ACL) 2020 Sarah E. Finch, Jinho D. Choi

As conversational AI-based dialogue management has increasingly become a trending topic, the need for a standardized and reliable evaluation procedure grows even more pressing.

Dialogue Management Management

Analysis of the Penn Korean Universal Dependency Treebank (PKT-UD): Manual Revision to Build Robust Parsing Model in Korean

no code implementations WS 2020 Tae Hwan Oh, Ji Yoon Han, Hyonsu Choe, Seokwon Park, Han He, Jinho D. Choi, Na-Rae Han, Jena D. Hwang, Hansaem Kim

In this paper, we first open on important issues regarding the Penn Korean Universal Treebank (PKT-UD) and address these issues by revising the entire corpus manually with the aim of producing cleaner UD annotations that are more faithful to Korean grammar.

Transformer-based Context-aware Sarcasm Detection in Conversation Threads from Social Media

no code implementations WS 2020 Xiangjue Dong, Changmao Li, Jinho D. Choi

We present a transformer-based sarcasm detection model that accounts for the context from the entire conversation thread for more robust predictions.

Sarcasm Detection

Noise Pollution in Hospital Readmission Prediction: Long Document Classification with Reinforcement Learning

no code implementations WS 2020 Liyan Xu, Julien Hogan, Rachel E. Patzer, Jinho D. Choi

This paper presents a reinforcement learning approach to extract noise in long clinical documents for the task of readmission prediction after kidney transplant.

Document Classification General Classification +3

Incremental Sense Weight Training for the Interpretation of Contextualized Word Embeddings

no code implementations5 Nov 2019 Xinyi Jiang, Zhengzhe Yang, Jinho D. Choi

We hypothesize that not all dimensions are equally important for downstream tasks so that our algorithm can detect unessential dimensions and discard them without hurting the performance.

Word Embeddings Word Sense Disambiguation

Design and Challenges of Cloze-Style Reading Comprehension Tasks on Multiparty Dialogue

no code implementations2 Nov 2019 Changmao Li, Tianhao Liu, Jinho D. Choi

According to our analysis, replacing the random data split with a chronological data split reduces test accuracy on previous single-variable passage completion task from 72\% to 34\%, that leaves much more room to improve.

Reading Comprehension

FriendsQA: Open-Domain Question Answering on TV Show Transcripts

no code implementations WS 2019 Zhengzhe Yang, Jinho D. Choi

This paper presents FriendsQA, a challenging question answering dataset that contains 1, 222 dialogues and 10, 610 open-domain questions, to tackle machine comprehension on everyday conversations.

Open-Domain Question Answering Reading Comprehension

Establishing Strong Baselines for the New Decade: Sequence Tagging, Syntactic and Semantic Parsing with BERT

1 code implementation14 Aug 2019 Han He, Jinho D. Choi

This paper presents new state-of-the-art models for three tasks, part-of-speech tagging, syntactic parsing, and semantic parsing, using the cutting-edge contextualized embedding framework known as BERT.

Part-Of-Speech Tagging Semantic Parsing

Meta-Semantic Representation for Early Detection of Alzheimer's Disease

no code implementations WS 2019 Jinho D. Choi, Mengmei Li, Felicia Goldstein, Ihab Hajjar

This paper presents a new task-oriented meaning representation called meta-semantics, that is designed to detect patients with early symptoms of Alzheimer{'}s disease by analyzing their language beyond a syntactic or semantic level.

Knowledge Graphs

Multimodal Ensemble Approach to Incorporate Various Types of Clinical Notes for Predicting Readmission

no code implementations31 May 2019 Bonggun Shin, Julien Hogan, Andrew B. Adams, Raymond J. Lynch, Rachel E. Patzer, Jinho D. Choi

One of the modalities in EHRs, clinical notes, has not been fully explored for these tasks due to its unstructured and inexplicable nature.

Coordinate Structures in Universal Dependencies for Head-final Languages

no code implementations WS 2018 Hiroshi Kanayama, Na-Rae Han, Masayuki Asahara, Jena D. Hwang, Yusuke Miyao, Jinho D. Choi, Yuji Matsumoto

This paper discusses the representation of coordinate structures in the Universal Dependencies framework for two head-final languages, Japanese and Korean.

They Exist! Introducing Plural Mentions to Coreference Resolution and Entity Linking

2 code implementations COLING 2018 Ethan Zhou, Jinho D. Choi

To the best of our knowledge, this is the first time that plural mentions are thoroughly analyzed for these two resolution tasks.

coreference-resolution Entity Linking +3

SemEval 2018 Task 4: Character Identification on Multiparty Dialogues

no code implementations SEMEVAL 2018 Jinho D. Choi, Henry Y. Chen

Character identification is a task of entity linking that finds the global entity of each personal mention in multiparty dialogue.

Entity Linking Machine Translation +1

Challenging Reading Comprehension on Daily Conversation: Passage Completion on Multiparty Dialog

no code implementations NAACL 2018 Kaixin Ma, Tomasz Jurczyk, Jinho D. Choi

This paper presents a new corpus and a robust deep learning architecture for a task in reading comprehension, passage completion, on multiparty dialog.

Question Answering Reading Comprehension

Analysis of Wikipedia-based Corpora for Question Answering

no code implementations6 Jan 2018 Tomasz Jurczyk, Amit Deshmane, Jinho D. Choi

This paper gives comprehensive analyses of corpora based on Wikipedia for several tasks in question answering.

Question Answering Retrieval

Improving Document Clustering by Removing Unnatural Language

no code implementations WS 2017 Myungha Jang, Jinho D. Choi, James Allan

We view this problem as an information extraction task and build a multiclass classification model identifying unnatural language components into four categories.

Clustering Document Layout Analysis +1

Classification of Radiology Reports Using Neural Attention Models

no code implementations22 Aug 2017 Bonggun Shin, Falgun H. Chokshi, Timothy Lee, Jinho D. Choi

The electronic health record (EHR) contains a large amount of multi-dimensional and unstructured clinical data of significant operational and research value.

Classification General Classification

Emotion Detection on TV Show Transcripts with Sequence-based Convolutional Neural Networks

1 code implementation14 Aug 2017 Sayyed M. Zahiri, Jinho D. Choi

While there have been significant advances in detecting emotions from speech and image recognition, emotion detection on text is still under-explored and remained as an active research field.

Document Classification General Classification

ISS-MULT: Intelligent Sample Selection for Multi-Task Learning in Question Answering

no code implementations7 Aug 2017 Ali Ahmadvand, Jinho D. Choi

In addition, using ISS-MULT could finely improve the MULT method for question answering tasks, and these improvements prove more significant in the answer triggering task.

Answer Selection Multi-Task Learning

Robust Coreference Resolution and Entity Linking on Dialogues: Character Identification on TV Show Transcripts

no code implementations CONLL 2017 Henry Y. Chen, Ethan Zhou, Jinho D. Choi

This paper presents a novel approach to character identification, that is an entity linking task that maps mentions to characters in dialogues from TV show transcripts.

coreference-resolution Entity Linking +2

Cross-genre Document Retrieval: Matching between Conversational and Formal Writings

no code implementations WS 2017 Tomasz Jurczyk, Jinho D. Choi

This paper challenges a cross-genre document retrieval task, where the queries are in formal writing and the target documents are in conversational writing.

Retrieval Sentence

Improving Document Clustering by Eliminating Unnatural Language

no code implementations16 Mar 2017 Myungha Jang, Jinho D. Choi, James Allan

We view this problem as an information extraction task and build a multiclass classification model identifying unnatural language components into four categories.

Clustering

Lexicon Integrated CNN Models with Attention for Sentiment Analysis

no code implementations WS 2017 Bonggun Shin, Timothy Lee, Jinho D. Choi

With the advent of word embeddings, lexicons are no longer fully utilized for sentiment analysis although they still provide important features in the traditional setting.

Sentiment Analysis Word Embeddings

Multi-Field Structural Decomposition for Question Answering

no code implementations4 Apr 2016 Tomasz Jurczyk, Jinho D. Choi

This paper presents a precursory yet novel approach to the question answering task using structural decomposition.

Question Answering

Targetable Named Entity Recognition in Social Media

no code implementations4 Aug 2014 Sandeep Ashwini, Jinho D. Choi

We present a novel approach for recognizing what we call targetable named entities; that is, named entities in a targeted set (e. g, movies, books, TV shows).

named-entity-recognition Named Entity Recognition +1

Preparing Korean Data for the Shared Task on Parsing Morphologically Rich Languages

no code implementations6 Sep 2013 Jinho D. Choi

This document gives a brief description of Korean data prepared for the SPMRL 2013 shared task.

Morphological Analysis

Empty Argument Insertion in the Hindi PropBank

no code implementations LREC 2012 Ashwini Vaidya, Jinho D. Choi, Martha Palmer, Bhuvana Narasimhan

This paper examines both linguistic behavior and practical implication of empty argument insertion in the Hindi PropBank.

Semantic Role Labeling

Cannot find the paper you are looking for? You can Submit a new open access paper.