Search Results for author: Yunyao Li

Found 49 papers, 14 papers with code

Learning Explainable Linguistic Expressions with Neural Inductive Logic Programming for Sentence Classification

no code implementations • EMNLP 2020 • Prithviraj Sen, Marina Danilevsky, Yunyao Li, Siddhartha Brahma, Matthias Boehm, Laura Chiticariu, Rajasekar Krishnamurthy

Our user studies confirm that the learned LEs are explainable and capture domain semantics.

General Classification Inductive logic programming +4

Paper
Add Code

Domain-Aware Dependency Parsing for Questions

no code implementations • Findings (ACL) 2021 • Aparna Garimella, Laura Chiticariu, Yunyao Li

Dependency Parsing

Paper
Add Code

Label Definitions Improve Semantic Role Labeling

1 code implementation • NAACL 2022 • Li Zhang, Ishan Jindal, Yunyao Li

Given a sentence and the predicate, a semantic role label is assigned to each argument of the predicate.

Semantic Role Labeling Sentence

Paper
Code

Improving Cross-lingual Text Classification with Zero-shot Instance-Weighting

no code implementations • ACL (RepL4NLP) 2021 • Irene Li, Prithviraj Sen, Huaiyu Zhu, Yunyao Li, Dragomir Radev

In this paper, we propose zero-shot instance-weighting, a general model-agnostic zero-shot learning framework for improving CLTC by leveraging source instance weighting.

text-classification Text Classification +1

Paper
Add Code

Universal Proposition Bank 2.0

no code implementations • LREC 2022 • Ishan Jindal, Alexandre Rademaker, Michał Ulewicz, Ha Linh, Huyen Nguyen, Khoi-Nguyen Tran, Huaiyu Zhu, Yunyao Li

Semantic role labeling (SRL) represents the meaning of a sentence in the form of predicate-argument structures.

Semantic Role Labeling Sentence

Paper
Add Code

AGRaME: Any-Granularity Ranking with Multi-Vector Embeddings

no code implementations • 23 May 2024 • Revanth Gangi Reddy, Omar Attia, Yunyao Li, Heng Ji, Saloni Potdar

Ranking is a fundamental and popular problem in search.

Open-Domain Question Answering Retrieval +1

Paper
Add Code

Entity Disambiguation via Fusion Entity Decoding

no code implementations • 2 Apr 2024 • Junxiong Wang, Ali Mousavi, Omar Attia, Ronak Pradeep, Saloni Potdar, Alexander M. Rush, Umar Farooq Minhas, Yunyao Li

Existing generative approaches demonstrate improved accuracy compared to classification approaches under the standardized ZELDA benchmark.

Ranked #1 on Entity Linking on KORE50

Decoder Entity Disambiguation +2

Paper
Add Code

Increasing Coverage and Precision of Textual Information in Multilingual Knowledge Graphs

1 code implementation • 27 Nov 2023 • Simone Conia, Min Li, Daniel Lee, Umar Farooq Minhas, Ihab Ilyas, Yunyao Li

Recent work in Natural Language Processing and Computer Vision has been using textual information -- e. g., entity names and descriptions -- available in knowledge graphs to ground neural models to high-quality structured data.

Entity Linking Machine Translation +1

Paper
Code

FairytaleCQA: Integrating a Commonsense Knowledge Graph into Children's Storybook Narratives

no code implementations • 16 Nov 2023 • Jiaju Chen, Yuxuan Lu, Shao Zhang, Bingsheng Yao, Yuanzhe Dong, Ying Xu, Yunyao Li, Qianwen Wang, Dakuo Wang, Yuling Sun

AI models (including LLM) often rely on narrative question-answering (QA) datasets to provide customized QA functionalities to support downstream children education applications; however, existing datasets only include QA pairs that are grounded within the given storybook content, but children can learn more when teachers refer the storybook content to real-world knowledge (e. g., commonsense knowledge).

Question Answering World Knowledge

Paper
Add Code

FLEEK: Factual Error Detection and Correction with Evidence Retrieved from External Knowledge

no code implementations • 26 Oct 2023 • Farima Fatahi Bayat, Kun Qian, Benjamin Han, Yisi Sang, Anton Belyi, Samira Khorshidi, Fei Wu, Ihab F. Ilyas, Yunyao Li

Detecting factual errors in textual information, whether generated by large language models (LLM) or curated by humans, is crucial for making informed decisions.

Attribute

Paper
Add Code

Construction of Paired Knowledge Graph-Text Datasets Informed by Cyclic Evaluation

no code implementations • 20 Sep 2023 • Ali Mousavi, Xin Zhan, He Bai, Peng Shi, Theo Rekatsinas, Benjamin Han, Yunyao Li, Jeff Pound, Josh Susskind, Natalie Schluter, Ihab Ilyas, Navdeep Jaitly

Guided by these observations, we construct a new, improved dataset called LAGRANGE using heuristics meant to improve equivalence between KG and text and show the impact of each of the heuristics on cyclic evaluation.

Hallucination Knowledge Graphs

Paper
Add Code

Beyond Labels: Empowering Human Annotators with Natural Language Explanations through a Novel Active-Learning Architecture

1 code implementation • 22 May 2023 • Bingsheng Yao, Ishan Jindal, Lucian Popa, Yannis Katsis, Sayan Ghosh, Lihong He, Yuxuan Lu, Shashank Srivastava, Yunyao Li, James Hendler, Dakuo Wang

Our AL architecture leverages an explanation-generation model to produce explanations guided by human explanations, a prediction model that utilizes generated explanations toward prediction faithfully, and a novel data diversity-based AL sampling strategy that benefits from the explanation annotations.

Active Learning Decision Making +2

Paper
Code

Growing and Serving Large Open-domain Knowledge Graphs

no code implementations • 16 May 2023 • Ihab F. Ilyas, JP Lacerda, Yunyao Li, Umar Farooq Minhas, Ali Mousavi, Jeffrey Pound, Theodoros Rekatsinas, Chiraag Sumanth

We then describe how our platform, including graph embeddings, can be leveraged to create a Semantic Annotation service that links unstructured Web documents to entities in our KG.

Entity Linking Fact Verification +2

Paper
Add Code

When to Use What: An In-Depth Comparative Empirical Analysis of OpenIE Systems for Downstream Applications

no code implementations • 15 Nov 2022 • Kevin Pei, Ishan Jindal, Kevin Chen-Chuan Chang, ChengXiang Zhai, Yunyao Li

Open Information Extraction (OpenIE) has been used in the pipelines of various NLP tasks.

Open Information Extraction

Paper
Add Code

PriMeSRL-Eval: A Practical Quality Metric for Semantic Role Labeling Systems Evaluation

1 code implementation • 12 Oct 2022 • Ishan Jindal, Alexandre Rademaker, Khoi-Nguyen Tran, Huaiyu Zhu, Hiroshi Kanayama, Marina Danilevsky, Yunyao Li

In this paper, we address key practical issues with existing evaluation scripts and propose a more strict SRL evaluation metric PriMeSRL.

Semantic Role Labeling Sentence

Paper
Code

Label Sleuth: From Unlabeled Text to a Classifier in a Few Hours

1 code implementation • 2 Aug 2022 • Eyal Shnarch, Alon Halfon, Ariel Gera, Marina Danilevsky, Yannis Katsis, Leshem Choshen, Martin Santillan Cooper, Dina Epelboim, Zheng Zhang, Dakuo Wang, Lucy Yip, Liat Ein-Dor, Lena Dankin, Ilya Shnayderman, Ranit Aharonov, Yunyao Li, Naftali Liberman, Philip Levin Slesarev, Gwilym Newton, Shila Ofek-Koifman, Noam Slonim, Yoav Katz

Text classification can be useful in many real-world scenarios, saving a lot of time for end users.

Text Classification

241

Paper
Code

Domain Representative Keywords Selection: A Probabilistic Approach

1 code implementation • Findings (ACL) 2022 • Pritom Saha Akash, Jie Huang, Kevin Chen-Chuan Chang, Yunyao Li, Lucian Popa, ChengXiang Zhai

We propose a probabilistic approach to select a subset of a \textit{target domain representative keywords} from a candidate set, contrasting with a context domain.

Paper
Code

LNN-EL: A Neuro-Symbolic Approach to Short-text Entity Linking

1 code implementation • ACL 2021 • Hang Jiang, Sairam Gurajada, Qiuhao Lu, Sumit Neelam, Lucian Popa, Prithviraj Sen, Yunyao Li, Alexander Gray

Entity linking (EL), the task of disambiguating mentions in text by linking them to entities in a knowledge graph, is crucial for text understanding, question answering or conversational systems.

Entity Linking Inductive Bias +2

219

Paper
Code

Development of an Enterprise-Grade Contract Understanding System

no code implementations • NAACL 2021 • Arvind Agarwal, Laura Chiticariu, Poornima Chozhiyath Raman, Marina Danilevsky, Diman Ghazi, Ankush Gupta, Shanmukha Guttula, Yannis Katsis, Rajasekar Krishnamurthy, Yunyao Li, Shubham Mudgal, Vitobha Munigala, Nicholas Phan, Dhaval Sonawane, Sneha Srinivasan, Sudarshan R. Thitte, Mitesh Vasa, Ramiya Venkatachalam, Vinitha Yaski, Huaiyu Zhu

Contracts are arguably the most important type of business documents.

Paper
Add Code

Deep Learning on Graphs for Natural Language Processing

no code implementations • NAACL 2021 • Lingfei Wu, Yu Chen, Heng Ji, Yunyao Li

Due to its great power in modeling non-Euclidean data like graphs or manifolds, deep learning on graph techniques (i. e., Graph Neural Networks (GNNs)) have opened a new door to solving challenging graph-related NLP problems.

graph construction Graph Representation Learning +10

Paper
Add Code

TableLab: An Interactive Table Extraction System with Adaptive Deep Learning

no code implementations • 16 Feb 2021 • Nancy Xin Ru Wang, Douglas Burdick, Yunyao Li

Perfect extraction quality is difficult to achieve with one single out-of-box model due to (1) the wide variety of table styles, (2) the lack of training data representing this variety and (3) the inherent ambiguity and subjectivity of table definitions between end-users.

Table Extraction

Paper
Add Code

Leveraging Abstract Meaning Representation for Knowledge Base Question Answering

1 code implementation • Findings (ACL) 2021 • Pavan Kapanipathi, Ibrahim Abdelaziz, Srinivas Ravishankar, Salim Roukos, Alexander Gray, Ramon Astudillo, Maria Chang, Cristina Cornelio, Saswati Dana, Achille Fokoue, Dinesh Garg, Alfio Gliozzo, Sairam Gurajada, Hima Karanam, Naweed Khan, Dinesh Khandelwal, Young-suk Lee, Yunyao Li, Francois Luus, Ndivhuwo Makondo, Nandana Mihindukulasooriya, Tahira Naseem, Sumit Neelam, Lucian Popa, Revanth Reddy, Ryan Riegel, Gaetano Rossiello, Udit Sharma, G P Shrivatsa Bhargav, Mo Yu

Knowledge base question answering (KBQA)is an important task in Natural Language Processing.

Entity Linking Knowledge Base Question Answering +1

230

Paper
Code

Exploiting Node Content for Multiview Graph Convolutional Network and Adversarial Regularization

1 code implementation • COLING 2020 • Qiuhao Lu, Nisansa de Silva, Dejing Dou, Thien Huu Nguyen, Prithviraj Sen, Berthold Reinwald, Yunyao Li

Network representation learning (NRL) is crucial in the area of graph learning.

Graph Learning Link Prediction +3

Paper
Code

Improved Semantic Role Labeling using Parameterized Neighborhood Memory Adaptation

1 code implementation • 29 Nov 2020 • Ishan Jindal, Ranit Aharonov, Siddhartha Brahma, Huaiyu Zhu, Yunyao Li

Deep neural models achieve some of the best results for semantic role labeling.

Semantic Parsing Semantic Role Labeling +1

Paper
Code

CLAR: A Cross-Lingual Argument Regularizer for Semantic Role Labeling

no code implementations • Findings of the Association for Computational Linguistics 2020 • Ishan Jindal, Yunyao Li, Siddhartha Brahma, Huaiyu Zhu

Although different languages have different argument annotations, polyglot training, the idea of training one model on multiple languages, has previously been shown to outperform monolingual baselines, especially for low resource languages.

Semantic Role Labeling Sentence

Paper
Add Code

A Novel Workflow for Accurately and Efficiently Crowdsourcing Predicate Senses and Argument Labels

1 code implementation • Findings of the Association for Computational Linguistics 2020 • Youxuan Jiang, Huaiyu Zhu, Jonathan K. Kummerfeld, Yunyao Li, Walter Lasecki

Resources for Semantic Role Labeling (SRL) are typically annotated by experts at great expense.

Semantic Role Labeling

Paper
Code

Learning Structured Representations of Entity Names using Active Learning and Weak Supervision

1 code implementation • EMNLP 2020 • Kun Qian, Poornima Chozhiyath Raman, Yunyao Li, Lucian Popa

Structured representations of entity names are useful for many entity-related tasks such as entity normalization and variant generation.

Active Learning

Paper
Code

Small but Mighty: New Benchmarks for Split and Rephrase

no code implementations • EMNLP 2020 • Li Zhang, Huaiyu Zhu, Siddhartha Brahma, Yunyao Li

Split and Rephrase is a text simplification task of rewriting a complex sentence into simpler ones.

Sentence Split and Rephrase +1

Paper
Add Code

Jennifer for COVID-19: An NLP-Powered Chatbot Built for the People and by the People to Combat Misinformation

no code implementations • ACL 2020 • Yunyao Li, Gr, Tyrone ison, Patricia Silveyra, Ali Douraghy, Xinyu Guan, Thomas Kieselbach, Chengkai Li, Haiqi Zhang

Just as SARS-CoV-2, a new form of coronavirus continues to infect a growing number of people around the world, harmful misinformation about the outbreak also continues to spread.

Chatbot Misinformation

Paper
Add Code

Answering Complex Questions by Combining Information from Curated and Extracted Knowledge Bases

no code implementations • WS 2020 • Nikita Bhutani, Xinyi Zheng, Kun Qian, Yunyao Li, H. Jagadish

Knowledge-based question answering (KB{\_}QA) has long focused on simple questions that can be answered from a single knowledge source, a manually curated or an automatically extracted KB.

Question Answering

Paper
Add Code

CORD-19: The COVID-19 Open Research Dataset

4 code implementations • ACL 2020 • Lucy Lu Wang, Kyle Lo, Yoganand Chandrasekhar, Russell Reas, Jiangjiang Yang, Doug Burdick, Darrin Eide, Kathryn Funk, Yannis Katsis, Rodney Kinney, Yunyao Li, Ziyang Liu, William Merrill, Paul Mooney, Dewey Murdick, Devvret Rishi, Jerry Sheehan, Zhihong Shen, Brandon Stilson, Alex Wade, Kuansan Wang, Nancy Xin Ru Wang, Chris Wilhelm, Boya Xie, Douglas Raymond, Daniel S. Weld, Oren Etzioni, Sebastian Kohlmeier

The COVID-19 Open Research Dataset (CORD-19) is a growing resource of scientific papers on COVID-19 and related historical coronavirus research.

Information Retrieval Management +1

152

Paper
Code

Towards Universal Semantic Representation

no code implementations • WS 2019 • Huaiyu Zhu, Yunyao Li, Laura Chiticariu

Natural language understanding at the semantic level and independent of language variations is of great practical value.

Natural Language Understanding Semantic Role Labeling

Paper
Add Code

HEIDL: Learning Linguistic Expressions with Deep Learning and Human-in-the-Loop

no code implementations • ACL 2019 • Yiwei Yang, Eser Kandogan, Yunyao Li, Walter S. Lasecki, Prithviraj Sen

While the role of humans is increasingly recognized in machine learning community, representation of and interaction with models in current human-in-the-loop machine learning (HITL-ML) approaches are too low-level and far-removed from human's conceptual models.

BIG-bench Machine Learning

Paper
Add Code

Low-resource Deep Entity Resolution with Transfer and Active Learning

no code implementations • ACL 2019 • Jungo Kasai, Kun Qian, Sairam Gurajada, Yunyao Li, Lucian Popa

Recent adaptation of deep learning methods for ER mitigates the need for dataset-specific feature engineering by constructing distributed representations of entity records.

Active Learning Entity Resolution +2

Paper
Add Code

DIMSIM: An Accurate Chinese Phonetic Similarity Algorithm Based on Learned High Dimensional Encoding

1 code implementation • CONLL 2018 • Min Li, Marina Danilevsky, Sara Noeman, Yunyao Li

Phonetic similarity algorithms identify words and phrases with similar pronunciation which are used in many natural language processing tasks.

Spelling Correction

120

Paper
Code

Exploiting Structure in Representation of Named Entities using Active Learning

no code implementations • COLING 2018 • Nikita Bhutani, Kun Qian, Yunyao Li, H. V. Jagadish, Hern, Mauricio ez, Mitesh Vasa

We show that programs for mapping entity mentions to their structures can be automatically generated using human-comprehensible labels.

Active Learning Entity Linking +4

Paper
Add Code

SystemT: Declarative Text Understanding for Enterprise

no code implementations • NAACL 2018 • Laura Chiticariu, Marina Danilevsky, Yunyao Li, Frederick Reiss, Huaiyu Zhu

The rise of enterprise applications over unstructured and semi-structured documents poses new challenges to text understanding systems across multiple dimensions.

Document Classification Entity Extraction using GAN +3

Paper
Add Code

CROWD-IN-THE-LOOP: A Hybrid Approach for Annotating Semantic Roles

no code implementations • EMNLP 2017 • Chenguang Wang, Alan Akbik, Laura Chiticariu, Yunyao Li, Fei Xia, Anbang Xu

Crowdsourcing has proven to be an effective method for generating labeled data for a range of NLP tasks.

Machine Translation Question Answering +1

Paper
Add Code

Multilingual Information Extraction with PolyglotIE

no code implementations • COLING 2016 • Alan Akbik, Laura Chiticariu, Marina Danilevsky, Yonas Kbrom, Yunyao Li, Huaiyu Zhu

We present PolyglotIE, a web-based tool for developing extractors that perform Information Extraction (IE) over multilingual data.

Semantic Parsing

Paper
Add Code

Multilingual Aliasing for Auto-Generating Proposition Banks

no code implementations • COLING 2016 • Alan Akbik, Xinyu Guan, Yunyao Li

To address these issues, we propose to manually alias TL verbs to existing English frames.

Machine Translation Question Answering +1

Paper
Add Code

K-SRL: Instance-based Learning for Semantic Role Labeling

no code implementations • COLING 2016 • Alan Akbik, Yunyao Li

To overcome this challenge, we propose the use of instance-based learning that performs no explicit generalization, but rather extrapolates predictions from the most similar instances in the training data.

Machine Translation Question Answering +1