Search Results for author: Samuel Carton

Found 9 papers, 3 papers with code

Human-Centered Evaluation of Explanations

no code implementations NAACL (ACL) 2022 Jordan Boyd-Graber, Samuel Carton, Shi Feng, Q. Vera Liao, Tania Lombrozo, Alison Smith-Renner, Chenhao Tan

The NLP community are increasingly interested in providing explanations for NLP models to help people make sense of model behavior and potentially improve human interaction with models.

Learning to Ignore Adversarial Attacks

no code implementations23 May 2022 Yiming Zhang, Yangqiaoyu Zhou, Samuel Carton, Chenhao Tan

Despite the strong performance of current NLP models, they can be brittle against adversarial attacks.

Data Augmentation

Human-AI Collaboration via Conditional Delegation: A Case Study of Content Moderation

no code implementations25 Apr 2022 Vivian Lai, Samuel Carton, Rajat Bhatnagar, Q. Vera Liao, Yunfeng Zhang, Chenhao Tan

Despite impressive performance in many benchmark datasets, AI models can still make mistakes, especially among out-of-distribution examples.

Open-Ended Question Answering

What to Learn, and How: Toward Effective Learning from Rationales

1 code implementation Findings (ACL) 2022 Samuel Carton, Surya Kanoria, Chenhao Tan

Learning from rationales seeks to augment model prediction accuracy using human-annotated rationales (i. e. subsets of input tokens) that justify their chosen labels, often in the form of intermediate or multitask supervision.

Evaluating and Characterizing Human Rationales

1 code implementation EMNLP 2020 Samuel Carton, Anirudh Rathore, Chenhao Tan

Two main approaches for evaluating the quality of machine-generated rationales are: 1) using human rationales as a gold standard; and 2) automated metrics based on how rationales affect model behavior.

Open-Ended Question Answering

Harnessing Explanations to Bridge AI and Humans

no code implementations16 Mar 2020 Vivian Lai, Samuel Carton, Chenhao Tan

Machine learning models are increasingly integrated into societally critical applications such as recidivism prediction and medical diagnosis, thanks to their superior predictive power.

Decision Making Medical Diagnosis

Judge the Judges: A Large-Scale Evaluation Study of Neural Language Models for Online Review Generation

1 code implementation IJCNLP 2019 Cristina Garbacea, Samuel Carton, Shiyan Yan, Qiaozhu Mei

We conduct a large-scale, systematic study to evaluate the existing evaluation methods for natural language generation in the context of generating online product reviews.

Review Generation Text Generation

Cannot find the paper you are looking for? You can Submit a new open access paper.