Search Results for author: Guangxuan Xu

Found 10 papers, 3 papers with code

BRAIn: Bayesian Reward-conditioned Amortized Inference for natural language generation from feedback

no code implementations4 Feb 2024 Gaurav Pandey, Yatin Nandwani, Tahira Naseem, Mayank Mishra, Guangxuan Xu, Dinesh Raghu, Sachindra Joshi, Asim Munawar, Ramón Fernandez Astudillo

Following the success of Proximal Policy Optimization (PPO) for Reinforcement Learning from Human Feedback (RLHF), new techniques such as Sequence Likelihood Calibration (SLiC) and Direct Policy Optimization (DPO) have been proposed that are offline in nature and use rewards in an indirect manner.

Text Generation

Are Fairy Tales Fair? Analyzing Gender Bias in Temporal Narrative Event Chains of Children's Fairy Tales

no code implementations26 May 2023 Paulina Toro Isaza, Guangxuan Xu, Akintoye Oloko, Yufang Hou, Nanyun Peng, Dakuo Wang

Social biases and stereotypes are embedded in our culture in part through their presence in our stories, as evidenced by the rich history of humanities and social science literature analyzing such biases in children stories.

EnDex: Evaluation of Dialogue Engagingness at Scale

1 code implementation22 Oct 2022 Guangxuan Xu, Ruibo Liu, Fabrice Harel-Canada, Nischal Reddy Chandra, Nanyun Peng

We propose EnDex, the first human-reaction based model to evaluate dialogue engagingness.

NECE: Narrative Event Chain Extraction Toolkit

no code implementations17 Aug 2022 Guangxuan Xu, Paulina Toro Isaza, Moshi Li, Akintoye Oloko, Bingsheng Yao, Cassia Sanctos, Aminat Adebiyi, Yufang Hou, Nanyun Peng, Dakuo Wang

To understand a narrative, it is essential to comprehend the temporal event flows, especially those associated with main characters; however, this can be challenging with lengthy and unstructured narrative texts.

Question Answering

Non-Parallel Text Style Transfer with Self-Parallel Supervision

1 code implementation ICLR 2022 Ruibo Liu, Chongyang Gao, Chenyan Jia, Guangxuan Xu, Soroush Vosoughi

The performance of existing text style transfer models is severely limited by the non-parallel datasets on which the models are trained.

Imitation Learning Style Transfer +1

Can Model Compression Improve NLP Fairness

no code implementations21 Jan 2022 Guangxuan Xu, Qingyuan Hu

Model compression techniques are receiving increasing attention; however, the effect of compression on model fairness is still under explored.

Fairness Knowledge Distillation +1

On the Safety of Conversational Models: Taxonomy, Dataset, and Benchmark

1 code implementation Findings (ACL) 2022 Hao Sun, Guangxuan Xu, Jiawen Deng, Jiale Cheng, Chujie Zheng, Hao Zhou, Nanyun Peng, Xiaoyan Zhu, Minlie Huang

We propose a taxonomy for dialogue safety specifically designed to capture unsafe behaviors in human-bot dialogue settings, with focuses on context-sensitive unsafety, which is under-explored in prior works.

Mitigating Political Bias in Language Models Through Reinforced Calibration

no code implementations30 Apr 2021 Ruibo Liu, Chenyan Jia, Jason Wei, Guangxuan Xu, Lili Wang, Soroush Vosoughi

Current large-scale language models can be politically biased as a result of the data they are trained on, potentially causing serious problems when they are deployed in real-world settings.

reinforcement-learning Reinforcement Learning (RL) +1

Enhanced Offensive Language Detection Through Data Augmentation

no code implementations5 Dec 2020 Ruibo Liu, Guangxuan Xu, Soroush Vosoughi

In this work, we present Dager (Data Augmenter), a generation-based data augmentation method, that improves the performance of classification on imbalanced and low-resource data such as the offensive language dataset.

Data Augmentation Task 2

Cannot find the paper you are looking for? You can Submit a new open access paper.