Search Results for author: Jiawen Deng

Found 11 papers, 5 papers with code

SSP: A Simple and Safe automatic Prompt engineering method towards realistic image synthesis on LVM

no code implementations2 Jan 2024 Weijin Cheng, Jianzhi Liu, Jiawen Deng, Fuji Ren

Consequently, we propose a simple and safe prompt engineering method (SSP) to improve image generation quality by providing optimal camera descriptions.

Image Generation Instruction Following +1

COKE: A Cognitive Knowledge Graph for Machine Theory of Mind

no code implementations9 May 2023 Jincenzi Wu, Zhuang Chen, Jiawen Deng, Sahand Sabour, Minlie Huang

To empower AI systems with the ToM ability and narrow the gap between them and humans, in this paper, we propose COKE: the first cognitive knowledge graph for machine theory of mind.

Automated Paper Screening for Clinical Reviews Using Large Language Models

no code implementations1 May 2023 Eddie Guo, Mehul Gupta, Jiawen Deng, Ye-Jean Park, Mike Paget, Christopher Naugler

Objective: To assess the performance of the OpenAI GPT API in accurately and efficiently identifying relevant titles and abstracts from real-world clinical review datasets and compare its performance against ground truth labelling by two independent human reviewers.

Safety Assessment of Chinese Large Language Models

2 code implementations20 Apr 2023 Hao Sun, Zhexin Zhang, Jiawen Deng, Jiale Cheng, Minlie Huang

To further promote the safe deployment of LLMs, we develop a Chinese LLM safety assessment benchmark.

Towards Safer Generative Language Models: A Survey on Safety Risks, Evaluations, and Improvements

no code implementations18 Feb 2023 Jiawen Deng, Jiale Cheng, Hao Sun, Zhexin Zhang, Minlie Huang

This survey presents a framework for safety research pertaining to large models, delineating the landscape of safety risks as well as safety evaluation and improvement methods.

Adversarial Attack Ethics

Constructing Highly Inductive Contexts for Dialogue Safety through Controllable Reverse Generation

1 code implementation4 Dec 2022 Zhexin Zhang, Jiale Cheng, Hao Sun, Jiawen Deng, Fei Mi, Yasheng Wang, Lifeng Shang, Minlie Huang

In order to detect such toxic generations, existing methods rely on templates, real-world data extraction, crowdsourcing workers, or automatic generation to construct adversarial contexts that are likely to induce toxic generations.

Response Generation

Perplexity from PLM Is Unreliable for Evaluating Text Quality

no code implementations12 Oct 2022 Yequan Wang, Jiawen Deng, Aixin Sun, Xuying Meng

Recently, amounts of works utilize perplexity~(PPL) to evaluate the quality of the generated text.

Common Sense Reasoning

Towards Identifying Social Bias in Dialog Systems: Frame, Datasets, and Benchmarks

1 code implementation16 Feb 2022 Jingyan Zhou, Jiawen Deng, Fei Mi, Yitong Li, Yasheng Wang, Minlie Huang, Xin Jiang, Qun Liu, Helen Meng

The research of open-domain dialog systems has been greatly prospered by neural models trained on large-scale corpora, however, such corpora often introduce various safety problems (e. g., offensive languages, biases, and toxic behaviors) that significantly hinder the deployment of dialog systems in practice.

Bias Detection Open-Domain Dialog

COLD: A Benchmark for Chinese Offensive Language Detection

1 code implementation16 Jan 2022 Jiawen Deng, Jingyan Zhou, Hao Sun, Chujie Zheng, Fei Mi, Helen Meng, Minlie Huang

To this end, we propose a benchmark --COLD for Chinese offensive language analysis, including a Chinese Offensive Language Dataset --COLDATASET and a baseline detector --COLDETECTOR which is trained on the dataset.

On the Safety of Conversational Models: Taxonomy, Dataset, and Benchmark

1 code implementation Findings (ACL) 2022 Hao Sun, Guangxuan Xu, Jiawen Deng, Jiale Cheng, Chujie Zheng, Hao Zhou, Nanyun Peng, Xiaoyan Zhu, Minlie Huang

We propose a taxonomy for dialogue safety specifically designed to capture unsafe behaviors in human-bot dialogue settings, with focuses on context-sensitive unsafety, which is under-explored in prior works.

Cannot find the paper you are looking for? You can Submit a new open access paper.