Search Results for author: Abhinav Rao

Found 4 papers, 2 papers with code

NORMAD: A Benchmark for Measuring the Cultural Adaptability of Large Language Models

no code implementations18 Apr 2024 Abhinav Rao, Akhila Yerukola, Vishwa Shah, Katharina Reinecke, Maarten Sap

We introduce NormAd, a novel dataset, which includes 2. 6k stories that represent social and cultural norms from 75 countries, to assess the ability of LLMs to adapt to different granular levels of socio-cultural contexts such as the country of origin, its associated cultural values, and prevalent social norms.

Navigate

Ethical Reasoning over Moral Alignment: A Case and Framework for In-Context Ethical Policies in LLMs

no code implementations11 Oct 2023 Abhinav Rao, Aditi Khandelwal, Kumar Tanmay, Utkarsh Agarwal, Monojit Choudhury

In this position paper, we argue that instead of morally aligning LLMs to specific set of ethical principles, we should infuse generic ethical reasoning capabilities into them so that they can handle value pluralism at a global scale.

Ethics Position

Tricking LLMs into Disobedience: Formalizing, Analyzing, and Detecting Jailbreaks

1 code implementation24 May 2023 Abhinav Rao, Sachin Vashistha, Atharva Naik, Somak Aditya, Monojit Choudhury

Recent explorations with commercial Large Language Models (LLMs) have shown that non-expert users can jailbreak LLMs by simply manipulating their prompts; resulting in degenerate output behavior, privacy and security breaches, offensive outputs, and violations of content regulator policies.

Punctuation Restoration for Singaporean Spoken Languages: English, Malay, and Mandarin

1 code implementation10 Dec 2022 Abhinav Rao, Ho Thi-Nga, Chng Eng-Siong

The focus languages are English, Mandarin, and Malay which are three of the most popular languages in Singapore.

Punctuation Restoration slot-filling +2

Cannot find the paper you are looking for? You can Submit a new open access paper.