Search Results for author: Naman jain

Found 20 papers, 4 papers with code

RAFT: Adapting Language Model to Domain Specific RAG

1 code implementation • 15 Mar 2024 • Tianjun Zhang, Shishir G. Patil, Naman jain, Sheng Shen, Matei Zaharia, Ion Stoica, Joseph E. Gonzalez

In this paper, we present Retrieval Augmented FineTuning (RAFT), a training recipe that improves the model's ability to answer questions in a "open-book" in-domain settings.

Language Modelling

10,206

Paper
Code

LiveCodeBench: Holistic and Contamination Free Evaluation of Large Language Models for Code

no code implementations • 12 Mar 2024 • Naman jain, King Han, Alex Gu, Wen-Ding Li, Fanjia Yan, Tianjun Zhang, Sida Wang, Armando Solar-Lezama, Koushik Sen, Ion Stoica

Large Language Models (LLMs) applied to code-related applications have emerged as a prominent field, attracting significant interest from both academia and industry.

Code Generation

Paper
Add Code

The Counterfeit Conundrum: Can Code Language Models Grasp the Nuances of Their Incorrect Generations?

no code implementations • 29 Feb 2024 • Alex Gu, Wen-Ding Li, Naman jain, Theo X. Olausson, Celine Lee, Koushik Sen, Armando Solar-Lezama

In this work, we focus on these counterfeit samples: programs sampled from a language model that 1) have a high enough log-probability to be generated at a moderate temperature and 2) pass weak correctness checks.

Code Generation Language Modelling

Paper
Add Code

StarCoder 2 and The Stack v2: The Next Generation

no code implementations • 29 Feb 2024 • Anton Lozhkov, Raymond Li, Loubna Ben allal, Federico Cassano, Joel Lamy-Poirier, Nouamane Tazi, Ao Tang, Dmytro Pykhtar, Jiawei Liu, Yuxiang Wei, Tianyang Liu, Max Tian, Denis Kocetkov, Arthur Zucker, Younes Belkada, Zijian Wang, Qian Liu, Dmitry Abulkhanov, Indraneil Paul, Zhuang Li, Wen-Ding Li, Megan Risdal, Jia Li, Jian Zhu, Terry Yue Zhuo, Evgenii Zheltonozhskii, Nii Osae Osae Dade, Wenhao Yu, Lucas Krauß, Naman jain, Yixuan Su, Xuanli He, Manan Dey, Edoardo Abati, Yekun Chai, Niklas Muennighoff, Xiangru Tang, Muhtasham Oblokulov, Christopher Akiki, Marc Marone, Chenghao Mou, Mayank Mishra, Alex Gu, Binyuan Hui, Tri Dao, Armel Zebaze, Olivier Dehaene, Nicolas Patry, Canwen Xu, Julian McAuley, Han Hu, Torsten Scholak, Sebastien Paquet, Jennifer Robinson, Carolyn Jane Anderson, Nicolas Chapados, Mostofa Patwary, Nima Tajbakhsh, Yacine Jernite, Carlos Muñoz Ferrandis, Lingming Zhang, Sean Hughes, Thomas Wolf, Arjun Guha, Leandro von Werra, Harm de Vries

Our large model, StarCoder2- 15B, significantly outperforms other models of comparable size.

Ranked #25 on Code Generation on MBPP

Code Completion Code Generation +1

Paper
Add Code

LLM-Assisted Code Cleaning For Training Accurate Code Generators

no code implementations • 25 Nov 2023 • Naman jain, Tianjun Zhang, Wei-Lin Chiang, Joseph E. Gonzalez, Koushik Sen, Ion Stoica

In this work, we investigate data quality for code and find that making the code more structured and readable leads to improved code generation performance of the system.

Code Generation

Paper
Add Code

Revisiting Prompt Engineering via Declarative Crowdsourcing

no code implementations • 7 Aug 2023 • Aditya G. Parameswaran, Shreya Shankar, Parth Asawa, Naman jain, Yujie Wang

Large language models (LLMs) are incredibly powerful at comprehending and generating data in the form of text, but are brittle and error-prone.

Entity Resolution Imputation +1

Paper
Add Code

SentEmojiBot: Empathising Conversations Generation with Emojis

no code implementations • 26 May 2021 • Akhilesh Ravi, Amit Yadav, Jainish Chauhan, Jatin Dholakia, Naman jain, Mayank Singh

The increasing use of dialogue agents makes it extremely desirable for them to understand and acknowledge the implied emotions to respond like humans with empathy.

Paper
Add Code

Learning Accurate Decision Trees with Bandit Feedback via Quantized Gradient Descent

3 code implementations • 15 Feb 2021 • Ajaykrishna Karthikeyan, Naman jain, Nagarajan Natarajan, Prateek Jain

Decision trees provide a rich family of highly non-linear but efficient models, due to which they continue to be the go-to family of predictive models by practitioners across domains.

Paper
Code

Bollyrics: Automatic Lyrics Generator for Romanised Hindi

1 code implementation • 25 Jul 2020 • Naman Jain, Ankush Chauhan, Atharva Chewale, Ojas Mithbavkar, Ujjaval Shah, Mayank Singh

Song lyrics convey a meaningful story in a creative manner with complex rhythmic patterns.

Paper
Code

What's in a Name? Are BERT Named Entity Representations just as Good for any other Name?

no code implementations • WS 2020 • Sriram Balasubramanian, Naman jain, Gaurav Jindal, Abhijeet Awasthi, Sunita Sarawagi

We evaluate named entity representations of BERT-based NLP models by investigating their robustness to replacements from the same typed class in the input.

Paper
Add Code

A Multi-Dimensional View of Aggression when voicing Opinion

no code implementations • LREC 2020 • Arjit Srivastava, Avijit Vajpayee, Syed Sarfaraz Akhtar, Naman jain, Vinay Singh, Manish Shrivastava

The advent of social media has immensely proliferated the amount of opinions and arguments voiced on the internet.

Paper
Add Code

PlantDoc: A Dataset for Visual Plant Disease Detection

2 code implementations • 23 Nov 2019 • Davinder Singh, Naman jain, Pranjali Jain, Pratik Kayal, Sudhakar Kumawat, Nipun Batra

Early detection of plant diseases remains difficult due to the lack of lab infrastructure and expertise.

General Classification Image Classification +1

208

Paper
Code

NLPExplorer: Exploring the Universe of NLP Papers

no code implementations • 16 Oct 2019 • Monarch Parmar, Naman jain, Pranjali Jain, P Jayakrishna Sahit, Soham Pachpande, Shruti Singh, Mayank Singh

Also, it provides temporal statistics such as yearwise popularity of topics, datasets, and seminal papers.

Paper
Add Code

On the Robustness of Human Pose Estimation

no code implementations • 18 Aug 2019 • Sahil Shah, Naman jain, Abhishek Sharma, Arjun Jain

This paper provides a comprehensive and exhaustive study of adversarial attacks on human pose estimation models and the evaluation of their robustness.

General Classification Pose Estimation +2

Paper
Add Code

A House United: Bridging the Script and Lexical Barrier between Hindi and Urdu

no code implementations • COLING 2016 • Riyaz A. Bhat, Irshad A. Bhat, Naman jain, Dipti Misra Sharma

With respect to text processing, addressing the differences between the Hindi and Urdu texts would be beneficial in the following ways: (a) instead of training separate models, their individual resources can be augmented to train single, unified models for better generalization, and (b) their individual text processing applications can be used interchangeably under varied resource conditions.

Dependency Parsing Part-Of-Speech Tagging +3