Search Results for author: Karthik Raman

Found 20 papers, 1 papers with code

Take One Step at a Time to Know Incremental Utility of Demonstration: An Analysis on Reranking for Few-Shot In-Context Learning

no code implementations16 Nov 2023 Kazuma Hashimoto, Karthik Raman, Michael Bendersky

Unlike the previous work, we introduce a novel labeling method, incremental utility, which estimates how much incremental knowledge is brought into the LLMs by a demonstration.

In-Context Learning Multi-class Classification +1

It's All Relative! -- A Synthetic Query Generation Approach for Improving Zero-Shot Relevance Prediction

no code implementations14 Nov 2023 Aditi Chaudhary, Karthik Raman, Michael Bendersky

Recent developments in large language models (LLMs) have shown promise in their ability to generate synthetic query-document pairs by prompting with as few as 8 demonstrations.

Ambiguity-Aware In-Context Learning with Large Language Models

no code implementations14 Sep 2023 Lingyu Gao, Aditi Chaudhary, Krishna Srinivasan, Kazuma Hashimoto, Karthik Raman, Michael Bendersky

In-context learning (ICL) i. e. showing LLMs only a few task-specific demonstrations has led to downstream gains with no task-specific fine-tuning required.

In-Context Learning Semantic Similarity +3

Exploring the Viability of Synthetic Query Generation for Relevance Prediction

no code implementations19 May 2023 Aditi Chaudhary, Karthik Raman, Krishna Srinivasan, Kazuma Hashimoto, Mike Bendersky, Marc Najork

While our experiments demonstrate that these modifications help improve performance of QGen techniques, we also find that QGen approaches struggle to capture the full nuance of the relevance label space and as a result the generated queries are not faithful to the desired relevance label.

Information Retrieval Question Answering +2

QUILL: Query Intent with Large Language Models using Retrieval Augmentation and Multi-stage Distillation

no code implementations27 Oct 2022 Krishna Srinivasan, Karthik Raman, Anupam Samanta, Lingrui Liao, Luca Bertelli, Mike Bendersky

Thus, in this paper we make the following contributions: (1) We demonstrate that Retrieval Augmentation of queries provides LLMs with valuable additional context enabling improved understanding.

Feature Engineering Knowledge Distillation +1

GROOT: Corrective Reward Optimization for Generative Sequential Labeling

no code implementations29 Sep 2022 Kazuma Hashimoto, Karthik Raman

GROOT works by training a generative sequential labeling model to match the decoder output distribution with that of the (black-box) reward function.

FiD-Light: Efficient and Effective Retrieval-Augmented Text Generation

no code implementations28 Sep 2022 Sebastian Hofstätter, Jiecao Chen, Karthik Raman, Hamed Zamani

Retrieval-augmented generation models offer many benefits over standalone language models: besides a textual answer to a given query they provide provenance items retrieved from an updateable knowledge base.

Open-Domain Question Answering Re-Ranking +2

Transforming Sequence Tagging Into A Seq2Seq Task

no code implementations16 Mar 2022 Karthik Raman, Iftekhar Naim, Jiecao Chen, Kazuma Hashimoto, Kiran Yalasangi, Krishna Srinivasan

Pretrained, large, generative language models (LMs) have had great success in a wide range of sequence tagging and structured prediction tasks.

Hallucination Structured Prediction +1

Effect of Dormant Spare Capacity on the Attack Tolerance of Complex Networks

no code implementations25 Sep 2021 Sai Saranga Das M, Karthik Raman

The vulnerability of networks to targeted attacks is an issue of widespread interest for policymakers, military strategists, network engineers and systems biologists alike.

The art of molecular computing: whence and whither

no code implementations12 Feb 2021 Sahana Gangadharan, Karthik Raman

An astonishingly diverse biomolecular circuitry orchestrates the functioning machinery underlying every living cell.

Goal directed molecule generation using Monte Carlo Tree Search

no code implementations30 Oct 2020 Anand A. Rajasekar, Karthik Raman, Balaraman Ravindran

One challenging and essential task in biochemistry is the generation of novel molecules with desired properties.

Navigate

DICT-MLM: Improved Multilingual Pre-Training using Bilingual Dictionaries

no code implementations23 Oct 2020 Aditi Chaudhary, Karthik Raman, Krishna Srinivasan, Jiecao Chen

In particular, by requiring the model to predict the language-specific token, the MLM objective disincentivizes learning a language-agnostic representation -- which is a key goal of multilingual pre-training.

Language Modelling Masked Language Modeling +1

Evaluating the Cross-Lingual Effectiveness of Massively Multilingual Neural Machine Translation

no code implementations1 Sep 2019 Aditya Siddhant, Melvin Johnson, Henry Tsai, Naveen Arivazhagan, Jason Riesa, Ankur Bapna, Orhan Firat, Karthik Raman

The recently proposed massively multilingual neural machine translation (NMT) system has been shown to be capable of translating over 100 languages to and from English within a single model.

Cross-Lingual Transfer Machine Translation +3

Learning Multilingual Word Embeddings Using Image-Text Data

no code implementations WS 2019 Karan Singhal, Karthik Raman, Balder ten Cate

There has been significant interest recently in learning multilingual word embeddings -- in which semantically similar words across languages have similar embeddings.

Multilingual Word Embeddings Semantic Similarity +1

Methods for Ordinal Peer Grading

no code implementations14 Apr 2014 Karthik Raman, Thorsten Joachims

Thus, in this paper we study the problem of automatically inferring student grades from ordinal peer feedback, as opposed to existing methods that require cardinal peer feedback.

Cannot find the paper you are looking for? You can Submit a new open access paper.