Search Results for author: Karthik Raman

Found 20 papers, 1 papers with code

Take One Step at a Time to Know Incremental Utility of Demonstration: An Analysis on Reranking for Few-Shot In-Context Learning

no code implementations • 16 Nov 2023 • Kazuma Hashimoto, Karthik Raman, Michael Bendersky

Unlike the previous work, we introduce a novel labeling method, incremental utility, which estimates how much incremental knowledge is brought into the LLMs by a demonstration.

In-Context Learning Multi-class Classification +1

Paper
Add Code

It's All Relative! -- A Synthetic Query Generation Approach for Improving Zero-Shot Relevance Prediction

no code implementations • 14 Nov 2023 • Aditi Chaudhary, Karthik Raman, Michael Bendersky

Recent developments in large language models (LLMs) have shown promise in their ability to generate synthetic query-document pairs by prompting with as few as 8 demonstrations.

Paper
Add Code

Ambiguity-Aware In-Context Learning with Large Language Models

no code implementations • 14 Sep 2023 • Lingyu Gao, Aditi Chaudhary, Krishna Srinivasan, Kazuma Hashimoto, Karthik Raman, Michael Bendersky

In-context learning (ICL) i. e. showing LLMs only a few task-specific demonstrations has led to downstream gains with no task-specific fine-tuning required.

In-Context Learning Semantic Similarity +3

Paper
Add Code

Exploring the Viability of Synthetic Query Generation for Relevance Prediction

no code implementations • 19 May 2023 • Aditi Chaudhary, Karthik Raman, Krishna Srinivasan, Kazuma Hashimoto, Mike Bendersky, Marc Najork

While our experiments demonstrate that these modifications help improve performance of QGen techniques, we also find that QGen approaches struggle to capture the full nuance of the relevance label space and as a result the generated queries are not faithful to the desired relevance label.

Information Retrieval Question Answering +2

Paper
Add Code

How Does Beam Search improve Span-Level Confidence Estimation in Generative Sequence Labeling?

no code implementations • 21 Dec 2022 • Kazuma Hashimoto, Iftekhar Naim, Karthik Raman

Sequence labeling is a core task in text understanding for IE/IR systems.

slot-filling Slot Filling +1

Paper
Add Code

QUILL: Query Intent with Large Language Models using Retrieval Augmentation and Multi-stage Distillation

no code implementations • 27 Oct 2022 • Krishna Srinivasan, Karthik Raman, Anupam Samanta, Lingrui Liao, Luca Bertelli, Mike Bendersky

Thus, in this paper we make the following contributions: (1) We demonstrate that Retrieval Augmentation of queries provides LLMs with valuable additional context enabling improved understanding.

Feature Engineering Knowledge Distillation +1

Paper
Add Code

GROOT: Corrective Reward Optimization for Generative Sequential Labeling

no code implementations • 29 Sep 2022 • Kazuma Hashimoto, Karthik Raman

GROOT works by training a generative sequential labeling model to match the decoder output distribution with that of the (black-box) reward function.

Paper
Add Code

FiD-Light: Efficient and Effective Retrieval-Augmented Text Generation

no code implementations • 28 Sep 2022 • Sebastian Hofstätter, Jiecao Chen, Karthik Raman, Hamed Zamani

Retrieval-augmented generation models offer many benefits over standalone language models: besides a textual answer to a given query they provide provenance items retrieved from an updateable knowledge base.

Open-Domain Question Answering Re-Ranking +2

Paper
Add Code

Multi-Task Retrieval-Augmented Text Generation with Relevance Sampling

no code implementations • 7 Jul 2022 • Sebastian Hofstätter, Jiecao Chen, Karthik Raman, Hamed Zamani

This paper studies multi-task training of retrieval-augmented generation models for knowledge-intensive tasks.

Open-Domain Question Answering Retrieval +1

Paper
Add Code

Transforming Sequence Tagging Into A Seq2Seq Task

no code implementations • 16 Mar 2022 • Karthik Raman, Iftekhar Naim, Jiecao Chen, Kazuma Hashimoto, Kiran Yalasangi, Krishna Srinivasan

Pretrained, large, generative language models (LMs) have had great success in a wide range of sequence tagging and structured prediction tasks.

Hallucination Structured Prediction +1

Paper
Add Code

Designing biological circuits: from principles to applications

no code implementations • 5 Nov 2021 • Debomita Chakraborty, Raghunathan Rengaswamy, Karthik Raman

Genetic circuit design is a well-studied problem in synthetic biology.

Morphological Analysis Robust Design

Paper
Add Code

Effect of Dormant Spare Capacity on the Attack Tolerance of Complex Networks

no code implementations • 25 Sep 2021 • Sai Saranga Das M, Karthik Raman

The vulnerability of networks to targeted attacks is an issue of widespread interest for policymakers, military strategists, network engineers and systems biologists alike.

Paper
Add Code

WIT: Wikipedia-based Image Text Dataset for Multimodal Multilingual Machine Learning

3 code implementations • 2 Mar 2021 • Krishna Srinivasan, Karthik Raman, Jiecao Chen, Michael Bendersky, Marc Najork

First, WIT is the largest multimodal dataset by the number of image-text examples by 3x (at the time of writing).

Ranked #1 on Image Retrieval on WIT

BIG-bench Machine Learning Image Retrieval +3

957

Paper
Code

The art of molecular computing: whence and whither

no code implementations • 12 Feb 2021 • Sahana Gangadharan, Karthik Raman

An astonishingly diverse biomolecular circuitry orchestrates the functioning machinery underlying every living cell.

Paper
Add Code

Goal directed molecule generation using Monte Carlo Tree Search

no code implementations • 30 Oct 2020 • Anand A. Rajasekar, Karthik Raman, Balaraman Ravindran

One challenging and essential task in biochemistry is the generation of novel molecules with desired properties.

Navigate

Paper
Add Code

DICT-MLM: Improved Multilingual Pre-Training using Bilingual Dictionaries

no code implementations • 23 Oct 2020 • Aditi Chaudhary, Karthik Raman, Krishna Srinivasan, Jiecao Chen

In particular, by requiring the model to predict the language-specific token, the MLM objective disincentivizes learning a language-agnostic representation -- which is a key goal of multilingual pre-training.

Language Modelling Masked Language Modeling +1

Paper
Add Code

DiPair: Fast and Accurate Distillation for Trillion-Scale Text Matching and Pair Modeling

no code implementations • Findings of the Association for Computational Linguistics 2020 • Jiecao Chen, Liu Yang, Karthik Raman, Michael Bendersky, Jung-Jung Yeh, Yun Zhou, Marc Najork, Danyang Cai, Ehsan Emadzadeh

Pre-trained models like BERT (Devlin et al., 2018) have dominated NLP / IR applications such as single sentence classification, text pair classification, and question answering.

Knowledge Distillation Question Answering +4

Paper
Add Code

Evaluating the Cross-Lingual Effectiveness of Massively Multilingual Neural Machine Translation

no code implementations • 1 Sep 2019 • Aditya Siddhant, Melvin Johnson, Henry Tsai, Naveen Arivazhagan, Jason Riesa, Ankur Bapna, Orhan Firat, Karthik Raman

The recently proposed massively multilingual neural machine translation (NMT) system has been shown to be capable of translating over 100 languages to and from English within a single model.

Cross-Lingual Transfer Machine Translation +3

Paper
Add Code

Learning Multilingual Word Embeddings Using Image-Text Data

no code implementations • WS 2019 • Karan Singhal, Karthik Raman, Balder ten Cate

There has been significant interest recently in learning multilingual word embeddings -- in which semantically similar words across languages have similar embeddings.

Multilingual Word Embeddings Semantic Similarity +1

Paper
Add Code

Methods for Ordinal Peer Grading

no code implementations • 14 Apr 2014 • Karthik Raman, Thorsten Joachims

Thus, in this paper we study the problem of automatically inferring student grades from ordinal peer feedback, as opposed to existing methods that require cardinal peer feedback.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.