Search Results for author: Alexander Wettig

Found 9 papers, 8 papers with code

QuRating: Selecting High-Quality Data for Training Language Models

1 code implementation15 Feb 2024 Alexander Wettig, Aatmik Gupta, Saumya Malik, Danqi Chen

Selecting high-quality pre-training data is important for creating capable language models, but existing methods rely on simple heuristics.

In-Context Learning

Poisoning Retrieval Corpora by Injecting Adversarial Passages

1 code implementation29 Oct 2023 Zexuan Zhong, Ziqing Huang, Alexander Wettig, Danqi Chen

Dense retrievers have achieved state-of-the-art performance in various information retrieval tasks, but to what extent can they be safely deployed in real-world applications?

Information Retrieval Natural Questions +1

SWE-bench: Can Language Models Resolve Real-World GitHub Issues?

no code implementations10 Oct 2023 Carlos E. Jimenez, John Yang, Alexander Wettig, Shunyu Yao, Kexin Pei, Ofir Press, Karthik Narasimhan

We find real-world software engineering to be a rich, sustainable, and challenging testbed for evaluating the next generation of language models.

Bug fixing Code Generation +1

Adapting Language Models to Compress Contexts

1 code implementation24 May 2023 Alexis Chevalier, Alexander Wettig, Anirudh Ajith, Danqi Chen

Transformer-based language models (LMs) are powerful and widely-applicable tools, but their usefulness is constrained by a finite context window and the expensive computational cost of processing long text documents.

In-Context Learning Language Modelling +3

Finding Dataset Shortcuts with Grammar Induction

1 code implementation20 Oct 2022 Dan Friedman, Alexander Wettig, Danqi Chen

Many NLP datasets have been found to contain shortcuts: simple decision rules that achieve surprisingly high accuracy.

Sentence Sentence Classification

A Kernel-Based View of Language Model Fine-Tuning

1 code implementation11 Oct 2022 Sadhika Malladi, Alexander Wettig, Dingli Yu, Danqi Chen, Sanjeev Arora

It has become standard to solve NLP tasks by fine-tuning pre-trained language models (LMs), especially in low-data settings.

Language Modelling

Cannot find the paper you are looking for? You can Submit a new open access paper.