Search Results for author: Jon Saad-Falcon

Found 12 papers, 8 papers with code

Benchmarking and Building Long-Context Retrieval Models with LoCo and M2-BERT

no code implementations • 12 Feb 2024 • Jon Saad-Falcon, Daniel Y. Fu, Simran Arora, Neel Guha, Christopher Ré

Retrieval pipelines-an integral component of many machine learning systems-perform poorly in domains where documents are long (e. g., 10K tokens or more) and where identifying the relevant document requires synthesizing information across the entire text.

Benchmarking Chunking +2

Paper
Add Code

ARES: An Automated Evaluation Framework for Retrieval-Augmented Generation Systems

1 code implementation • 16 Nov 2023 • Jon Saad-Falcon, Omar Khattab, Christopher Potts, Matei Zaharia

Evaluating retrieval-augmented generation (RAG) systems traditionally relies on hand annotations for input queries, passages to retrieve, and responses to generate.

Retrieval

263

Paper
Code

PDFTriage: Question Answering over Long, Structured Documents

no code implementations • 16 Sep 2023 • Jon Saad-Falcon, Joe Barrow, Alexa Siu, Ani Nenkova, David Seunghyun Yoon, Ryan A. Rossi, Franck Dernoncourt

Representing such structured documents as plain text is incongruous with the user's mental model of these documents with rich structure.

Question Answering Retrieval

Paper
Add Code

UDAPDR: Unsupervised Domain Adaptation via LLM Prompting and Distillation of Rerankers

1 code implementation • 1 Mar 2023 • Jon Saad-Falcon, Omar Khattab, Keshav Santhanam, Radu Florian, Martin Franz, Salim Roukos, Avirup Sil, Md Arafat Sultan, Christopher Potts

Many information retrieval tasks require large labeled datasets for fine-tuning.

Information Retrieval Retrieval +1

697

Paper
Code

Moving Beyond Downstream Task Accuracy for Information Retrieval Benchmarking

no code implementations • 2 Dec 2022 • Keshav Santhanam, Jon Saad-Falcon, Martin Franz, Omar Khattab, Avirup Sil, Radu Florian, Md Arafat Sultan, Salim Roukos, Matei Zaharia, Christopher Potts

Neural information retrieval (IR) systems have progressed rapidly in recent years, in large part due to the release of publicly available benchmarking tasks.

Benchmarking Information Retrieval +1

Paper
Add Code

Embedding Recycling for Language Models

1 code implementation • 11 Jul 2022 • Jon Saad-Falcon, Amanpreet Singh, Luca Soldaini, Mike D'Arcy, Arman Cohan, Doug Downey

Real-world applications of neural language models often involve running many different models over the same corpus.

Question Answering Text Classification

Paper
Code

ColBERTv2: Effective and Efficient Retrieval via Lightweight Late Interaction

3 code implementations • NAACL 2022 • Keshav Santhanam, Omar Khattab, Jon Saad-Falcon, Christopher Potts, Matei Zaharia

Neural information retrieval (IR) has greatly advanced search and other knowledge-intensive language tasks.

Ranked #6 on Zero-shot Text Search on BEIR

Information Retrieval Open-Domain Question Answering +2

2,437

Paper
Code

Quantifying the Impact of Human Capital, Job History, and Language Factors on Job Seniority with a Large-scale Analysis of Resumes

no code implementations • 15 Jun 2021 • Austin P Wright, Caleb Ziems, Haekyu Park, Jon Saad-Falcon, Duen Horng Chau, Diyi Yang, Maria Tomprou

As job markets worldwide have become more competitive and applicant selection criteria have become more opaque, and different (and sometimes contradictory) information and advice is available for job seekers wishing to progress in their careers, it has never been more difficult to determine which factors in a r\'esum\'e most effectively help career progression.

Paper
Add Code

EnergyVis: Interactively Tracking and Exploring Energy Consumption for ML Models

2 code implementations • 30 Mar 2021 • Omar Shaikh, Jon Saad-Falcon, Austin P Wright, Nilaksh Das, Scott Freitas, Omar Isaac Asensio, Duen Horng Chau

The advent of larger machine learning (ML) models have improved state-of-the-art (SOTA) performance in various modeling tasks, ranging from computer vision to natural language.

175

Paper
Code

Examining the Ordering of Rhetorical Strategies in Persuasive Requests

1 code implementation • Findings of the Association for Computational Linguistics 2020 • Omar Shaikh, Jiaao Chen, Jon Saad-Falcon, Duen Horng Chau, Diyi Yang

We find that specific (orderings of) strategies interact uniquely with a request's content to impact success rate, and thus the persuasiveness of a request.

Persuasiveness

Paper
Code

Mapping Researchers with PeopleMap

1 code implementation • 31 Aug 2020 • Jon Saad-Falcon, Omar Shaikh, Zijie J. Wang, Austin P. Wright, Sasha Richardson, Duen Horng Chau

Discovering research expertise at universities can be a difficult task.

Paper
Code

PeopleMap: Visualization Tool for Mapping Out Researchers using Natural Language Processing

1 code implementation • 10 Jun 2020 • Jon Saad-Falcon, Omar Shaikh, Zijie J. Wang, Austin P. Wright, Sasha Richardson, Duen Horng Chau

Discovering research expertise at institutions can be a difficult task.

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.