Search Results for author: John X. Morris

Found 11 papers, 10 papers with code

Do language models plan ahead for future tokens?

no code implementations • 1 Apr 2024 • Wilson Wu, John X. Morris, Lionel Levine

Do transformers "think ahead" during inference at a given position?

Paper
Add Code

Nomic Embed: Training a Reproducible Long Context Text Embedder

1 code implementation • 2 Feb 2024 • Zach Nussbaum, John X. Morris, Brandon Duderstadt, Andriy Mulyar

This technical report describes the training of nomic-embed-text-v1, the first fully reproducible, open-source, open-weights, open-data, 8192 context length English text embedding model that outperforms both OpenAI Ada-002 and OpenAI text-embedding-3-small on short and long-context tasks.

422

Paper
Code

Language Model Inversion

2 code implementations • 22 Nov 2023 • John X. Morris, Wenting Zhao, Justin T. Chiu, Vitaly Shmatikov, Alexander M. Rush

We consider the problem of language model inversion and show that next-token probabilities contain a surprising amount of information about the preceding text.

Language Modelling

599

Paper
Code

Tree Prompting: Efficient Task Adaptation without Fine-Tuning

2 code implementations • 21 Oct 2023 • John X. Morris, Chandan Singh, Alexander M. Rush, Jianfeng Gao, Yuntian Deng

Prompting language models (LMs) is the main interface for applying them to new tasks.

Classification Decision Making

Paper
Code

Text Embeddings Reveal (Almost) As Much As Text

1 code implementation • 10 Oct 2023 • John X. Morris, Volodymyr Kuleshov, Vitaly Shmatikov, Alexander M. Rush

How much private information do text embeddings reveal about the original text?

599

Paper
Code

Unsupervised Text Deidentification

1 code implementation • 20 Oct 2022 • John X. Morris, Justin T. Chiu, Ramin Zabih, Alexander M. Rush

We propose an unsupervised deidentification method that masks words that leak personally-identifying information.

Named Entity Recognition Named Entity Recognition (NER)

Paper
Code

Explaining Patterns in Data with Language Models via Interpretable Autoprompting

2 code implementations • 4 Oct 2022 • Chandan Singh, John X. Morris, Jyoti Aneja, Alexander M. Rush, Jianfeng Gao

Large language models (LLMs) have displayed an impressive ability to harness natural language to perform complex tasks.

Explanation Generation Natural Language Understanding +2

131

Paper
Code

Second-Order NLP Adversarial Examples

1 code implementation • 5 Oct 2020 • John X. Morris

In these methods, a valid adversarial example fools the model being attacked, and is determined to be semantically or syntactically valid by a second model.

Adversarial Attack Semantic Similarity +3

Paper
Code

Searching for a Search Method: Benchmarking Search Algorithms for Generating NLP Adversarial Examples

2 code implementations • EMNLP (BlackboxNLP) 2020 • Jin Yong Yoo, John X. Morris, Eli Lifland, Yanjun Qi

We study the behavior of several black-box search algorithms used for generating adversarial examples for natural language processing (NLP) tasks.

Adversarial Text Benchmarking +1

2,773

Paper
Code

TextAttack: A Framework for Adversarial Attacks, Data Augmentation, and Adversarial Training in NLP

2 code implementations • EMNLP 2020 • John X. Morris, Eli Lifland, Jin Yong Yoo, Jake Grigsby, Di Jin, Yanjun Qi

TextAttack also includes data augmentation and adversarial training modules for using components of adversarial attacks to improve model accuracy and robustness.

Adversarial Text Data Augmentation +3

2,773

Paper
Code

Reevaluating Adversarial Examples in Natural Language

2 code implementations • Findings of the Association for Computational Linguistics 2020 • John X. Morris, Eli Lifland, Jack Lanchantin, Yangfeng Ji, Yanjun Qi

State-of-the-art attacks on NLP models lack a shared definition of a what constitutes a successful attack.

Sentence

2,773

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.