Search Results for author: Eric Hallahan

Found 3 papers, 3 papers with code

Pythia: A Suite for Analyzing Large Language Models Across Training and Scaling

4 code implementations • 3 Apr 2023 • Stella Biderman, Hailey Schoelkopf, Quentin Anthony, Herbie Bradley, Kyle O'Brien, Eric Hallahan, Mohammad Aflah Khan, Shivanshu Purohit, USVSN Sai Prashanth, Edward Raff, Aviya Skowron, Lintang Sutawika, Oskar van der Wal

How do large language models (LLMs) develop and evolve over the course of training?

Ranked #4 on Language Modelling on LAMBADA (Perplexity metric)

Common Sense Reasoning Coreference Resolution +3

6,878

Paper
Code

VQGAN-CLIP: Open Domain Image Generation and Editing with Natural Language Guidance

1 code implementation • 18 Apr 2022 • Katherine Crowson, Stella Biderman, Daniel Kornis, Dashiell Stander, Eric Hallahan, Louis Castricato, Edward Raff

Generating and editing images from open domain text prompts is a challenging task that heretofore has required expensive and specially trained models.

Image Generation

342

Paper
Code

GPT-NeoX-20B: An Open-Source Autoregressive Language Model

5 code implementations • BigScience (ACL) 2022 • Sid Black, Stella Biderman, Eric Hallahan, Quentin Anthony, Leo Gao, Laurence Golding, Horace He, Connor Leahy, Kyle McDonell, Jason Phang, Michael Pieler, USVSN Sai Prashanth, Shivanshu Purohit, Laria Reynolds, Jonathan Tow, Ben Wang, Samuel Weinbach

We introduce GPT-NeoX-20B, a 20 billion parameter autoregressive language model trained on the Pile, whose weights will be made freely and openly available to the public through a permissive license.

Ranked #86 on Multi-task Language Understanding on MMLU

Language Modelling Multi-task Language Understanding

48,719

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.