Search Results for author: Sudharsan Sundar

Found 1 papers, 0 papers with code

Beyond Scale: the Diversity Coefficient as a Data Quality Metric Demonstrates LLMs are Pre-trained on Formally Diverse Data

no code implementations24 Jun 2023 Alycia Lee, Brando Miranda, Sudharsan Sundar, Sanmi Koyejo

Current trends to pre-train capable Large Language Models (LLMs) mostly focus on scaling of model and dataset size.

Cannot find the paper you are looking for? You can Submit a new open access paper.