no code implementations • 8 Apr 2024 • Rohan Deepak Ajwani, Zining Zhu, Jonathan Rose, Frank Rudzicz
Transformer-based Large Language Models (LLMs) have shown exceptional language generation capabilities in response to text-based prompts.
no code implementations • 1 Feb 2024 • Andrew Brown, Jiading Zhu, Mohamed Abdelwahab, Alec Dong, Cindy Wang, Jonathan Rose
Many will be motivated to distill specific capabilities of foundational models into smaller models that can be owned and controlled.