no code implementations • 15 Mar 2024 • Diganta Misra, Jay Gala, Antonio Orvieto
The strength of modern large-scale neural networks lies in their ability to efficiently adapt to new tasks with few examples.
1 code implementation • 26 Jan 2024 • Jay Gala, Thanmay Jayakumar, Jaavid Aktar Husain, Aswanth Kumar M, Mohammed Safi Ur Rahman Khan, Diptesh Kanojia, Ratish Puduppully, Mitesh M. Khapra, Raj Dabre, Rudra Murthy, Anoop Kunchukuttan
We announce the initial release of "Airavata," an instruction-tuned LLM for Hindi.
no code implementations • 25 Jan 2024 • Jaavid Aktar Husain, Raj Dabre, Aswanth Kumar, Jay Gala, Thanmay Jayakumar, Ratish Puduppully, Anoop Kunchukuttan
This study addresses the challenge of extending Large Language Models (LLMs) to non-English languages using non-Roman scripts.
no code implementations • 22 Jan 2024 • Pranjal A. Chitale, Jay Gala, Raj Dabre
While we establish the significance of the quality of the target distribution over the source distribution of demonstrations, we further observe that perturbations sometimes act as regularizers, resulting in performance improvements.
no code implementations • 9 Nov 2023 • Jay Gala, Sauradip Nag, Huichou Huang, Ruirui Liu, Xiatian Zhu
Cloud analysis is a critical component of weather and climate science, impacting various sectors like disaster management.
2 code implementations • 25 May 2023 • Jay Gala, Pranjal A. Chitale, Raghavan AK, Varun Gumma, Sumanth Doddapaneni, Aswanth Kumar, Janki Nawale, Anupama Sujatha, Ratish Puduppully, Vivek Raghavan, Pratyush Kumar, Mitesh M. Khapra, Raj Dabre, Anoop Kunchukuttan
Prior to this work, there was (i) no parallel training data spanning all 22 languages, (ii) no robust benchmarks covering all these languages and containing content relevant to India, and (iii) no existing translation models which support all the 22 scheduled languages of India.
1 code implementation • 18 Feb 2023 • Jay Gala, Deep Gandhi, Jash Mehta, Zeerak Talat
Hate speech detection has been the subject of high research attention, due to the scale of content created on social media.
no code implementations • 1 Dec 2021 • Jay Gala, Pengtao Xie
In this work, we aim to investigate how effectively we can leverage this exceptional learning ability to improve machine learning models.