no code implementations • 16 Feb 2024 • Arindam Mitra, Hamed Khanpour, Corby Rosset, Ahmed Awadallah
Ensembling provides a substantial boost in accuracy but at a significant cost increase with multiple calls to the model (e. g., Phi-GSM uses top-48 to boost the performance from 68. 2 to 81. 5).
Ranked #36 on Arithmetic Reasoning on GSM8K
no code implementations • 18 Nov 2023 • Arindam Mitra, Luciano del Corro, Shweti Mahajan, Andres Codas, Clarisse Simoes, Sahaj Agarwal, Xuxi Chen, Anastasia Razdaibiedina, Erik Jones, Kriti Aggarwal, Hamid Palangi, Guoqing Zheng, Corby Rosset, Hamed Khanpour, Ahmed Awadallah
Research on training small LMs has often relied on imitation learning to replicate the output of more capable models.
Ranked #1 on Crass AI on BIG-bench
no code implementations • 20 Jul 2023 • Somayeh Ghanbarzadeh, Yan Huang, Hamid Palangi, Radames Cruz Moreno, Hamed Khanpour
Recent studies have revealed that the widely-used Pre-trained Language Models (PLMs) propagate societal biases from the large unmoderated pre-training corpora.
no code implementations • 19 Jul 2023 • Somayeh Ghanbarzadeh, Hamid Palangi, Yan Huang, Radames Cruz Moreno, Hamed Khanpour
The reusability of state-of-the-art Pre-trained Language Models (PLMs) is often limited by their generalization problem, where their performance drastically decreases when evaluated on examples that differ from the training dataset, known as Out-of-Distribution (OOD)/unseen examples.
no code implementations • 20 May 2022 • Simin Chen, Hamed Khanpour, Cong Liu, Wei Yang
With the privatization deployment of DNNs on edge devices, the security of on-device DNNs has raised significant concern.
no code implementations • EMNLP 2018 • Hamed Khanpour, Cornelia Caragea
Detecting fine-grained emotions in online health communities provides insightful information about patients{'} emotional states.
no code implementations • IJCNLP 2017 • Hamed Khanpour, Cornelia Caragea, Prakhar Biyani
Empathy captures one{'}s ability to correlate with and understand others{'} emotional states and experiences.
no code implementations • COLING 2016 • Hamed Khanpour, Guntak, Nishitha la, Rodney Nielsen
In this study, we applied a deep LSTM structure to classify dialogue acts (DAs) in open-domain conversations.
Automatic Speech Recognition (ASR) Dialogue Act Classification +5