Search Results for author: Megh Thakkar

Found 10 papers, 7 papers with code

CIAug: Equipping Interpolative Augmentation with Curriculum Learning

1 code implementation • NAACL 2022 • Ramit Sawhney, Ritesh Soun, Shrey Pandit, Megh Thakkar, Sarvagya Malaviya, Yuval Pinter

CIAug achieves state-of-the-art results over existing interpolative augmentation methods on 10 benchmark datasets across 4 languages in text classification and named-entity recognition tasks.

Data Augmentation named-entity-recognition +5

Paper
Code

DMix: Distance Constrained Interpolative Mixup

no code implementations • EMNLP (MRL) 2021 • Ramit Sawhney, Megh Thakkar, Shrey Pandit, Debdoot Mukherjee, Lucie Flek

Interpolation-based regularisation methods have proven to be effective for various tasks and modalities.

Classification Data Augmentation +2

Paper
Add Code

Sequence Mixup for Zero-Shot Cross-Lingual Part-Of-Speech Tagging

no code implementations • EMNLP (MRL) 2021 • Megh Thakkar, Vishwa Shah, Ramit Sawhney, Debdoot Mukherjee

There have been efforts in cross-lingual transfer learning for various tasks.

Cross-Lingual Transfer Data Augmentation +2

Paper
Add Code

HypMix: Hyperbolic Interpolative Data Augmentation

1 code implementation • EMNLP 2021 • Ramit Sawhney, Megh Thakkar, Shivam Agarwal, Di Jin, Diyi Yang, Lucie Flek

Interpolation-based regularisation methods for data augmentation have proven to be effective for various tasks and modalities.

Adversarial Robustness Data Augmentation

Paper
Code

DMix: Adaptive Distance-aware Interpolative Mixup

1 code implementation • ACL 2022 • Ramit Sawhney, Megh Thakkar, Shrey Pandit, Ritesh Soun, Di Jin, Diyi Yang, Lucie Flek

Interpolation-based regularisation methods such as Mixup, which generate virtual training samples, have proven to be effective for various tasks and modalities. We extend Mixup and propose DMix, an adaptive distance-aware interpolative Mixup that selects samples based on their diversity in the embedding space.

Data Augmentation Sentence +1

Paper
Code

WorkArena: How Capable Are Web Agents at Solving Common Knowledge Work Tasks?

2 code implementations • 12 Mar 2024 • Alexandre Drouin, Maxime Gasse, Massimo Caccia, Issam H. Laradji, Manuel Del Verme, Tom Marty, Léo Boisvert, Megh Thakkar, Quentin Cappart, David Vazquez, Nicolas Chapados, Alexandre Lacoste

We study the use of large language model-based agents for interacting with software via web browsers.

Language Modelling Large Language Model

126

Paper
Code

Self-Influence Guided Data Reweighting for Language Model Pre-training

no code implementations • 2 Nov 2023 • Megh Thakkar, Tolga Bolukbasi, Sriram Ganapathy, Shikhar Vashishth, Sarath Chandar, Partha Talukdar

Once the pre-training corpus has been assembled, all data samples in the corpus are treated with equal importance during LM pre-training.

Language Modelling

Paper
Add Code

Randomized Smoothing with Masked Inference for Adversarially Robust Text Classifications

1 code implementation • 11 May 2023 • Han Cheol Moon, Shafiq Joty, Ruochen Zhao, Megh Thakkar, Xu Chi

Large-scale pre-trained language models have shown outstanding performance in a variety of NLP tasks.

Adversarial Robustness

Paper
Code

Towards Robust Low-Resource Fine-Tuning with Multi-View Compressed Representations

1 code implementation • 16 Nov 2022 • Linlin Liu, Xingxuan Li, Megh Thakkar, Xin Li, Shafiq Joty, Luo Si, Lidong Bing

Due to the huge amount of parameters, fine-tuning of pretrained language models (PLMs) is prone to overfitting in the low resource scenarios.

Paper
Code

Chart-to-Text: A Large-Scale Benchmark for Chart Summarization

2 code implementations • ACL 2022 • Shankar Kantharaj, Rixie Tiffany Ko Leong, Xiang Lin, Ahmed Masry, Megh Thakkar, Enamul Hoque, Shafiq Joty

We also introduce a number of state-of-the-art neural models as baselines that utilize image captioning and data-to-text generation techniques to tackle two problem variations: one assumes the underlying data table of the chart is available while the other needs to extract data from chart images.

Data-to-Text Generation Image Captioning

135

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.