Search Results for author: Anuj Gupta

Found 7 papers, 1 papers with code

Noisy Text Data: Achilles’ Heel of BERT

no code implementations EMNLP (WNUT) 2020 Ankit Kumar, Piyush Makhija, Anuj Gupta

Owing to the phenomenal success of BERT on various NLP tasks and benchmark datasets, industry practitioners are actively experimenting with fine-tuning BERT to build NLP applications for solving industry use cases.

Sentiment Analysis SST-2 +2

Root Causing Prediction Anomalies Using Explainable AI

no code implementations4 Mar 2024 Ramanathan Vishnampet, Rajesh Shenoy, Jianhui Chen, Anuj Gupta

We have found this technique to be a model-agnostic, cheap and effective way to monitor complex data pipelines in production and have deployed a system for continuously analyzing the global feature importance distribution of continuously trained models.

Feature Correlation Feature Importance

Assistant, Parrot, or Colonizing Loudspeaker? ChatGPT Metaphors for Developing Critical AI Literacies

no code implementations15 Jan 2024 Anuj Gupta, Yasser Atef, Anna Mills, Maha Bali

This study explores how discussing metaphors for AI can help build awareness of the frames that shape our understanding of AI systems, particularly large language models (LLMs) like ChatGPT.

Ethics

Noisy Text Data: Achilles' Heel of popular transformer based NLP models

no code implementations7 Oct 2021 Kartikay Bagla, Ankit Kumar, Shivam Gupta, Anuj Gupta

However, for most datasets that are used by practitioners to build industrial NLP applications, it is hard to guarantee the presence of any noise in the data.

NER Open-Ended Question Answering +3

hinglishNorm - A Corpus of Hindi-English Code Mixed Sentences for Text Normalization

no code implementations COLING 2020 Piyush Makhija, Ankit Kumar, Anuj Gupta

We present hinglishNorm - a human annotated corpus of Hindi-English code-mixed sentences for text normalization task.

Sentence Translation

hinglishNorm -- A Corpus of Hindi-English Code Mixed Sentences for Text Normalization

2 code implementations18 Oct 2020 Piyush Makhija, Ankit Kumar, Anuj Gupta

We present hinglishNorm -- a human annotated corpus of Hindi-English code-mixed sentences for text normalization task.

Sentence Translation

Noisy Text Data: Achilles' Heel of BERT

no code implementations29 Mar 2020 Ankit Kumar, Piyush Makhija, Anuj Gupta

Owing to the phenomenal success of BERT on various NLP tasks and benchmark datasets, industry practitioners are actively experimenting with fine-tuning BERT to build NLP applications for solving industry use cases.

Sentiment Analysis SST-2 +2

Cannot find the paper you are looking for? You can Submit a new open access paper.