TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Sentiment Analysis	IITP Product Reviews Sentiment	CalBERT	Accuracy	79.4	# 1
Sentiment Analysis	SAIL 2017	CalBERT	F1	62	# 1
Sentiment Analysis	SAIL 2017	CalBERT	Precision	61.8	# 1
Sentiment Analysis	SAIL 2017	CalBERT	Recall	61.8	# 1

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/calbert-code-mixed-adaptive-language-1/sentiment-analysis-on-iitp-product-reviews)](https://paperswithcode.com/sota/sentiment-analysis-on-iitp-product-reviews?p=calbert-code-mixed-adaptive-language-1)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/calbert-code-mixed-adaptive-language-1/sentiment-analysis-on-sail-2017)](https://paperswithcode.com/sota/sentiment-analysis-on-sail-2017?p=calbert-code-mixed-adaptive-language-1)`

CalBERT - Code-mixed Adaptive Language representations using BERT

AAAI-MAKE 2022 · Aditeya Baral, Ansh Sarkar, Aronya Baksy, Deeksha D, Ashwini M Joshi ·

A code-mixed language is a type of language that involves the combination of two or more language varieties in its script or speech. Analysis of code-text is difficult to tackle because the language present is not consistent and does not work with existing monolingual approaches. We propose a novel approach to improve performance in Transformers by introducing an additional step called "Siamese Pre-Training", which allows pre-trained monolingual Transformers to adapt language representations for code-mixed languages with a few examples of code-mixed data. The proposed architectures beat the state of the art F1-score on the Sentiment Analysis for Indian Languages (SAIL) dataset, with the highest possible improvement being 5.1 points, while also achieving the state-of-the-art accuracy on the IndicGLUE Product Reviews dataset by beating the benchmark by 0.4 points.

PDF

Code

Add Remove Mark official

aditeyabaral/calbert official

Tasks

Add Remove

Natural Language Understanding

Sentiment Analysis

Datasets

IndicGLUE SAIL 2017

Results from the Paper

Add Remove

Ranked #1 on Sentiment Analysis on IITP Product Reviews Sentiment

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Sentiment Analysis	IITP Product Reviews Sentiment	CalBERT	Accuracy	79.4	# 1	Compare
Sentiment Analysis	SAIL 2017	CalBERT	F1	62	# 1	Compare
			Precision	61.8	# 1	Compare
			Recall	61.8	# 1	Compare

Methods

Add Remove

Adam • Attention Dropout • BERT • Contrastive Learning • Dense Connections • Dropout • GELU • Layer Normalization • Linear Layer • Linear Warmup With Linear Decay • Multi-Head Attention • Residual Connection • Scaled Dot-Product Attention • Softmax • Weight Decay • WordPiece

Edit Social Preview

CalBERT - Code-mixed Adaptive Language representations using BERT

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit Add Remove

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Add Remove

Methods

Add Remove