TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK	REMOVE
Arabic Text Diacritization	Tashkeela	CBHG model	Diacritic Error Rate	0.0113	# 1
Arabic Text Diacritization	Tashkeela	CBHG model	Word Error Rate (WER)	0.0443	# 1

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/effective-deep-learning-models-for-automatic/arabic-text-diacritization-on-tashkeela-1)](https://paperswithcode.com/sota/arabic-text-diacritization-on-tashkeela-1?p=effective-deep-learning-models-for-automatic)`

Effective Deep Learning Models for Automatic Diacritization of Arabic Text

1 Nov 2020 · Mokthar Ali Hasan Madhfar, Ali Mustafa Qamar ·

While building a text-to-speech system for the Arabic language, we found that the system synthesized speeches with many pronunciation errors. The primary source of these errors is the lack of diacritics in modern standard Arabic writing. These diacritics are small strokes that appear above or below each letter to provide pronunciation and grammatical information. We propose three deep learning models to recover Arabic text diacritics based on our work in a text-to-speech synthesis system using deep learning. The first model is a baseline model used to test how a simple deep learning model performs on the corpora. The second model is based on an encoder-decoder architecture, which resembles our text-to-speech synthesis model with many modifications to suit this problem. The last model is based on the encoder part of the text-to-speech model, which achieves state-of-the-art performances in both word error rate and diacritic error rate metrics. These models will benefit a wide range of natural language processing applications such as text-to-speech, part-of-speech tagging, and machine translation.

PDF

Code

Add Remove Mark official

almodhfer/Arabic_Diacritization official

Tasks

Add Remove

Arabic Text Diacritization

Decoder

Machine Translation

Part-Of-Speech Tagging

Speech Synthesis

Text-To-Speech Synthesis

Translation

Datasets

Arabic Text Diacritization

Results from the Paper

Add Remove

Ranked #1 on Arabic Text Diacritization on Tashkeela

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Arabic Text Diacritization	Tashkeela	CBHG model	Diacritic Error Rate	0.0113	# 1	Compare
Arabic Text Diacritization	Tashkeela	CBHG model	Word Error Rate (WER)	0.0443	# 1	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

Effective Deep Learning Models for Automatic Diacritization of Arabic Text

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit Add Remove

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Add Remove

Methods

Add Remove