Search Results for author: Wazir Ali

Found 6 papers, 0 papers with code

Creating and Evaluating Resources for Sentiment Analysis in the Low-resource Language: Sindhi

no code implementations EACL (WASSA) 2021 Wazir Ali, Naveed Ali, Yong Dai, Jay Kumar, Saifullah Tumrani, Zenglin Xu

In this paper, we develop Sindhi subjective lexicon using a merger of existing English resources: NRC lexicon, list of opinion words, SentiWordNet, Sindhi-English bilingual dictionary, and collection of Sindhi modifiers.

Sentiment Analysis Subjectivity Analysis +1

SiPOS: A Benchmark Dataset for Sindhi Part-of-Speech Tagging

no code implementations RANLP 2021 Wazir Ali, Zenglin Xu, Jay Kumar

In this paper, we introduce the SiPOS dataset for part-of-speech tagging in the low-resource Sindhi language with quality baselines.

Part-Of-Speech Tagging text annotation

An Online Semantic-enhanced Dirichlet Model for Short Text Stream Clustering

no code implementations ACL 2020 Jay Kumar, Junming Shao, Salah Uddin, Wazir Ali

Clustering short text streams is a challenging task due to its unique properties: infinite length, sparse data representation and cluster evolution.

Clustering Short Text Clustering

SiNER: A Large Dataset for Sindhi Named Entity Recognition

no code implementations LREC 2020 Wazir Ali, Junyu Lu, Zenglin Xu

We introduce the SiNER: a named entity recognition (NER) dataset for low-resourced Sindhi language with quality baselines.

named-entity-recognition Named Entity Recognition +1

Word Embedding based New Corpus for Low-resourced Language: Sindhi

no code implementations28 Nov 2019 Wazir Ali, Jay Kumar, Junyu Lu, Zenglin Xu

Our intrinsic evaluation results demonstrate the high quality of our generated Sindhi word embeddings using SG, CBoW, and GloVe as compare to SdfastText word representations.

Word Embeddings

Cannot find the paper you are looking for? You can Submit a new open access paper.