Search Results for author: Sanath Jayasena

Found 8 papers, 1 papers with code

BERTifying Sinhala - A Comprehensive Analysis of Pre-trained Language Models for Sinhala Text Classification

1 code implementation • LREC 2022 • Vinura Dhananjaya, Piyumal Demotte, Surangika Ranathunga, Sanath Jayasena

We test on a set of different Sinhala text classification tasks and our analysis shows that out of the pre-trained multilingual models that include Sinhala (XLM-R, LaBSE, and LASER), XLM-R is the best model by far for Sinhala text classification.

text-classification Text Classification +1

Paper
Code

Dialog policy optimization for low resource setting using Self-play and Reward based Sampling

no code implementations • PACLIC 2020 • Tharindu Madusanka, Durashi Langappuli, Thisara Welmilla, Uthayasanker Thayasivam, Sanath Jayasena

Paper
Add Code

BERTifying Sinhala -- A Comprehensive Analysis of Pre-trained Language Models for Sinhala Text Classification

no code implementations • 16 Aug 2022 • Vinura Dhananjaya, Piyumal Demotte, Surangika Ranathunga, Sanath Jayasena

text-classification Text Classification +1

Paper
Add Code

Effect of Pressure for Compositionality on Language Emergence

no code implementations • 29 Sep 2021 • Mihira Kasun Vithanage, Rukshan Darshana Wijesinghe, Alex Xavier, Dumindu Tissera, Sanath Jayasena, Subha Fernando

In this paper, we present a learning environment where agents are pressured to make their emerging languages compositional by incorporating a metric of topological similarity into the loss function.

Paper
Add Code

Neural Mixture Models with Expectation-Maximization for End-to-end Deep Clustering

no code implementations • 6 Jul 2021 • Dumindu Tissera, Kasun Vithanage, Rukshan Wijesinghe, Alex Xavier, Sanath Jayasena, Subha Fernando, Ranga Rodrigo

The network parameters pose as the parameters of those distributions.

Clustering Deep Clustering +2

Paper
Add Code

Improving domain-specific SMT for low-resourced languages using data from different domains

no code implementations • LREC 2018 • Fathima Farhath, Pranavan Theivendiram, Surangika Ranathunga, Sanath Jayasena, Gihan Dias

Domain Adaptation Language Modelling +1

Paper
Add Code

Comprehensive Part-Of-Speech Tag Set and SVM based POS Tagger for Sinhala

no code implementations • WS 2016 • Fern, S o, areka, Surangika Ranathunga, Sanath Jayasena, Gihan Dias

This paper presents a new comprehensive multi-level Part-Of-Speech tag set and a Support Vector Machine based Part-Of-Speech tagger for the Sinhala language.

POS TAG

Paper
Add Code

Automatic Creation of a Sentence Aligned Sinhala-Tamil Parallel Corpus

no code implementations • WS 2016 • Riyafa Abdul Hameed, Nadeeshani Pathirennehelage, Anusha Ihalapathirana, Maryam Ziyad Mohamed, Surangika Ranathunga, Sanath Jayasena, Gihan Dias, Fern, S o, areka

A sentence aligned parallel corpus is an important prerequisite in statistical machine translation.

Machine Translation Sentence +2

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.