Search Results for author: Mark Lee

Found 28 papers, 5 papers with code

Kawarith: an Arabic Twitter Corpus for Crisis Events

2 code implementations EACL (WANLP) 2021 Alaa Alharbi, Mark Lee

Exploration of this content revealed the most discussed topics and information types, and the paper presents a labelled dataset from seven emergency events that serves as a gold standard for several tasks in crisis informatics research.

GUSUM: Graph-based Unsupervised Summarization Using Sentence Features Scoring and Sentence-BERT

1 code implementation COLING (TextGraphs) 2022 Tuba Gokhan, Phillip Smith, Mark Lee

In this paper, we develop a Graph-Based Unsupervised Summarization(GUSUM) method for extractive text summarization based on the principle of including the most important sentences while excluding sentences with similar meanings in the summary.

Document Summarization Extractive Document Summarization +5

Classifying Arabic Crisis Tweets using Data Selection and Pre-trained Language Models

no code implementations OSACT (LREC) 2022 Alaa Alharbi, Mark Lee

User-generated Social Media (SM) content has been explored as a valuable and accessible source of data about crises to enhance situational awareness and support humanitarian response efforts.

Humanitarian

Code-Mixed Probes Show How Pre-Trained Models Generalise On Code-Switched Text

1 code implementation7 Mar 2024 Frances A. Laureano De Leon, Harish Tayyar Madabushi, Mark Lee

Code-switching is a prevalent linguistic phenomenon in which multilingual individuals seamlessly alternate between languages.

Can vectors read minds better than experts? Comparing data augmentation strategies for the automated scoring of children's mindreading ability

no code implementations ACL 2021 Venelin Kovatchev, Phillip Smith, Mark Lee, Rory Devine

To determine the capabilities of automatic systems to generalize to unseen data, we create UK-MIND-20 - a new corpus of children's performance on tests of mindreading, consisting of 10, 320 question-answer pairs.

Data Augmentation

Augmenting Neural Metaphor Detection with Concreteness

no code implementations WS 2020 Ghadi Alnafesah, Harish Tayyar Madabushi, Mark Lee

The idea that a shift in concreteness within a sentence indicates the presence of a metaphor has been around for a while.

Sentence

Combining Character and Word Embeddings for the Detection of Offensive Language in Arabic

no code implementations LREC 2020 Abdullah I. Alharbi, Mark Lee

A key challenge was the uniqueness of the language used on social media, prompting the out-of-vocabulary (OOV) problem.

Word Embeddings

On Physical Adversarial Patches for Object Detection

1 code implementation20 Jun 2019 Mark Lee, Zico Kolter

In this paper, we demonstrate a physical adversarial patch attack against object detectors, notably the YOLOv3 detector.

Object object-detection +1

Integrating Question Classification and Deep Learning for improved Answer Selection

no code implementations COLING 2018 Harish Tayyar Madabushi, Mark Lee, John Barnden

We present a system for Answer Selection that integrates fine-grained Question Classification with a Deep Learning model designed for Answer Selection.

Answer Selection Classification +1

High Accuracy Rule-based Question Classification using Question Syntax and Semantics

no code implementations COLING 2016 Harish Tayyar Madabushi, Mark Lee

We present in this paper a purely rule-based system for Question Classification which we divide into two parts: The first is the extraction of relevant words from a question by use of its structure, and the second is the classification of questions based on rules that associate these words to Concepts.

BIG-bench Machine Learning General Classification +3

Building Text-to-Speech Systems for Resource Poor Languages

no code implementations LREC 2012 Nur-Hana Samsudin, Mark Lee

This paper describes research on building text-to-speech synthesis systems (TTS) for resource poor languages using available resources from other languages and describes our general approach to building cross-linguistic polyglot TTS.

Clustering Speech Synthesis +1

Cannot find the paper you are looking for? You can Submit a new open access paper.