Search Results for author: Mark Lee

Found 28 papers, 5 papers with code

Multi-task Learning Using a Combination of Contextualised and Static Word Embeddings for Arabic Sarcasm Detection and Sentiment Analysis

no code implementations • EACL (WANLP) 2021 • Abdullah I. Alharbi, Mark Lee

Sarcasm detection and sentiment analysis are important tasks in Natural Language Understanding.

Multi-Task Learning Natural Language Understanding +3

Paper
Add Code

Kawarith: an Arabic Twitter Corpus for Crisis Events

2 code implementations • EACL (WANLP) 2021 • Alaa Alharbi, Mark Lee

Exploration of this content revealed the most discussed topics and information types, and the paper presents a labelled dataset from seven emergency events that serves as a gold standard for several tasks in crisis informatics research.

Paper
Code

UoB at ProfNER 2021: Data Augmentation for Classification Using Machine Translation

no code implementations • NAACL (SMM4H) 2021 • Frances Adriana Laureano De Leon, Harish Tayyar Madabushi, Mark Lee

This paper describes the participation of the UoB-NLP team in the ProfNER-ST shared subtask 7a.

Data Augmentation Machine Translation +1

Paper
Add Code

Extractive Financial Narrative Summarisation using SentenceBERT Based Clustering

no code implementations • FNP 2021 • Tuba Gokhan, Phillip Smith, Mark Lee

Clustering

Paper
Add Code

GUSUM: Graph-based Unsupervised Summarization Using Sentence Features Scoring and Sentence-BERT

1 code implementation • COLING (TextGraphs) 2022 • Tuba Gokhan, Phillip Smith, Mark Lee

In this paper, we develop a Graph-Based Unsupervised Summarization(GUSUM) method for extractive text summarization based on the principle of including the most important sentences while excluding sentences with similar meanings in the summary.

Document Summarization Extractive Document Summarization +5

Paper
Code

Classifying Arabic Crisis Tweets using Data Selection and Pre-trained Language Models

no code implementations • OSACT (LREC) 2022 • Alaa Alharbi, Mark Lee

User-generated Social Media (SM) content has been explored as a valuable and accessible source of data about crises to enhance situational awareness and support humanitarian response efforts.

Humanitarian

Paper
Add Code

MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training

no code implementations • 14 Mar 2024 • Brandon McKinzie, Zhe Gan, Jean-Philippe Fauconnier, Sam Dodge, BoWen Zhang, Philipp Dufter, Dhruti Shah, Xianzhi Du, Futang Peng, Floris Weers, Anton Belyi, Haotian Zhang, Karanjeet Singh, Doug Kang, Ankur Jain, Hongyu Hè, Max Schwarzer, Tom Gunter, Xiang Kong, Aonan Zhang, Jianyu Wang, Chong Wang, Nan Du, Tao Lei, Sam Wiseman, Guoli Yin, Mark Lee, ZiRui Wang, Ruoming Pang, Peter Grasch, Alexander Toshev, Yinfei Yang

Through careful and comprehensive ablations of the image encoder, the vision language connector, and various pre-training data choices, we identified several crucial design lessons.

Ranked #18 on Visual Question Answering on MM-Vet

In-Context Learning Visual Question Answering

Paper
Add Code

Code-Mixed Probes Show How Pre-Trained Models Generalise On Code-Switched Text

1 code implementation • 7 Mar 2024 • Frances A. Laureano De Leon, Harish Tayyar Madabushi, Mark Lee

Code-switching is a prevalent linguistic phenomenon in which multilingual individuals seamlessly alternate between languages.

Paper
Code

The self-supervised spectral-spatial attention-based transformer network for automated, accurate prediction of crop nitrogen status from UAV imagery

no code implementations • 12 Nov 2021 • Xin Zhang, Liangxiu Han, Tam Sobeih, Lewis Lappin, Mark Lee, Andew Howard, Aron Kisdi

In this work, we propose a novel deep learning framework: a self-supervised spectral-spatial attention-based vision transformer (SSVT).

Self-Supervised Learning

Paper
Add Code

UoB\_UK at SemEval 2021 Task 2: Zero-Shot and Few-Shot Learning for Multi-lingual and Cross-lingual Word Sense Disambiguation.

no code implementations • SEMEVAL 2021 • Wei Li, Harish Tayyar Madabushi, Mark Lee

This paper describes our submission to SemEval 2021 Task 2.

Few-Shot Learning Task 2 +2

Paper
Add Code

Can vectors read minds better than experts? Comparing data augmentation strategies for the automated scoring of children's mindreading ability

no code implementations • ACL 2021 • Venelin Kovatchev, Phillip Smith, Mark Lee, Rory Devine

To determine the capabilities of automatic systems to generalize to unseen data, we create UK-MIND-20 - a new corpus of children's performance on tests of mindreading, consisting of 10, 320 question-answer pairs.

Data Augmentation

Paper
Add Code

``What is on your mind?'' Automated Scoring of Mindreading in Childhood and Early Adolescence

no code implementations • COLING 2020 • Venelin Kovatchev, Phillip Smith, Mark Lee, Imogen Grumley Traynor, Irene Luque Aguilera, Rory Devine

In this paper we present the first work on the automated scoring of mindreading ability in middle childhood and early adolescence.

Paper
Add Code

BhamNLP at SemEval-2020 Task 12: An Ensemble of Different Word Embeddings and Emotion Transfer Learning for Arabic Offensive Language Identification in Social Media

no code implementations • SEMEVAL 2020 • Abdullah I. Alharbi, Mark Lee

Social media platforms such as Twitter offer people an opportunity to publish short posts in which they can share their opinions and perspectives.

Language Identification Transfer Learning +1

Paper
Add Code

"What is on your mind?" Automated Scoring of Mindreading in Childhood and Early Adolescence

1 code implementation • 16 Nov 2020 • Venelin Kovatchev, Phillip Smith, Mark Lee, Imogen Grumley Traynor, Irene Luque Aguilera, Rory T. Devine

In this paper we present the first work on the automated scoring of mindreading ability in middle childhood and early adolescence.

BIG-bench Machine Learning

Paper
Code

Augmenting Neural Metaphor Detection with Concreteness

no code implementations • WS 2020 • Ghadi Alnafesah, Harish Tayyar Madabushi, Mark Lee

The idea that a shift in concreteness within a sentence indicates the presence of a metaphor has been around for a while.

Sentence

Paper
Add Code

Combining Character and Word Embeddings for the Detection of Offensive Language in Arabic

no code implementations • LREC 2020 • Abdullah I. Alharbi, Mark Lee

A key challenge was the uniqueness of the language used on social media, prompting the out-of-vocabulary (OOV) problem.

Word Embeddings

Paper
Add Code

Crisis Detection from Arabic Tweets

no code implementations • WS 2019 • Alaa Alharbi, Mark Lee

Paper
Add Code

On Physical Adversarial Patches for Object Detection

1 code implementation • 20 Jun 2019 • Mark Lee, Zico Kolter

In this paper, we demonstrate a physical adversarial patch attack against object detectors, notably the YOLOv3 detector.

Object object-detection +1

Paper
Code

Integrating Question Classification and Deep Learning for improved Answer Selection

no code implementations • COLING 2018 • Harish Tayyar Madabushi, Mark Lee, John Barnden

We present a system for Answer Selection that integrates fine-grained Question Classification with a Deep Learning model designed for Answer Selection.

Answer Selection Classification +1

Paper
Add Code

High Accuracy Rule-based Question Classification using Question Syntax and Semantics

no code implementations • COLING 2016 • Harish Tayyar Madabushi, Mark Lee

We present in this paper a purely rule-based system for Question Classification which we divide into two parts: The first is the extraction of relevant words from a question by use of its structure, and the second is the classification of questions based on rules that associate these words to Concepts.

Ranked #1 on Text Classification on TREC-50

BIG-bench Machine Learning General Classification +3

Paper
Add Code

UoB-UK at SemEval-2016 Task 1: A Flexible and Extendable System for Semantic Text Similarity using Types, Surprise and Phrase Linking

no code implementations • SEMEVAL 2016 • Harish Tayyar Madabushi, Mark Buhagiar, Mark Lee

Machine Translation text similarity

Paper
Add Code

Sentiment Classification via a Response Recalibration Framework

no code implementations • WS 2015 • Phillip Smith, Mark Lee

Classification General Classification +2

Paper
Add Code

A Hybrid Approach to Features Representation for Fine-grained Arabic Named Entity Recognition

no code implementations • COLING 2014 • Fahd Alotaibi, Mark Lee

named-entity-recognition Named Entity Recognition +2

Paper
Add Code

Automatically Developing a Fine-grained Arabic Named Entity Corpus and Gazetteer by utilizing Wikipedia

no code implementations • IJCNLP 2013 • Fahd Alotaibi, Mark Lee

Question Answering Transliteration

Paper
Add Code

Mapping Arabic Wikipedia into the Named Entities Taxonomy

no code implementations • COLING 2012 • Fahd Alotaibi, Mark Lee

Document Classification

Paper
Add Code

A CCG-based Approach to Fine-Grained Sentiment Analysis

no code implementations • WS 2012 • Phillip Smith, Mark Lee

Emotion Classification Sentiment Analysis

Paper
Add Code

Cross-discourse Development of Supervised Sentiment Analysis in the Clinical Domain

no code implementations • WS 2012 • Phillip Smith, Mark Lee

Sentiment Analysis

Paper
Add Code

Building Text-to-Speech Systems for Resource Poor Languages

no code implementations • LREC 2012 • Nur-Hana Samsudin, Mark Lee

This paper describes research on building text-to-speech synthesis systems (TTS) for resource poor languages using available resources from other languages and describes our general approach to building cross-linguistic polyglot TTS.

Clustering Speech Synthesis +1

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.