Search Results for author: Simon Baker

Found 18 papers, 5 papers with code

Multi-SimLex: A Large-Scale Evaluation of Multilingual and Crosslingual Lexical Semantic Similarity

no code implementations • CL (ACL) 2020 • Ivan Vulić, Simon Baker, Edoardo Maria Ponti, Ulla Petti, Ira Leviant, Kelly Wing, Olga Majewska, Eden Bar, Matt Malone, Thierry Poibeau, Roi Reichart, Anna Korhonen

We introduce Multi-SimLex, a large-scale lexical resource and evaluation benchmark covering data sets for 12 typologically diverse languages, including major languages (e. g., Mandarin Chinese, Spanish, Russian) as well as less-resourced ones (e. g., Welsh, Kiswahili).

Representation Learning Semantic Similarity +2

Paper
Add Code

Keep the Primary, Rewrite the Secondary: A Two-Stage Approach for Paraphrase Generation

no code implementations • Findings (ACL) 2021 • Yixuan Su, David Vandyke, Simon Baker, Yan Wang, Nigel Collier

Paraphrase Generation

Paper
Add Code

Few-Shot Table-to-Text Generation with Prototype Memory

1 code implementation • Findings (EMNLP) 2021 • Yixuan Su, Zaiqiao Meng, Simon Baker, Nigel Collier

Neural table-to-text generation models have achieved remarkable progress on an array of tasks.

Table-to-Text Generation

Paper
Code

Non-Autoregressive Text Generation with Pre-trained Language Models

1 code implementation • EACL 2021 • Yixuan Su, Deng Cai, Yan Wang, David Vandyke, Simon Baker, Piji Li, Nigel Collier

In this work, we show that BERT can be employed as the backbone of a NAG model to greatly improve performance.

Machine Translation Sentence +3

Paper
Code

New dimension bounds for $αβ$ sets

no code implementations • 11 Feb 2021 • Simon Baker

In this paper we obtain new lower bounds for the upper box dimension of $\alpha\beta$ sets.

Dynamical Systems Metric Geometry Number Theory

Paper
Add Code

Dialogue Response Selection with Hierarchical Curriculum Learning

1 code implementation • ACL 2021 • Yixuan Su, Deng Cai, Qingyu Zhou, Zibo Lin, Simon Baker, Yunbo Cao, Shuming Shi, Nigel Collier, Yan Wang

As for IC, it progressively strengthens the model's ability in identifying the mismatching information between the dialogue context and a response candidate.

Ranked #3 on Conversational Response Selection on RRS

Conversational Response Selection

Paper
Code

Prototype-to-Style: Dialogue Generation with Style-Aware Editing on Retrieval Memory

no code implementations • 5 Apr 2020 • Yixuan Su, Yan Wang, Simon Baker, Deng Cai, Xiaojiang Liu, Anna Korhonen, Nigel Collier

A stylistic response generator then takes the prototype and the desired language style as model input to obtain a high-quality and stylistic response.

Dialogue Generation Information Retrieval +1

Paper
Add Code

Stylistic Dialogue Generation via Information-Guided Reinforcement Learning Strategy

no code implementations • 5 Apr 2020 • Yixuan Su, Deng Cai, Yan Wang, Simon Baker, Anna Korhonen, Nigel Collier, Xiaojiang Liu

To enable better balance between the content quality and the style, we introduce a new training strategy, know as Information-Guided Reinforcement Learning (IG-RL).

Dialogue Generation reinforcement-learning +2

Paper
Add Code

Multi-SimLex: A Large-Scale Evaluation of Multilingual and Cross-Lingual Lexical Semantic Similarity

no code implementations • 10 Mar 2020 • Ivan Vulić, Simon Baker, Edoardo Maria Ponti, Ulla Petti, Ira Leviant, Kelly Wing, Olga Majewska, Eden Bar, Matt Malone, Thierry Poibeau, Roi Reichart, Anna Korhonen

We introduce Multi-SimLex, a large-scale lexical resource and evaluation benchmark covering datasets for 12 typologically diverse languages, including major languages (e. g., Mandarin Chinese, Spanish, Russian) as well as less-resourced ones (e. g., Welsh, Kiswahili).

Cross-Lingual Word Embeddings Representation Learning +3

Paper
Add Code

Equidistribution results for self-similar measures

no code implementations • 26 Feb 2020 • Simon Baker

A well known theorem due to Koksma states that for Lebesgue almost every $x>1$ the sequence $(x^n)_{n=1}^{\infty}$ is uniformly distributed modulo one.

Dynamical Systems Classical Analysis and ODEs Number Theory

Paper
Add Code

Enhancing biomedical word embeddings by retrofitting to verb clusters

1 code implementation • WS 2019 • Billy Chiu, Simon Baker, Martha Palmer, Anna Korhonen

Verbs play a fundamental role in many biomed-ical tasks and applications such as relation and event extraction.

Event Extraction Relation +4

Paper
Code

Variable Typing: Assigning Meaning to Variables in Mathematical Text

no code implementations • NAACL 2018 • Yiannos Stathopoulos, Simon Baker, Marek Rei, Simone Teufel

Our results show that the best performing MIR models make use of our typed index, compared to a formula index only containing raw symbols, thereby demonstrating the usefulness of variable typing.

Information Retrieval Retrieval +1

Paper
Add Code

Initializing neural networks for hierarchical multi-label text classification

no code implementations • WS 2017 • Simon Baker, Anna Korhonen

Many tasks in the biomedical domain require the assignment of one or more predefined labels to input text, where the labels are a part of a hierarchical structure (such as a taxonomy).

General Classification Multi-Label Classification +4

Paper
Add Code

Robust Text Classification for Sparsely Labelled Data Using Multi-level Embeddings

no code implementations • COLING 2016 • Simon Baker, Douwe Kiela, Anna Korhonen

The conventional solution for handling sparsely labelled data is extensive feature engineering.

Feature Engineering General Classification +4

Paper
Add Code

Cancer Hallmark Text Classification Using Convolutional Neural Networks

no code implementations • WS 2016 • Simon Baker, Anna Korhonen, Sampo Pyysalo

Methods based on deep learning approaches have recently achieved state-of-the-art performance in a range of machine learning tasks and are increasingly applied to natural language processing (NLP).

General Classification text-classification +1