Search Results for author: Simon Baker

Found 18 papers, 5 papers with code

Multi-SimLex: A Large-Scale Evaluation of Multilingual and Crosslingual Lexical Semantic Similarity

no code implementations CL (ACL) 2020 Ivan Vulić, Simon Baker, Edoardo Maria Ponti, Ulla Petti, Ira Leviant, Kelly Wing, Olga Majewska, Eden Bar, Matt Malone, Thierry Poibeau, Roi Reichart, Anna Korhonen

We introduce Multi-SimLex, a large-scale lexical resource and evaluation benchmark covering data sets for 12 typologically diverse languages, including major languages (e. g., Mandarin Chinese, Spanish, Russian) as well as less-resourced ones (e. g., Welsh, Kiswahili).

Representation Learning Semantic Similarity +2

New dimension bounds for $αβ$ sets

no code implementations11 Feb 2021 Simon Baker

In this paper we obtain new lower bounds for the upper box dimension of $\alpha\beta$ sets.

Dynamical Systems Metric Geometry Number Theory

Dialogue Response Selection with Hierarchical Curriculum Learning

1 code implementation ACL 2021 Yixuan Su, Deng Cai, Qingyu Zhou, Zibo Lin, Simon Baker, Yunbo Cao, Shuming Shi, Nigel Collier, Yan Wang

As for IC, it progressively strengthens the model's ability in identifying the mismatching information between the dialogue context and a response candidate.

Conversational Response Selection

Prototype-to-Style: Dialogue Generation with Style-Aware Editing on Retrieval Memory

no code implementations5 Apr 2020 Yixuan Su, Yan Wang, Simon Baker, Deng Cai, Xiaojiang Liu, Anna Korhonen, Nigel Collier

A stylistic response generator then takes the prototype and the desired language style as model input to obtain a high-quality and stylistic response.

Dialogue Generation Information Retrieval +1

Stylistic Dialogue Generation via Information-Guided Reinforcement Learning Strategy

no code implementations5 Apr 2020 Yixuan Su, Deng Cai, Yan Wang, Simon Baker, Anna Korhonen, Nigel Collier, Xiaojiang Liu

To enable better balance between the content quality and the style, we introduce a new training strategy, know as Information-Guided Reinforcement Learning (IG-RL).

Dialogue Generation reinforcement-learning +2

Multi-SimLex: A Large-Scale Evaluation of Multilingual and Cross-Lingual Lexical Semantic Similarity

no code implementations10 Mar 2020 Ivan Vulić, Simon Baker, Edoardo Maria Ponti, Ulla Petti, Ira Leviant, Kelly Wing, Olga Majewska, Eden Bar, Matt Malone, Thierry Poibeau, Roi Reichart, Anna Korhonen

We introduce Multi-SimLex, a large-scale lexical resource and evaluation benchmark covering datasets for 12 typologically diverse languages, including major languages (e. g., Mandarin Chinese, Spanish, Russian) as well as less-resourced ones (e. g., Welsh, Kiswahili).

Cross-Lingual Word Embeddings Representation Learning +3

Equidistribution results for self-similar measures

no code implementations26 Feb 2020 Simon Baker

A well known theorem due to Koksma states that for Lebesgue almost every $x>1$ the sequence $(x^n)_{n=1}^{\infty}$ is uniformly distributed modulo one.

Dynamical Systems Classical Analysis and ODEs Number Theory

Variable Typing: Assigning Meaning to Variables in Mathematical Text

no code implementations NAACL 2018 Yiannos Stathopoulos, Simon Baker, Marek Rei, Simone Teufel

Our results show that the best performing MIR models make use of our typed index, compared to a formula index only containing raw symbols, thereby demonstrating the usefulness of variable typing.

Information Retrieval Retrieval +1

Initializing neural networks for hierarchical multi-label text classification

no code implementations WS 2017 Simon Baker, Anna Korhonen

Many tasks in the biomedical domain require the assignment of one or more predefined labels to input text, where the labels are a part of a hierarchical structure (such as a taxonomy).

General Classification Multi-Label Classification +4

Cancer Hallmark Text Classification Using Convolutional Neural Networks

no code implementations WS 2016 Simon Baker, Anna Korhonen, Sampo Pyysalo

Methods based on deep learning approaches have recently achieved state-of-the-art performance in a range of machine learning tasks and are increasingly applied to natural language processing (NLP).

General Classification text-classification +1

Cannot find the paper you are looking for? You can Submit a new open access paper.