Search Results for author: Alexandre Salle

Found 7 papers, 5 papers with code

Native Language Identification with Large Language Models

no code implementations • 13 Dec 2023 • Wei zhang, Alexandre Salle

We present the first experiments on Native Language Identification (NLI) using LLMs such as GPT-4.

Paper
Add Code

Why So Down? The Role of Negative (and Positive) Pointwise Mutual Information in Distributional Semantics

1 code implementation • 19 Aug 2019 • Alexandre Salle, Aline Villavicencio

In distributional semantics, the pointwise mutual information ($\mathit{PMI}$) weighting of the cooccurrence matrix performs far better than raw counts.

780

Paper
Code

Think Again Networks and the Delta Loss

1 code implementation • 26 Apr 2019 • Alexandre Salle, Marcelo Prates

This short paper introduces an abstraction called Think Again Networks (ThinkNet) which can be applied to any state-dependent function (such as a recurrent neural network).

Language Modelling

Paper
Code

Incorporating Subword Information into Matrix Factorization Word Embeddings

1 code implementation • WS 2018 • Alexandre Salle, Aline Villavicencio

The positive effect of adding subword information to word embeddings has been demonstrated for predictive models.

Word Embeddings

780

Paper
Code

Restricted Recurrent Neural Tensor Networks: Exploiting Word Frequency and Compositionality

no code implementations • ACL 2018 • Alexandre Salle, Aline Villavicencio

Increasing the capacity of recurrent neural networks (RNN) usually involves augmenting the size of the hidden layer, with significant increase of computational cost.

Language Modelling Tensor Networks

Paper
Add Code

Enhancing the LexVec Distributed Word Representation Model Using Positional Contexts and External Memory

1 code implementation • 3 Jun 2016 • Alexandre Salle, Marco Idiart, Aline Villavicencio

The effectiveness of both modifications is shown using word similarity and analogy tasks.

Word Similarity

780

Paper
Code

Matrix Factorization using Window Sampling and Negative Sampling for Improved Word Representations

1 code implementation • ACL 2016 • Alexandre Salle, Marco Idiart, Aline Villavicencio

In this paper, we propose LexVec, a new method for generating distributed word representations that uses low-rank, weighted factorization of the Positive Point-wise Mutual Information matrix via stochastic gradient descent, employing a weighting scheme that assigns heavier penalties for errors on frequent co-occurrences while still accounting for negative co-occurrence.

Word Similarity

780

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.