Search Results for author: Telmo Pessoa Pires

Found 4 papers, 0 papers with code

SaulLM-7B: A pioneering Large Language Model for Law

no code implementations • 6 Mar 2024 • Pierre Colombo, Telmo Pessoa Pires, Malik Boudiaf, Dominic Culver, Rui Melo, Caio Corro, Andre F. T. Martins, Fabrizio Esposito, Vera Lúcia Raposo, Sofia Morgado, Michael Desa

In this paper, we introduce SaulLM-7B, a large language model (LLM) tailored for the legal domain.

Language Modelling Large Language Model +1

Paper
Add Code

One Wide Feedforward is All You Need

no code implementations • 4 Sep 2023 • Telmo Pessoa Pires, António V. Lopes, Yannick Assogba, Hendra Setiawan

The Transformer architecture has two main non-embedding components: Attention and the Feed Forward Network (FFN).

Position

Paper
Add Code

Learning Language-Specific Layers for Multilingual Machine Translation

no code implementations • 4 May 2023 • Telmo Pessoa Pires, Robin M. Schmidt, Yi-Hsiu Liao, Stephan Peitz

Multilingual Machine Translation promises to improve translation quality between non-English languages.

Machine Translation Neural Architecture Search +1

Paper
Add Code

State Spaces Aren't Enough: Machine Translation Needs Attention

no code implementations • 25 Apr 2023 • Ali Vardasbi, Telmo Pessoa Pires, Robin M. Schmidt, Stephan Peitz

Structured State Spaces for Sequences (S4) is a recently proposed sequence model with successful applications in various tasks, e. g. vision, language modeling, and audio.

Language Modelling Machine Translation +2

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.