Search Results for author: Matheus Pereira

Found 2 papers, 2 papers with code

Joint Prompt Optimization of Stacked LLMs using Variational Inference

1 code implementation • NeurIPS 2023 • Alessandro Sordoni, Xingdi Yuan, Marc-Alexandre Côté, Matheus Pereira, Adam Trischler, Ziang Xiao, Arian Hosseini, Friederike Niedtner, Nicolas Le Roux

Thus, they can be seen as stochastic language layers in a language network, where the learnable parameters are the natural language prompts at each layer.

Natural Language Understanding Variational Inference

Paper
Code

Multi-Head Adapter Routing for Cross-Task Generalization

1 code implementation • NeurIPS 2023 • Lucas Caccia, Edoardo Ponti, Zhan Su, Matheus Pereira, Nicolas Le Roux, Alessandro Sordoni

We find that routing is most beneficial during multi-task pre-training rather than during few-shot adaptation and propose $\texttt{MHR}$-$\mu$, which discards routing and fine-tunes the average of the pre-trained adapters on each downstream tasks.

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.