Search Results for author: João G. M. Araújo

Found 5 papers, 1 papers with code

Categorical Deep Learning: An Algebraic Theory of Architectures

no code implementations • 23 Feb 2024 • Bruno Gavranović, Paul Lessard, Andrew Dudzik, Tamara von Glehn, João G. M. Araújo, Petar Veličković

We present our position on the elusive quest for a general-purpose framework for specifying and studying deep learning architectures.

Paper
Add Code

Open RL Benchmark: Comprehensive Tracked Experiments for Reinforcement Learning

1 code implementation • 5 Feb 2024 • Shengyi Huang, Quentin Gallouédec, Florian Felten, Antonin Raffin, Rousslan Fernand Julien Dossa, Yanxiao Zhao, Ryan Sullivan, Viktor Makoviychuk, Denys Makoviichuk, Mohamad H. Danesh, Cyril Roumégous, Jiayi Weng, Chufan Chen, Md Masudur Rahman, João G. M. Araújo, Guorui Quan, Daniel Tan, Timo Klein, Rujikorn Charakorn, Mark Towers, Yann Berthelot, Kinal Mehta, Dipam Chakraborty, Arjun KG, Valentin Charraut, Chang Ye, Zichen Liu, Lucas N. Alegre, Alexander Nikulin, Xiao Hu, Tianlin Liu, Jongwook Choi, Brent Yi

As a result, it is usually necessary to reproduce the experiments from scratch, which can be time-consuming and error-prone.

reinforcement-learning Reinforcement Learning (RL)

163

Paper
Code

Scalable Training of Language Models using JAX pjit and TPUv4

no code implementations • 13 Apr 2022 • Joanna Yoo, Kuba Perlin, Siddhartha Rao Kamalakara, João G. M. Araújo

Modern large language models require distributed training strategies due to their size.

Paper
Add Code

No News is Good News: A Critique of the One Billion Word Benchmark

no code implementations • 25 Oct 2021 • Helen Ngo, João G. M. Araújo, Jeffrey Hui, Nicholas Frosst

The One Billion Word Benchmark is a dataset derived from the WMT 2011 News Crawl, commonly used to measure language modeling ability in natural language processing.

Language Modelling

Paper
Add Code

Mitigating harm in language models with conditional-likelihood filtration

no code implementations • 4 Aug 2021 • Helen Ngo, Cooper Raterink, João G. M. Araújo, Ivan Zhang, Carol Chen, Adrien Morisot, Nicholas Frosst

Language models trained on large-scale unfiltered datasets curated from the open web acquire systemic biases, prejudices, and harmful views from their training data.

Language Modelling

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.