Search Results for author: Alex Trott

Found 4 papers, 1 papers with code

MosaicBERT: A Bidirectional Encoder Optimized for Fast Pretraining

1 code implementation • NeurIPS 2023 • Jacob Portes, Alex Trott, Sam Havens, Daniel King, Abhinav Venigalla, Moin Nadeem, Nikhil Sardana, Daya Khudia, Jonathan Frankle

Here, we introduce MosaicBERT, a BERT-style encoder architecture and training recipe that is empirically optimized for fast pretraining.

Language Modelling Masked Language Modeling

416

Paper
Code

LIMIT: Less Is More for Instruction Tuning Across Evaluation Paradigms

no code implementations • 22 Nov 2023 • Aditi Jha, Sam Havens, Jeremey Dohmann, Alex Trott, Jacob Portes

We find that subsets of 1k-6k instruction finetuning samples are sufficient to achieve good performance on both (1) traditional NLP benchmarks and (2) model-based evaluation.

Instruction Following

Paper
Add Code

Learning World Graph Decompositions To Accelerate Reinforcement Learning

no code implementations • 25 Sep 2019 • Wenling Shang, Alex Trott, Stephan Zheng, Caiming Xiong, Richard Socher

Efficiently learning to solve tasks in complex environments is a key challenge for reinforcement learning (RL) agents.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

Learning World Graphs to Accelerate Hierarchical Reinforcement Learning

no code implementations • 1 Jul 2019 • Wenling Shang, Alex Trott, Stephan Zheng, Caiming Xiong, Richard Socher

We perform a thorough ablation study to evaluate our approach on a suite of challenging maze tasks, demonstrating significant advantages from the proposed framework over baselines that lack world graph knowledge in terms of performance and efficiency.

Hierarchical Reinforcement Learning reinforcement-learning +1

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.