Search Results for author: Alessandro Cappelli

Found 8 papers, 2 papers with code

The RefinedWeb Dataset for Falcon LLM: Outperforming Curated Corpora with Web Data, and Web Data Only

1 code implementation1 Jun 2023 Guilherme Penedo, Quentin Malartic, Daniel Hesslow, Ruxandra Cojocaru, Alessandro Cappelli, Hamza Alobeidli, Baptiste Pannier, Ebtesam Almazrouei, Julien Launay

Large language models are commonly trained on a mixture of filtered web data and curated high-quality corpora, such as social media conversations, books, or technical papers.

Zero-shot Generalization

Scaling Laws Beyond Backpropagation

no code implementations26 Oct 2022 Matthew J. Filipovich, Alessandro Cappelli, Daniel Hesslow, Julien Launay

Alternatives to backpropagation have long been studied to better understand how biological brains may learn.

Photonic Differential Privacy with Direct Feedback Alignment

no code implementations NeurIPS 2021 Ruben Ohana, Hamlet J. Medina Ruiz, Julien Launay, Alessandro Cappelli, Iacopo Poli, Liva Ralaivola, Alain Rakotomamonjy

Optical Processing Units (OPUs) -- low-power photonic chips dedicated to large scale random projections -- have been used in previous work to train deep neural networks using Direct Feedback Alignment (DFA), an effective alternative to backpropagation.

Cannot find the paper you are looking for? You can Submit a new open access paper.