Search Results for author: Sheheryar Zaidi

Found 5 papers, 3 papers with code

When Does Re-initialization Work?

no code implementations • 20 Jun 2022 • Sheheryar Zaidi, Tudor Berariu, Hyunjik Kim, Jörg Bornschein, Claudia Clopath, Yee Whye Teh, Razvan Pascanu

However, when deployed alongside other carefully tuned regularization techniques, re-initialization methods offer little to no added benefit for generalization, although optimal generalization performance becomes less sensitive to the choice of learning rate and weight decay hyperparameters.

Data Augmentation Image Classification

Paper
Add Code

Pre-training via Denoising for Molecular Property Prediction

1 code implementation • 31 May 2022 • Sheheryar Zaidi, Michael Schaarschmidt, James Martens, Hyunjik Kim, Yee Whye Teh, Alvaro Sanchez-Gonzalez, Peter Battaglia, Razvan Pascanu, Jonathan Godwin

Many important problems involving molecular property prediction from 3D structures have limited data, posing a generalization challenge for neural networks.

Denoising Molecular Property Prediction +1

Paper
Code

Provably Strict Generalisation Benefit for Equivariant Models

no code implementations • 20 Feb 2021 • Bryn Elesedy, Sheheryar Zaidi

It is widely believed that engineering a model to be invariant/equivariant improves generalisation.

Paper
Add Code

LieTransformer: Equivariant self-attention for Lie Groups

1 code implementation • 20 Dec 2020 • Michael Hutchinson, Charline Le Lan, Sheheryar Zaidi, Emilien Dupont, Yee Whye Teh, Hyunjik Kim

Group equivariant neural networks are used as building blocks of group invariant neural networks, which have been shown to improve generalisation performance and data efficiency through principled parameter sharing.

regression

Paper
Code

Neural Ensemble Search for Uncertainty Estimation and Dataset Shift

1 code implementation • NeurIPS 2021 • Sheheryar Zaidi, Arber Zela, Thomas Elsken, Chris Holmes, Frank Hutter, Yee Whye Teh

On a variety of classification tasks and modern architecture search spaces, we show that the resulting ensembles outperform deep ensembles not only in terms of accuracy but also uncertainty calibration and robustness to dataset shift.

Image Classification Neural Architecture Search

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.