TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Neural Architecture Search	NAS-Bench-201, CIFAR-10	ENAS	Accuracy (Test)	54.3	# 32
Neural Architecture Search	NAS-Bench-201, CIFAR-10	ENAS	Accuracy (Val)	39.77	# 29
Neural Architecture Search	NAS-Bench-201, CIFAR-10	ENAS	Search time (s)	13315	# 12
Neural Architecture Search	NAS-Bench-201, CIFAR-100	ENAS	Accuracy (Test)	15.61	# 33
Neural Architecture Search	NAS-Bench-201, CIFAR-100	ENAS	Accuracy (Val)	15.03	# 31
Neural Architecture Search	NAS-Bench-201, CIFAR-100	ENAS	Search time (s)	13315	# 10
Neural Architecture Search	NAS-Bench-201, ImageNet-16-120	ENAS	Accuracy (Test)	16.43	# 40
Neural Architecture Search	NAS-Bench-201, ImageNet-16-120	ENAS	Search time (s)	13315	# 13

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/efficient-neural-architecture-search-via/neural-architecture-search-on-nas-bench-201-1)](https://paperswithcode.com/sota/neural-architecture-search-on-nas-bench-201-1?p=efficient-neural-architecture-search-via)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/efficient-neural-architecture-search-via/neural-architecture-search-on-nas-bench-201-2)](https://paperswithcode.com/sota/neural-architecture-search-on-nas-bench-201-2?p=efficient-neural-architecture-search-via)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/efficient-neural-architecture-search-via/neural-architecture-search-on-nas-bench-201)](https://paperswithcode.com/sota/neural-architecture-search-on-nas-bench-201?p=efficient-neural-architecture-search-via)`

Efficient Neural Architecture Search via Parameters Sharing

ICML 2018 · Hieu Pham, Melody Guan, Barret Zoph, Quoc Le, Jeff Dean ·

We propose Efficient Neural Architecture Search (ENAS), a fast and inexpensive approach for automatic model design. ENAS constructs a large computational graph, where each subgraph represents a neural network architecture, hence forcing all architectures to share their parameters. A controller is trained with policy gradient to search for a subgraph that maximizes the expected reward on a validation set. Meanwhile a model corresponding to the selected subgraph is trained to minimize a canonical cross entropy loss. Sharing parameters among child models allows ENAS to deliver strong empirical performances, whilst using much fewer GPU-hours than existing automatic model design approaches, and notably, 1000x less expensive than standard Neural Architecture Search. On Penn Treebank, ENAS discovers a novel architecture that achieves a test perplexity of 56.3, on par with the existing state-of-the-art among all methods without post-training processing. On CIFAR-10, ENAS finds a novel architecture that achieves 2.89% test error, which is on par with the 2.65% test error of NASNet (Zoph et al., 2018).

PDF Abstract

Code

Add Remove Mark official

No code implementations yet. Submit your code now

Tasks

Add Remove

Neural Architecture Search

Datasets

CIFAR-10

NAS-Bench-201

Results from the Paper

Add Remove

Ranked #32 on Neural Architecture Search on NAS-Bench-201, CIFAR-10

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Neural Architecture Search	NAS-Bench-201, CIFAR-10	ENAS	Accuracy (Test)	54.3	# 32	Compare
			Accuracy (Val)	39.77	# 29	Compare
			Search time (s)	13315	# 12	Compare
Neural Architecture Search	NAS-Bench-201, CIFAR-100	ENAS	Accuracy (Test)	15.61	# 33	Compare
			Accuracy (Val)	15.03	# 31	Compare
			Search time (s)	13315	# 10	Compare
Neural Architecture Search	NAS-Bench-201, ImageNet-16-120	ENAS	Accuracy (Test)	16.43	# 40	Compare
Neural Architecture Search	NAS-Bench-201, ImageNet-16-120	ENAS	Search time (s)	13315	# 13	Compare

Methods

Add Remove

LSTM • Sigmoid Activation • Softmax • Tanh Activation

Edit Social Preview

Efficient Neural Architecture Search via Parameters Sharing

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit Add Remove

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Add Remove

Methods

Add Remove