MixNet

Last updated on Feb 14, 2021

mixnet_l

Parameters 7 Million
FLOPs 739 Million
File Size 28.24 MB
Training Data ImageNet
Training Resources
Training Time

Architecture Batch Normalization, Dense Connections, Dropout, Global Average Pooling, Grouped Convolution, MixConv, Squeeze-and-Excitation Block, Swish
ID mixnet_l
Crop Pct 0.875
Image Size 224
Interpolation bicubic
SHOW MORE
SHOW LESS
mixnet_m

Parameters 5 Million
FLOPs 455 Million
File Size 19.36 MB
Training Data ImageNet
Training Resources
Training Time

Architecture Batch Normalization, Dense Connections, Dropout, Global Average Pooling, Grouped Convolution, MixConv, Squeeze-and-Excitation Block, Swish
ID mixnet_m
Crop Pct 0.875
Image Size 224
Interpolation bicubic
SHOW MORE
SHOW LESS
mixnet_s

Parameters 4 Million
FLOPs 321 Million
File Size 15.95 MB
Training Data ImageNet
Training Resources
Training Time

Architecture Batch Normalization, Dense Connections, Dropout, Global Average Pooling, Grouped Convolution, MixConv, Squeeze-and-Excitation Block, Swish
ID mixnet_s
Crop Pct 0.875
Image Size 224
Interpolation bicubic
SHOW MORE
SHOW LESS
mixnet_xl

Parameters 12 Million
FLOPs 1 Billion
File Size 45.78 MB
Training Data ImageNet
Training Resources
Training Time

Architecture Batch Normalization, Dense Connections, Dropout, Global Average Pooling, Grouped Convolution, MixConv, Squeeze-and-Excitation Block, Swish
ID mixnet_xl
Crop Pct 0.875
Image Size 224
Interpolation bicubic
SHOW MORE
SHOW LESS
README.md

Summary

MixNet is a type of convolutional neural network discovered via AutoML that utilises MixConvs instead of regular depthwise convolutions.

How do I load this model?

To load a pretrained model:

import timm
m = timm.create_model('mixnet_s', pretrained=True)
m.eval()

Replace the model name with the variant you want to use, e.g. mixnet_s. You can find the IDs in the model summaries at the top of this page.

How do I train this model?

You can follow the timm recipe scripts for training a new model afresh.

Citation

@misc{tan2019mixconv,
      title={MixConv: Mixed Depthwise Convolutional Kernels}, 
      author={Mingxing Tan and Quoc V. Le},
      year={2019},
      eprint={1907.09595},
      archivePrefix={arXiv},
      primaryClass={cs.CV}
}

Results

Image Classification on ImageNet

Image Classification on ImageNet
MODEL TOP 1 ACCURACY TOP 5 ACCURACY
mixnet_xl 80.47% 94.93%
mixnet_l 78.98% 94.18%
mixnet_m 77.27% 93.42%
mixnet_s 75.99% 92.79%