Fine-Grained Image Classification

173 papers with code • 35 benchmarks • 36 datasets

Fine-Grained Image Classification is a task in computer vision where the goal is to classify images into subcategories within a larger category. For example, classifying different species of birds or different types of flowers. This task is considered to be fine-grained because it requires the model to distinguish between subtle differences in visual appearance and patterns, making it more challenging than regular image classification tasks.

( Image credit: Looking for the Devil in the Details )

Benchmarks

Add a Result

These leaderboards are used to track progress in Fine-Grained Image Classification

Dataset	Best Model	Compare
Stanford Cars	CMAL-Net	See all
CUB-200-2011	HERBS	See all
FGVC Aircraft	SR-GNN	See all
Oxford 102 Flowers	VIT-L/16 (Background)	See all
CUB-200-2011	HERBS	See all
NABirds	MetaFormer (MetaFormer-2,384)	See all
Stanford Dogs	SR-GNN	See all
Oxford-IIIT Pet Dataset	OmniVec	See all
Food-101	CAP	See all
Caltech-101	VIT-L/16	See all
Oxford-IIIT Pets	EffNet-L2 (SAM)	See all
CompCars	ResNet101-swp	See all
Birdsnap	EffNet-L2 (SAM)	See all
Bird-225	WideResNet-101 (Spinal FC)	See all
SUN397	µ2Net (ViT-L/16)	See all
10 Monkey Species	Inception-v3 (Spinal FC)	See all
Fruits-360	ResNeXt-101	See all
FoodX-251	CSWin-L	See all
Imbalanced CUB-200-2011	PC-Softmax	See all
SOP	Assemble-ResNet-FGVC-50	See all
Con-Text	PHOC descriptor + Fisher Vector Encoding	See all
Bottles	PHOC descriptor + Fisher Vector Encoding	See all
MNIST	Vanilla FC layer only	See all
EMNIST-Digits	VGG-5	See all
EMNIST-Letters	VGG-5	See all
QMNIST	VGG-5	See all
Kuzushiji-MNIST	VGG-5	See all
STL-10	Pre trained wide-resnet-101	See all
BoxCars116K	ResNet152 + COOC	See all
CarFlag-1532	ResNet101-swp	See all
CarFlag-563	ResNet101-swp	See all
iNaturalist	TASN	See all
FGVC-Aircraft	EnGraf-Net101 (G=4, H=1)	See all
Herbarium 2021 Half–Earth	Conviformer-B	See all
Herbarium 2022	Conviformer-B	See all

Show all 35 benchmarks

Collapse benchmarks

Libraries

Use these libraries to find Fine-Grained Image Classification models and implementations

rwightman/pytorch-image-models

7 papers

29,863

open-mmlab/mmclassification

4 papers

3,173

osmr/imgclsmob

4 papers

2,919

Westlake-AI/openmixup

4 papers

574

See all 25 libraries.

Datasets

Subtasks

Displaced People Recognition

Latest papers

Most implemented Social Latest No code

DINOv2: Learning Robust Visual Features without Supervision

huggingface/transformers • • 14 Apr 2023

The recent breakthroughs in natural language processing for model pretraining on large quantities of data have opened the way for similar foundation models in computer vision.

125,478

14 Apr 2023

Paper
Code

Your Diffusion Model is Secretly a Zero-Shot Classifier

diffusion-classifier/diffusion-classifier • • ICCV 2023

Our generative approach to classification, which we call Diffusion Classifier, attains strong results on a variety of benchmarks and outperforms alternative methods of extracting knowledge from diffusion models.

350

28 Mar 2023

Paper
Code

Take 5: Interpretable Image Classification with a Handful of Features

thomasnorr/q-senn • • 23 Mar 2023

We argue that a human can only understand the decision of a machine learning model, if the features are interpretable and only very few of them are used for a single decision.

23 Mar 2023

Paper
Code

Learn from Each Other to Classify Better: Cross-layer Mutual Attention Learning for Fine-grained Visual Classification

Dichao-Liu/CMAL • • Pattern Recognition 2023

Specifically, this work views the shallow to deep layers of CNNs as “experts” knowledgeable about different perspectives.

22 Mar 2023

Paper
Code

Cascading Hierarchical Networks with Multi-task Balanced Loss for Fine-grained hashing

kaiba007/fg-cnet • • 20 Mar 2023

To improve the retrieval accuracy of fine-grained hashing, we propose a cascaded network to learn compact and highly semantic hash codes, and introduce an attention-guided data augmentation method.

20 Mar 2023

Paper
Code

Fine-grained Visual Classification with High-temperature Refinement and Background Suppression

chou141253/FGVC-HERBS • • 11 Mar 2023

The high-temperature refinement module allows the model to learn the appropriate feature scales by refining the features map at different scales and improving the learning of diverse features.

11 Mar 2023

Paper
Code

Fine-Grained Visual Classification via Internal Ensemble Learning Transformer

mobulan/ielt • • IEEE Transactions on Multimedia 2023

The proposed IELT involves three main modules: multi-head voting (MHV) module, cross-layer refinement (CLR) module, and dynamic selection (DS) module.

13 Feb 2023

Paper
Code

LiT Tuned Models for Efficient Species Detection

NYU-DICE-Lab/open_clip • • 12 Feb 2023

Recent advances in training vision-language models have demonstrated unprecedented robustness and transfer learning effectiveness; however, standard computer vision datasets are image-only, and therefore not well adapted to such training methods.

12 Feb 2023

Paper
Code

The CropAndWeed Dataset: A Multi-Modal Learning Approach for Efficient Crop and Weed Manipulation

cropandweed/cropandweed-dataset • Winter Conference on Applications of Computer Vision (WACV) 2023

Precision Agriculture and especially the application of automated weed intervention represents an increasingly essential research area, as sustainability and efficiency considerations are becoming more and more relevant.

06 Jan 2023

Paper
Code

Multi-View Active Fine-Grained Visual Recognition

pris-cv/mafr • • ICCV 2023

Despite the remarkable progress of Fine-grained visual classification (FGVC) with years of history, it is still limited to recognizing 2 images.

01 Jan 2023

Paper
Code

Fine-Grained Image Classification

Benchmarks Add a Result

Libraries

Datasets

Subtasks

Latest papers

Content

Benchmarks

Add a Result