Fine-Grained Image Classification

172 papers with code • 35 benchmarks • 36 datasets

Fine-Grained Image Classification is a task in computer vision where the goal is to classify images into subcategories within a larger category. For example, classifying different species of birds or different types of flowers. This task is considered to be fine-grained because it requires the model to distinguish between subtle differences in visual appearance and patterns, making it more challenging than regular image classification tasks.

( Image credit: Looking for the Devil in the Details )

Benchmarks

Add a Result

These leaderboards are used to track progress in Fine-Grained Image Classification

Dataset	Best Model	Compare
Stanford Cars	CMAL-Net	See all
CUB-200-2011	HERBS	See all
FGVC Aircraft	SR-GNN	See all
Oxford 102 Flowers	VIT-L/16 (Background)	See all
CUB-200-2011	HERBS	See all
NABirds	MetaFormer (MetaFormer-2,384)	See all
Oxford-IIIT Pet Dataset	OmniVec	See all
Stanford Dogs	SR-GNN	See all
Food-101	CAP	See all
Caltech-101	VIT-L/16	See all
Oxford-IIIT Pets	EffNet-L2 (SAM)	See all
CompCars	ResNet101-swp	See all
Birdsnap	EffNet-L2 (SAM)	See all
Bird-225	WideResNet-101 (Spinal FC)	See all
SUN397	µ2Net (ViT-L/16)	See all
10 Monkey Species	Inception-v3 (Spinal FC)	See all
Fruits-360	ResNeXt-101	See all
FoodX-251	CSWin-L	See all
Imbalanced CUB-200-2011	PC-Softmax	See all
SOP	Assemble-ResNet-FGVC-50	See all
Con-Text	PHOC descriptor + Fisher Vector Encoding	See all
Bottles	PHOC descriptor + Fisher Vector Encoding	See all
MNIST	Vanilla FC layer only	See all
EMNIST-Digits	VGG-5	See all
EMNIST-Letters	VGG-5	See all
QMNIST	VGG-5	See all
Kuzushiji-MNIST	VGG-5	See all
STL-10	Pre trained wide-resnet-101	See all
BoxCars116K	ResNet152 + COOC	See all
CarFlag-1532	ResNet101-swp	See all
CarFlag-563	ResNet101-swp	See all
iNaturalist	TASN	See all
FGVC-Aircraft	EnGraf-Net101 (G=4, H=1)	See all
Herbarium 2021 Half–Earth	Conviformer-B	See all
Herbarium 2022	Conviformer-B	See all

Show all 35 benchmarks

Collapse benchmarks

Libraries

Use these libraries to find Fine-Grained Image Classification models and implementations

rwightman/pytorch-image-models

7 papers

29,735

open-mmlab/mmclassification

4 papers

3,153

osmr/imgclsmob

4 papers

2,917

Westlake-AI/openmixup

4 papers

568

See all 25 libraries.

Datasets

Subtasks

Displaced People Recognition

Latest papers

Most implemented Social Latest No code

Parameter-Efficient Long-Tailed Recognition

shijxcs/pel • • 18 Sep 2023

In this paper, we propose PEL, a fine-tuning method that can effectively adapt pre-trained models to long-tailed recognition tasks in fewer than 20 epochs without the need for extra data.

18 Sep 2023

Paper
Code

Masking Strategies for Background Bias Removal in Computer Vision Models

ananthu-aniraj/masking_strategies_bias_removal • • 23 Aug 2023

Models for fine-grained image classification tasks, where the difference between some classes can be extremely subtle and the number of samples per class tends to be low, are particularly prone to picking up background-related biases and demand robust methods to handle potential examples with out-of-distribution (OOD) backgrounds.

23 Aug 2023

Paper
Code

Multiscale patch-based feature graphs for image classification

mvtodescato/MultiscaleGraphFeatures • • Expert Systems with Applications 2023

We compared our approach with two conventional approaches for dealing with image classification.

08 Aug 2023

Paper
Code

Task-Oriented Channel Attention for Fine-Grained Few-Shot Classification

leesb7426/cvpr2022-task-discrepancy-maximization-for-fine-grained-few-shot-classification • • 28 Jul 2023

While TDM influences high-level feature maps by task-adaptive calibration of channel-wise importance, we further introduce Instance Attention Module (IAM) operating in intermediate layers of feature extractors to instance-wisely highlight object-relevant channels, by extending QAM.

28 Jul 2023

Paper
Code

GIST: Generating Image-Specific Text for Fine-grained Object Classification

emu1729/gist • • 21 Jul 2023

We demonstrate the utility of GIST by fine-tuning vision-language models on the image-and-generated-text pairs to learn an aligned vision-language representation space for improved classification.

21 Jul 2023

Paper
Code

Diffusion Models Beat GANs on Image Classification

soumik-kanad/diffssl • • 17 Jul 2023

We explore optimal methods for extracting and using these embeddings for classification tasks, demonstrating promising results on the ImageNet classification task.

17 Jul 2023

Paper
Code

TOAST: Transfer Learning via Attention Steering

bfshi/toast • • 24 May 2023

We introduce Top-Down Attention Steering (TOAST), a novel transfer learning algorithm that keeps the pre-trained backbone frozen, selects task-relevant features in the output, and feeds those features back to the model to steer the attention to the task-specific features.

181

24 May 2023

Paper
Code

Salient Mask-Guided Vision Transformer for Fine-Grained Classification

demidovd98/sm-vit • • 11 May 2023

Fine-grained visual classification (FGVC) is a challenging computer vision problem, where the task is to automatically recognise objects from subordinate categories.

11 May 2023

Paper
Code

Reduction of Class Activation Uncertainty with Background Information

dipuk0506/SpinalNet • • 5 May 2023

Through the class activation mappings (CAMs) of the trained models, we observed the tendency towards looking at a bigger picture with the proposed model training methodology.

166

05 May 2023

Paper
Code

Learning Partial Correlation based Deep Visual Representation for Image Classification

csiro-robotics/isice • • CVPR 2023

Our work obtains a partial correlation based deep visual representation and mitigates the small sample problem often encountered by covariance matrix estimation in CNN.

23 Apr 2023

Paper
Code

Fine-Grained Image Classification

Benchmarks Add a Result

Libraries

Datasets

Subtasks

Latest papers

Content

Benchmarks

Add a Result