Fine-Grained Visual Categorization
26 papers with code • 0 benchmarks • 5 datasets
Benchmarks
These leaderboards are used to track progress in Fine-Grained Visual Categorization
Most implemented papers
On the Eigenvalues of Global Covariance Pooling for Fine-grained Visual Recognition
Inspired by this observation, we propose a network branch dedicated to magnifying the importance of small eigenvalues.
ViT-NeT: Interpretable Vision Transformers with Neural Tree Decoder
How- ever, the complexity of the model makes it difficult to interpret the decision-making process, and the ambiguity of the attention maps can cause incorrect correlations between image patches.
Exploring Fine-Grained Audiovisual Categorization with the SSW60 Dataset
We thoroughly benchmark audiovisual classification performance and modality fusion experiments through the use of state-of-the-art transformer methods.
SIM-Trans: Structure Information Modeling Transformer for Fine-grained Visual Categorization
To address the above limitations, we propose the Structure Information Modeling Transformer (SIM-Trans) to incorporate object structure information into transformer for enhancing discriminative representation learning to contain both the appearance information and structure information.
Coping with Change: Learning Invariant and Minimum Sufficient Representations for Fine-Grained Visual Categorization
Fine-grained visual categorization (FGVC) is a challenging task due to similar visual appearances between various species.
Data-free Knowledge Distillation for Fine-grained Visual Categorization
Our approach utilizes an adversarial distillation framework with attention generator, mixed high-order attention distillation, and semantic feature contrast learning.