Image Classification
3788 papers with code • 142 benchmarks • 240 datasets
Image Classification is a fundamental task in vision recognition that aims to understand and categorize an image as a whole under a specific label. Unlike object detection, which involves classification and location of multiple objects within an image, image classification typically pertains to single-object images. When the classification becomes highly detailed or reaches instance-level, it is often referred to as image retrieval, which also involves finding similar images in a large database.
Libraries
Use these libraries to find Image Classification models and implementationsDatasets
Subtasks
- Out of Distribution (OOD) Detection
- Few-Shot Image Classification
- Fine-Grained Image Classification
- Semi-Supervised Image Classification
- Semi-Supervised Image Classification
- Learning with noisy labels
- Hyperspectral Image Classification
- Self-Supervised Image Classification
- Small Data Image Classification
- Multi-Label Image Classification
- Genre classification
- Sequential Image Classification
- Unsupervised Image Classification
- Efficient ViTs
- Document Image Classification
- Satellite Image Classification
- Sparse Representation-based Classification
- Photo geolocation estimation
- Image Classification with Differential Privacy
- Token Reduction
- Superpixel Image Classification
- Classification Consistency
- Gallbladder Cancer Detection
- Artistic style classification
- Artist classification
- Temporal Metadata Manipulation Detection
- Misclassification Rate - Natural Adversarial Samples
- Scale Generalisation
Latest papers with no code
EncodeNet: A Framework for Boosting DNN Accuracy with Entropy-driven Generalized Converting Autoencoder
Our experimental results demonstrate that EncodeNet improves the accuracy of VGG16 from 92. 64% to 94. 05% on CIFAR-10 and RestNet20 from 74. 56% to 76. 04% on CIFAR-100.
I2CANSAY:Inter-Class Analogical Augmentation and Intra-Class Significance Analysis for Non-Exemplar Online Task-Free Continual Learning
Online task-free continual learning (OTFCL) is a more challenging variant of continual learning which emphasizes the gradual shift of task boundaries and learns in an online mode.
Nested-TNT: Hierarchical Vision Transformers with Multi-Scale Feature Processing
ViT divides an image into several local patches, known as "visual sentences".
Transformer-Based Classification Outcome Prediction for Multimodal Stroke Treatment
This study proposes a multi-modal fusion framework Multitrans based on the Transformer architecture and self-attention mechanism.
A Hybrid Generative and Discriminative PointNet on Unordered Point Sets
This paper proposes GDPNet, the first hybrid Generative and Discriminative PointNet that extends JEM for point cloud classification and generation.
On-board classification of underwater images using hybrid classical-quantum CNN based method
In the current work, we use quantum-classical hybrid machine learning methods for real-time under-water object recognition on-board an AUV for the first time.
Concept Induction using LLMs: a user experiment for assessment
To evaluate the output, we compare the concepts generated by the LLM with two other methods: concepts generated by humans and the ECII heuristic concept induction system.
A Perspective on Deep Vision Performance with Standard Image and Video Codecs
The use of standardized codecs, such as JPEG or H. 264, is prevalent and required to ensure interoperability.
Achieving Rotation Invariance in Convolution Operations: Shifting from Data-Driven to Mechanism-Assured
Based on various types of non-learnable operators, including gradient, sort, local binary pattern, maximum, etc., this paper designs a set of new convolution operations that are natually invariant to arbitrary rotations.
A Progressive Framework of Vision-language Knowledge Distillation and Alignment for Multilingual Scene
Pre-trained vision-language (V-L) models such as CLIP have shown excellent performance in many downstream cross-modal tasks.