Image Classification

3788 papers with code • 142 benchmarks • 240 datasets

Image Classification is a fundamental task in vision recognition that aims to understand and categorize an image as a whole under a specific label. Unlike object detection, which involves classification and location of multiple objects within an image, image classification typically pertains to single-object images. When the classification becomes highly detailed or reaches instance-level, it is often referred to as image retrieval, which also involves finding similar images in a large database.

Source: Metamorphic Testing for Object Detection Systems

Libraries

Use these libraries to find Image Classification models and implementations

Latest papers with no code

EncodeNet: A Framework for Boosting DNN Accuracy with Entropy-driven Generalized Converting Autoencoder

no code yet • 21 Apr 2024

Our experimental results demonstrate that EncodeNet improves the accuracy of VGG16 from 92. 64% to 94. 05% on CIFAR-10 and RestNet20 from 74. 56% to 76. 04% on CIFAR-100.

I2CANSAY:Inter-Class Analogical Augmentation and Intra-Class Significance Analysis for Non-Exemplar Online Task-Free Continual Learning

no code yet • 21 Apr 2024

Online task-free continual learning (OTFCL) is a more challenging variant of continual learning which emphasizes the gradual shift of task boundaries and learns in an online mode.

Nested-TNT: Hierarchical Vision Transformers with Multi-Scale Feature Processing

no code yet • 20 Apr 2024

ViT divides an image into several local patches, known as "visual sentences".

Transformer-Based Classification Outcome Prediction for Multimodal Stroke Treatment

no code yet • 19 Apr 2024

This study proposes a multi-modal fusion framework Multitrans based on the Transformer architecture and self-attention mechanism.

A Hybrid Generative and Discriminative PointNet on Unordered Point Sets

no code yet • 19 Apr 2024

This paper proposes GDPNet, the first hybrid Generative and Discriminative PointNet that extends JEM for point cloud classification and generation.

On-board classification of underwater images using hybrid classical-quantum CNN based method

no code yet • 19 Apr 2024

In the current work, we use quantum-classical hybrid machine learning methods for real-time under-water object recognition on-board an AUV for the first time.

Concept Induction using LLMs: a user experiment for assessment

no code yet • 18 Apr 2024

To evaluate the output, we compare the concepts generated by the LLM with two other methods: concepts generated by humans and the ECII heuristic concept induction system.

A Perspective on Deep Vision Performance with Standard Image and Video Codecs

no code yet • 18 Apr 2024

The use of standardized codecs, such as JPEG or H. 264, is prevalent and required to ensure interoperability.

Achieving Rotation Invariance in Convolution Operations: Shifting from Data-Driven to Mechanism-Assured

no code yet • 17 Apr 2024

Based on various types of non-learnable operators, including gradient, sort, local binary pattern, maximum, etc., this paper designs a set of new convolution operations that are natually invariant to arbitrary rotations.

A Progressive Framework of Vision-language Knowledge Distillation and Alignment for Multilingual Scene

no code yet • 17 Apr 2024

Pre-trained vision-language (V-L) models such as CLIP have shown excellent performance in many downstream cross-modal tasks.