Scene Classification

121 papers with code • 2 benchmarks • 21 datasets

Scene Classification is a task in which scenes from photographs are categorically classified. Unlike object classification, which focuses on classifying prominent objects in the foreground, Scene Classification uses the layout of objects within the scene, in addition to the ambient context, for classification.

Source: Scene classification with Convolutional Neural Networks

Benchmarks

Add a Result

These leaderboards are used to track progress in Scene Classification

Trend	Dataset	Best Model	Paper	Code	Compare
	UC Merced Land Use Dataset	µ2Net+ (ViT-L/16)			See all
	Places365-Standard	WaveMix			See all

Datasets

Most implemented papers

Most implemented Social Latest No code

A Remote Sensing Image Dataset for Cloud Removal

BUPTLdy/RICE_DATASET • 3 Jan 2019

Removing clouds is an indispensable pre-processing step in remote sensing image analysis.

Paper
Code

SEN12MS -- A Curated Dataset of Georeferenced Multi-Spectral Sentinel-1/2 Imagery for Deep Learning and Data Fusion

chrieke/awesome-satellite-imagery-datasets • • 18 Jun 2019

The availability of curated large-scale training data is a crucial factor for the development of well-generalizing deep learning methods for the extraction of geoinformation from multi-sensor remote sensing imagery.

Paper
Code

Receptive-field-regularized CNN variants for acoustic scene classification

kkoutini/cpjku_dcase19 • • 5 Sep 2019

One side effect of restricting the RF of CNNs is that more frequency information is lost.

Paper
Code

Emergent Properties of Foveated Perceptual Systems

ArturoDeza/EmergentProperties • • 14 Jun 2020

The primary model has a foveated-textural input stage, which we compare to a model with foveated-blurred input and a model with spatially-uniform blurred input (both matched for perceptual compression), and a final reference model with minimal input-based compression.

Paper
Code

Understanding the Role of Individual Units in a Deep Neural Network

davidbau/dissect • • 10 Sep 2020

Second, we use a similar analytic method to analyze a generative adversarial network (GAN) model trained to generate scenes.

Paper
Code

A system of vision sensor based deep neural networks for complex driving scene analysis in support of crash risk assessment and prevention

monjurulkarim/Crash_road_function_dataset • 18 Jun 2021

The paper further evaluates the performance of the Multi-Net and the efficiency of the developed system.

Paper
Code

Efficient Multi-Task RGB-D Scene Analysis for Indoor Environments

tui-nicr/emsanet • • 10 Jul 2022

In order to evaluate our multi-task approach, we extend the annotations of the common RGB-D indoor datasets NYUv2 and SUNRGB-D for instance segmentation and orientation estimation.

Paper
Code

Vision-Language Models in Remote Sensing: Current Progress and Future Trends

lzw-lzw/awesome-remote-sensing-vision-language-models • • 9 May 2023

Existing AI-related research in remote sensing primarily focuses on visual understanding tasks while neglecting the semantic understanding of the objects and their relationships.

Paper
Code

Efficient Multi-Task Scene Analysis with RGB-D Transformers

tui-nicr/nicr-scene-analysis-datasets • • 8 Jun 2023

However, we show that the dual CNN-based encoder of EMSANet can be replaced with a single Transformer-based encoder.

Paper
Code

DeCUR: decoupling common & unique representations for multimodal self-supervision

zhu-xlab/decur • • 11 Sep 2023

We propose Decoupling Common and Unique Representations (DeCUR), a simple yet effective method for multimodal self-supervised learning.

Paper
Code

Scene Classification

Benchmarks Add a Result

Datasets

Most implemented papers

Content

Benchmarks

Add a Result