Scene Recognition

64 papers with code • 8 benchmarks • 15 datasets

This task has no description! Would you like to contribute one?

Benchmarks

Add a Result

These leaderboards are used to track progress in Scene Recognition

Dataset	Best Model	Compare
YUP++	DEEP-HAL with ODF+SDF (I3D)	See all
MIT Indoor Scenes	FOSNet	See all
AID	AGOS	See all
SUN-RGBD	OMNIVORE (Swin-B)	See all
Places365	FOSNet	See all
SUN397	FOSNet	See all
ScanNet	SSMA	See all
ADE20K	Semantic-Aware Scene Recogniton (ResNet-18)	See all

Datasets

Latest papers

Most implemented Social Latest No code

MovieCLIP: Visual Scene Recognition in Movies

usc-sail/mica-MovieCLIP • • 20 Oct 2022

Longform media such as movies have complex narrative structures, with events spanning a rich variety of ambient visual scenes.

20 Oct 2022

Paper
Code

Capsule Networks as Generative Models

exilefaker/capsnet-experiments • • 6 Sep 2022

Capsule networks are a neural network architecture specialized for visual scene recognition.

06 Sep 2022

Paper
Code

All Grains, One Scheme (AGOS): Learning Multi-grain Instance Representation for Aerial Scene Classification

biqiwhu/agos • • IEEE Transactions on Geoscience and Remote Sensing 2022

Finally, our SSF allows our framework to learn the same scene scheme from multi-grain instance representations and fuses them, so that the entire framework is optimized as a whole.

06 May 2022

Paper
Code

Where in the World is this Image? Transformer-based Geo-localization in the Wild

shramanpramanick/transformer_based_geo-localization • • 29 Apr 2022

Predicting the geographic location (geo-localization) from a single ground-level RGB image taken anywhere in the world is a very challenging problem.

29 Apr 2022

Paper
Code

An Empirical Study of Remote Sensing Pretraining

vitae-transformer/vitae-transformer-remote-sensing • 6 Apr 2022

To this end, we train different networks from scratch with the help of the largest RS scene recognition dataset up to now -- MillionAID, to obtain a series of RS pretrained backbones, including both convolutional neural networks (CNN) and vision transformers such as Swin and ViTAE, which have shown promising performance on computer vision tasks.

413

06 Apr 2022

Paper
Code

Omnivore: A Single Model for Many Visual Modalities

towhee-io/towhee • • CVPR 2022

Prior work has studied different visual modalities in isolation and developed separate architectures for recognition of images, videos, and 3D data.

2,996

20 Jan 2022

Paper
Code

InstaIndoor and Multi-modal Deep Learning for Indoor Scene Recognition

andreea-glavan/multimodal-audiovisual-scene-recognition • • 23 Dec 2021

Furthermore, we highlight the potential of our approach by benchmarking on a YouTube-8M subset of indoor scenes as well, where it achieves 74% accuracy and 0. 74 F1-Score.

23 Dec 2021

Paper
Code

An embarrassingly simple comparison of machine learning algorithms for indoor scene classification

bhanukaManesha/embarrassingly-simple-classifier-comparison • • 25 Sep 2021

With the emergence of autonomous indoor robots, the computer vision task of indoor scene recognition has gained the spotlight.

25 Sep 2021

Paper
Code

Object-to-Scene: Learning to Transfer Object Knowledge to Indoor Scene Recognition

FreeformRobotics/OTS • • 1 Aug 2021

The final results in this work show that OTS successfully extracts object features and learns object relations from the segmentation network.

01 Aug 2021

Paper
Code

BORM: Bayesian Object Relation Model for Indoor Scene Recognition

FreeformRobotics/BORM • • 1 Aug 2021

First, we utilize an improved object model (IOM) as a baseline that enriches the object knowledge by introducing a scene parsing algorithm pretrained on the ADE20K dataset with rich object categories related to the indoor scene.

01 Aug 2021

Paper
Code

Scene Recognition

Benchmarks Add a Result

Datasets

Latest papers

Content

Benchmarks

Add a Result