Scene Understanding

516 papers with code • 3 benchmarks • 43 datasets

Scene Understanding is something that to understand a scene. For instance, iPhone has function that help eye disabled person to take a photo by discribing what the camera sees. This is an example of Scene Understanding.

Benchmarks

Add a Result

These leaderboards are used to track progress in Scene Understanding

Dataset	Best Model	Compare
ADE20K val	CPN(ResNet-101)	See all
Semantic Scene Understanding Challenge (passive actuation & ground-truth localisation)	ACRV Baseline	See all
Semantic Scene Understanding Challenge (active actuation & ground-truth localisation)	ACRV Baseline	See all

Libraries

Use these libraries to find Scene Understanding models and implementations

osmr/imgclsmob

4 papers

2,918

Pointcept/Pointcept

4 papers

1,139

PaddlePaddle/PaddleDetection

2 papers

12,088

open-mmlab/mmdetection3d

2 papers

4,828

See all 5 libraries.

Datasets

Subtasks

road scene understanding

Monocular Cross-View Road Scene Parsing(Road)

Outdoor Light Source Estimation

Most implemented papers

Most implemented Social Latest No code

FloodNet: A High Resolution Aerial Imagery Dataset for Post Flood Scene Understanding

tashchowdhury/FloodNet-Supervised_v1.0 • 5 Dec 2020

This dataset demonstrates the post flooded damages of the affected areas.

Paper
Code

Boundary-Seeking Generative Adversarial Networks

eriklindernoren/PyTorch-GAN • • 27 Feb 2017

We introduce a method for training GANs with discrete data that uses the estimated difference measure from the discriminator to compute importance weights for generated samples, thus providing a policy gradient for training the generator.

Paper
Code

Multi-Task Learning as Multi-Objective Optimization

IntelVCL/MultiObjectiveOptimization • • NeurIPS 2018

These algorithms are not directly applicable to large-scale learning problems since they scale poorly with the dimensionality of the gradients and the number of tasks.

Paper
Code

ShelfNet for Fast Semantic Segmentation

juntang-zhuang/ShelfNet • • 27 Nov 2018

Compared with real-time segmentation models such as BiSeNet, our model achieves higher accuracy at comparable speed on the Cityscapes Dataset, enabling the application in speed-demanding tasks such as street-scene understanding for autonomous driving.

Paper
Code

From Points to Parts: 3D Object Detection from Point Cloud with Part-aware and Part-aggregation Network

sshaoshuai/PointCloudDet3D • • 8 Jul 2019

3D object detection from LiDAR point cloud is a challenging problem in 3D scene understanding and has many practical applications.

Paper
Code

Adapting Deep Network Features to Capture Psychological Representations

kbraunlich/contort_DNN • 6 Aug 2016

To remedy this, we develop a method for adapting deep features to align with human similarity judgments, resulting in image representations that can potentially be used to extend the scope of psychological experiments.

Paper
Code

SemanticKITTI: A Dataset for Semantic Scene Understanding of LiDAR Sequences

PRBonn/semantic-kitti-api • ICCV 2019

Despite the relevance of semantic scene understanding for this application, there is a lack of a large dataset for this task which is based on an automotive LiDAR.

Paper
Code

Towards Ghost-free Shadow Removal via Dual Hierarchical Aggregation Network and Shadow Matting GAN

vinthony/ghost-free-shadow-removal • • 20 Nov 2019

With the help of novel masks or scenes, we enhance the current datasets using synthesized shadow images.

Paper
Code

COCO-Text: Dataset and Benchmark for Text Detection and Recognition in Natural Images

xiaofengShi/CHINESE-OCR • • 26 Jan 2016

The goal of COCO-Text is to advance state-of-the-art in text detection and recognition in natural images.

Paper
Code

Tensor Comprehensions: Framework-Agnostic High-Performance Machine Learning Abstractions

facebookresearch/TensorComprehensions • • 13 Feb 2018

Deep learning models with convolutional and recurrent networks are now ubiquitous and analyze massive amounts of audio, image, video, text and graph data, with applications in automatic translation, speech-to-text, scene understanding, ranking user preferences, ad placement, etc.

Paper
Code

Scene Understanding

Benchmarks Add a Result

Libraries

Datasets

Subtasks

Most implemented papers

Content

Benchmarks

Add a Result