Scene Recognition

64 papers with code • 8 benchmarks • 15 datasets

This task has no description! Would you like to contribute one?

NuScenes-MQA: Integrated Evaluation of Captions and QA for Autonomous Driving Datasets using Markup Annotations

turingmotors/nuscenes-mqa 11 Dec 2023

Visual Question Answering (VQA) is one of the most important tasks in autonomous driving, which requires accurate recognition and complex situation evaluations.

14
11 Dec 2023

On the Road with GPT-4V(ision): Early Explorations of Visual-Language Model on Autonomous Driving

pjlab-adg/gpt4v-ad-exploration 9 Nov 2023

This has been a significant bottleneck, particularly in the development of common sense reasoning and nuanced scene understanding necessary for safe and reliable autonomous driving.

259
09 Nov 2023

Counting Manatee Aggregations using Deep Neural Networks and Anisotropic Gaussian Kernel

yeyimilk/deep-learning-for-manatee-counting 4 Nov 2023

In this paper, we propose a deep learning based crowd counting approach to automatically count number of manatees within a region, by using low quality images as input.

1
04 Nov 2023

A Prior Instruction Representation Framework for Remote Sensing Image-text Retrieval

jaychempan/PIR ACMMM 2023

Our highlight is the proposal of a paradigm that draws on prior knowledge to instruct adaptive learning of vision and text representations.

17
27 Oct 2023

DisasterNets: Embedding Machine Learning in Disaster Mapping

hydropml/disasternets 16 Jun 2023

It consists of two stages, space granulation and attribute granulation.

5
16 Jun 2023

NarrativeXL: A Large-scale Dataset For Long-Term Memory Models

r-seny/narrativexl 23 May 2023

We show that our questions 1) adequately represent the source material 2) can be used to diagnose a model's memory capacity 3) are not trivial for modern language models even when the memory demand does not exceed those models' context lengths.

3
23 May 2023

SRRM: Semantic Region Relation Model for Indoor Scene Recognition

ChuanxinSong/SRRM 15 May 2023

Despite the remarkable success of convolutional neural networks in various computer vision tasks, recognizing indoor scenes still presents a significant challenge due to their complex composition.

2
15 May 2023

Designing Deep Networks for Scene Recognition

zn-qiao/deep-narrow-network 13 Mar 2023

Most deep learning backbones are evaluated on ImageNet.

0
13 Mar 2023

CoMAE: Single Model Hybrid Pre-training on Small-Scale RGB-D Datasets

mcg-nju/comae 13 Feb 2023

Our CoMAE presents a curriculum learning strategy to unify the two popular self-supervised representation learning algorithms: contrastive learning and masked image modeling.

29
13 Feb 2023

NEVIS'22: A Stream of 100 Tasks Sampled from 30 Years of Computer Vision Research

deepmind/dm_nevis 15 Nov 2022

A shared goal of several machine learning communities like continual learning, meta-learning and transfer learning, is to design algorithms and models that efficiently and robustly adapt to unseen tasks.

94
15 Nov 2022