Scene Recognition
64 papers with code • 8 benchmarks • 15 datasets
Benchmarks
These leaderboards are used to track progress in Scene Recognition
Latest papers with no code
Non-negative Subspace Feature Representation for Few-shot Learning in Medical Imaging
Extensive empirical studies are conducted in terms of validating the effectiveness of NMF, especially its supervised variants (e. g., discriminative NMF, and supervised and constrained NMF with sparseness), and the comparison with principal component analysis (PCA), i. e., the collaborative representation-based dimensionality reduction technique derived from eigenvectors.
TextBlockV2: Towards Precise-Detection-Free Scene Text Spotting with Pre-trained Language Model
Taking advantage of the fine-tuned language model on scene recognition benchmarks and the paradigm of text block detection, extensive experiments demonstrate the superior performance of our scene text spotter across multiple public benchmarks.
A Survey of Vision Transformers in Autonomous Driving: Current Trends and Future Directions
This survey explores the adaptation of visual transformer models in Autonomous Driving, a transition inspired by their success in Natural Language Processing.
Leveraging Self-Supervised Learning for Scene Recognition in Child Sexual Abuse Imagery
In light of that, reliable automated tools that can securely and efficiently deal with this data are paramount.
Digital Divides in Scene Recognition: Uncovering Socioeconomic Biases in Deep Learning Systems
By mitigating the bias in the computer vision pipelines, we can ensure fairer and more equitable outcomes for applied computer vision, including home valuation and smart home security systems.
Knowledge-enhanced Multi-perspective Video Representation Learning for Scene Recognition
With the explosive growth of video data in real-world applications, a comprehensive representation of videos becomes increasingly important.
Inter-object Discriminative Graph Modeling for Indoor Scene Recognition
Leveraging object information within scenes to enhance the distinguishability of feature representations has emerged as a key approach in this domain.
VisPercep: A Vision-Language Approach to Enhance Visual Perception for People with Blindness and Low Vision
By combining the prompt and input image, a large vision-language model (i. e., InstructBLIP) generates detailed and comprehensive descriptions of the environment and identifies potential risks in the environment by analyzing the environmental objects and scenes, relevant to the prompt.
Video Scene Location Recognition with Neural Networks
This paper provides an insight into the possibility of scene recognition from a video sequence with a small set of repeated shooting locations (such as in television series) using artificial neural networks.
Semantic-embedded Similarity Prototype for Scene Recognition
Due to the high inter-class similarity caused by the complex composition and the co-existing objects across scenes, numerous studies have explored object semantic knowledge within scenes to improve scene recognition.