Region Proposal
137 papers with code • 1 benchmarks • 5 datasets
Libraries
Use these libraries to find Region Proposal models and implementationsLatest papers with no code
Event Camera as Region Proposal Network
The number of rods is much higher than the cones, which means that most human vision processing is done in monochrome.
Towards Precise Weakly Supervised Object Detection via Interactive Contrastive Learning of Context Information
In spite of intensive research on deep learning (DL) approaches over the past few years, there is still a significant performance gap between WSOD and fully supervised object detection.
[CLS] Token is All You Need for Zero-Shot Semantic Segmentation
Based on that, we build upon the CLIP model as a backbone which we extend with a One-Way [CLS] token navigation from text to the visual branch that enables zero-shot dense prediction, dubbed \textbf{ClsCLIP}.
MOST: Multiple Object localization with Self-supervised Transformers for object discovery
In this work, we present Multiple Object localization with Self-supervised Transformers (MOST) that uses features of transformers trained using self-supervised learning to localize multiple objects in real world images.
Locate Then Generate: Bridging Vision and Language with Bounding Box for Scene-Text VQA
Different to conventional STVQA models which take the linguistic semantics and visual semantics in scene text as two separate features, in this paper, we propose a paradigm of "Locate Then Generate" (LTG), which explicitly unifies this two semantics with the spatial bounding box as a bridge connecting them.
End-to-end Semantic Object Detection with Cross-Modal Alignment
This paper presents an extension of existing object detection models for semantic image search that considers the semantic alignment between object proposals and text queries, with a focus on searching for objects within images.
s-Adaptive Decoupled Prototype for Few-Shot Object Detection
To provide precise information for the query image, the prototype is decoupled into task-specific ones, which provide tailored guidance for 'where to look' and 'what to look for', respectively.
Multi-level and multi-modal feature fusion for accurate 3D object detection in Connected and Automated Vehicles
The fused features are shared by a two-stage network: the region proposal network (RPN) and the detection head (DH).
Multimodal Query-guided Object Localization
In such a scenario, a hand-drawn sketch of the object could be a choice for a query.
ARISE: Graph Anomaly Detection on Attributed Networks via Substructure Awareness
The average node-pair similarity can be regarded as the topology anomaly degree of nodes within substructures.