Search Results for author: Jerome Revaud

Found 18 papers, 8 papers with code

DUSt3R: Geometric 3D Vision Made Easy

1 code implementation • 21 Dec 2023 • Shuzhe Wang, Vincent Leroy, Yohann Cabon, Boris Chidlovskii, Jerome Revaud

Our formulation directly provides a 3D model of the scene as well as depth information, but interestingly, we can seamlessly recover from it, pixel matches, relative and absolute camera.

3D Reconstruction Camera Calibration +2

4,144

Paper
Code

MFOS: Model-Free & One-Shot Object Pose Estimation

no code implementations • 3 Oct 2023 • Jongmin Lee, Yohann Cabon, Romain Brégier, Sungjoo Yoo, Jerome Revaud

Existing learning-based methods for object pose estimation in RGB images are mostly model-specific or category based.

Object Pose Estimation

Paper
Add Code

Win-Win: Training High-Resolution Vision Transformers from Two Windows

no code implementations • 1 Oct 2023 • Vincent Leroy, Jerome Revaud, Thomas Lucas, Philippe Weinzaepfel

It is 4 times faster to train than a full-resolution network, and it is straightforward to use at test time compared to existing approaches.

Depth Estimation Depth Prediction +2

Paper
Add Code

SACReg: Scene-Agnostic Coordinate Regression for Visual Localization

no code implementations • 21 Jul 2023 • Jerome Revaud, Yohann Cabon, Romain Brégier, Jongmin Lee, Philippe Weinzaepfel

Instead of encoding the scene coordinates into the network weights, our model takes as input a database image with some sparse 2D pixel to 3D coordinate annotations, extracted from e. g. off-the-shelf Structure-from-Motion or RGB-D data, and a query image for which are predicted a dense 3D coordinate map and its confidence, based on cross-attention.

Image Retrieval regression +2

Paper
Add Code

Robust Automatic Monocular Vehicle Speed Estimation for Traffic Surveillance

no code implementations • ICCV 2021 • Jerome Revaud, Martin Humenberger

Experimental results conducted on three diverse benchmarks demonstrate excellent speed estimation accuracy that could enable the wide use of CCTV cameras for traffic analysis, even in challenging conditions where state-of-the-art methods completely fail.

Camera Calibration Keypoint Detection +1

Paper
Add Code

SuperLoss: A Generic Loss for Robust Curriculum Learning

2 code implementations • NeurIPS 2020 • Thibault Castells, Philippe Weinzaepfel, Jerome Revaud

The key idea is to somehow estimate the importance (or weight) of each sample directly during training based on the observation that easy and hard samples behave differently and can therefore be separated.

Image Classification Image Retrieval +4

Paper
Code

R2D2: Reliable and Repeatable Detector and Descriptor

2 code implementations • NeurIPS 2019 • Jerome Revaud, Cesar De Souza, Martin Humenberger, Philippe Weinzaepfel

We thus propose to jointly learn keypoint detection and description together with a predictor of the local descriptor discriminativeness.

Ranked #2 on Camera Localization on Aachen Day-Night benchmark

Atari Games Camera Localization +4

443

Paper
Code

Learning with Average Precision: Training Image Retrieval with a Listwise Loss

2 code implementations • ICCV 2019 • Jerome Revaud, Jon Almazan, Rafael Sampaio de Rezende, Cesar Roberto de Souza

Recent deep models for image retrieval have outperformed traditional methods by leveraging ranking-tailored loss functions, but important theoretical and practical problems remain.

Image Retrieval Retrieval

609

Paper
Code

R2D2: Repeatable and Reliable Detector and Descriptor

1 code implementation • 14 Jun 2019 • Jerome Revaud, Philippe Weinzaepfel, César De Souza, Noe Pion, Gabriela Csurka, Yohann Cabon, Martin Humenberger

In this work, we argue that salient regions are not necessarily discriminative, and therefore can harm the performance of the description.

Interest Point Detection Keypoint Detection +1

438

Paper
Code

Did It Change? Learning to Detect Point-Of-Interest Changes for Proactive Map Updates

no code implementations • CVPR 2019 • Jerome Revaud, Minhyeok Heo, Rafael S. Rezende, Chanmi You, Seong-Gyun Jeong

Maps are an increasingly important tool in our daily lives, yet their rich semantic content still largely depends on manual input.

Change Detection Metric Learning

Paper
Add Code

End-to-end Learning of Deep Visual Representations for Image Retrieval

4 code implementations • 25 Oct 2016 • Albert Gordo, Jon Almazan, Jerome Revaud, Diane Larlus

Second, we build on the recent R-MAC descriptor, show that it can be interpreted as a deep and differentiable architecture, and present improvements to enhance it.

Ranked #13 on Image Retrieval on ROxford (Medium)

Image Retrieval Quantization +1

609

Paper
Code

Deep Image Retrieval: Learning global representations for image search

3 code implementations • 5 Apr 2016 • Albert Gordo, Jon Almazan, Jerome Revaud, Diane Larlus

We propose a novel approach for instance-level image retrieval.

Ranked #3 on Image Retrieval on Oxf105k

Image Retrieval Region Proposal +1

76,589

Paper
Code

Beat-Event Detection in Action Movie Franchises

no code implementations • 15 Aug 2015 • Danila Potapov, Matthijs Douze, Jerome Revaud, Zaid Harchaoui, Cordelia Schmid

While important advances were recently made towards temporally localizing and recognizing specific human actions or activities in videos, efficient detection and classification of long video chunks belonging to semantically defined categories such as "pursuit" or "romance" remains challenging. We introduce a new dataset, Action Movie Franchises, consisting of a collection of Hollywood action movie franchises.

Classification Event Detection +1

Paper
Add Code

DeepMatching: Hierarchical Deformable Dense Matching

1 code implementation • 25 Jun 2015 • Jerome Revaud, Philippe Weinzaepfel, Zaid Harchaoui, Cordelia Schmid

We introduce a novel matching algorithm, called DeepMatching, to compute dense correspondences between images.

Ranked #4 on Dense Pixel Correspondence Estimation on HPatches

Dense Pixel Correspondence Estimation Optical Flow Estimation

Paper
Code

Learning to Detect Motion Boundaries

no code implementations • CVPR 2015 • Philippe Weinzaepfel, Jerome Revaud, Zaid Harchaoui, Cordelia Schmid

We compare the results obtained with several state-of-the-art optical flow approaches and study the impact of the different cues used in the random forest. Furthermore, we introduce a new dataset, the YouTube Motion Boundaries dataset (YMB), that comprises 60 sequences taken from real-world videos with manually annotated motion boundaries.

Boundary Detection Optical Flow Estimation

Paper
Add Code

EpicFlow: Edge-Preserving Interpolation of Correspondences for Optical Flow

no code implementations • CVPR 2015 • Jerome Revaud, Philippe Weinzaepfel, Zaid Harchaoui, Cordelia Schmid

We propose a novel approach for optical flow estimation , targeted at large displacements with significant oc-clusions.

Optical Flow Estimation

Paper
Add Code

Transformation Pursuit for Image Classification

no code implementations • CVPR 2014 • Mattis Paulin, Jerome Revaud, Zaid Harchaoui, Florent Perronnin, Cordelia Schmid

We propose a principled algorithm Image Transformation Pursuit (ITP) for the automatic selection of a compact set of transformations.

Classification General Classification +1

Paper
Add Code

Event Retrieval in Large Video Collections with Circulant Temporal Encoding

no code implementations • CVPR 2013 • Jerome Revaud, Matthijs Douze, Cordelia Schmid, Herve Jegou

Furthermore, we extend product quantization to complex vectors in order to compress our descriptors, and to compare them in the compressed domain.

Copy Detection Quantization +1

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.