Search Results for author: Jerome Revaud

Found 18 papers, 8 papers with code

DUSt3R: Geometric 3D Vision Made Easy

1 code implementation21 Dec 2023 Shuzhe Wang, Vincent Leroy, Yohann Cabon, Boris Chidlovskii, Jerome Revaud

Our formulation directly provides a 3D model of the scene as well as depth information, but interestingly, we can seamlessly recover from it, pixel matches, relative and absolute camera.

3D Reconstruction Camera Calibration +2

MFOS: Model-Free & One-Shot Object Pose Estimation

no code implementations3 Oct 2023 Jongmin Lee, Yohann Cabon, Romain Brégier, Sungjoo Yoo, Jerome Revaud

Existing learning-based methods for object pose estimation in RGB images are mostly model-specific or category based.

Object Pose Estimation

Win-Win: Training High-Resolution Vision Transformers from Two Windows

no code implementations1 Oct 2023 Vincent Leroy, Jerome Revaud, Thomas Lucas, Philippe Weinzaepfel

It is 4 times faster to train than a full-resolution network, and it is straightforward to use at test time compared to existing approaches.

Depth Estimation Depth Prediction +2

SACReg: Scene-Agnostic Coordinate Regression for Visual Localization

no code implementations21 Jul 2023 Jerome Revaud, Yohann Cabon, Romain Brégier, Jongmin Lee, Philippe Weinzaepfel

Instead of encoding the scene coordinates into the network weights, our model takes as input a database image with some sparse 2D pixel to 3D coordinate annotations, extracted from e. g. off-the-shelf Structure-from-Motion or RGB-D data, and a query image for which are predicted a dense 3D coordinate map and its confidence, based on cross-attention.

Image Retrieval regression +2

Robust Automatic Monocular Vehicle Speed Estimation for Traffic Surveillance

no code implementations ICCV 2021 Jerome Revaud, Martin Humenberger

Experimental results conducted on three diverse benchmarks demonstrate excellent speed estimation accuracy that could enable the wide use of CCTV cameras for traffic analysis, even in challenging conditions where state-of-the-art methods completely fail.

Camera Calibration Keypoint Detection +1

SuperLoss: A Generic Loss for Robust Curriculum Learning

2 code implementations NeurIPS 2020 Thibault Castells, Philippe Weinzaepfel, Jerome Revaud

The key idea is to somehow estimate the importance (or weight) of each sample directly during training based on the observation that easy and hard samples behave differently and can therefore be separated.

Image Classification Image Retrieval +4

Learning with Average Precision: Training Image Retrieval with a Listwise Loss

2 code implementations ICCV 2019 Jerome Revaud, Jon Almazan, Rafael Sampaio de Rezende, Cesar Roberto de Souza

Recent deep models for image retrieval have outperformed traditional methods by leveraging ranking-tailored loss functions, but important theoretical and practical problems remain.

Image Retrieval Retrieval

R2D2: Repeatable and Reliable Detector and Descriptor

1 code implementation14 Jun 2019 Jerome Revaud, Philippe Weinzaepfel, César De Souza, Noe Pion, Gabriela Csurka, Yohann Cabon, Martin Humenberger

In this work, we argue that salient regions are not necessarily discriminative, and therefore can harm the performance of the description.

Interest Point Detection Keypoint Detection +1

End-to-end Learning of Deep Visual Representations for Image Retrieval

4 code implementations25 Oct 2016 Albert Gordo, Jon Almazan, Jerome Revaud, Diane Larlus

Second, we build on the recent R-MAC descriptor, show that it can be interpreted as a deep and differentiable architecture, and present improvements to enhance it.

Image Retrieval Quantization +1

Beat-Event Detection in Action Movie Franchises

no code implementations15 Aug 2015 Danila Potapov, Matthijs Douze, Jerome Revaud, Zaid Harchaoui, Cordelia Schmid

While important advances were recently made towards temporally localizing and recognizing specific human actions or activities in videos, efficient detection and classification of long video chunks belonging to semantically defined categories such as "pursuit" or "romance" remains challenging. We introduce a new dataset, Action Movie Franchises, consisting of a collection of Hollywood action movie franchises.

Classification Event Detection +1

Learning to Detect Motion Boundaries

no code implementations CVPR 2015 Philippe Weinzaepfel, Jerome Revaud, Zaid Harchaoui, Cordelia Schmid

We compare the results obtained with several state-of-the-art optical flow approaches and study the impact of the different cues used in the random forest. Furthermore, we introduce a new dataset, the YouTube Motion Boundaries dataset (YMB), that comprises 60 sequences taken from real-world videos with manually annotated motion boundaries.

Boundary Detection Optical Flow Estimation

Transformation Pursuit for Image Classification

no code implementations CVPR 2014 Mattis Paulin, Jerome Revaud, Zaid Harchaoui, Florent Perronnin, Cordelia Schmid

We propose a principled algorithm – Image Transformation Pursuit (ITP) – for the automatic selection of a compact set of transformations.

Classification General Classification +1

Event Retrieval in Large Video Collections with Circulant Temporal Encoding

no code implementations CVPR 2013 Jerome Revaud, Matthijs Douze, Cordelia Schmid, Herve Jegou

Furthermore, we extend product quantization to complex vectors in order to compress our descriptors, and to compare them in the compressed domain.

Copy Detection Quantization +1

Cannot find the paper you are looking for? You can Submit a new open access paper.