Search Results for author: Fabien Baradel

Found 14 papers, 7 papers with code

Cross-view and Cross-pose Completion for 3D Human Understanding

no code implementations15 Nov 2023 Matthieu Armando, Salma Galaaoui, Fabien Baradel, Thomas Lucas, Vincent Leroy, Romain Brégier, Philippe Weinzaepfel, Grégory Rogez

Human perception and understanding is a major domain of computer vision which, like many other vision subdomains recently, stands to gain from the use of large models pre-trained on large datasets.

Human Mesh Recovery Self-Supervised Learning

SHOWMe: Benchmarking Object-agnostic Hand-Object 3D Reconstruction

no code implementations19 Sep 2023 Anilkumar Swamy, Vincent Leroy, Philippe Weinzaepfel, Fabien Baradel, Salma Galaaoui, Romain Bregier, Matthieu Armando, Jean-Sebastien Franco, Gregory Rogez

Recent hand-object interaction datasets show limited real object variability and rely on fitting the MANO parametric model to obtain groundtruth hand shapes.

3D Reconstruction Benchmarking +1

PoseGPT: Quantization-based 3D Human Motion Generation and Forecasting

1 code implementation19 Oct 2022 Thomas Lucas, Fabien Baradel, Philippe Weinzaepfel, Grégory Rogez

The discrete and compressed nature of the latent space allows the GPT-like model to focus on long-range signal, as it removes low-level redundancy in the input signal.

Human-Object Interaction Detection Quantization

PoseBERT: A Generic Transformer Module for Temporal 3D Human Modeling

1 code implementation22 Aug 2022 Fabien Baradel, Romain Brégier, Thibault Groueix, Philippe Weinzaepfel, Yannis Kalantidis, Grégory Rogez

It is simple, generic and versatile, as it can be plugged on top of any image-based model to transform it in a video-based model leveraging temporal information.

Pose Estimation Pose Prediction

Filtered-CoPhy: Unsupervised Learning of Counterfactual Physics in Pixel Space

no code implementations ICLR 2022 Steeven Janny, Fabien Baradel, Natalia Neverova, Madiha Nadri, Greg Mori, Christian Wolf

Learning causal relationships in high-dimensional data (images, videos) is a hard task, as they are often defined on low dimensional manifolds and must be extracted from complex signals dominated by appearance, lighting, textures and also spurious correlations in the data.

counterfactual Counterfactual Reasoning +1

CoPhy: Counterfactual Learning of Physical Dynamics

1 code implementation ICLR 2020 Fabien Baradel, Natalia Neverova, Julien Mille, Greg Mori, Christian Wolf

Understanding causes and effects in mechanical systems is an essential component of reasoning in the physical world.

counterfactual Video Prediction

Learning Video Representations using Contrastive Bidirectional Transformer

no code implementations13 Jun 2019 Chen Sun, Fabien Baradel, Kevin Murphy, Cordelia Schmid

This paper proposes a self-supervised learning approach for video features that results in significantly improved performance on downstream tasks (such as video classification, captioning and segmentation) compared to existing methods.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +5

Object Level Visual Reasoning in Videos

1 code implementation ECCV 2018 Fabien Baradel, Natalia Neverova, Christian Wolf, Julien Mille, Greg Mori

Human activity recognition is typically addressed by detecting key concepts like global and local motion, features related to object classes present in the scene, as well as features related to the global context.

Human Activity Recognition Object +3

Glimpse Clouds: Human Activity Recognition from Unstructured Feature Points

1 code implementation CVPR 2018 Fabien Baradel, Christian Wolf, Julien Mille, Graham W. Taylor

No spatial coherence is forced on the glimpse locations, which gives the module liberty to explore different points at each frame and better optimize the process of scrutinizing visual information.

Action Recognition Activity Prediction +3

Human Action Recognition: Pose-based Attention draws focus to Hands

no code implementations20 Dec 2017 Fabien Baradel, Christian Wolf, Julien Mille

We propose a new spatio-temporal attention based mechanism for human action recognition able to automatically attend to the hands most involved into the studied action and detect the most discriminative moments in an action.

Action Recognition Temporal Action Localization

Pose-conditioned Spatio-Temporal Attention for Human Action Recognition

no code implementations29 Mar 2017 Fabien Baradel, Christian Wolf, Julien Mille

We show that it is of high interest to shift the attention to different hands at different time steps depending on the activity itself.

Action Recognition Human Activity Recognition +1

Cannot find the paper you are looking for? You can Submit a new open access paper.