Search Results for author: Davide Scaramuzza

Found 119 papers, 66 papers with code

Event Cameras Meet SPADs for High-Speed, Low-Bandwidth Imaging

no code implementations • 17 Apr 2024 • Manasi Muglikar, Siddharth Somasundaram, Akshat Dave, Edoardo Charbon, Ramesh Raskar, Davide Scaramuzza

Traditional cameras face a trade-off between low-light performance and high-speed imaging: longer exposure times to capture sufficient light results in motion blur, whereas shorter exposures result in Poisson-corrupted noisy images.

Image Reconstruction Sensor Fusion

Paper
Add Code

3D scene generation from scene graphs and self-attention

no code implementations • 2 Apr 2024 • Pietro Bonazzi, Mengqi Wang, Diego Martin Arroyo, Fabian Manhardt, Nico Messikomer, Federico Tombari, Davide Scaramuzza

Synthesizing realistic and diverse indoor 3D scene layouts in a controllable fashion opens up applications in simulated navigation and virtual reality.

Scene Generation

Paper
Add Code

Few-shot point cloud reconstruction and denoising via learned Guassian splats renderings and fine-tuned diffusion features

no code implementations • 1 Apr 2024 • Pietro Bonazzi, Marie-Julie Rakatosaona, Marco Cannici, Federico Tombari, Davide Scaramuzza

Existing deep learning methods for the reconstruction and denoising of point clouds rely on small datasets of 3D shapes.

Denoising Point cloud reconstruction

Paper
Add Code

An N-Point Linear Solver for Line and Motion Estimation with Event Cameras

no code implementations • 1 Apr 2024 • Ling Gao, Daniel Gehrig, Hang Su, Davide Scaramuzza, Laurent Kneip

To recover the full linear camera velocity we fuse observations from multiple lines with a novel velocity averaging scheme that relies on a geometrically-motivated residual, and thus solves the problem more efficiently than previous schemes which minimize an algebraic residual.

Motion Estimation

Paper
Add Code

Mitigating Motion Blur in Neural Radiance Fields with Events and Frames

1 code implementation • 28 Mar 2024 • Marco Cannici, Davide Scaramuzza

Neural Radiance Fields (NeRFs) have shown great potential in novel view synthesis.

Novel View Synthesis

Paper
Code

Learning Quadruped Locomotion Using Differentiable Simulation

no code implementations • 21 Mar 2024 • Yunlong Song, Sangbae Kim, Davide Scaramuzza

This work provides several important insights into using differentiable simulations for legged locomotion in the real world.

Paper
Add Code

Bootstrapping Reinforcement Learning with Imitation for Vision-Based Agile Flight

no code implementations • 18 Mar 2024 • Jiaxu Xing, Angel Romero, Leonard Bauersfeld, Davide Scaramuzza

Our experiments in both simulated and real-world environments demonstrate that our approach achieves superior performance and robustness than IL or RL alone in navigating a quadrotor through a racing course using only visual information without explicit state estimation.

Imitation Learning reinforcement-learning +1

Paper
Add Code

State Space Models for Event Cameras

1 code implementation • 23 Feb 2024 • Nikola Zubić, Mathias Gehrig, Davide Scaramuzza

We address this challenge by introducing state-space models (SSMs) with learnable timescale parameters to event-based vision.

Event-based vision

Paper
Code

Reaching the Limit in Autonomous Racing: Optimal Control versus Reinforcement Learning

no code implementations • 17 Oct 2023 • Yunlong Song, Angel Romero, Matthias Mueller, Vladlen Koltun, Davide Scaramuzza

A central question in robotics is how to design a control system for an agile mobile robot.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

A 5-Point Minimal Solver for Event Camera Relative Motion Estimation

no code implementations • ICCV 2023 • Ling Gao, Hang Su, Daniel Gehrig, Marco Cannici, Davide Scaramuzza, Laurent Kneip

Event-based cameras are ideal for line-based motion estimation, since they predominantly respond to edges in the scene.

Event-based Motion Estimation Motion Estimation

Paper
Add Code

Contrastive Learning for Enhancing Robust Scene Transfer in Vision-based Agile Flight

no code implementations • 18 Sep 2023 • Jiaxu Xing, Leonard Bauersfeld, Yunlong Song, Chunwei Xing, Davide Scaramuzza

The utility of a robot greatly depends on its ability to perform a task in the real world, outside of a well-controlled lab environment.

Contrastive Learning Representation Learning

Paper
Add Code

End-to-end Learned Visual Odometry with Events and Frames

no code implementations • 18 Sep 2023 • Roberto Pellerito, Marco Cannici, Daniel Gehrig, Joris Belhadj, Olivier Dubois-Matra, Massimo Casasco, Davide Scaramuzza

Visual Odometry (VO) is crucial for autonomous robotic navigation, especially in GPS-denied environments like planetary terrains.

Visual Odometry

Paper
Add Code

Contrastive Initial State Buffer for Reinforcement Learning

1 code implementation • 18 Sep 2023 • Nico Messikommer, Yunlong Song, Davide Scaramuzza

In Reinforcement Learning, the trade-off between exploration and exploitation poses a complex challenge for achieving efficient learning from limited samples.

reinforcement-learning

Paper
Code

Seeing Behind Dynamic Occlusions with Event Cameras

no code implementations • 28 Jul 2023 • Rong Zou, Manasi Muglikar, Nico Messikommer, Davide Scaramuzza

We present the first large-scale dataset consisting of synchronized images and event sequences to evaluate our approach.

Image Inpainting

Paper
Add Code

E-Calib: A Fast, Robust and Accurate Calibration Toolbox for Event Cameras

1 code implementation • 15 Jun 2023 • Mohammed Salah, Abdulla Ayyad, Muhammad Humais, Daniel Gehrig, Abdelqader Abusafieh, Lakmal Seneviratne, Davide Scaramuzza, Yahya Zweiri

However, conventional image-based calibration techniques are not applicable due to the asynchronous, binary output of the sensor.

Image Reconstruction

Paper
Code

Revisiting Token Pruning for Object Detection and Instance Segmentation

1 code implementation • 12 Jun 2023 • Yifei Liu, Mathias Gehrig, Nico Messikommer, Marco Cannici, Davide Scaramuzza

In relation to the dense counterpart that utilizes all tokens, our method realizes an increase in inference speed, achieving up to 34% faster performance for the entire network and 46% for the backbone.

Image Classification Instance Segmentation +4

Paper
Code

From Chaos Comes Order: Ordering Event Representations for Object Recognition and Detection

1 code implementation • ICCV 2023 • Nikola Zubić, Daniel Gehrig, Mathias Gehrig, Davide Scaramuzza

However, selecting the appropriate representation for the task traditionally requires training a neural network for each representation and selecting the best one based on the validation score, which is very time-consuming.

Ranked #1 on Object Detection on GEN1 Detection

Event-based vision object-detection +3

Paper
Code

Neuromorphic Optical Flow and Real-time Implementation with Event Cameras

no code implementations • 14 Apr 2023 • Yannick Schnider, Stanislaw Wozniak, Mathias Gehrig, Jules Lecomte, Axel von Arnim, Luca Benini, Davide Scaramuzza, Angeliki Pantazi

Optical flow provides information on relative motion that is an important component in many computer vision pipelines.

Event-based vision Optical Flow Estimation

Paper
Add Code

A Hybrid ANN-SNN Architecture for Low-Power and Low-Latency Visual Perception

1 code implementation • 24 Mar 2023 • Asude Aydin, Mathias Gehrig, Daniel Gehrig, Davide Scaramuzza

Our hybrid ANN-SNN model thus combines the best of both worlds: It does not suffer from long state transients and state decay thanks to the ANN, and can generate predictions with high temporal resolution, low latency, and low power thanks to the SNN.

3D Human Pose Estimation

Paper
Code

COVERED, CollabOratiVE Robot Environment Dataset for 3D Semantic segmentation

1 code implementation • 24 Feb 2023 • Charith Munasinghe, Fatemeh Mohammadi Amin, Davide Scaramuzza, Hans Wernher van de Venn

Despite the importance of semantic understanding for such applications, 3D semantic segmentation of collaborative robot workspaces lacks sufficient research and dedicated datasets.

3D Semantic Segmentation Real-Time Semantic Segmentation

Paper
Code

Improving safety in physical human-robot collaboration via deep metric learning

no code implementations • 23 Feb 2023 • Maryam Rezayati, Grammatiki Zanni, Ying Zaoshi, Davide Scaramuzza, Hans Wernher van de Venn

This work uses the Deep Metric Learning (DML) approach to distinguish between non-contact robot movement, intentional contact aimed at physical human-robot interaction, and collision situations.

Metric Learning

Paper
Add Code

Event-based Shape from Polarization

no code implementations • CVPR 2023 • Manasi Muglikar, Leonard Bauersfeld, Diederik Paul Moeys, Davide Scaramuzza

Our method uses the continuous event stream caused by the rotation to reconstruct relative intensities at multiple polarizer angles.

Paper
Add Code

Recurrent Vision Transformers for Object Detection with Event Cameras

1 code implementation • CVPR 2023 • Mathias Gehrig, Davide Scaramuzza

By revisiting the high-level design of recurrent vision backbones, we reduce inference time by a factor of 6 while retaining similar performance.

Event-based vision object-detection +1

288

Paper
Code

SLAM for Visually Impaired People: a Survey

no code implementations • 9 Dec 2022 • Marziyeh Bamdad, Davide Scaramuzza, Alireza Darvishy

In recent decades, several assistive technologies have been developed to improve the ability of blind and visually impaired individuals to navigate independently and safely.

Navigate Simultaneous Localization and Mapping

Paper
Add Code

Data-driven Feature Tracking for Event Cameras

1 code implementation • CVPR 2023 • Nico Messikommer, Carter Fang, Mathias Gehrig, Davide Scaramuzza

Because of their high temporal resolution, increased resilience to motion blur, and very sparse output, event cameras have been shown to be ideal for low-latency and low-bandwidth feature tracking, even in challenging scenarios.

132

Paper
Code

Pushing the Limits of Asynchronous Graph-based Object Detection with Event Cameras

no code implementations • 22 Nov 2022 • Daniel Gehrig, Davide Scaramuzza

A recent line of work tackles this issue by modeling events as spatiotemporally evolving graphs that can be efficiently and asynchronously processed using graph neural networks.

object-detection Object Detection

Paper
Add Code

Cracking Double-Blind Review: Authorship Attribution with Deep Learning

1 code implementation • 14 Nov 2022 • Leonard Bauersfeld, Angel Romero, Manasi Muglikar, Davide Scaramuzza

In this work, we present a transformer-based, neural-network architecture that only uses the text content and the author names in the bibliography to attribute an anonymous manuscript to an author.

Attribute Authorship Attribution

Paper
Code

Learning Deep Sensorimotor Policies for Vision-based Autonomous Drone Racing

no code implementations • 26 Oct 2022 • Jiawei Fu, Yunlong Song, Yan Wu, Fisher Yu, Davide Scaramuzza

The resulting policy directly infers control commands with feature representations learned from raw images, forgoing the need for globally-consistent state estimation, trajectory planning, and handcrafted control design.

Contrastive Learning Trajectory Planning

Paper
Add Code

Learning Perception-Aware Agile Flight in Cluttered Environments

no code implementations • 4 Oct 2022 • Yunlong Song, Kexin Shi, Robert Penicka, Davide Scaramuzza

Recently, neural control policies have outperformed existing model-based planning-and-control methods for autonomously navigating quadrotors through cluttered environments in minimum time.

Imitation Learning Reinforcement Learning (RL)

Paper
Add Code

Training Efficient Controllers via Analytic Policy Gradient

1 code implementation • 26 Sep 2022 • Nina Wiedemann, Valentin Wüest, Antonio Loquercio, Matthias Müller, Dario Floreano, Davide Scaramuzza

Conversely, learning-based offline optimization approaches, such as Reinforcement Learning (RL), allow fast and efficient execution on the robot but hardly match the accuracy of MPC in trajectory tracking tasks.

Model Predictive Control Reinforcement Learning (RL)

106

Paper
Code

Data-Efficient Collaborative Decentralized Thermal-Inertial Odometry

1 code implementation • 14 Sep 2022 • Vincenzo Polizzi, Robert Hewitt, Javier Hidalgo-Carrió, Jeff Delaune, Davide Scaramuzza

Our system back-end uses a covariance-intersection fusion strategy to neglect the cross-correlation between agents so as to lower memory usage and computational cost.

Paper
Code

E-NeRF: Neural Radiance Fields from a Moving Event Camera

1 code implementation • 24 Aug 2022 • Simon Klenk, Lukas Koestler, Davide Scaramuzza, Daniel Cremers

We also show that combining events and frames can overcome failure cases of NeRF estimation in scenarios where only a few input views are available without requiring additional regularization.

Paper
Code

Event-aided Direct Sparse Odometry

no code implementations • CVPR 2022 • Javier Hidalgo-Carrió, Guillermo Gallego, Davide Scaramuzza

This opens the door to low-power motion-tracking applications where frames are sparingly triggered "on demand" and our method tracks the motion in between.

Monocular Visual Odometry

Paper
Add Code

Exploring Event Camera-based Odometry for Planetary Robots

no code implementations • 12 Apr 2022 • Florian Mahlknecht, Daniel Gehrig, Jeremy Nash, Friedrich M. Rockenbauer, Benjamin Morrell, Jeff Delaune, Davide Scaramuzza

Due to their resilience to motion blur and high robustness in low-light and high dynamic range conditions, event cameras are poised to become enabling sensors for vision-based exploration on future Mars helicopter missions.

Paper
Add Code

AEGNN: Asynchronous Event-based Graph Neural Networks

no code implementations • CVPR 2022 • Simon Schaefer, Daniel Gehrig, Davide Scaramuzza

For this reason, recent works have adopted Graph Neural Networks (GNNs), which process events as ``static" spatio-temporal graphs, which are inherently "sparse".

Paper
Add Code

Time Lens++: Event-based Frame Interpolation with Parametric Non-linear Flow and Multi-scale Fusion

no code implementations • CVPR 2022 • Stepan Tulyakov, Alfredo Bochicchio, Daniel Gehrig, Stamatios Georgoulis, Yuanyou Li, Davide Scaramuzza

Recently, video frame interpolation using a combination of frame- and event-based cameras has surpassed traditional image-based methods both in terms of performance and memory efficiency.

Motion Estimation Video Frame Interpolation

Paper
Add Code

Are High-Resolution Event Cameras Really Needed?

no code implementations • 28 Mar 2022 • Daniel Gehrig, Davide Scaramuzza

We provide both empirical and theoretical evidence for this claim, which indicates that high-resolution event cameras exhibit higher per-pixel event rates, leading to higher temporal noise in low-illumination conditions and at high speeds.

Image Reconstruction Optical Flow Estimation +2

Paper
Add Code

Dense Continuous-Time Optical Flow from Events and Frames

1 code implementation • 25 Mar 2022 • Mathias Gehrig, Manasi Muglikar, Davide Scaramuzza

To the best of our knowledge, our model is the first method that can regress dense pixel trajectories from event data.

Optical Flow Estimation

132

Paper
Code

ESS: Learning Event-based Semantic Segmentation from Still Images

1 code implementation • 18 Mar 2022 • Zhaoning Sun, Nico Messikommer, Daniel Gehrig, Davide Scaramuzza

Nonetheless, semantic segmentation with event cameras is still in its infancy which is chiefly due to the lack of high-quality, labeled datasets.

Ranked #5 on Event-based Object Segmentation on MVSEC-SEG

Event-based Object Segmentation Segmentation +2

Paper
Code

Real-time Neural-MPC: Deep Learning Model Predictive Control for Quadrotors and Agile Robotic Platforms

2 code implementations • 15 Mar 2022 • Tim Salzmann, Elia Kaufmann, Jon Arrizabalaga, Marco Pavone, Davide Scaramuzza, Markus Ryll

Our experiments, performed in simulation and the real world onboard a highly agile quadrotor platform, demonstrate the capabilities of the described system to run learned models with, previously infeasible, large modeling capacity using gradient-based online optimization MPC.

Model Predictive Control

244

Paper
Code

Multi-Bracket High Dynamic Range Imaging with Event Cameras

no code implementations • 13 Mar 2022 • Nico Messikommer, Stamatios Georgoulis, Daniel Gehrig, Stepan Tulyakov, Julius Erbach, Alfredo Bochicchio, Yuanyou Li, Davide Scaramuzza

Modern high dynamic range (HDR) imaging pipelines align and fuse multiple low dynamic range (LDR) images captured at different exposure times.

valid Vocal Bursts Intensity Prediction

Paper
Add Code

Continuous-Time vs. Discrete-Time Vision-based SLAM: A Comparative Study

1 code implementation • 17 Feb 2022 • Giovanni Cioffi, Titus Cieslewski, Davide Scaramuzza

In the context of this work, we developed, and open source, a modular and efficient software architecture containing state-of-the-art algorithms to solve the SLAM problem in discrete and continuous time.

188

Paper
Code

Perception-Aware Perching on Powerlines with Multirotors

1 code implementation • 13 Feb 2022 • Julio L. Paneque, Jose Ramiro Martínez de Dios, Aníbal Ollero. Drew Hanover, Sihao Sun, Ángel Romero, Davide Scaramuzza

Multirotor aerial robots are becoming widely used for the inspection of powerlines.

Paper
Code

Visual Attention Prediction Improves Performance of Autonomous Drone Racing Agents

no code implementations • 7 Jan 2022 • Christian Pfeiffer, Simon Wengeler, Antonio Loquercio, Davide Scaramuzza

This work investigates whether neural networks capable of imitating human eye gaze behavior and attention can improve neural network performance for the challenging task of vision-based autonomous drone racing.

Imitation Learning

Paper
Add Code

Policy Search for Model Predictive Control with Application to Agile Drone Flight

no code implementations • 7 Dec 2021 • Yunlong Song, Davide Scaramuzza

In this work, we provide an answer by using policy search for automatically choosing high-level decision variables for MPC, which leads to a novel policy-search-for-model-predictive-control framework.

Model Predictive Control

Paper
Add Code

ESL: Event-based Structured Light

no code implementations • 30 Nov 2021 • Manasi Muglikar, Guillermo Gallego, Davide Scaramuzza

Event cameras are bio-inspired sensors providing significant advantages over standard cameras such as low latency, high temporal resolution, and high dynamic range.

3D Reconstruction

Paper
Add Code

Event Guided Depth Sensing

no code implementations • 20 Oct 2021 • Manasi Muglikar, Diederik Paul Moeys, Davide Scaramuzza

The depth estimation is achieved by an event-based structured light system consisting of a laser point projector coupled with a second event-based sensor tuned to detect the reflection of the laser from the scene.

Autonomous Driving Depth Estimation

Paper
Add Code

Learning High-Speed Flight in the Wild

1 code implementation • 11 Oct 2021 • Antonio Loquercio, Elia Kaufmann, René Ranftl, Matthias Müller, Vladlen Koltun, Davide Scaramuzza

Indeed, the subtasks are executed sequentially, leading to increased processing latency and a compounding of errors through the pipeline.

Vocal Bursts Intensity Prediction

526

Paper
Code

Jointly Learning Identification and Control for Few-Shot Policy Adaptation

no code implementations • 29 Sep 2021 • Nina Wiedemann, Antonio Loquercio, Matthias Müller, Rene Ranftl, Davide Scaramuzza

We evaluate our approach on several complex systems and tasks, and experimentally analyze the advantages over model-free and model-based methods in terms of performance and sample efficiency.

Paper
Add Code

The Hilti SLAM Challenge Dataset

no code implementations • 23 Sep 2021 • Michael Helmberger, Kristian Morin, Beda Berner, Nitish Kumar, Giovanni Cioffi, Davide Scaramuzza

The results of the challenge, which are summarized in this paper, show that the proposed dataset is an important asset in the development of new SLAM algorithms that are ready to be deployed in the real-world.

Pose Estimation Sensor Fusion +1

Paper
Add Code

Bridging the Gap between Events and Frames through Unsupervised Domain Adaptation

1 code implementation • 6 Sep 2021 • Nico Messikommer, Daniel Gehrig, Mathias Gehrig, Davide Scaramuzza

However, event-based vision has been held back by the shortage of labeled datasets due to the novelty of event cameras.

Event-based vision object-detection +2

Paper
Code

E-RAFT: Dense Optical Flow from Event Cameras

1 code implementation • 24 Aug 2021 • Mathias Gehrig, Mario Millhäusler, Daniel Gehrig, Davide Scaramuzza

Modern frame-based optical flow methods heavily rely on matching costs computed from feature correlation.

Feature Correlation Optical Flow Estimation

102

Paper
Code

Time-Optimal Planning for Quadrotor Waypoint Flight

1 code implementation • 10 Aug 2021 • Philipp Foehn, Angel Romero, Davide Scaramuzza

However, this time allocation is a priori unknown and renders previous works incapable of producing truly time-optimal trajectories.

Paper
Code

Event-driven Vision and Control for UAVs on a Neuromorphic Chip

no code implementations • 8 Aug 2021 • Antonio Vitale, Alpha Renner, Celine Nauer, Davide Scaramuzza, Yulia Sandamirskaya

Here, we explore how an event-based vision algorithm can be implemented as a spiking neuronal network on a neuromorphic chip and used in a drone controller.

Drone Controller Event-based vision

Paper
Add Code

Time Lens: Event-Based Video Frame Interpolation

1 code implementation • CVPR 2021 • Stepan Tulyakov, Daniel Gehrig, Stamatios Georgoulis, Julius Erbach, Mathias Gehrig, Yuanyou Li, Davide Scaramuzza

However, while these approaches can capture non-linear motions they suffer from ghosting and perform poorly in low-texture regions with few events.

Optical Flow Estimation Video Frame Interpolation

589

Paper
Code

TimeLens: Event-based Video Frame Interpolation

1 code implementation • 14 Jun 2021 • Stepan Tulyakov, Daniel Gehrig, Stamatios Georgoulis, Julius Erbach, Mathias Gehrig, Yuanyou Li, Davide Scaramuzza

State-of-the-art frame interpolation methods generate intermediate frames by inferring object motions in the image from consecutive key-frames.

Optical Flow Estimation Video Frame Interpolation

589

Paper
Code

How to Calibrate Your Event Camera

1 code implementation • 26 May 2021 • Manasi Muglikar, Mathias Gehrig, Daniel Gehrig, Davide Scaramuzza

We propose a generic event camera calibration framework using image reconstruction.

Camera Calibration Image Reconstruction

142

Paper
Code

Autonomous Overtaking in Gran Turismo Sport Using Curriculum Reinforcement Learning

no code implementations • 26 Mar 2021 • Yunlong Song, HaoChih Lin, Elia Kaufmann, Peter Duerr, Davide Scaramuzza

Professional race-car drivers can execute extreme overtaking maneuvers.

Car Racing reinforcement-learning +1

Paper
Add Code

AutoTune: Controller Tuning for High-Speed Flight

1 code implementation • 19 Mar 2021 • Antonio Loquercio, Alessandro Saviolo, Davide Scaramuzza

To answer the first question, we study the relationship between parameters and performance and find out that the faster the maneuver, the more sensitive a controller becomes to its parameters.

Vocal Bursts Intensity Prediction

Paper
Code

Autonomous Drone Racing with Deep Reinforcement Learning

no code implementations • 15 Mar 2021 • Yunlong Song, Mats Steinweg, Elia Kaufmann, Davide Scaramuzza

In many robotic tasks, such as autonomous drone racing, the goal is to travel through a set of waypoints as fast as possible.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

DSEC: A Stereo Event Camera Dataset for Driving Scenarios

1 code implementation • 10 Mar 2021 • Mathias Gehrig, Willem Aarents, Daniel Gehrig, Davide Scaramuzza

To address these challenges, we propose, DSEC, a new dataset that contains such demanding illumination conditions and provides a rich set of sensory data.

Autonomous Driving

119

Paper
Code

Combining Events and Frames using Recurrent Asynchronous Multimodal Networks for Monocular Depth Prediction

1 code implementation • 18 Feb 2021 • Daniel Gehrig, Michelle Rüegg, Mathias Gehrig, Javier Hidalgo Carrio, Davide Scaramuzza

However, events only measure the varying component of the visual signal, which limits their ability to encode scene context.

Depth Prediction Monocular Depth Estimation

Paper
Code

Data-Driven MPC for Quadrotors

1 code implementation • 10 Feb 2021 • Guillem Torrente, Elia Kaufmann, Philipp Foehn, Davide Scaramuzza

Aerodynamic forces render accurate high-speed trajectory tracking with quadrotors extremely challenging.

Gaussian Processes Robotics

217

Paper
Code

Primal-Dual Mesh Convolutional Neural Networks

1 code implementation • NeurIPS 2020 • Francesco Milano, Antonio Loquercio, Antoni Rosinol, Davide Scaramuzza, Luca Carlone

Recent works in geometric deep learning have introduced neural networks that allow performing inference tasks on three-dimensional geometric data by defining convolution, and sometimes pooling, operations on triangle meshes.

Clustering

103

Paper
Code

Learning Monocular Dense Depth from Events

1 code implementation • 16 Oct 2020 • Javier Hidalgo-Carrió, Daniel Gehrig, Davide Scaramuzza

Event cameras are novel sensors that output brightness changes in the form of a stream of asynchronous events instead of intensity frames.

Depth Estimation Depth Prediction +1

103

Paper
Code

Unsupervised Feature Learning for Event Data: Direct vs Inverse Problem Formulation

no code implementations • 23 Sep 2020 • Dimche Kostadinov, Davide Scaramuzza

Due to the asynchronous nature, efficient learning of compact representation for event data is challenging.

Object Recognition Representation Learning

Paper
Add Code

Flightmare: A Flexible Quadrotor Simulator

3 code implementations • 1 Sep 2020 • Yunlong Song, Selim Naji, Elia Kaufmann, Antonio Loquercio, Davide Scaramuzza

State-of-the-art quadrotor simulators have a rigid and highly-specialized structure: either are they really fast, physically accurate, or photo-realistic.

reinforcement-learning Reinforcement Learning (RL) +1

1,083

Paper
Code

Super-Human Performance in Gran Turismo Sport Using Deep Reinforcement Learning

no code implementations • 18 Aug 2020 • Florian Fuchs, Yunlong Song, Elia Kaufmann, Davide Scaramuzza, Peter Duerr

Autonomous car racing is a major challenge in robotics.

Car Racing reinforcement-learning +1

Paper
Add Code

Fisher Information Field: an Efficient and Differentiable Map for Perception-aware Planning

1 code implementation • 7 Aug 2020 • Zichao Zhang, Davide Scaramuzza

However, computing the Fisher information from a set of sparse landmarks (i. e., a point cloud), which is the most common map for visual localization, is inefficient.

Robotics

179

Paper
Code

Learning High-Level Policies for Model Predictive Control

1 code implementation • 20 Jul 2020 • Yunlong Song, Davide Scaramuzza

In this work, we leverage probabilistic decision-making approaches and the generalization capability of artificial neural networks to the powerful online optimization by learning a deep high-level policy for the MPC (High-MPC).

Decision Making Model Predictive Control +2

375

Paper
Code

Deep Drone Acrobatics

1 code implementation • 10 Jun 2020 • Elia Kaufmann, Antonio Loquercio, René Ranftl, Matthias Müller, Vladlen Koltun, Davide Scaramuzza

In this paper, we propose to learn a sensorimotor policy that enables an autonomous quadrotor to fly extreme acrobatic maneuvers with only onboard sensing and computation.

Robotics

200

Paper
Code

AlphaPilot: Autonomous Drone Racing

no code implementations • 26 May 2020 • Philipp Foehn, Dario Brescianini, Elia Kaufmann, Titus Cieslewski, Mathias Gehrig, Manasi Muglikar, Davide Scaramuzza

This paper presents a novel system for autonomous, vision-based drone racing combining learned data abstraction, nonlinear filtering, and time-optimal trajectory planning.

Navigate Trajectory Planning

Paper
Add Code

Reference Pose Generation for Long-term Visual Localization via Learned Features and View Synthesis

1 code implementation • 11 May 2020 • Zichao Zhang, Torsten Sattler, Davide Scaramuzza

Visual Localization is one of the key enabling technologies for autonomous driving and augmented reality.

Autonomous Driving Benchmarking +1

328

Paper
Code

Faster than FAST: GPU-Accelerated Frontend for High-Speed VIO

1 code implementation • 30 Mar 2020 • Balazs Nagy, Philipp Foehn, Davide Scaramuzza

While most steps of a VIO pipeline work on visual features, they rely on image data for detection and tracking, of which both steps are well suited for parallelization.

feature selection Vocal Bursts Intensity Prediction

420

Paper
Code

Event-based Asynchronous Sparse Convolutional Networks

1 code implementation • ECCV 2020 • Nico Messikommer, Daniel Gehrig, Antonio Loquercio, Davide Scaramuzza

However, these approaches discard the spatial and temporal sparsity inherent in event data at the cost of higher computational complexity and latency.

object-detection Object Detection +1

120

Paper
Code

Reducing the Sim-to-Real Gap for Event Cameras

1 code implementation • ECCV 2020 • Timo Stoffregen, Cedric Scheerlinck, Davide Scaramuzza, Tom Drummond, Nick Barnes, Lindsay Kleeman, Robert Mahony

We present strategies for improving training data for event based CNNs that result in 20-40% boost in performance of existing state-of-the-art (SOTA) video reconstruction networks retrained with our method, and up to 15% for optic flow networks.

Ranked #3 on Video Reconstruction on MVSEC

Event-Based Video Reconstruction Video Reconstruction

Paper
Code

AirSim Drone Racing Lab

2 code implementations • 12 Mar 2020 • Ratnesh Madaan, Nicholas Gyde, Sai Vemprala, Matthew Brown, Keiko Nagami, Tim Taubner, Eric Cristofalo, Davide Scaramuzza, Mac Schwager, Ashish Kapoor

Autonomous drone racing is a challenging research problem at the intersection of computer vision, planning, state estimation, and control.

Benchmarking Optical Flow Estimation

340

Paper
Code

Event-Based Angular Velocity Regression with Spiking Networks

1 code implementation • 5 Mar 2020 • Mathias Gehrig, Sumit Bam Shrestha, Daniel Mouritzen, Davide Scaramuzza

Due to their spike-based computational model, SNNs can process output from event-based, asynchronous sensors without any pre-processing at extremely lower power unlike standard artificial neural networks.

regression

108

Paper
Code

Redesigning SLAM for Arbitrary Multi-Camera Systems

no code implementations • 4 Mar 2020 • Juichung Kuo, Manasi Muglikar, Zichao Zhang, Davide Scaramuzza

We adapt a state-of-the-art visual-inertial odometry with these modifications, and experimental results show that the modified pipeline can adapt to a wide range of camera setups (e. g., 2 to 6 cameras in one experiment) without the need of sensor-specific modifications or tuning.

Paper
Add Code

Voxel Map for Visual SLAM

no code implementations • 4 Mar 2020 • Manasi Muglikar, Zichao Zhang, Davide Scaramuzza

We propose a voxel-map representation to efficiently retrieve map points for visual SLAM.

Paper
Add Code

Learning Depth With Very Sparse Supervision

no code implementations • 2 Mar 2020 • Antonio Loquercio, Alexey Dosovitskiy, Davide Scaramuzza

Motivated by the astonishing capabilities of natural intelligent agents and inspired by theories from psychology, this paper explores the idea that perception gets coupled to 3D properties of the world via interaction with the environment.

Depth Estimation

Paper
Add Code

Augmenting Visual Place Recognition with Structural Cues

no code implementations • 29 Feb 2020 • Amadeus Oertel, Titus Cieslewski, Davide Scaramuzza

In this paper, we propose to augment image-based place recognition with structural cues.

Visual Place Recognition

Paper
Add Code

Registration made easy -- standalone orthopedic navigation with HoloLens

no code implementations • 17 Jan 2020 • Florentin Liebmann, Simon Roner, Marco von Atzigen, Florian Wanivenhaus, Caroline Neuhaus, José Spirig, Davide Scaramuzza, Reto Sutter, Jess Snedeker, Mazda Farshad, Philipp Fürnstahl

In surgical navigation, finding correspondence between preoperative plan and intraoperative anatomy, the so-called registration task, is imperative.

Anatomy

Paper
Add Code

Video to Events: Recycling Video Datasets for Event Cameras

1 code implementation • CVPR 2020 • Daniel Gehrig, Mathias Gehrig, Javier Hidalgo-Carrió, Davide Scaramuzza

Event cameras are novel sensors that output brightness changes in the form of a stream of asynchronous "events" instead of intensity frames.

Object Recognition Semantic Segmentation

286

Paper
Code

Towards Low-Latency High-Bandwidth Control of Quadrotors using Event Cameras

no code implementations • 11 Nov 2019 • Rika Sugimoto Dimitrova, Mathias Gehrig, Dario Brescianini, Davide Scaramuzza

In particular, this paper addresses the problem of one-dimensional attitude tracking using a dualcopter platform equipped with an event camera.

Robotics Systems and Control Systems and Control

Paper
Add Code

Global-Local Network for Learning Depth with Very Sparse Supervision

no code implementations • 25 Sep 2019 • Antonio Loquercio, Alexey Dosovitskiy, Davide Scaramuzza

Natural intelligent agents learn to perceive the three dimensional structure of the world without training on large datasets and are unlikely to have the precise equations of projective geometry hard-wired in the brain.

Depth Estimation Inductive Bias

Paper
Add Code

A General Framework for Uncertainty Estimation in Deep Learning

1 code implementation • 16 Jul 2019 • Antonio Loquercio, Mattia Segù, Davide Scaramuzza

Current approaches for uncertainty estimation of neural networks require changes to the network and optimization process, typically ignore prior knowledge about the data, and tend to make over-simplifying assumptions which underestimate uncertainty.

Autonomous Driving Bayesian Inference +1

217

Paper
Code

High Speed and High Dynamic Range Video with an Event Camera

1 code implementation • 15 Jun 2019 • Henri Rebecq, René Ranftl, Vladlen Koltun, Davide Scaramuzza

In this work we propose to learn to reconstruct intensity images from event streams directly from data instead of relying on any hand-crafted priors.

Ranked #3 on Event-based Object Segmentation on MVSEC-SEG

Event-based Object Segmentation Video Reconstruction +1

318

Paper
Code

EVDodgeNet: Deep Dynamic Obstacle Dodging with Event Cameras

2 code implementations • 7 Jun 2019 • Nitin J. Sanket, Chethan M. Parameshwara, Chahat Deep Singh, Ashwin V. Kuruttukulam, Cornelia Fermüller, Davide Scaramuzza, Yiannis Aloimonos

To our knowledge, this is the first deep learning -- based solution to the problem of dynamic obstacle avoidance using event cameras on a quadrotor.

Motion Estimation

Paper
Code

Visual-Inertial Odometry of Aerial Robots

no code implementations • 7 Jun 2019 • Davide Scaramuzza, Zichao Zhang

Visual-Inertial odometry (VIO) is the process of estimating the state (pose and velocity) of an agent (e. g., an aerial robot) by using only the input of one or more cameras plus one or more Inertial Measurement Units (IMUs) attached to it.

Robotics

Paper
Add Code

CED: Color Event Camera Dataset

no code implementations • 24 Apr 2019 • Cedric Scheerlinck, Henri Rebecq, Timo Stoffregen, Nick Barnes, Robert Mahony, Davide Scaramuzza

Event cameras are novel, bio-inspired visual sensors, whose pixels output asynchronous and independent timestamped spikes at local intensity changes, called 'events'.

Event-based vision Image Reconstruction

Paper
Add Code

Event-based Vision: A Survey

1 code implementation • 17 Apr 2019 • Guillermo Gallego, Tobi Delbruck, Garrick Orchard, Chiara Bartolozzi, Brian Taba, Andrea Censi, Stefan Leutenegger, Andrew Davison, Joerg Conradt, Kostas Daniilidis, Davide Scaramuzza

Event cameras offer attractive properties compared to traditional cameras: high temporal resolution (in the order of microseconds), very high dynamic range (140 dB vs. 60 dB), low power consumption, and high pixel bandwidth (on the order of kHz) resulting in reduced motion blur.

Event-based vision

2,642

Paper
Code

End-to-End Learning of Representations for Asynchronous Event-Based Data

1 code implementation • ICCV 2019 • Daniel Gehrig, Antonio Loquercio, Konstantinos G. Derpanis, Davide Scaramuzza

Event cameras are vision sensors that record asynchronous streams of per-pixel brightness changes, referred to as "events".

Ranked #15 on Robust classification on N-ImageNet

Classification Object Recognition +2

130

Paper
Code

Events-to-Video: Bringing Modern Computer Vision to Event Cameras

no code implementations • CVPR 2019 • Henri Rebecq, René Ranftl, Vladlen Koltun, Davide Scaramuzza

Since the output of event cameras is fundamentally different from conventional cameras, it is commonly accepted that they require the development of specialized algorithms to accommodate the particular nature of events.

Paper
Add Code

Focus Is All You Need: Loss Functions For Event-based Vision

1 code implementation • CVPR 2019 • Guillermo Gallego, Mathias Gehrig, Davide Scaramuzza

The proposed loss functions allow bringing mature computer vision tools to the realm of event cameras.

Event-based vision Motion Compensation +1

Paper
Code

Event-Based Motion Segmentation by Motion Compensation

1 code implementation • ICCV 2019 • Timo Stoffregen, Guillermo Gallego, Tom Drummond, Lindsay Kleeman, Davide Scaramuzza

In contrast to traditional cameras, whose pixels have a common exposure time, event-based cameras are novel bio-inspired sensors whose pixels work independently and asynchronously output intensity changes (called "events"), with microsecond resolution.

Event Segmentation Motion Compensation +2

Paper
Code

Unsupervised Moving Object Detection via Contextual Information Separation

1 code implementation • CVPR 2019 • Yanchao Yang, Antonio Loquercio, Davide Scaramuzza, Stefano Soatto

We propose an adversarial contextual model for detecting moving objects in images.

Moving Object Detection Object +3

259

Paper
Code

Matching Features without Descriptors: Implicitly Matched Interest Points

1 code implementation • 26 Nov 2018 • Titus Cieslewski, Michael Bloesch, Davide Scaramuzza

The extraction and matching of interest points is a prerequisite for many geometric computer vision problems.

Pose Estimation valid

Paper
Code

Asynchronous, Photometric Feature Tracking using Events and Frames

1 code implementation • ECCV 2018 • Daniel Gehrig, Henri Rebecq, Guillermo Gallego, Davide Scaramuzza

By contrast, standard cameras provide intensity measurements (frames) that do not depend on motion direction.

123

Paper
Code

Semi-Dense 3D Reconstruction with a Stereo Event Camera

2 code implementations • ECCV 2018 • Yi Zhou, Guillermo Gallego, Henri Rebecq, Laurent Kneip, Hongdong Li, Davide Scaramuzza

Event cameras are bio-inspired sensors that offer several advantages, such as low latency, high-speed and high dynamic range, to tackle challenging scenarios in computer vision.

3D Reconstruction Simultaneous Localization and Mapping

412

Paper
Code

A 64mW DNN-based Visual Navigation Engine for Autonomous Nano-Drones

3 code implementations • 4 May 2018 • Daniele Palossi, Antonio Loquercio, Francesco Conti, Eric Flamand, Davide Scaramuzza, Luca Benini

As part of our general methodology we discuss the software mapping techniques that enable the state-of-the-art deep convolutional neural network presented in [1] to be fully executed on-board within a strict 6 fps real-time constraint with no compromise in terms of flight results, while all processing is done with only 64 mW on average.

Autonomous Navigation Visual Navigation

473

Paper
Code

SIPs: Succinct Interest Points from Unsupervised Inlierness Probability Learning

1 code implementation • 3 May 2018 • Titus Cieslewski, Konstantinos G. Derpanis, Davide Scaramuzza

In certain cases, our detector is able to obtain an equivalent amount of inliers with as little as 60% of the amount of points of other detectors.

Interest Point Detection Pose Estimation +1

Paper
Code

Event-based Vision meets Deep Learning on Steering Prediction for Self-driving Cars

no code implementations • CVPR 2018 • Ana I. Maqueda, Antonio Loquercio, Guillermo Gallego, Narciso Garcia, Davide Scaramuzza

Event cameras are bio-inspired vision sensors that naturally capture the dynamics of a scene, filtering out redundant information.

Ranked #12 on Robust classification on N-ImageNet

Classification Event-based vision +4

Paper
Add Code

A Unifying Contrast Maximization Framework for Event Cameras, with Applications to Motion, Depth, and Optical Flow Estimation

2 code implementations • CVPR 2018 • Guillermo Gallego, Henri Rebecq, Davide Scaramuzza

We present a unifying framework to solve several computer vision problems with event cameras: motion, depth and optical flow estimation.

Optical Flow Estimation

Paper
Code

A Real-Time Game Theoretic Planner for Autonomous Two-Player Drone Racing

1 code implementation • 8 Jan 2018 • Riccardo Spica, Davide Falanga, Eric Cristofalo, Eduardo Montijano, Davide Scaramuzza, Mac Schwager

To be successful in multi-player drone racing, a player must not only follow the race track in an optimal way, but also compete with other drones through strategic blocking, faking, and opportunistic passing while avoiding collisions.

Robotics

340

Paper
Code

Differential Flatness of Quadrotor Dynamics Subject to Rotor Drag for Accurate Tracking of High-Speed Trajectories

no code implementations • 6 Dec 2017 • Matthias Faessler, Antonio Franchi, Davide Scaramuzza

In this paper, we prove that the dynamical model of a quadrotor subject to linear rotor drag effects is differentially flat in its position and heading.

Robotics

Paper
Add Code

Fast, Autonomous Flight in GPS-Denied and Cluttered Environments

no code implementations • 6 Dec 2017 • Kartik Mohta, Michael Watterson, Yash Mulgaonkar, Sikang Liu, Chao Qu, Anurag Makineni, Kelsey Saulnier, Ke Sun, Alex Zhu, Jeffrey Delmerico, Konstantinos Karydis, Nikolay Atanasov, Giuseppe Loianno, Davide Scaramuzza, Kostas Daniilidis, Camillo Jose Taylor, Vijay Kumar

One of the most challenging tasks for a flying robot is to autonomously navigate between target locations quickly and reliably while avoiding obstacles in its path, and with little to no a-priori knowledge of the operating environment.

Robotics

Paper
Add Code

Data-Efficient Decentralized Visual SLAM

1 code implementation • 16 Oct 2017 • Titus Cieslewski, Siddharth Choudhary, Davide Scaramuzza

In this work, we integrate state-of-the-art decentralized SLAM components into a new, complete decentralized visual SLAM system.

Robotics

276

Paper
Code

Ultimate SLAM? Combining Events, Images, and IMU for Robust Visual SLAM in HDR and High Speed Scenarios

no code implementations • 19 Sep 2017 • Antoni Rosinol Vidal, Henri Rebecq, Timo Horstschaefer, Davide Scaramuzza

Event cameras are bio-inspired vision sensors that output pixel-level brightness changes instead of standard intensity frames.

Paper
Add Code

Learning-based Image Enhancement for Visual Odometry in Challenging HDR Environments

no code implementations • 5 Jul 2017 • Ruben Gomez-Ojeda, Zichao Zhang, Javier Gonzalez-Jimenez, Davide Scaramuzza

One of the main open challenges in visual odometry (VO) is the robustness to difficult illumination conditions or high dynamic range (HDR) environments.

Image Enhancement Visual Odometry

Paper
Add Code

Independent Motion Detection with Event-driven Cameras

no code implementations • 27 Jun 2017 • Valentina Vasco, Arren Glover, Elias Mueggler, Davide Scaramuzza, Lorenzo Natale, Chiara Bartolozzi

In this paper, we propose a method for segmenting the motion of an independently moving object for event-driven cameras.

Motion Detection Visual Tracking

Paper
Add Code

Efficient Decentralized Visual Place Recognition From Full-Image Descriptors

1 code implementation • 30 May 2017 • Titus Cieslewski, Davide Scaramuzza

As we show, casting this to a key-value lookup problem can be achieved with k-means clustering, and results in a much simpler system than [1].

Clustering Visual Place Recognition

276

Paper
Code

PL-SLAM: a Stereo SLAM System through the Combination of Points and Line Segments

3 code implementations • 26 May 2017 • Ruben Gomez-Ojeda, David Zuñiga-Noël, Francisco-Angel Moreno, Davide Scaramuzza, Javier Gonzalez-Jimenez

This paper proposes PL-SLAM, a stereo visual SLAM system that combines both points and line segments to work robustly in a wider variety of scenarios, particularly in those where point features are scarce or not well-distributed in the image.

C++ code Descriptive +1

721

Paper
Code

Continuous-Time Visual-Inertial Odometry for Event Cameras

no code implementations • 23 Feb 2017 • Elias Mueggler, Guillermo Gallego, Henri Rebecq, Davide Scaramuzza

Recent work has shown that a continuous-time representation of the event camera pose can deal with the high temporal resolution and asynchronous nature of this sensor in a principled way.

Visual Odometry

Paper
Add Code

The Event-Camera Dataset and Simulator: Event-based Data for Pose Estimation, Visual Odometry, and SLAM

2 code implementations • 26 Oct 2016 • Elias Mueggler, Henri Rebecq, Guillermo Gallego, Tobi Delbruck, Davide Scaramuzza

New vision sensors, such as the Dynamic and Active-pixel Vision sensor (DAVIS), incorporate a conventional global-shutter camera and an event-based sensor in the same pixel array.

Motion Estimation Pose Estimation +1

Paper
Code

Event-based, 6-DOF Camera Tracking from Photometric Depth Maps

1 code implementation • 12 Jul 2016 • Guillermo Gallego, Jon E. A. Lund, Elias Mueggler, Henri Rebecq, Tobi Delbruck, Davide Scaramuzza

Event cameras are bio-inspired vision sensors that output pixel-level brightness changes instead of standard intensity frames.

2,642

Paper
Code

Past, Present, and Future of Simultaneous Localization And Mapping: Towards the Robust-Perception Age

2 code implementations • 19 Jun 2016 • Cesar Cadena, Luca Carlone, Henry Carrillo, Yasir Latif, Davide Scaramuzza, Jose Neira, Ian Reid, John J. Leonard

Simultaneous Localization and Mapping (SLAM)consists in the concurrent construction of a model of the environment (the map), and the estimation of the state of the robot moving within it.

Robotics

Paper
Code

On-Manifold Preintegration for Real-Time Visual-Inertial Odometry

2 code implementations • 8 Dec 2015 • Christian Forster, Luca Carlone, Frank Dellaert, Davide Scaramuzza

However, real-time optimization quickly becomes infeasible as the trajectory grows over time, this problem is further emphasized by the fact that inertial measurements come at high rate, hence leading to fast growth of the number of variables in the optimization.

Robotics

1,439

Paper
Code

Event-based Camera Pose Tracking using a Generative Event Model

2 code implementations • 7 Oct 2015 • Guillermo Gallego, Christian Forster, Elias Mueggler, Davide Scaramuzza

Event-based vision sensors mimic the operation of biological retina and they represent a major paradigm shift from traditional cameras.

Camera Localization Event-based vision +1

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.