Search Results for author: Davide Scaramuzza

Found 119 papers, 66 papers with code

Event Cameras Meet SPADs for High-Speed, Low-Bandwidth Imaging

no code implementations17 Apr 2024 Manasi Muglikar, Siddharth Somasundaram, Akshat Dave, Edoardo Charbon, Ramesh Raskar, Davide Scaramuzza

Traditional cameras face a trade-off between low-light performance and high-speed imaging: longer exposure times to capture sufficient light results in motion blur, whereas shorter exposures result in Poisson-corrupted noisy images.

Image Reconstruction Sensor Fusion

3D scene generation from scene graphs and self-attention

no code implementations2 Apr 2024 Pietro Bonazzi, Mengqi Wang, Diego Martin Arroyo, Fabian Manhardt, Nico Messikomer, Federico Tombari, Davide Scaramuzza

Synthesizing realistic and diverse indoor 3D scene layouts in a controllable fashion opens up applications in simulated navigation and virtual reality.

Scene Generation

An N-Point Linear Solver for Line and Motion Estimation with Event Cameras

no code implementations1 Apr 2024 Ling Gao, Daniel Gehrig, Hang Su, Davide Scaramuzza, Laurent Kneip

To recover the full linear camera velocity we fuse observations from multiple lines with a novel velocity averaging scheme that relies on a geometrically-motivated residual, and thus solves the problem more efficiently than previous schemes which minimize an algebraic residual.

Motion Estimation

Mitigating Motion Blur in Neural Radiance Fields with Events and Frames

1 code implementation28 Mar 2024 Marco Cannici, Davide Scaramuzza

Neural Radiance Fields (NeRFs) have shown great potential in novel view synthesis.

Novel View Synthesis

Learning Quadruped Locomotion Using Differentiable Simulation

no code implementations21 Mar 2024 Yunlong Song, Sangbae Kim, Davide Scaramuzza

This work provides several important insights into using differentiable simulations for legged locomotion in the real world.

Bootstrapping Reinforcement Learning with Imitation for Vision-Based Agile Flight

no code implementations18 Mar 2024 Jiaxu Xing, Angel Romero, Leonard Bauersfeld, Davide Scaramuzza

Our experiments in both simulated and real-world environments demonstrate that our approach achieves superior performance and robustness than IL or RL alone in navigating a quadrotor through a racing course using only visual information without explicit state estimation.

Imitation Learning reinforcement-learning +1

State Space Models for Event Cameras

1 code implementation23 Feb 2024 Nikola Zubić, Mathias Gehrig, Davide Scaramuzza

We address this challenge by introducing state-space models (SSMs) with learnable timescale parameters to event-based vision.

Event-based vision

Contrastive Learning for Enhancing Robust Scene Transfer in Vision-based Agile Flight

no code implementations18 Sep 2023 Jiaxu Xing, Leonard Bauersfeld, Yunlong Song, Chunwei Xing, Davide Scaramuzza

The utility of a robot greatly depends on its ability to perform a task in the real world, outside of a well-controlled lab environment.

Contrastive Learning Representation Learning

End-to-end Learned Visual Odometry with Events and Frames

no code implementations18 Sep 2023 Roberto Pellerito, Marco Cannici, Daniel Gehrig, Joris Belhadj, Olivier Dubois-Matra, Massimo Casasco, Davide Scaramuzza

Visual Odometry (VO) is crucial for autonomous robotic navigation, especially in GPS-denied environments like planetary terrains.

Visual Odometry

Contrastive Initial State Buffer for Reinforcement Learning

1 code implementation18 Sep 2023 Nico Messikommer, Yunlong Song, Davide Scaramuzza

In Reinforcement Learning, the trade-off between exploration and exploitation poses a complex challenge for achieving efficient learning from limited samples.

reinforcement-learning

Seeing Behind Dynamic Occlusions with Event Cameras

no code implementations28 Jul 2023 Rong Zou, Manasi Muglikar, Nico Messikommer, Davide Scaramuzza

We present the first large-scale dataset consisting of synchronized images and event sequences to evaluate our approach.

Image Inpainting

Revisiting Token Pruning for Object Detection and Instance Segmentation

1 code implementation12 Jun 2023 Yifei Liu, Mathias Gehrig, Nico Messikommer, Marco Cannici, Davide Scaramuzza

In relation to the dense counterpart that utilizes all tokens, our method realizes an increase in inference speed, achieving up to 34% faster performance for the entire network and 46% for the backbone.

Image Classification Instance Segmentation +4

From Chaos Comes Order: Ordering Event Representations for Object Recognition and Detection

1 code implementation ICCV 2023 Nikola Zubić, Daniel Gehrig, Mathias Gehrig, Davide Scaramuzza

However, selecting the appropriate representation for the task traditionally requires training a neural network for each representation and selecting the best one based on the validation score, which is very time-consuming.

Event-based vision object-detection +3

A Hybrid ANN-SNN Architecture for Low-Power and Low-Latency Visual Perception

1 code implementation24 Mar 2023 Asude Aydin, Mathias Gehrig, Daniel Gehrig, Davide Scaramuzza

Our hybrid ANN-SNN model thus combines the best of both worlds: It does not suffer from long state transients and state decay thanks to the ANN, and can generate predictions with high temporal resolution, low latency, and low power thanks to the SNN.

3D Human Pose Estimation

COVERED, CollabOratiVE Robot Environment Dataset for 3D Semantic segmentation

1 code implementation24 Feb 2023 Charith Munasinghe, Fatemeh Mohammadi Amin, Davide Scaramuzza, Hans Wernher van de Venn

Despite the importance of semantic understanding for such applications, 3D semantic segmentation of collaborative robot workspaces lacks sufficient research and dedicated datasets.

3D Semantic Segmentation Real-Time Semantic Segmentation

Improving safety in physical human-robot collaboration via deep metric learning

no code implementations23 Feb 2023 Maryam Rezayati, Grammatiki Zanni, Ying Zaoshi, Davide Scaramuzza, Hans Wernher van de Venn

This work uses the Deep Metric Learning (DML) approach to distinguish between non-contact robot movement, intentional contact aimed at physical human-robot interaction, and collision situations.

Metric Learning

Event-based Shape from Polarization

no code implementations CVPR 2023 Manasi Muglikar, Leonard Bauersfeld, Diederik Paul Moeys, Davide Scaramuzza

Our method uses the continuous event stream caused by the rotation to reconstruct relative intensities at multiple polarizer angles.

Recurrent Vision Transformers for Object Detection with Event Cameras

1 code implementation CVPR 2023 Mathias Gehrig, Davide Scaramuzza

By revisiting the high-level design of recurrent vision backbones, we reduce inference time by a factor of 6 while retaining similar performance.

Event-based vision object-detection +1

SLAM for Visually Impaired People: a Survey

no code implementations9 Dec 2022 Marziyeh Bamdad, Davide Scaramuzza, Alireza Darvishy

In recent decades, several assistive technologies have been developed to improve the ability of blind and visually impaired individuals to navigate independently and safely.

Navigate Simultaneous Localization and Mapping

Data-driven Feature Tracking for Event Cameras

1 code implementation CVPR 2023 Nico Messikommer, Carter Fang, Mathias Gehrig, Davide Scaramuzza

Because of their high temporal resolution, increased resilience to motion blur, and very sparse output, event cameras have been shown to be ideal for low-latency and low-bandwidth feature tracking, even in challenging scenarios.

Pushing the Limits of Asynchronous Graph-based Object Detection with Event Cameras

no code implementations22 Nov 2022 Daniel Gehrig, Davide Scaramuzza

A recent line of work tackles this issue by modeling events as spatiotemporally evolving graphs that can be efficiently and asynchronously processed using graph neural networks.

object-detection Object Detection

Cracking Double-Blind Review: Authorship Attribution with Deep Learning

1 code implementation14 Nov 2022 Leonard Bauersfeld, Angel Romero, Manasi Muglikar, Davide Scaramuzza

In this work, we present a transformer-based, neural-network architecture that only uses the text content and the author names in the bibliography to attribute an anonymous manuscript to an author.

Attribute Authorship Attribution

Learning Deep Sensorimotor Policies for Vision-based Autonomous Drone Racing

no code implementations26 Oct 2022 Jiawei Fu, Yunlong Song, Yan Wu, Fisher Yu, Davide Scaramuzza

The resulting policy directly infers control commands with feature representations learned from raw images, forgoing the need for globally-consistent state estimation, trajectory planning, and handcrafted control design.

Contrastive Learning Trajectory Planning

Learning Perception-Aware Agile Flight in Cluttered Environments

no code implementations4 Oct 2022 Yunlong Song, Kexin Shi, Robert Penicka, Davide Scaramuzza

Recently, neural control policies have outperformed existing model-based planning-and-control methods for autonomously navigating quadrotors through cluttered environments in minimum time.

Imitation Learning Reinforcement Learning (RL)

Training Efficient Controllers via Analytic Policy Gradient

1 code implementation26 Sep 2022 Nina Wiedemann, Valentin Wüest, Antonio Loquercio, Matthias Müller, Dario Floreano, Davide Scaramuzza

Conversely, learning-based offline optimization approaches, such as Reinforcement Learning (RL), allow fast and efficient execution on the robot but hardly match the accuracy of MPC in trajectory tracking tasks.

Model Predictive Control Reinforcement Learning (RL)

Data-Efficient Collaborative Decentralized Thermal-Inertial Odometry

1 code implementation14 Sep 2022 Vincenzo Polizzi, Robert Hewitt, Javier Hidalgo-Carrió, Jeff Delaune, Davide Scaramuzza

Our system back-end uses a covariance-intersection fusion strategy to neglect the cross-correlation between agents so as to lower memory usage and computational cost.

E-NeRF: Neural Radiance Fields from a Moving Event Camera

1 code implementation24 Aug 2022 Simon Klenk, Lukas Koestler, Davide Scaramuzza, Daniel Cremers

We also show that combining events and frames can overcome failure cases of NeRF estimation in scenarios where only a few input views are available without requiring additional regularization.

Event-aided Direct Sparse Odometry

no code implementations CVPR 2022 Javier Hidalgo-Carrió, Guillermo Gallego, Davide Scaramuzza

This opens the door to low-power motion-tracking applications where frames are sparingly triggered "on demand" and our method tracks the motion in between.

Monocular Visual Odometry

Exploring Event Camera-based Odometry for Planetary Robots

no code implementations12 Apr 2022 Florian Mahlknecht, Daniel Gehrig, Jeremy Nash, Friedrich M. Rockenbauer, Benjamin Morrell, Jeff Delaune, Davide Scaramuzza

Due to their resilience to motion blur and high robustness in low-light and high dynamic range conditions, event cameras are poised to become enabling sensors for vision-based exploration on future Mars helicopter missions.

AEGNN: Asynchronous Event-based Graph Neural Networks

no code implementations CVPR 2022 Simon Schaefer, Daniel Gehrig, Davide Scaramuzza

For this reason, recent works have adopted Graph Neural Networks (GNNs), which process events as ``static" spatio-temporal graphs, which are inherently "sparse".

Time Lens++: Event-based Frame Interpolation with Parametric Non-linear Flow and Multi-scale Fusion

no code implementations CVPR 2022 Stepan Tulyakov, Alfredo Bochicchio, Daniel Gehrig, Stamatios Georgoulis, Yuanyou Li, Davide Scaramuzza

Recently, video frame interpolation using a combination of frame- and event-based cameras has surpassed traditional image-based methods both in terms of performance and memory efficiency.

Motion Estimation Video Frame Interpolation

Are High-Resolution Event Cameras Really Needed?

no code implementations28 Mar 2022 Daniel Gehrig, Davide Scaramuzza

We provide both empirical and theoretical evidence for this claim, which indicates that high-resolution event cameras exhibit higher per-pixel event rates, leading to higher temporal noise in low-illumination conditions and at high speeds.

Image Reconstruction Optical Flow Estimation +2

Dense Continuous-Time Optical Flow from Events and Frames

1 code implementation25 Mar 2022 Mathias Gehrig, Manasi Muglikar, Davide Scaramuzza

To the best of our knowledge, our model is the first method that can regress dense pixel trajectories from event data.

Optical Flow Estimation

ESS: Learning Event-based Semantic Segmentation from Still Images

1 code implementation18 Mar 2022 Zhaoning Sun, Nico Messikommer, Daniel Gehrig, Davide Scaramuzza

Nonetheless, semantic segmentation with event cameras is still in its infancy which is chiefly due to the lack of high-quality, labeled datasets.

Event-based Object Segmentation Segmentation +2

Real-time Neural-MPC: Deep Learning Model Predictive Control for Quadrotors and Agile Robotic Platforms

2 code implementations15 Mar 2022 Tim Salzmann, Elia Kaufmann, Jon Arrizabalaga, Marco Pavone, Davide Scaramuzza, Markus Ryll

Our experiments, performed in simulation and the real world onboard a highly agile quadrotor platform, demonstrate the capabilities of the described system to run learned models with, previously infeasible, large modeling capacity using gradient-based online optimization MPC.

Model Predictive Control

Multi-Bracket High Dynamic Range Imaging with Event Cameras

no code implementations13 Mar 2022 Nico Messikommer, Stamatios Georgoulis, Daniel Gehrig, Stepan Tulyakov, Julius Erbach, Alfredo Bochicchio, Yuanyou Li, Davide Scaramuzza

Modern high dynamic range (HDR) imaging pipelines align and fuse multiple low dynamic range (LDR) images captured at different exposure times.

valid Vocal Bursts Intensity Prediction

Continuous-Time vs. Discrete-Time Vision-based SLAM: A Comparative Study

1 code implementation17 Feb 2022 Giovanni Cioffi, Titus Cieslewski, Davide Scaramuzza

In the context of this work, we developed, and open source, a modular and efficient software architecture containing state-of-the-art algorithms to solve the SLAM problem in discrete and continuous time.

Visual Attention Prediction Improves Performance of Autonomous Drone Racing Agents

no code implementations7 Jan 2022 Christian Pfeiffer, Simon Wengeler, Antonio Loquercio, Davide Scaramuzza

This work investigates whether neural networks capable of imitating human eye gaze behavior and attention can improve neural network performance for the challenging task of vision-based autonomous drone racing.

Imitation Learning

Policy Search for Model Predictive Control with Application to Agile Drone Flight

no code implementations7 Dec 2021 Yunlong Song, Davide Scaramuzza

In this work, we provide an answer by using policy search for automatically choosing high-level decision variables for MPC, which leads to a novel policy-search-for-model-predictive-control framework.

Model Predictive Control

ESL: Event-based Structured Light

no code implementations30 Nov 2021 Manasi Muglikar, Guillermo Gallego, Davide Scaramuzza

Event cameras are bio-inspired sensors providing significant advantages over standard cameras such as low latency, high temporal resolution, and high dynamic range.

3D Reconstruction

Event Guided Depth Sensing

no code implementations20 Oct 2021 Manasi Muglikar, Diederik Paul Moeys, Davide Scaramuzza

The depth estimation is achieved by an event-based structured light system consisting of a laser point projector coupled with a second event-based sensor tuned to detect the reflection of the laser from the scene.

Autonomous Driving Depth Estimation

Learning High-Speed Flight in the Wild

1 code implementation11 Oct 2021 Antonio Loquercio, Elia Kaufmann, René Ranftl, Matthias Müller, Vladlen Koltun, Davide Scaramuzza

Indeed, the subtasks are executed sequentially, leading to increased processing latency and a compounding of errors through the pipeline.

Vocal Bursts Intensity Prediction

Jointly Learning Identification and Control for Few-Shot Policy Adaptation

no code implementations29 Sep 2021 Nina Wiedemann, Antonio Loquercio, Matthias Müller, Rene Ranftl, Davide Scaramuzza

We evaluate our approach on several complex systems and tasks, and experimentally analyze the advantages over model-free and model-based methods in terms of performance and sample efficiency.

The Hilti SLAM Challenge Dataset

no code implementations23 Sep 2021 Michael Helmberger, Kristian Morin, Beda Berner, Nitish Kumar, Giovanni Cioffi, Davide Scaramuzza

The results of the challenge, which are summarized in this paper, show that the proposed dataset is an important asset in the development of new SLAM algorithms that are ready to be deployed in the real-world.

Pose Estimation Sensor Fusion +1

Bridging the Gap between Events and Frames through Unsupervised Domain Adaptation

1 code implementation6 Sep 2021 Nico Messikommer, Daniel Gehrig, Mathias Gehrig, Davide Scaramuzza

However, event-based vision has been held back by the shortage of labeled datasets due to the novelty of event cameras.

Event-based vision object-detection +2

E-RAFT: Dense Optical Flow from Event Cameras

1 code implementation24 Aug 2021 Mathias Gehrig, Mario Millhäusler, Daniel Gehrig, Davide Scaramuzza

Modern frame-based optical flow methods heavily rely on matching costs computed from feature correlation.

Feature Correlation Optical Flow Estimation

Time-Optimal Planning for Quadrotor Waypoint Flight

1 code implementation10 Aug 2021 Philipp Foehn, Angel Romero, Davide Scaramuzza

However, this time allocation is a priori unknown and renders previous works incapable of producing truly time-optimal trajectories.

Event-driven Vision and Control for UAVs on a Neuromorphic Chip

no code implementations8 Aug 2021 Antonio Vitale, Alpha Renner, Celine Nauer, Davide Scaramuzza, Yulia Sandamirskaya

Here, we explore how an event-based vision algorithm can be implemented as a spiking neuronal network on a neuromorphic chip and used in a drone controller.

Drone Controller Event-based vision

Time Lens: Event-Based Video Frame Interpolation

1 code implementation CVPR 2021 Stepan Tulyakov, Daniel Gehrig, Stamatios Georgoulis, Julius Erbach, Mathias Gehrig, Yuanyou Li, Davide Scaramuzza

However, while these approaches can capture non-linear motions they suffer from ghosting and perform poorly in low-texture regions with few events.

Optical Flow Estimation Video Frame Interpolation

TimeLens: Event-based Video Frame Interpolation

1 code implementation14 Jun 2021 Stepan Tulyakov, Daniel Gehrig, Stamatios Georgoulis, Julius Erbach, Mathias Gehrig, Yuanyou Li, Davide Scaramuzza

State-of-the-art frame interpolation methods generate intermediate frames by inferring object motions in the image from consecutive key-frames.

Optical Flow Estimation Video Frame Interpolation

AutoTune: Controller Tuning for High-Speed Flight

1 code implementation19 Mar 2021 Antonio Loquercio, Alessandro Saviolo, Davide Scaramuzza

To answer the first question, we study the relationship between parameters and performance and find out that the faster the maneuver, the more sensitive a controller becomes to its parameters.

Vocal Bursts Intensity Prediction

Autonomous Drone Racing with Deep Reinforcement Learning

no code implementations15 Mar 2021 Yunlong Song, Mats Steinweg, Elia Kaufmann, Davide Scaramuzza

In many robotic tasks, such as autonomous drone racing, the goal is to travel through a set of waypoints as fast as possible.

reinforcement-learning Reinforcement Learning (RL)

DSEC: A Stereo Event Camera Dataset for Driving Scenarios

1 code implementation10 Mar 2021 Mathias Gehrig, Willem Aarents, Daniel Gehrig, Davide Scaramuzza

To address these challenges, we propose, DSEC, a new dataset that contains such demanding illumination conditions and provides a rich set of sensory data.

Autonomous Driving

Data-Driven MPC for Quadrotors

1 code implementation10 Feb 2021 Guillem Torrente, Elia Kaufmann, Philipp Foehn, Davide Scaramuzza

Aerodynamic forces render accurate high-speed trajectory tracking with quadrotors extremely challenging.

Gaussian Processes Robotics

Primal-Dual Mesh Convolutional Neural Networks

1 code implementation NeurIPS 2020 Francesco Milano, Antonio Loquercio, Antoni Rosinol, Davide Scaramuzza, Luca Carlone

Recent works in geometric deep learning have introduced neural networks that allow performing inference tasks on three-dimensional geometric data by defining convolution, and sometimes pooling, operations on triangle meshes.

Clustering

Learning Monocular Dense Depth from Events

1 code implementation16 Oct 2020 Javier Hidalgo-Carrió, Daniel Gehrig, Davide Scaramuzza

Event cameras are novel sensors that output brightness changes in the form of a stream of asynchronous events instead of intensity frames.

Depth Estimation Depth Prediction +1

Flightmare: A Flexible Quadrotor Simulator

3 code implementations1 Sep 2020 Yunlong Song, Selim Naji, Elia Kaufmann, Antonio Loquercio, Davide Scaramuzza

State-of-the-art quadrotor simulators have a rigid and highly-specialized structure: either are they really fast, physically accurate, or photo-realistic.

reinforcement-learning Reinforcement Learning (RL) +1

Fisher Information Field: an Efficient and Differentiable Map for Perception-aware Planning

1 code implementation7 Aug 2020 Zichao Zhang, Davide Scaramuzza

However, computing the Fisher information from a set of sparse landmarks (i. e., a point cloud), which is the most common map for visual localization, is inefficient.

Robotics

Learning High-Level Policies for Model Predictive Control

1 code implementation20 Jul 2020 Yunlong Song, Davide Scaramuzza

In this work, we leverage probabilistic decision-making approaches and the generalization capability of artificial neural networks to the powerful online optimization by learning a deep high-level policy for the MPC (High-MPC).

Decision Making Model Predictive Control +2

Deep Drone Acrobatics

1 code implementation10 Jun 2020 Elia Kaufmann, Antonio Loquercio, René Ranftl, Matthias Müller, Vladlen Koltun, Davide Scaramuzza

In this paper, we propose to learn a sensorimotor policy that enables an autonomous quadrotor to fly extreme acrobatic maneuvers with only onboard sensing and computation.

Robotics

AlphaPilot: Autonomous Drone Racing

no code implementations26 May 2020 Philipp Foehn, Dario Brescianini, Elia Kaufmann, Titus Cieslewski, Mathias Gehrig, Manasi Muglikar, Davide Scaramuzza

This paper presents a novel system for autonomous, vision-based drone racing combining learned data abstraction, nonlinear filtering, and time-optimal trajectory planning.

Navigate Trajectory Planning

Faster than FAST: GPU-Accelerated Frontend for High-Speed VIO

1 code implementation30 Mar 2020 Balazs Nagy, Philipp Foehn, Davide Scaramuzza

While most steps of a VIO pipeline work on visual features, they rely on image data for detection and tracking, of which both steps are well suited for parallelization.

feature selection Vocal Bursts Intensity Prediction

Event-based Asynchronous Sparse Convolutional Networks

1 code implementation ECCV 2020 Nico Messikommer, Daniel Gehrig, Antonio Loquercio, Davide Scaramuzza

However, these approaches discard the spatial and temporal sparsity inherent in event data at the cost of higher computational complexity and latency.

object-detection Object Detection +1

Reducing the Sim-to-Real Gap for Event Cameras

1 code implementation ECCV 2020 Timo Stoffregen, Cedric Scheerlinck, Davide Scaramuzza, Tom Drummond, Nick Barnes, Lindsay Kleeman, Robert Mahony

We present strategies for improving training data for event based CNNs that result in 20-40% boost in performance of existing state-of-the-art (SOTA) video reconstruction networks retrained with our method, and up to 15% for optic flow networks.

Event-Based Video Reconstruction Video Reconstruction

AirSim Drone Racing Lab

2 code implementations12 Mar 2020 Ratnesh Madaan, Nicholas Gyde, Sai Vemprala, Matthew Brown, Keiko Nagami, Tim Taubner, Eric Cristofalo, Davide Scaramuzza, Mac Schwager, Ashish Kapoor

Autonomous drone racing is a challenging research problem at the intersection of computer vision, planning, state estimation, and control.

Benchmarking Optical Flow Estimation

Event-Based Angular Velocity Regression with Spiking Networks

1 code implementation5 Mar 2020 Mathias Gehrig, Sumit Bam Shrestha, Daniel Mouritzen, Davide Scaramuzza

Due to their spike-based computational model, SNNs can process output from event-based, asynchronous sensors without any pre-processing at extremely lower power unlike standard artificial neural networks.

regression

Redesigning SLAM for Arbitrary Multi-Camera Systems

no code implementations4 Mar 2020 Juichung Kuo, Manasi Muglikar, Zichao Zhang, Davide Scaramuzza

We adapt a state-of-the-art visual-inertial odometry with these modifications, and experimental results show that the modified pipeline can adapt to a wide range of camera setups (e. g., 2 to 6 cameras in one experiment) without the need of sensor-specific modifications or tuning.

Voxel Map for Visual SLAM

no code implementations4 Mar 2020 Manasi Muglikar, Zichao Zhang, Davide Scaramuzza

We propose a voxel-map representation to efficiently retrieve map points for visual SLAM.

Learning Depth With Very Sparse Supervision

no code implementations2 Mar 2020 Antonio Loquercio, Alexey Dosovitskiy, Davide Scaramuzza

Motivated by the astonishing capabilities of natural intelligent agents and inspired by theories from psychology, this paper explores the idea that perception gets coupled to 3D properties of the world via interaction with the environment.

Depth Estimation

Augmenting Visual Place Recognition with Structural Cues

no code implementations29 Feb 2020 Amadeus Oertel, Titus Cieslewski, Davide Scaramuzza

In this paper, we propose to augment image-based place recognition with structural cues.

Visual Place Recognition

Registration made easy -- standalone orthopedic navigation with HoloLens

no code implementations17 Jan 2020 Florentin Liebmann, Simon Roner, Marco von Atzigen, Florian Wanivenhaus, Caroline Neuhaus, José Spirig, Davide Scaramuzza, Reto Sutter, Jess Snedeker, Mazda Farshad, Philipp Fürnstahl

In surgical navigation, finding correspondence between preoperative plan and intraoperative anatomy, the so-called registration task, is imperative.

Anatomy

Video to Events: Recycling Video Datasets for Event Cameras

1 code implementation CVPR 2020 Daniel Gehrig, Mathias Gehrig, Javier Hidalgo-Carrió, Davide Scaramuzza

Event cameras are novel sensors that output brightness changes in the form of a stream of asynchronous "events" instead of intensity frames.

Object Recognition Semantic Segmentation

Towards Low-Latency High-Bandwidth Control of Quadrotors using Event Cameras

no code implementations11 Nov 2019 Rika Sugimoto Dimitrova, Mathias Gehrig, Dario Brescianini, Davide Scaramuzza

In particular, this paper addresses the problem of one-dimensional attitude tracking using a dualcopter platform equipped with an event camera.

Robotics Systems and Control Systems and Control

Global-Local Network for Learning Depth with Very Sparse Supervision

no code implementations25 Sep 2019 Antonio Loquercio, Alexey Dosovitskiy, Davide Scaramuzza

Natural intelligent agents learn to perceive the three dimensional structure of the world without training on large datasets and are unlikely to have the precise equations of projective geometry hard-wired in the brain.

Depth Estimation Inductive Bias

A General Framework for Uncertainty Estimation in Deep Learning

1 code implementation16 Jul 2019 Antonio Loquercio, Mattia Segù, Davide Scaramuzza

Current approaches for uncertainty estimation of neural networks require changes to the network and optimization process, typically ignore prior knowledge about the data, and tend to make over-simplifying assumptions which underestimate uncertainty.

Autonomous Driving Bayesian Inference +1

High Speed and High Dynamic Range Video with an Event Camera

1 code implementation15 Jun 2019 Henri Rebecq, René Ranftl, Vladlen Koltun, Davide Scaramuzza

In this work we propose to learn to reconstruct intensity images from event streams directly from data instead of relying on any hand-crafted priors.

Event-based Object Segmentation Video Reconstruction +1

EVDodgeNet: Deep Dynamic Obstacle Dodging with Event Cameras

2 code implementations7 Jun 2019 Nitin J. Sanket, Chethan M. Parameshwara, Chahat Deep Singh, Ashwin V. Kuruttukulam, Cornelia Fermüller, Davide Scaramuzza, Yiannis Aloimonos

To our knowledge, this is the first deep learning -- based solution to the problem of dynamic obstacle avoidance using event cameras on a quadrotor.

Motion Estimation

Visual-Inertial Odometry of Aerial Robots

no code implementations7 Jun 2019 Davide Scaramuzza, Zichao Zhang

Visual-Inertial odometry (VIO) is the process of estimating the state (pose and velocity) of an agent (e. g., an aerial robot) by using only the input of one or more cameras plus one or more Inertial Measurement Units (IMUs) attached to it.

Robotics

CED: Color Event Camera Dataset

no code implementations24 Apr 2019 Cedric Scheerlinck, Henri Rebecq, Timo Stoffregen, Nick Barnes, Robert Mahony, Davide Scaramuzza

Event cameras are novel, bio-inspired visual sensors, whose pixels output asynchronous and independent timestamped spikes at local intensity changes, called 'events'.

Event-based vision Image Reconstruction

Event-based Vision: A Survey

1 code implementation17 Apr 2019 Guillermo Gallego, Tobi Delbruck, Garrick Orchard, Chiara Bartolozzi, Brian Taba, Andrea Censi, Stefan Leutenegger, Andrew Davison, Joerg Conradt, Kostas Daniilidis, Davide Scaramuzza

Event cameras offer attractive properties compared to traditional cameras: high temporal resolution (in the order of microseconds), very high dynamic range (140 dB vs. 60 dB), low power consumption, and high pixel bandwidth (on the order of kHz) resulting in reduced motion blur.

Event-based vision

Events-to-Video: Bringing Modern Computer Vision to Event Cameras

no code implementations CVPR 2019 Henri Rebecq, René Ranftl, Vladlen Koltun, Davide Scaramuzza

Since the output of event cameras is fundamentally different from conventional cameras, it is commonly accepted that they require the development of specialized algorithms to accommodate the particular nature of events.

Event-Based Motion Segmentation by Motion Compensation

1 code implementation ICCV 2019 Timo Stoffregen, Guillermo Gallego, Tom Drummond, Lindsay Kleeman, Davide Scaramuzza

In contrast to traditional cameras, whose pixels have a common exposure time, event-based cameras are novel bio-inspired sensors whose pixels work independently and asynchronously output intensity changes (called "events"), with microsecond resolution.

Event Segmentation Motion Compensation +2

Matching Features without Descriptors: Implicitly Matched Interest Points

1 code implementation26 Nov 2018 Titus Cieslewski, Michael Bloesch, Davide Scaramuzza

The extraction and matching of interest points is a prerequisite for many geometric computer vision problems.

Pose Estimation valid

Asynchronous, Photometric Feature Tracking using Events and Frames

1 code implementation ECCV 2018 Daniel Gehrig, Henri Rebecq, Guillermo Gallego, Davide Scaramuzza

By contrast, standard cameras provide intensity measurements (frames) that do not depend on motion direction.

Semi-Dense 3D Reconstruction with a Stereo Event Camera

2 code implementations ECCV 2018 Yi Zhou, Guillermo Gallego, Henri Rebecq, Laurent Kneip, Hongdong Li, Davide Scaramuzza

Event cameras are bio-inspired sensors that offer several advantages, such as low latency, high-speed and high dynamic range, to tackle challenging scenarios in computer vision.

3D Reconstruction Simultaneous Localization and Mapping

A 64mW DNN-based Visual Navigation Engine for Autonomous Nano-Drones

3 code implementations4 May 2018 Daniele Palossi, Antonio Loquercio, Francesco Conti, Eric Flamand, Davide Scaramuzza, Luca Benini

As part of our general methodology we discuss the software mapping techniques that enable the state-of-the-art deep convolutional neural network presented in [1] to be fully executed on-board within a strict 6 fps real-time constraint with no compromise in terms of flight results, while all processing is done with only 64 mW on average.

Autonomous Navigation Visual Navigation

SIPs: Succinct Interest Points from Unsupervised Inlierness Probability Learning

1 code implementation3 May 2018 Titus Cieslewski, Konstantinos G. Derpanis, Davide Scaramuzza

In certain cases, our detector is able to obtain an equivalent amount of inliers with as little as 60% of the amount of points of other detectors.

Interest Point Detection Pose Estimation +1

A Real-Time Game Theoretic Planner for Autonomous Two-Player Drone Racing

1 code implementation8 Jan 2018 Riccardo Spica, Davide Falanga, Eric Cristofalo, Eduardo Montijano, Davide Scaramuzza, Mac Schwager

To be successful in multi-player drone racing, a player must not only follow the race track in an optimal way, but also compete with other drones through strategic blocking, faking, and opportunistic passing while avoiding collisions.

Robotics

Differential Flatness of Quadrotor Dynamics Subject to Rotor Drag for Accurate Tracking of High-Speed Trajectories

no code implementations6 Dec 2017 Matthias Faessler, Antonio Franchi, Davide Scaramuzza

In this paper, we prove that the dynamical model of a quadrotor subject to linear rotor drag effects is differentially flat in its position and heading.

Robotics

Fast, Autonomous Flight in GPS-Denied and Cluttered Environments

no code implementations6 Dec 2017 Kartik Mohta, Michael Watterson, Yash Mulgaonkar, Sikang Liu, Chao Qu, Anurag Makineni, Kelsey Saulnier, Ke Sun, Alex Zhu, Jeffrey Delmerico, Konstantinos Karydis, Nikolay Atanasov, Giuseppe Loianno, Davide Scaramuzza, Kostas Daniilidis, Camillo Jose Taylor, Vijay Kumar

One of the most challenging tasks for a flying robot is to autonomously navigate between target locations quickly and reliably while avoiding obstacles in its path, and with little to no a-priori knowledge of the operating environment.

Robotics

Data-Efficient Decentralized Visual SLAM

1 code implementation16 Oct 2017 Titus Cieslewski, Siddharth Choudhary, Davide Scaramuzza

In this work, we integrate state-of-the-art decentralized SLAM components into a new, complete decentralized visual SLAM system.

Robotics

Ultimate SLAM? Combining Events, Images, and IMU for Robust Visual SLAM in HDR and High Speed Scenarios

no code implementations19 Sep 2017 Antoni Rosinol Vidal, Henri Rebecq, Timo Horstschaefer, Davide Scaramuzza

Event cameras are bio-inspired vision sensors that output pixel-level brightness changes instead of standard intensity frames.

Learning-based Image Enhancement for Visual Odometry in Challenging HDR Environments

no code implementations5 Jul 2017 Ruben Gomez-Ojeda, Zichao Zhang, Javier Gonzalez-Jimenez, Davide Scaramuzza

One of the main open challenges in visual odometry (VO) is the robustness to difficult illumination conditions or high dynamic range (HDR) environments.

Image Enhancement Visual Odometry

Independent Motion Detection with Event-driven Cameras

no code implementations27 Jun 2017 Valentina Vasco, Arren Glover, Elias Mueggler, Davide Scaramuzza, Lorenzo Natale, Chiara Bartolozzi

In this paper, we propose a method for segmenting the motion of an independently moving object for event-driven cameras.

Motion Detection Visual Tracking

Efficient Decentralized Visual Place Recognition From Full-Image Descriptors

1 code implementation30 May 2017 Titus Cieslewski, Davide Scaramuzza

As we show, casting this to a key-value lookup problem can be achieved with k-means clustering, and results in a much simpler system than [1].

Clustering Visual Place Recognition

PL-SLAM: a Stereo SLAM System through the Combination of Points and Line Segments

3 code implementations26 May 2017 Ruben Gomez-Ojeda, David Zuñiga-Noël, Francisco-Angel Moreno, Davide Scaramuzza, Javier Gonzalez-Jimenez

This paper proposes PL-SLAM, a stereo visual SLAM system that combines both points and line segments to work robustly in a wider variety of scenarios, particularly in those where point features are scarce or not well-distributed in the image.

C++ code Descriptive +1

Continuous-Time Visual-Inertial Odometry for Event Cameras

no code implementations23 Feb 2017 Elias Mueggler, Guillermo Gallego, Henri Rebecq, Davide Scaramuzza

Recent work has shown that a continuous-time representation of the event camera pose can deal with the high temporal resolution and asynchronous nature of this sensor in a principled way.

Visual Odometry

The Event-Camera Dataset and Simulator: Event-based Data for Pose Estimation, Visual Odometry, and SLAM

2 code implementations26 Oct 2016 Elias Mueggler, Henri Rebecq, Guillermo Gallego, Tobi Delbruck, Davide Scaramuzza

New vision sensors, such as the Dynamic and Active-pixel Vision sensor (DAVIS), incorporate a conventional global-shutter camera and an event-based sensor in the same pixel array.

Motion Estimation Pose Estimation +1

Event-based, 6-DOF Camera Tracking from Photometric Depth Maps

1 code implementation12 Jul 2016 Guillermo Gallego, Jon E. A. Lund, Elias Mueggler, Henri Rebecq, Tobi Delbruck, Davide Scaramuzza

Event cameras are bio-inspired vision sensors that output pixel-level brightness changes instead of standard intensity frames.

Past, Present, and Future of Simultaneous Localization And Mapping: Towards the Robust-Perception Age

2 code implementations19 Jun 2016 Cesar Cadena, Luca Carlone, Henry Carrillo, Yasir Latif, Davide Scaramuzza, Jose Neira, Ian Reid, John J. Leonard

Simultaneous Localization and Mapping (SLAM)consists in the concurrent construction of a model of the environment (the map), and the estimation of the state of the robot moving within it.

Robotics

On-Manifold Preintegration for Real-Time Visual-Inertial Odometry

2 code implementations8 Dec 2015 Christian Forster, Luca Carlone, Frank Dellaert, Davide Scaramuzza

However, real-time optimization quickly becomes infeasible as the trajectory grows over time, this problem is further emphasized by the fact that inertial measurements come at high rate, hence leading to fast growth of the number of variables in the optimization.

Robotics

Event-based Camera Pose Tracking using a Generative Event Model

2 code implementations7 Oct 2015 Guillermo Gallego, Christian Forster, Elias Mueggler, Davide Scaramuzza

Event-based vision sensors mimic the operation of biological retina and they represent a major paradigm shift from traditional cameras.

Camera Localization Event-based vision +1

Cannot find the paper you are looking for? You can Submit a new open access paper.