Search Results for author: Shaojie Shen

Found 51 papers, 26 papers with code

Metric3D v2: A Versatile Monocular Geometric Foundation Model for Zero-shot Metric Depth and Surface Normal Estimation

1 code implementation Under review for Transaction 2024 Mu Hu, Wei Yin, Chi Zhang, Zhipeng Cai, Xiaoxiao Long, Hao Chen, Kaixuan Wang, Gang Yu, Chunhua Shen, Shaojie Shen

Our method benefits various applications including in-the-wild metrology monocular-SLAM, and 3D reconstruction, which highlight the versatility of Metric3D v2 models as geometric foundation models.

 Ranked #1 on Monocular Depth Estimation on NYU-Depth V2 (using extra training data)

3D Reconstruction Monocular Depth Estimation +3

GeoWizard: Unleashing the Diffusion Priors for 3D Geometry Estimation from a Single Image

no code implementations18 Mar 2024 Xiao Fu, Wei Yin, Mu Hu, Kaixuan Wang, Yuexin Ma, Ping Tan, Shaojie Shen, Dahua Lin, Xiaoxiao Long

We introduce GeoWizard, a new generative foundation model designed for estimating geometric attributes, e. g., depth and normals, from single images.

3D Reconstruction

MASSTAR: A Multi-Modal and Large-Scale Scene Dataset with a Versatile Toolchain for Surface Prediction and Completion

no code implementations18 Mar 2024 Guiyong Zheng, Jinqi Jiang, Chen Feng, Shaojie Shen, Boyu Zhou

To bridge this research gap, we propose MASSTAR: a Multi-modal lArge-scale Scene dataset with a verSatile Toolchain for surfAce pRediction and completion.

Descriptive

G3Reg: Pyramid Graph-based Global Registration using Gaussian Ellipsoid Model

2 code implementations22 Aug 2023 Zhijian Qiao, Zehuan Yu, Binqian Jiang, Huan Yin, Shaojie Shen

Utilizing these GEMs, we present a distrust-and-verify scheme based on a Pyramid Compatibility Graph for Global Registration (PAGOR).

Pyramid Semantic Graph-based Global Point Cloud Registration with Low Overlap

2 code implementations22 Jul 2023 Zhijian Qiao, Zehuan Yu, Huan Yin, Shaojie Shen

In this paper, we propose a graph-theoretic framework to address the problem of global point cloud registration with low overlap.

Point Cloud Registration Pose Estimation

Towards View-invariant and Accurate Loop Detection Based on Scene Graph

no code implementations24 May 2023 Chuhao Liu, Shaojie Shen

The current semantic-aided loop detection embeds the topology between semantic instances to search a loop.

Descriptive Simultaneous Localization and Mapping

Environment Transformer and Policy Optimization for Model-Based Offline Reinforcement Learning

no code implementations7 Mar 2023 Pengqin Wang, Meixin Zhu, Shaojie Shen

It models the probability distribution of the environment dynamics and reward function to capture aleatoric uncertainty and treats epistemic uncertainty as a learnable noise parameter.

Continuous Control Offline RL +4

Are All Point Clouds Suitable for Completion? Weakly Supervised Quality Evaluation Network for Point Cloud Completion

no code implementations3 Mar 2023 Jieqi Shi, Peiliang Li, Xiaozhi Chen, Shaojie Shen

In this paper, we propose a quality evaluation network to score the point clouds and help judge the quality of the point cloud before applying the completion model.

Autonomous Driving Point Cloud Completion

Contour Context: Abstract Structural Distribution for 3D LiDAR Loop Detection and Metric Pose Estimation

1 code implementation13 Feb 2023 Binqian Jiang, Shaojie Shen

This paper proposes \textit{Contour Context}, a simple, effective, and efficient topological loop closure detection pipeline with accurate 3-DoF metric pose estimation, targeting the urban utonomous driving scenario.

Loop Closure Detection Pose Estimation +1

The Devil is in the Wrongly-classified Samples: Towards Unified Open-set Recognition

1 code implementation8 Feb 2023 Jun Cen, Di Luan, Shiwei Zhang, Yixuan Pei, Yingya Zhang, Deli Zhao, Shaojie Shen, Qifeng Chen

Recently, Unified Open-set Recognition (UOSR) has been proposed to reject not only unknown samples but also known but wrongly classified samples, which tends to be more practical in real-world applications.

Open Set Learning

You Only Label Once: 3D Box Adaptation from Point Cloud to Image via Semi-Supervised Learning

no code implementations17 Nov 2022 Jieqi Shi, Peiliang Li, Xiaozhi Chen, Shaojie Shen

The image-based 3D object detection task expects that the predicted 3D bounding box has a ``tightness'' projection (also referred to as cuboid), which fits the object contour well on the image while still keeping the geometric attribute on the 3D space, e. g., physical dimension, pairwise orthogonal, etc.

3D Object Detection Attribute +1

Temporal Point Cloud Completion with Pose Disturbance

no code implementations7 Feb 2022 Jieqi Shi, Lingyun Xu, Peiliang Li, Xiaozhi Chen, Shaojie Shen

With the help of gated recovery units(GRU) and attention mechanisms as temporal units, we propose a point cloud completion framework that accepts a sequence of unaligned and sparse inputs, and outputs consistent and aligned point clouds.

Point Cloud Completion

Graph-Guided Deformation for Point Cloud Completion

no code implementations11 Nov 2021 Jieqi Shi, Lingyun Xu, Liang Heng, Shaojie Shen

In this paper, we propose a Graph-Guided Deformation Network, which respectively regards the input data and intermediate generation as controlling and supporting points, and models the optimization guided by a graph convolutional network(GCN) for the point cloud completion task.

Autonomous Driving Point Cloud Completion

Event-based Motion Segmentation by Cascaded Two-Level Multi-Model Fitting

no code implementations5 Nov 2021 Xiuyuan Lu, Yi Zhou, Shaojie Shen

In this paper, we present a cascaded two-level multi-model fitting method for identifying independently moving objects (i. e., the motion segmentation problem) with a monocular event camera.

Clustering Motion Segmentation +1

Trajectory Prediction with Graph-based Dual-scale Context Fusion

1 code implementation2 Nov 2021 Lu Zhang, Peiliang Li, Jing Chen, Shaojie Shen

In this paper, we present a graph-based trajectory prediction network named the Dual Scale Predictor (DSP), which encodes both the static and dynamical driving context in a hierarchical manner.

Motion Forecasting motion prediction +1

Event-based Motion Segmentation with Spatio-Temporal Graph Cuts

1 code implementation16 Dec 2020 Yi Zhou, Guillermo Gallego, Xiuyuan Lu, SiQi Liu, Shaojie Shen

We develop a method to identify independently moving objects acquired with an event-based camera, i. e., to solve the event-based motion segmentation problem.

Motion Segmentation Scene Understanding

Tracking from Patterns: Learning Corresponding Patterns in Point Clouds for 3D Object Tracking

no code implementations20 Oct 2020 Jieqi Shi, Peiliang Li, Shaojie Shen

A robust 3D object tracker which continuously tracks surrounding objects and estimates their trajectories is key for self-driving vehicles.

3D Object Tracking Motion Estimation +2

An Augmented Reality Interaction Interface for Autonomous Drone

no code implementations5 Aug 2020 Chuhao Liu, Shaojie Shen

In this work, we build an AR interface that displays the reconstructed 3D map from the drone on physical surfaces in front of the operator.

Human-Computer Interaction Robotics

Event-based Stereo Visual Odometry

2 code implementations30 Jul 2020 Yi Zhou, Guillermo Gallego, Shaojie Shen

We present a solution to the problem of visual odometry from the data acquired by a stereo event-based camera rig.

3D Reconstruction Pose Estimation +1

RAPTOR: Robust and Perception-aware Trajectory Replanning for Quadrotor Fast Flight

1 code implementation6 Jul 2020 Boyu Zhou, Jie Pan, Fei Gao, Shaojie Shen

In this paper, we present RAPTOR, a robust and perception-aware replanning framework to support fast and safe flight.

Robotics

FP-Stereo: Hardware-Efficient Stereo Vision for Embedded Applications

no code implementations5 Jun 2020 Jieru Zhao, Tingyuan Liang, Liang Feng, Wenchao Ding, Sharad Sinha, Wei zhang, Shaojie Shen

To reduce the design effort and achieve the right balance, we propose FP-Stereo for building high-performance stereo matching pipelines on FPGAs automatically.

C++ code Depth Estimation +1

Joint Spatial-Temporal Optimization for Stereo 3D Object Tracking

no code implementations CVPR 2020 Peiliang Li, Jieqi Shi, Shaojie Shen

Directly learning multiple 3D objects motion from sequential images is difficult, while the geometric bundle adjustment lacks the ability to localize the invisible object centroid.

3D Object Tracking Benchmarking +2

PiP: Planning-informed Trajectory Prediction for Autonomous Driving

1 code implementation ECCV 2020 Haoran Song, Wenchao Ding, Yuxuan Chen, Shaojie Shen, Michael Yu Wang, Qifeng Chen

Moreover, our approach enables a novel pipeline which couples the prediction and planning, by conditioning PiP on multiple candidate trajectories of the ego vehicle, which is highly beneficial for autonomous driving in interactive scenarios.

Autonomous Driving Future prediction +1

Efficient Uncertainty-aware Decision-making for Automated Driving Using Guided Branching

2 code implementations5 Mar 2020 Lu Zhang, Wenchao Ding, Jing Chen, Shaojie Shen

Decision-making in dense traffic scenarios is challenging for automated vehicles (AVs) due to potentially stochastic behaviors of other traffic participants and perception uncertainties (e. g., tracking noise and prediction errors, etc.).

Robotics

Robust Real-time UAV Replanning Using Guided Gradient-based Optimization and Topological Paths

2 code implementations29 Dec 2019 Boyu Zhou, Fei Gao, Jie Pan, Shaojie Shen

A path-guided optimization (PGO) approach is devised to tackle infeasible local minima, which improves the replanning success rate significantly.

Robotics

Flow-Motion and Depth Network for Monocular Stereo and Beyond

1 code implementation12 Sep 2019 Kaixuan Wang, Shaojie Shen

Third, beyond two-view depth estimation, we further extend the above networks to fuse depth information from multiple target images and estimate the depth map of the source image.

Depth Estimation Optical Flow Estimation

Multi-Sensor 3D Object Box Refinement for Autonomous Driving

no code implementations11 Sep 2019 Peiliang Li, Si-Qi Liu, Shaojie Shen

We propose a 3D object detection system with multi-sensor refinement in the context of autonomous driving.

3D Object Detection Autonomous Driving +2

Real-time Scalable Dense Surfel Mapping

1 code implementation10 Sep 2019 Kaixuan Wang, Fei Gao, Shaojie Shen

First, superpixels extracted from both intensity and depth images are used to model surfels in the system.

3D Point Cloud Reconstruction Superpixels

Safe Trajectory Generation for Complex Urban Environments Using Spatio-temporal Semantic Corridor

2 code implementations24 Jun 2019 Wenchao Ding, Lu Zhang, Jing Chen, Shaojie Shen

Planning safe trajectories for autonomous vehicles in complex urban environments is challenging since there are numerous semantic elements (such as dynamic agents, traffic lights and speed limits) to consider.

Autonomous Vehicles Benchmarking

An Efficient B-spline-Based Kinodynamic Replanning Framework for Quadrotors

no code implementations24 Jun 2019 Wenchao Ding, Wenliang Gao, Kaixuan Wang, Shaojie Shen

Our framework starts with an efficient B-spline-based kinodynamic (EBK) search algorithm which finds a feasible trajectory with minimum control effort and time.

Motion Planning

Learning Whole-Image Descriptors for Real-time Loop Detection andKidnap Recovery under Large Viewpoint Difference

3 code implementations15 Apr 2019 Manohar Kuse, Shaojie Shen

Finally, we present the fully functional system with relative computation andhandling of multiple world co-ordinate system which is able to reduce odometry drift, recover fromcomplicated kidnap scenarios and random odometry failures.

Robotics

Probabilistic Dense Reconstruction from a Moving Camera

no code implementations26 Mar 2019 Yonggen Ling, Kaixuan Wang, Shaojie Shen

This paper presents a probabilistic approach for online dense reconstruction using a single monocular camera moving through the environment.

Robotics

Flying through a narrow gap using neural network: an end-to-end planning and control approach

1 code implementation21 Mar 2019 Jiarong Lin, Luqi Wang, Fei Gao, Shaojie Shen, Fu Zhang

To this end, we propose an end-to-end policy network, which imitates from the traditional pipeline and is fine-tuned using reinforcement learning.

Robotics

FIESTA: Fast Incremental Euclidean Distance Fields for Online Motion Planning of Aerial Robots

2 code implementations6 Mar 2019 Luxin Han, Fei Gao, Boyu Zhou, Shaojie Shen

We integrate FIESTA into a completed quadrotor system and validate it by both simulation and onboard experiments.

Robotics

Online Vehicle Trajectory Prediction using Policy Anticipation Network and Optimization-based Context Reasoning

no code implementations3 Mar 2019 Wenchao Ding, Shaojie Shen

In this paper, we present an online two-level vehicle trajectory prediction framework for urban autonomous driving where there are complex contextual factors, such as lane geometries, road constructions, traffic regulations and moving agents.

Autonomous Driving Trajectory Prediction

Predicting Vehicle Behaviors Over An Extended Horizon Using Behavior Interaction Network

no code implementations3 Mar 2019 Wenchao Ding, Jing Chen, Shaojie Shen

In this paper, we uncover that clues to vehicle behaviors over an extended horizon can be found in vehicle interaction, which makes it possible to anticipate the likelihood of a certain behavior, even in the absence of any clear maneuver pattern.

Autonomous Vehicles

A General Optimization-based Framework for Global Pose Estimation with Multiple Sensors

4 code implementations11 Jan 2019 Tong Qin, Shaozu Cao, Jie Pan, Shaojie Shen

We highlight that our system is a general framework, which can easily fuse various global sensors in a unified pose graph optimization.

Pose Estimation Sensor Fusion

A General Optimization-based Framework for Local Odometry Estimation with Multiple Sensors

4 code implementations11 Jan 2019 Tong Qin, Jie Pan, Shaozu Cao, Shaojie Shen

We validate the performance of our system on public datasets and through real-world experiments with multiple sensors.

Visual Odometry

Estimating Metric Poses of Dynamic Objects Using Monocular Visual-Inertial Fusion

no code implementations21 Aug 2018 Kejie Qiu, Tong Qin, Hongwen Xie, Shaojie Shen

By introducing an additional constraint in the time domain, our monocular visual-inertial tracking system can obtain continuous six degree of freedom (6-DoF) pose estimation without scale ambiguity.

3D Object Tracking Object +2

Online Temporal Calibration for Monocular Visual-Inertial Systems

1 code implementation2 Aug 2018 Tong Qin, Shaojie Shen

Visual and inertial fusion is a popular technology for 6-DOF state estimation in recent years.

Autonomous Driving Robot Navigation +2

MVDepthNet: Real-time Multiview Depth Estimation Neural Network

1 code implementation23 Jul 2018 Kaixuan Wang, Shaojie Shen

In this paper, we present MVDepthNet, a convolutional network to solve the depth estimation problem given several image-pose pairs from a localized monocular camera in neighbor viewpoints.

Data Augmentation Depth Estimation

Stereo Vision-based Semantic 3D Object and Ego-motion Tracking for Autonomous Driving

no code implementations ECCV 2018 Peiliang Li, Tong Qin, Shaojie Shen

We propose a stereo vision-based approach for tracking the camera ego-motion and 3D semantic objects in dynamic autonomous driving scenarios.

Autonomous Driving Motion Estimation +3

Relocalization, Global Optimization and Map Merging for Monocular Visual-Inertial SLAM

1 code implementation5 Mar 2018 Tong Qin, Perliang Li, Shaojie Shen

In this paper, we propose a monocular visual-inertial SLAM system, which can relocalize camera and get the absolute pose in a previous-built map.

Pose Estimation

Learning Unmanned Aerial Vehicle Control for Autonomous Target Following

no code implementations24 Sep 2017 Siyi Li, Tianbo Liu, Chi Zhang, Dit-yan Yeung, Shaojie Shen

While deep reinforcement learning (RL) methods have achieved unprecedented successes in a range of challenging problems, their applicability has been mainly limited to simulation or game domains due to the high sample complexity of the trial-and-error learning process.

reinforcement-learning Reinforcement Learning (RL)

VINS-Mono: A Robust and Versatile Monocular Visual-Inertial State Estimator

11 code implementations13 Aug 2017 Tong Qin, Peiliang Li, Shaojie Shen

A monocular visual-inertial system (VINS), consisting of a camera and a low-cost inertial measurement unit (IMU), forms the minimum sensor suite for metric six degrees-of-freedom (DOF) state estimation.

Robotics

Cannot find the paper you are looking for? You can Submit a new open access paper.