Search Results for author: Weihao Yuan

Found 22 papers, 7 papers with code

An Optimization Framework to Enforce Multi-View Consistency for Texturing 3D Meshes Using Pre-Trained Text-to-Image Models

no code implementations22 Mar 2024 Zhengyi Zhao, Chen Song, Xiaodong Gu, Yuan Dong, Qi Zuo, Weihao Yuan, Zilong Dong, Liefeng Bo, QiXing Huang

In particular, the third and fourth stages are iterated, with the cuts obtained in the fourth stage encouraging non-rigid alignment in the third stage to focus on regions close to the cuts.

OV9D: Open-Vocabulary Category-Level 9D Object Pose and Size Estimation

no code implementations19 Mar 2024 Junhao Cai, Yisheng He, Weihao Yuan, Siyu Zhu, Zilong Dong, Liefeng Bo, Qifeng Chen

Derived from OmniObject3D, OO3D-9D is the largest and most diverse dataset in the field of category-level object pose and size estimation.

Object

VideoMV: Consistent Multi-View Generation Based on Large Video Generative Model

no code implementations18 Mar 2024 Qi Zuo, Xiaodong Gu, Lingteng Qiu, Yuan Dong, Zhengyi Zhao, Weihao Yuan, Rui Peng, Siyu Zhu, Zilong Dong, Liefeng Bo, QiXing Huang

Images from video generative models are more suitable for multi-view generation because the underlying network architecture that generates them employs a temporal module to enforce frame consistency.

Denoising

RichDreamer: A Generalizable Normal-Depth Diffusion Model for Detail Richness in Text-to-3D

no code implementations28 Nov 2023 Lingteng Qiu, GuanYing Chen, Xiaodong Gu, Qi Zuo, Mutian Xu, Yushuang Wu, Weihao Yuan, Zilong Dong, Liefeng Bo, Xiaoguang Han

Lifting 2D diffusion for 3D generation is a challenging problem due to the lack of geometric prior and the complex entanglement of materials and lighting in natural images.

3D Generation Text to 3D

3D Former: Monocular Scene Reconstruction with 3D SDF Transformers

1 code implementation31 Jan 2023 Weihao Yuan, Xiaodong Gu, Heng Li, Zilong Dong, Siyu Zhu

In this work, we propose an SDF transformer network, which replaces the role of 3D CNN for better 3D feature aggregation.

Dense RGB SLAM with Neural Implicit Maps

no code implementations21 Jan 2023 Heng Li, Xiaodong Gu, Weihao Yuan, Luwei Yang, Zilong Dong, Ping Tan

To reach this challenging goal without depth input, we introduce a hierarchical feature volume to facilitate the implicit map decoder.

Simultaneous Localization and Mapping

${S}^{2}$Net: Accurate Panorama Depth Estimation on Spherical Surface

no code implementations14 Jan 2023 Meng Li, Senbo Wang, Weihao Yuan, Weichao Shen, Zhe Sheng, Zilong Dong

In this paper, we propose an end-to-end deep network for monocular panorama depth estimation on a unit spherical surface.

Monocular Depth Estimation

RCP: Recurrent Closest Point for Scene Flow Estimation on 3D Point Clouds

no code implementations23 May 2022 Xiaodong Gu, Chengzhou Tang, Weihao Yuan, Zuozhuo Dai, Siyu Zhu, Ping Tan

In the experiments, we evaluate the proposed method on both the 3D scene flow estimation and the point cloud registration task.

Motion Estimation Point Cloud Registration +1

NeW CRFs: Neural Window Fully-connected CRFs for Monocular Depth Estimation

1 code implementation CVPR 2022 Weihao Yuan, Xiaodong Gu, Zuozhuo Dai, Siyu Zhu, Ping Tan

While recent works design increasingly complicated and powerful networks to directly regress the depth map, we take the path of CRFs optimization.

Depth Prediction Monocular Depth Estimation

Neural Window Fully-Connected CRFs for Monocular Depth Estimation

no code implementations CVPR 2022 Weihao Yuan, Xiaodong Gu, Zuozhuo Dai, Siyu Zhu, Ping Tan

Estimating the accurate depth from a single image is challenging since it is inherently ambiguous and ill-posed.

Monocular Depth Estimation

RCP: Recurrent Closest Point for Point Cloud

1 code implementation CVPR 2022 Xiaodong Gu, Chengzhou Tang, Weihao Yuan, Zuozhuo Dai, Siyu Zhu, Ping Tan

In the experiments, we evaluate the proposed method on both the 3D scene flow estimation and the point cloud registration task.

Motion Estimation Point Cloud Registration +1

MFuseNet: Robust Depth Estimation with Learned Multiscopic Fusion

no code implementations5 Aug 2021 Weihao Yuan, Rui Fan, Michael Yu Wang, Qifeng Chen

We design a multiscopic vision system that utilizes a low-cost monocular RGB camera to acquire accurate depth estimation.

Depth Estimation Stereo Matching

DRO: Deep Recurrent Optimizer for Video to Depth

1 code implementation24 Mar 2021 Xiaodong Gu, Weihao Yuan, Zuozhuo Dai, Siyu Zhu, Chengzhou Tang, Zilong Dong, Ping Tan

There are increasing interests of studying the video-to-depth (V2D) problem with machine learning techniques.

Self-supervised Object Tracking with Cycle-consistent Siamese Networks

1 code implementation3 Aug 2020 Weihao Yuan, Michael Yu Wang, Qifeng Chen

Self-supervised learning for visual object tracking possesses valuable advantages compared to supervised learning, such as the non-necessity of laborious human annotations and online training.

Object Region Proposal +5

Active Perception with A Monocular Camera for Multiscopic Vision

1 code implementation22 Jan 2020 Weihao Yuan, Rui Fan, Michael Yu Wang, Qifeng Chen

We design a multiscopic vision system that utilizes a low-cost monocular RGB camera to acquire accurate depth estimation for robotic applications.

Depth Estimation Stereo Matching

Towards Learning to Detect and Predict Contact Events on Vision-based Tactile Sensors

no code implementations9 Oct 2019 Yazhan Zhang, Weihao Yuan, Zicheng Kan, Michael Yu Wang

In essence, successful grasp boils down to correct responses to multiple contact events between fingertips and objects.

Contact Detection Robotic Grasping

Rearrangement with Nonprehensile Manipulation Using Deep Reinforcement Learning

no code implementations15 Mar 2018 Weihao Yuan, Johannes A. Stork, Danica Kragic, Michael Y. Wang, Kaiyu Hang

Usually, this is achieved by precisely modeling physical properties of the objects, robot, and the environment for explicit planning.

reinforcement-learning Reinforcement Learning (RL)

Cannot find the paper you are looking for? You can Submit a new open access paper.