Search Results for author: Long Quan

Found 39 papers, 19 papers with code

ConTex-Human: Free-View Rendering of Human from a Single Image with Texture-Consistent Synthesis

no code implementations28 Nov 2023 Xiangjun Gao, Xiaoyu Li, Chaopeng Zhang, Qi Zhang, YanPei Cao, Ying Shan, Long Quan

In this work, we propose a method to address the challenge of rendering a 3D human from a single image in a free-view manner.

Direct2.5: Diverse Text-to-3D Generation via Multi-view 2.5D Diffusion

no code implementations27 Nov 2023 Yuanxun Lu, Jingyang Zhang, Shiwei Li, Tian Fang, David McKinnon, Yanghai Tsin, Long Quan, Xun Cao, Yao Yao

The multi-view 2. 5D diffusion directly models the structural distribution of 3D data, while still maintaining the strong generalization ability of the original 2D diffusion model, filling the gap between 2D diffusion-based and direct 3D diffusion-based methods for 3D content generation.

Image Generation Text to 3D

HiFi-123: Towards High-fidelity One Image to 3D Content Generation

no code implementations10 Oct 2023 Wangbo Yu, Li Yuan, Yan-Pei Cao, Xiangjun Gao, Xiaoyu Li, WenBo Hu, Long Quan, Ying Shan, Yonghong Tian

Our contributions are twofold: First, we propose a Reference-Guided Novel View Enhancement (RGNV) technique that significantly improves the fidelity of diffusion-based zero-shot novel view synthesis methods.

Image to 3D Novel View Synthesis

JointNet: Extending Text-to-Image Diffusion for Dense Distribution Modeling

no code implementations10 Oct 2023 Jingyang Zhang, Shiwei Li, Yuanxun Lu, Tian Fang, David McKinnon, Yanghai Tsin, Long Quan, Yao Yao

We introduce JointNet, a novel neural network architecture for modeling the joint distribution of images and an additional dense modality (e. g., depth maps).

Depth Estimation Depth Prediction +1

NeILF++: Inter-Reflectable Light Fields for Geometry and Material Estimation

no code implementations ICCV 2023 Jingyang Zhang, Yao Yao, Shiwei Li, Jingbo Liu, Tian Fang, David McKinnon, Yanghai Tsin, Long Quan

We present a novel differentiable rendering framework for joint geometry, material, and lighting estimation from multi-view images.

Lighting Estimation

ASpanFormer: Detector-Free Image Matching with Adaptive Span Transformer

1 code implementation30 Aug 2022 Hongkai Chen, Zixin Luo, Lei Zhou, Yurun Tian, Mingmin Zhen, Tian Fang, David McKinnon, Yanghai Tsin, Long Quan

Generating robust and reliable correspondences across images is a fundamental task for a diversity of applications.

Homography Estimation

Critical Regularizations for Neural Surface Reconstruction in the Wild

no code implementations CVPR 2022 Jingyang Zhang, Yao Yao, Shiwei Li, Tian Fang, David McKinnon, Yanghai Tsin, Long Quan

The first one is the Hessian regularization that smoothly diffuses the signed distance values to the entire distance field given noisy and incomplete input.

Surface Reconstruction

NeILF: Neural Incident Light Field for Physically-based Material Estimation

1 code implementation14 Mar 2022 Yao Yao, Jingyang Zhang, Jingbo Liu, Yihang Qu, Tian Fang, David McKinnon, Yanghai Tsin, Long Quan

We present a differentiable rendering framework for material and lighting estimation from multi-view images and a reconstructed geometry.

Lighting Estimation

Learning Signed Distance Field for Multi-view Surface Reconstruction

1 code implementation ICCV 2021 Jingyang Zhang, Yao Yao, Long Quan

In this work, we introduce a novel neural surface reconstruction framework that leverages the knowledge of stereo matching and feature consistency to optimize the implicit surface representation.

Stereo Matching Surface Reconstruction

Learning to Match Features with Seeded Graph Matching Network

1 code implementation ICCV 2021 Hongkai Chen, Zixin Luo, Jiahui Zhang, Lei Zhou, Xuyang Bai, Zeyu Hu, Chiew-Lan Tai, Long Quan

2) Seeded Graph Neural Network, which utilizes seed matches to pass messages within/across images and predicts assignment costs.

Graph Matching

Learning Stereo Matchability in Disparity Regression Networks

1 code implementation11 Aug 2020 Jingyang Zhang, Yao Yao, Zixin Luo, Shiwei Li, Tianwei Shen, Tian Fang, Long Quan

Finally, a matchability-aware disparity refinement is introduced to improve the depth inference in weakly matchable regions.

regression Stereo Disparity Estimation +1

Stochastic Bundle Adjustment for Efficient and Scalable 3D Reconstruction

1 code implementation ECCV 2020 Lei Zhou, Zixin Luo, Mingmin Zhen, Tianwei Shen, Shiwei Li, Zhuofei Huang, Tian Fang, Long Quan

In this work, we propose a stochastic bundle adjustment algorithm which seeks to decompose the RCS approximately inside the LM iterations to improve the efficiency and scalability.

3D Reconstruction Clustering

Self-Supervised Monocular 3D Face Reconstruction by Occlusion-Aware Multi-view Geometry Consistency

1 code implementation ECCV 2020 Jiaxiang Shang, Tianwei Shen, Shiwei Li, Lei Zhou, Mingmin Zhen, Tian Fang, Long Quan

Recent learning-based approaches, in which models are trained by single-view images have shown promising results for monocular 3D face reconstruction, but they suffer from the ill-posed face pose and depth ambiguity issue.

3D Face Reconstruction Depth Estimation +2

KFNet: Learning Temporal Camera Relocalization using Kalman Filtering

1 code implementation CVPR 2020 Lei Zhou, Zixin Luo, Tianwei Shen, Jiahui Zhang, Mingmin Zhen, Yao Yao, Tian Fang, Long Quan

Temporal camera relocalization estimates the pose with respect to each video frame in sequence, as opposed to one-shot relocalization which focuses on a still image.

Camera Relocalization

D3Feat: Joint Learning of Dense Detection and Description of 3D Local Features

2 code implementations CVPR 2020 Xuyang Bai, Zixin Luo, Lei Zhou, Hongbo Fu, Long Quan, Chiew-Lan Tai

In this paper, we leverage a 3D fully convolutional network for 3D point clouds, and propose a novel and practical learning mechanism that densely predicts both a detection score and a description feature for each 3D point.

Point Cloud Registration

BlendedMVS: A Large-scale Dataset for Generalized Multi-view Stereo Networks

2 code implementations CVPR 2020 Yao Yao, Zixin Luo, Shiwei Li, Jingyang Zhang, Yufan Ren, Lei Zhou, Tian Fang, Long Quan

Compared with other computer vision tasks, it is rather difficult to collect a large-scale MVS dataset as it requires expensive active scanners and labor-intensive process to obtain ground truth 3D structures.

3D Reconstruction

Self-Supervised Learning of Depth and Motion Under Photometric Inconsistency

1 code implementation19 Sep 2019 Tianwei Shen, Lei Zhou, Zixin Luo, Yao Yao, Shiwei Li, Jiahui Zhang, Tian Fang, Long Quan

The self-supervised learning of depth and pose from monocular sequences provides an attractive solution by using the photometric consistency of nearby frames as it depends much less on the ground-truth data.

Pose Estimation Self-Supervised Learning

Learning Two-View Correspondences and Geometry Using Order-Aware Network

1 code implementation ICCV 2019 Jiahui Zhang, Dawei Sun, Zixin Luo, Anbang Yao, Lei Zhou, Tianwei Shen, Yurong Chen, Long Quan, Hongen Liao

First, to capture the local context of sparse correspondences, the network clusters unordered input correspondences by learning a soft assignment matrix.

Vocal Bursts Valence Prediction

ContextDesc: Local Descriptor Augmentation with Cross-Modality Context

1 code implementation CVPR 2019 Zixin Luo, Tianwei Shen, Lei Zhou, Jiahui Zhang, Yao Yao, Shiwei Li, Tian Fang, Long Quan

Most existing studies on learning local features focus on the patch-based descriptions of individual keypoints, whereas neglecting the spatial relations established from their keypoint locations.

Geometric Matching

Recurrent MVSNet for High-resolution Multi-view Stereo Depth Inference

1 code implementation CVPR 2019 Yao Yao, Zixin Luo, Shiwei Li, Tianwei Shen, Tian Fang, Long Quan

However, one major limitation of current learned MVS approaches is the scalability: the memory-consuming cost volume regularization makes the learned MVS hard to be applied to high-resolution scenes.

Vocal Bursts Intensity Prediction

Beyond Photometric Loss for Self-Supervised Ego-Motion Estimation

1 code implementation25 Feb 2019 Tianwei Shen, Zixin Luo, Lei Zhou, Hanyu Deng, Runze Zhang, Tian Fang, Long Quan

Accurate relative pose is one of the key components in visual odometry (VO) and simultaneous localization and mapping (SLAM).

Motion Estimation Self-Supervised Learning +2

Matchable Image Retrieval by Learning from Surface Reconstruction

1 code implementation26 Nov 2018 Tianwei Shen, Zixin Luo, Lei Zhou, Runze Zhang, Siyu Zhu, Tian Fang, Long Quan

Convolutional Neural Networks (CNNs) have achieved superior performance on object image retrieval, while Bag-of-Words (BoW) models with handcrafted local features still dominate the retrieval of overlapping images in 3D reconstruction.

3D Reconstruction Image Retrieval +2

GeoDesc: Learning Local Descriptors by Integrating Geometry Constraints

1 code implementation ECCV 2018 Zixin Luo, Tianwei Shen, Lei Zhou, Siyu Zhu, Runze Zhang, Yao Yao, Tian Fang, Long Quan

Learned local descriptors based on Convolutional Neural Networks (CNNs) have achieved significant improvements on patch-based benchmarks, whereas not having demonstrated strong generalization ability on recent benchmarks of image-based 3D reconstruction.

3D Reconstruction

Learning and Matching Multi-View Descriptors for Registration of Point Clouds

no code implementations ECCV 2018 Lei Zhou, Siyu Zhu, Zixin Luo, Tianwei Shen, Runze Zhang, Mingmin Zhen, Tian Fang, Long Quan

Critical to the registration of point clouds is the establishment of a set of accurate correspondences between points in 3D space.

Very Large-Scale Global SfM by Distributed Motion Averaging

no code implementations CVPR 2018 Siyu Zhu, Runze Zhang, Lei Zhou, Tianwei Shen, Tian Fang, Ping Tan, Long Quan

This work proposes a divide-and-conquer framework to solve very large global SfM at the scale of millions of images.

MVSNet: Depth Inference for Unstructured Multi-view Stereo

4 code implementations ECCV 2018 Yao Yao, Zixin Luo, Shiwei Li, Tian Fang, Long Quan

We present an end-to-end deep learning architecture for depth map inference from multi-view images.

Ranked #19 on Point Clouds on Tanks and Temples (Mean F1 (Intermediate) metric)

3D Reconstruction Point Clouds

Progressive Large Scale-Invariant Image Matching in Scale Space

no code implementations ICCV 2017 Lei Zhou, Siyu Zhu, Tianwei Shen, Jinglu Wang, Tian Fang, Long Quan

In this paper, we propose a scale-invariant image matching approach to tackling the very large scale variation of views.

Image Retrieval Retrieval

Distributed Very Large Scale Bundle Adjustment by Global Camera Consensus

no code implementations ICCV 2017 Runze Zhang, Siyu Zhu, Tian Fang, Long Quan

In this paper, we propose a distributed approach to coping with this global bundle adjustment for very large scale Structure-from-Motion computation.

Distributed Computing Distributed Optimization

Parallel Structure from Motion from Local Increment to Global Averaging

no code implementations28 Feb 2017 Siyu Zhu, Tianwei Shen, Lei Zhou, Runze Zhang, Jinglu Wang, Tian Fang, Long Quan

In this paper, we tackle the accurate and consistent Structure from Motion (SfM) problem, in particular camera registration, far exceeding the memory of a single computer in parallel.

Clustering

Color Constancy with Derivative Colors

no code implementations25 Nov 2016 Huan Lei, Guang Jiang, Long Quan

While extracting derivative colors from achromatic regions to approximate the illuminant color well is basically straightforward, the success of our extraction in highlight regions is attributed to the different rates of variation of the diffuse and specular magnitudes in the dichromatic reflection model.

Color Constancy Density Estimation

Dual-Feature Warping-Based Motion Model Estimation

no code implementations ICCV 2015 Shiwei Li, Lu Yuan, Jian Sun, Long Quan

Line segment is a prominent feature in artificial environments and it can supply sufficient geometrical and structural information of scenes, which not only helps guild to a correct warp in low-texture condition, but also prevents the undesired distortion induced by warping.

Image Stitching Video Stabilization

Higher-Order CRF Structural Segmentation of 3D Reconstructed Surfaces

no code implementations ICCV 2015 Jingbo Liu, Jinglu Wang, Tian Fang, Chiew-Lan Tai, Long Quan

In this paper, we propose a structural segmentation algorithm to partition multi-view stereo reconstructed surfaces of large-scale urban environments into structural segments.

Segmentation

Joint Camera Clustering and Surface Segmentation for Large-Scale Multi-View Stereo

no code implementations ICCV 2015 Runze Zhang, Shiwei Li, Tian Fang, Siyu Zhu, Long Quan

To solve this problem, we propose a joint optimization in a hierarchical framework to obtain the final surface segments and corresponding optimal camera clusters.

Clustering Segmentation

Low-rank SIFT: An Affine Invariant Feature for Place Recognition

no code implementations7 Aug 2014 Chao Yang, Shengnan Caih, Jingdong Wang, Long Quan

As an extension of SIFT, our method seeks to add prior to solve the ill-posed affine parameter estimation problem and normalizes them directly, and is applicable to objects with regular structures.

feature selection Translation

Local Readjustment for High-Resolution 3D Reconstruction

no code implementations CVPR 2014 Siyu Zhu, Tian Fang, Jianxiong Xiao, Long Quan

To this end, we propose a segment-based approach to readjust the camera poses locally and improve the reconstruction for fine geometry details.

3D Reconstruction Vocal Bursts Intensity Prediction

Cannot find the paper you are looking for? You can Submit a new open access paper.