Search Results for author: yinda zhang

Found 64 papers, 22 papers with code

Du²Net: Learning Depth Estimation from Dual-Cameras and Dual-Pixels

no code implementations • ECCV 2020 • Yinda Zhang, Neal Wadhwa, Sergio Orts-Escolano, Christian Häne, Sean Fanello, Rahul Garg

Computational stereo has reached a high level of accuracy, but degrades in the presence of occlusions, repeated textures, and correspondence errors along edges.

Depth Estimation Stereo Matching

Paper
Add Code

GeneAvatar: Generic Expression-Aware Volumetric Head Avatar Editing from a Single Image

no code implementations • 2 Apr 2024 • Chong Bao, yinda zhang, Yuan Li, Xiyu Zhang, Bangbang Yang, Hujun Bao, Marc Pollefeys, Guofeng Zhang, Zhaopeng Cui

Recently, we have witnessed the explosive growth of various volumetric representations in modeling animatable head avatars.

Paper
Add Code

CHOSEN: Contrastive Hypothesis Selection for Multi-View Depth Refinement

no code implementations • 2 Apr 2024 • Di Qiu, yinda zhang, Thabo Beeler, Vladimir Tankovich, Christian Häne, Sean Fanello, Christoph Rhemann, Sergio Orts Escolano

We propose CHOSEN, a simple yet flexible, robust and effective multi-view depth refinement framework.

Contrastive Learning Depth Estimation

Paper
Add Code

Efficient 3D Implicit Head Avatar with Mesh-anchored Hash Table Blendshapes

no code implementations • 2 Apr 2024 • Ziqian Bai, Feitong Tan, Sean Fanello, Rohit Pandey, Mingsong Dou, Shichen Liu, Ping Tan, yinda zhang

To address these challenges, we propose a novel fast 3D neural implicit head avatar model that achieves real-time rendering while maintaining fine-grained controllability and high rendering quality.

Neural Rendering

Paper
Add Code

T-Pixel2Mesh: Combining Global and Local Transformer for 3D Mesh Generation from a Single Image

no code implementations • 20 Mar 2024 • Shijie Zhang, Boyan Jiang, Keke He, Junwei Zhu, Ying Tai, Chengjie Wang, yinda zhang, Yanwei Fu

Pixel2Mesh (P2M) is a classical approach for reconstructing 3D shapes from a single color image through coarse-to-fine mesh deformation.

3D Reconstruction Single-View 3D Reconstruction

Paper
Add Code

Pushing Auto-regressive Models for 3D Shape Generation at Capacity and Scalability

no code implementations • 19 Feb 2024 • Xuelin Qian, Yu Wang, Simian Luo, yinda zhang, Ying Tai, Zhenyu Zhang, Chengjie Wang, xiangyang xue, Bo Zhao, Tiejun Huang, Yunsheng Wu, Yanwei Fu

In this paper, we extend auto-regressive models to 3D domains, and seek a stronger ability of 3D shape generation by improving auto-regressive models at capacity and scalability simultaneously.

3D Generation 3D Shape Generation +1

Paper
Add Code

One2Avatar: Generative Implicit Head Avatar For Few-shot User Adaptation

no code implementations • 19 Feb 2024 • Zhixuan Yu, Ziqian Bai, Abhimitra Meka, Feitong Tan, Qiangeng Xu, Rohit Pandey, Sean Fanello, Hyun Soo Park, yinda zhang

Traditional methods for constructing high-quality, personalized head avatars from monocular videos demand extensive face captures and training time, posing a significant challenge for scalability.

Camera Calibration

Paper
Add Code

GO-NeRF: Generating Virtual Objects in Neural Radiance Fields

no code implementations • 11 Jan 2024 • Peng Dai, Feitong Tan, Xin Yu, yinda zhang, Xiaojuan Qi

To this end, we propose a new method, GO-NeRF, capable of utilizing scene context for high-quality and harmonious 3D object generation within an existing NeRF.

3D Generation Object

Paper
Add Code

InstructPipe: Building Visual Programming Pipelines with Human Instructions

no code implementations • 15 Dec 2023 • Zhongyi Zhou, Jing Jin, Vrushank Phadnis, Xiuxiu Yuan, Jun Jiang, Xun Qian, Jingtao Zhou, Yiyi Huang, Zheng Xu, yinda zhang, Kristen Wright, Jason Mayes, Mark Sherwood, Johnny Lee, Alex Olwal, David Kim, Ram Iyengar, Na Li, Ruofei Du

Our user study (N=16) showed that InstructPipe empowers novice users to streamline their workflow in creating desired ML pipelines, reduce their learning curve, and spark innovative ideas with open-ended commands.

Paper
Add Code

MVDD: Multi-View Depth Diffusion Models

no code implementations • 8 Dec 2023 • Zhen Wang, Qiangeng Xu, Feitong Tan, Menglei Chai, Shichen Liu, Rohit Pandey, Sean Fanello, Achuta Kadambi, yinda zhang

State-of-the-art results from extensive experiments demonstrate MVDD's excellent ability in 3D shape generation, depth completion, and its potential as a 3D prior for downstream tasks.

3D Shape Generation Denoising +3

Paper
Add Code

Gaussian3Diff: 3D Gaussian Diffusion for 3D Full Head Synthesis and Editing

no code implementations • 5 Dec 2023 • Yushi Lan, Feitong Tan, Di Qiu, Qiangeng Xu, Kyle Genova, Zeng Huang, Sean Fanello, Rohit Pandey, Thomas Funkhouser, Chen Change Loy, yinda zhang

We present a novel framework for generating photorealistic 3D human head and subsequently manipulating and reposing them with remarkable flexibility.

Face Model

Paper
Add Code

Multi-Modal Neural Radiance Field for Monocular Dense SLAM with a Light-Weight ToF Sensor

no code implementations • ICCV 2023 • Xinyang Liu, Yijin Li, Yanbin Teng, Hujun Bao, Guofeng Zhang, yinda zhang, Zhaopeng Cui

Specifically, we propose a multi-modal implicit scene representation that supports rendering both the signals from the RGB camera and light-weight ToF sensor which drives the optimization by comparing with the raw sensor inputs.

Pose Tracking

Paper
Add Code

Self-supervised Learning of Implicit Shape Representation with Dense Correspondence for Deformable Objects

no code implementations • ICCV 2023 • Baowen Zhang, Jiahe Li, Xiaoming Deng, yinda zhang, Cuixia Ma, Hongan Wang

In this paper, we propose a novel self-supervised approach to learn neural implicit shape representation for deformable objects, which can represent shapes with a template shape and dense correspondence in 3D.

3D Shape Representation Self-Supervised Learning

Paper
Add Code

Novel-view Synthesis and Pose Estimation for Hand-Object Interaction from Sparse Views

no code implementations • ICCV 2023 • Wentian Qu, Zhaopeng Cui, yinda zhang, Chenyu Meng, Cuixia Ma, Xiaoming Deng, Hongan Wang

Hand-object interaction understanding and the barely addressed novel view synthesis are highly desired in the immersive communication, whereas it is challenging due to the high deformation of hand and heavy occlusions between hand and object.

Neural Rendering Novel View Synthesis +3

Paper
Add Code

Spectral Graphormer: Spectral Graph-based Transformer for Egocentric Two-Hand Reconstruction using Multi-View Color Images

no code implementations • ICCV 2023 • Tze Ho Elden Tse, Franziska Mueller, Zhengyang Shen, Danhang Tang, Thabo Beeler, Mingsong Dou, yinda zhang, Sasa Petrovic, Hyung Jin Chang, Jonathan Taylor, Bardia Doosti

We propose a novel transformer-based framework that reconstructs two high fidelity hands from multi-view RGB images.

Hand Pose Estimation

Paper
Add Code

Hybrid Neural Rendering for Large-Scale Scenes with Motion Blur

1 code implementation • CVPR 2023 • Peng Dai, yinda zhang, Xin Yu, Xiaoyang Lyu, Xiaojuan Qi

Rendering novel view images is highly desirable for many applications.

Neural Rendering Novel View Synthesis

Paper
Code

Grad-PU: Arbitrary-Scale Point Cloud Upsampling via Gradient Descent with Learned Distance Functions

1 code implementation • CVPR 2023 • Yun He, Danhang Tang, yinda zhang, xiangyang xue, Yanwei Fu

Most existing point cloud upsampling methods have roughly three steps: feature extraction, feature expansion and 3D coordinate prediction.

point cloud upsampling

Paper
Code

Learning Personalized High Quality Volumetric Head Avatars from Monocular RGB Videos

no code implementations • CVPR 2023 • Ziqian Bai, Feitong Tan, Zeng Huang, Kripasindhu Sarkar, Danhang Tang, Di Qiu, Abhimitra Meka, Ruofei Du, Mingsong Dou, Sergio Orts-Escolano, Rohit Pandey, Ping Tan, Thabo Beeler, Sean Fanello, yinda zhang

The learnt avatar is driven by a parametric face model to achieve user-controlled facial expressions and head poses.

Face Model Vocal Bursts Intensity Prediction

Paper
Add Code

Learning Versatile 3D Shape Generation with Improved AR Models

no code implementations • 26 Mar 2023 • Simian Luo, Xuelin Qian, Yanwei Fu, yinda zhang, Ying Tai, Zhenyu Zhang, Chengjie Wang, xiangyang xue

Auto-Regressive (AR) models have achieved impressive results in 2D image generation by modeling joint distributions in the grid space.

3D Shape Generation Image Generation +1

Paper
Add Code

SINE: Semantic-driven Image-based NeRF Editing with Prior-guided Editing Field

1 code implementation • CVPR 2023 • Chong Bao, yinda zhang, Bangbang Yang, Tianxing Fan, Zesong Yang, Hujun Bao, Guofeng Zhang, Zhaopeng Cui

Despite the great success in 2D editing using user-friendly tools, such as Photoshop, semantic strokes, or even text prompts, similar capabilities in 3D areas are still limited, either relying on 3D modeling skills or allowing editing within only a few categories.

123

Paper
Code

Learning Versatile 3D Shape Generation with Improved Auto-regressive Models

no code implementations • ICCV 2023 • Simian Luo, Xuelin Qian, Yanwei Fu, yinda zhang, Ying Tai, Zhenyu Zhang, Chengjie Wang, xiangyang xue

Auto-Regressive (AR) models have achieved impressive results in 2D image generation by modeling joint distributions in the grid space.

3D Shape Generation Image Generation +1

Paper
Add Code

DELTAR: Depth Estimation from a Light-weight ToF Sensor and RGB Image

no code implementations • 27 Sep 2022 • Yijin Li, Xinyang Liu, Wenqi Dong, Han Zhou, Hujun Bao, Guofeng Zhang, yinda zhang, Zhaopeng Cui

Light-weight time-of-flight (ToF) depth sensors are small, cheap, low-energy and have been massively deployed on mobile devices for the purposes like autofocus, obstacle detection, etc.

3D Reconstruction Depth Completion +2

Paper
Add Code

LoRD: Local 4D Implicit Representation for High-Fidelity Dynamic Human Modeling

1 code implementation • 18 Aug 2022 • Boyan Jiang, Xinlin Ren, Mingsong Dou, xiangyang xue, Yanwei Fu, yinda zhang

Recent progress in 4D implicit representation focuses on globally controlling the shape and motion with low dimensional latent vectors, which is prone to missing surface details and accumulating tracking error.

3D Shape Modeling 4D reconstruction +1

Paper
Code

PRIF: Primary Ray-based Implicit Function

no code implementations • 12 Aug 2022 • Brandon Yushan Feng, yinda zhang, Danhang Tang, Ruofei Du, Amitabh Varshney

We introduce a new implicit shape representation called Primary Ray-based Implicit Function (PRIF).

Inverse Rendering Neural Rendering +1

Paper
Add Code

NeuMesh: Learning Disentangled Neural Mesh-based Implicit Field for Geometry and Texture Editing

no code implementations • 25 Jul 2022 • Bangbang Yang, Chong Bao, Junyi Zeng, Hujun Bao, yinda zhang, Zhaopeng Cui, Guofeng Zhang

Very recently neural implicit rendering techniques have been rapidly evolved and shown great advantages in novel view synthesis and 3D scene reconstruction.

3D Scene Reconstruction Neural Rendering +1

Paper
Add Code

Neural Rendering in a Room: Amodal 3D Understanding and Free-Viewpoint Rendering for the Closed Scene Composed of Pre-Captured Objects

no code implementations • 5 May 2022 • Bangbang Yang, yinda zhang, Yijin Li, Zhaopeng Cui, Sean Fanello, Hujun Bao, Guofeng Zhang

We, as human beings, can understand and picture a familiar scene from arbitrary viewpoints given a single image, whereas this is still a grand challenge for computers.

Data Augmentation Neural Rendering +1

Paper
Add Code

Density-preserving Deep Point Cloud Compression

no code implementations • CVPR 2022 • Yun He, Xinlin Ren, Danhang Tang, yinda zhang, xiangyang xue, Yanwei Fu

To address this, we propose a novel deep point cloud compression method that preserves local density information.

Paper
Add Code

Pixel2Mesh++: 3D Mesh Generation and Refinement from Multi-View Images

no code implementations • 21 Apr 2022 • Chao Wen, yinda zhang, Chenjie Cao, Zhuwen Li, xiangyang xue, Yanwei Fu

We study the problem of shape generation in 3D mesh representation from a small number of color images with or without camera poses.

Paper
Add Code

Efficient Virtual View Selection for 3D Hand Pose Estimation

1 code implementation • 29 Mar 2022 • Jian Cheng, Yanguang Wan, Dexin Zuo, Cuixia Ma, Jian Gu, Ping Tan, Hongan Wang, Xiaoming Deng, yinda zhang

3D hand pose estimation from single depth is a fundamental problem in computer vision, and has wide applications. However, the existing methods still can not achieve satisfactory hand pose estimation results due to view variation and occlusion of human hand.

Ranked #1 on Hand Pose Estimation on ICVL Hands

3D Hand Pose Estimation

Paper
Code

H4D: Human 4D Modeling by Learning Neural Compositional Representation

no code implementations • CVPR 2022 • Boyan Jiang, yinda zhang, Xingkui Wei, xiangyang xue, Yanwei Fu

A simple yet effective linear motion model is proposed to provide a rough and regularized motion estimation, followed by per-frame compensation for pose and geometry details with the residual encoded in the auxiliary code.

3D Reconstruction Future prediction +2

Paper
Add Code

OmniSyn: Synthesizing 360 Videos with Wide-baseline Panoramas

no code implementations • 17 Feb 2022 • David Li, yinda zhang, Christian Häne, Danhang Tang, Amitabh Varshney, Ruofei Du

Immersive maps such as Google Street View and Bing Streetside provide true-to-life views with a massive collection of panoramas.

Paper
Add Code

VoLux-GAN: A Generative Model for 3D Face Synthesis with HDRI Relighting

no code implementations • 13 Jan 2022 • Feitong Tan, Sean Fanello, Abhimitra Meka, Sergio Orts-Escolano, Danhang Tang, Rohit Pandey, Jonathan Taylor, Ping Tan, yinda zhang

We propose VoLux-GAN, a generative framework to synthesize 3D-aware faces with convincing relighting.

3D-Aware Image Synthesis Data Augmentation +2

Paper
Add Code

Multiresolution Deep Implicit Functions for 3D Shape Representation

no code implementations • ICCV 2021 • Zhang Chen, yinda zhang, Kyle Genova, Sean Fanello, Sofien Bouaziz, Christian Haene, Ruofei Du, Cem Keskin, Thomas Funkhouser, Danhang Tang

To the best of our knowledge, MDIF is the first deep implicit function model that can at the same time (1) represent different levels of detail and allow progressive decoding; (2) support both encoder-decoder inference and decoder-only latent optimization, and fulfill multiple applications; (3) perform detailed decoder-only shape completion.

3D Reconstruction 3D Shape Representation

Paper
Add Code

Learning Object-Compositional Neural Radiance Field for Editable Scene Rendering

no code implementations • ICCV 2021 • Bangbang Yang, yinda zhang, Yinghao Xu, Yijin Li, Han Zhou, Hujun Bao, Guofeng Zhang, Zhaopeng Cui

In this paper, we present a novel neural scene rendering system, which learns an object-compositional neural radiance field and produces realistic rendering with editing capability for a clustered and real-world scene.

Neural Rendering Novel View Synthesis +1

Paper
Add Code

DeepPanoContext: Panoramic 3D Scene Understanding with Holistic Scene Context Graph and Relation-based Optimization

1 code implementation • ICCV 2021 • Cheng Zhang, Zhaopeng Cui, Cai Chen, Shuaicheng Liu, Bing Zeng, Hujun Bao, yinda zhang

Panorama images have a much larger field-of-view thus naturally encode enriched scene context information compared to standard perspective images, which however is not well exploited in the previous scene understanding methods.

Object Relation +1

Paper
Code

Deep Hybrid Self-Prior for Full 3D Mesh Generation

no code implementations • ICCV 2021 • Xingkui Wei, Zhengqing Chen, Yanwei Fu, Zhaopeng Cui, yinda zhang

We present a deep learning pipeline that leverages network self-prior to recover a full 3D model consisting of both a triangular mesh and a texture map from the colored 3D point cloud.

Surface Reconstruction

Paper
Add Code

Spatially-Varying Outdoor Lighting Estimation from Intrinsics

no code implementations • CVPR 2021 • Yongjie Zhu, yinda zhang, Si Li, Boxin Shi

We train a deep neural network to regress intrinsic cues with physically-based constraints and use them to conduct global and local lightings estimation.

Lighting Estimation

Paper
Add Code

HumanGPS: Geodesic PreServing Feature for Dense Human Correspondences

1 code implementation • CVPR 2021 • Feitong Tan, Danhang Tang, Mingsong Dou, Kaiwen Guo, Rohit Pandey, Cem Keskin, Ruofei Du, Deqing Sun, Sofien Bouaziz, Sean Fanello, Ping Tan, yinda zhang

In this paper, we address the problem of building dense correspondences between human images under arbitrary camera viewpoints and body poses.

Paper
Code

Learning Compositional Representation for 4D Captures with Neural ODE

no code implementations • CVPR 2021 • Boyan Jiang, yinda zhang, Xingkui Wei, xiangyang xue, Yanwei Fu

To model the motion, a neural Ordinary Differential Equation (ODE) is trained to update the initial state conditioned on the learned motion code, and a decoder takes the shape code and the updated state code to reconstruct the 3D model at each time stamp.

4D reconstruction

Paper
Add Code

Holistic 3D Scene Understanding from a Single Image with Implicit Representation

1 code implementation • CVPR 2021 • Cheng Zhang, Zhaopeng Cui, yinda zhang, Bing Zeng, Marc Pollefeys, Shuaicheng Liu

We not only propose an image-based local structured implicit network to improve the object shape estimation, but also refine the 3D object pose and scene layout via a novel implicit scene graph neural network that exploits the implicit local object features.

Ranked #1 on Monocular 3D Object Detection on SUN RGB-D (using extra training data)

3D Shape Reconstruction Monocular 3D Object Detection +4

189

Paper
Code

Interacting Two-Hand 3D Pose and Shape Reconstruction From Single Color Image

no code implementations • ICCV 2021 • Baowen Zhang, Yangang Wang, Xiaoming Deng, yinda zhang, Ping Tan, Cuixia Ma, Hongan Wang

In this paper, we propose a novel deep learning framework to reconstruct 3D hand poses and shapes of two interacting hands from a single color image.

Ranked #6 on 3D Interacting Hand Pose Estimation on InterHand2.6M

3D Interacting Hand Pose Estimation

Paper
Add Code

GeoLayout: Geometry Driven Room Layout Estimation Based on Depth Maps of Planes

no code implementations • ECCV 2020 • Weidong Zhang, Wei zhang, yinda zhang

The task of room layout estimation is to locate the wall-floor, wall-ceiling, and wall-wall boundaries.

Keypoint Detection Room Layout Estimation +1

Paper
Add Code

HITNet: Hierarchical Iterative Tile Refinement Network for Real-time Stereo Matching

8 code implementations • CVPR 2021 • Vladimir Tankovich, Christian Häne, yinda zhang, Adarsh Kowdle, Sean Fanello, Sofien Bouaziz

Contrary to many recent neural network approaches that operate on a full cost volume and rely on 3D convolutions, our approach does not explicitly build a volume and instead relies on a fast multi-resolution initialization step, differentiable 2D geometric propagation and warping mechanisms to infer disparity hypotheses.

Ranked #1 on Stereo Depth Estimation on KITTI2015

Stereo Depth Estimation Stereo Disparity Estimation +1

32,815

Paper
Code

Deep Implicit Volume Compression

no code implementations • CVPR 2020 • Danhang Tang, Saurabh Singh, Philip A. Chou, Christian Haene, Mingsong Dou, Sean Fanello, Jonathan Taylor, Philip Davidson, Onur G. Guleryuz, yinda zhang, Shahram Izadi, Andrea Tagliasacchi, Sofien Bouaziz, Cem Keskin

We describe a novel approach for compressing truncated signed distance fields (TSDF) stored in 3D voxel grids, and their corresponding textures.

Video Compression

Paper
Add Code

Du$^2$Net: Learning Depth Estimation from Dual-Cameras and Dual-Pixels

no code implementations • 31 Mar 2020 • Yinda Zhang, Neal Wadhwa, Sergio Orts-Escolano, Christian Häne, Sean Fanello, Rahul Garg

Computational stereo has reached a high level of accuracy, but degrades in the presence of occlusions, repeated textures, and correspondence errors along edges.

Depth Estimation Stereo Matching

Paper
Add Code

Neural Pose Transfer by Spatially Adaptive Instance Normalization

1 code implementation • CVPR 2020 • Jiashun Wang, Chao Wen, Yanwei Fu, Haitao Lin, Tianyun Zou, xiangyang xue, yinda zhang

Pose transfer has been studied for decades, in which the pose of a source mesh is applied to a target mesh.

Pose Transfer Style Transfer

141

Paper
Code

DeepSFM: Structure From Motion Via Deep Bundle Adjustment

1 code implementation • ECCV 2020 • Xingkui Wei, yinda zhang, Zhuwen Li, Yanwei Fu, xiangyang xue

The explicit constraints on both depth (structure) and pose (motion), when combined with the learning components, bring the merit from both traditional BA and emerging deep learning technology.

Pose Estimation

308

Paper
Code

Neural Point Cloud Rendering via Multi-Plane Projection

1 code implementation • CVPR 2020 • Peng Dai, yinda zhang, Zhuwen Li, Shuaicheng Liu, Bing Zeng

The input to the network is the raw point cloud of a scene and the output are image or image sequences from a novel view or along a novel camera trajectory.

Paper
Code

DIST: Rendering Deep Implicit Signed Distance Function with Differentiable Sphere Tracing

1 code implementation • CVPR 2020 • Shaohui Liu, yinda zhang, Songyou Peng, Boxin Shi, Marc Pollefeys, Zhaopeng Cui

We propose a differentiable sphere tracing algorithm to bridge the gap between inverse graphics methods and the recently proposed deep learning based implicit signed distance function.

215

Paper
Code

Pixel2Mesh++: Multi-View 3D Mesh Generation via Deformation

2 code implementations • ICCV 2019 • Chao Wen, yinda zhang, Zhuwen Li, Yanwei Fu

We study the problem of shape generation in 3D mesh representation from a few color images with known camera poses.

352

Paper
Code

DeepLiDAR: Deep Surface Normal Guided Depth Prediction for Outdoor Scene from Sparse LiDAR Data and Single Color Image

1 code implementation • CVPR 2019 • Jiaxiong Qiu, Zhaopeng Cui, yinda zhang, Xingdi Zhang, Shuaicheng Liu, Bing Zeng, Marc Pollefeys

In this paper, we propose a deep learning architecture that produces accurate dense depth for the outdoor scene from a single color image and a sparse depth.

Depth Completion Depth Estimation +1

241

Paper
Code

ActiveStereoNet: End-to-End Self-Supervised Learning for Active Stereo Systems

1 code implementation • ECCV 2018 • Yinda Zhang, Sameh Khamis, Christoph Rhemann, Julien Valentin, Adarsh Kowdle, Vladimir Tankovich, Michael Schoenberg, Shahram Izadi, Thomas Funkhouser, Sean Fanello

In this paper we present ActiveStereoNet, the first deep learning solution for active stereo systems.

Self-Supervised Learning Stereo Disparity Estimation +1

663

Paper
Code

Multi-level Semantic Feature Augmentation for One-shot Learning

1 code implementation • 15 Apr 2018 • Zitian Chen, Yanwei Fu, yinda zhang, Yu-Gang Jiang, xiangyang xue, Leonid Sigal

In semantic space, we search for related concepts, which are then projected back into the image feature spaces by the decoder portion of the TriNet.

Novel Concepts One-Shot Learning

Paper
Code

Pixel2Mesh: Generating 3D Mesh Models from Single RGB Images

6 code implementations • ECCV 2018 • Nanyang Wang, yinda zhang, Zhuwen Li, Yanwei Fu, Wei Liu, Yu-Gang Jiang

We propose an end-to-end deep learning architecture that produces a 3D shape in triangular mesh from a single color image.

Ranked #3 on 3D Object Reconstruction on Data3D−R2N2 (Avg F1 metric)

3D Object Reconstruction

1,607

Paper
Code

Deep Depth Completion of a Single RGB-D Image

1 code implementation • CVPR 2018 • Yinda Zhang, Thomas Funkhouser

The goal of our work is to complete the depth channel of an RGB-D image.

Depth Completion Depth Estimation

543

Paper
Code

VOCABULARY-INFORMED VISUAL FEATURE AUGMENTATION FOR ONE-SHOT LEARNING

no code implementations • ICLR 2018 • jianqi ma, Hangyu Lin, yinda zhang, Yanwei Fu, xiangyang xue

Besides directly augmenting image features, we transform the image features to semantic space using the encoder and perform the data augmentation.

Classification Data Augmentation +2

Paper
Add Code

Matterport3D: Learning from RGB-D Data in Indoor Environments

1 code implementation • 18 Sep 2017 • Angel Chang, Angela Dai, Thomas Funkhouser, Maciej Halber, Matthias Nießner, Manolis Savva, Shuran Song, Andy Zeng, yinda zhang

Access to large, diverse RGB-D datasets is critical for training RGB-D scene understanding algorithms.

General Classification Scene Understanding +1

893

Paper
Code

Hand3D: Hand Pose Estimation using 3D Neural Network

no code implementations • 7 Apr 2017 • Xiaoming Deng, Shuo Yang, yinda zhang, Ping Tan, Liang Chang, Hongan Wang

We propose a novel 3D neural network architecture for 3D hand pose estimation from a single depth image.

3D Hand Pose Estimation

Paper
Add Code

Physically-Based Rendering for Indoor Scene Understanding Using Convolutional Neural Networks

no code implementations • CVPR 2017 • Yinda Zhang, Shuran Song, Ersin Yumer, Manolis Savva, Joon-Young Lee, Hailin Jin, Thomas Funkhouser

One of the bottlenecks in training for better representations is the amount of available per-pixel ground truth data that is required for core scene understanding tasks such as semantic segmentation, normal prediction, and object edge detection.

Boundary Detection Edge Detection +4

Paper
Add Code

Joint Hand Detection and Rotation Estimation by Using CNN

no code implementations • 8 Dec 2016 • Xiaoming Deng, Ye Yuan, Yinda Zhang, Ping Tan, Liang Chang, Shuo Yang, Hongan Wang

Hand detection is essential for many hand related tasks, e. g. parsing hand pose, understanding gesture, which are extremely useful for robotics and human-computer interaction.

General Classification Hand Detection +2

Paper
Add Code

DeepContext: Context-Encoding Neural Pathways for 3D Holistic Scene Understanding

no code implementations • ICCV 2017 • Yinda Zhang, Mingru Bai, Pushmeet Kohli, Shahram Izadi, Jianxiong Xiao

In particular, 3D context has been shown to be an extremely important cue for scene understanding - yet very little research has been done on integrating context information with deep models.

Object Scene Understanding

Paper
Add Code

LSUN: Construction of a Large-scale Image Dataset using Deep Learning with Humans in the Loop

4 code implementations • 10 Jun 2015 • Fisher Yu, Ari Seff, yinda zhang, Shuran Song, Thomas Funkhouser, Jianxiong Xiao

While there has been remarkable progress in the performance of visual recognition algorithms, the state-of-the-art models tend to be exceptionally data-hungry.

517

Paper
Code

TurkerGaze: Crowdsourcing Saliency with Webcam based Eye Tracking

1 code implementation • 25 Apr 2015 • Pingmei Xu, Krista A. Ehinger, yinda zhang, Adam Finkelstein, Sanjeev R. Kulkarni, Jianxiong Xiao

Traditional eye tracking requires specialized hardware, which means collecting gaze data from many observers is expensive, tedious and slow.

Saliency Prediction

Paper
Code

FrameBreak: Dramatic Image Extrapolation by Guided Shift-Maps

no code implementations • CVPR 2013 • Yinda Zhang, Jianxiong Xiao, James Hays, Ping Tan

We analyze the self-similarity of the guide image to generate a set of allowable local transformations and apply them to the input image.

Image Generation

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.