Search Results for author: Danhang Tang

Found 22 papers, 3 papers with code

One-Click Upgrade from 2D to 3D: Sandwiched RGB-D Video Compression for Stereoscopic Teleconferencing

no code implementations • 15 Apr 2024 • Yueyu Hu, Onur G. Guleryuz, Philip A. Chou, Danhang Tang, Jonathan Taylor, Rus Maxham, Yao Wang

In this paper, we propose a new approach to upgrade a 2D video codec to support stereo RGB-D video compression, by wrapping it with a neural pre- and post-processor pair.

Video Compression

Paper
Add Code

GaussianFlow: Splatting Gaussian Dynamics for 4D Content Creation

no code implementations • 19 Mar 2024 • Quankai Gao, Qiangeng Xu, Zhe Cao, Ben Mildenhall, Wenchao Ma, Le Chen, Danhang Tang, Ulrich Neumann

While the optimization can draw photometric reference from the input videos or be regulated by generative models, directly supervising Gaussian motions remains underexplored.

Novel View Synthesis Optical Flow Estimation

Paper
Add Code

Sandwiched Compression: Repurposing Standard Codecs with Neural Network Wrappers

1 code implementation • 8 Feb 2024 • Onur G. Guleryuz, Philip A. Chou, Berivan Isik, Hugues Hoppe, Danhang Tang, Ruofei Du, Jonathan Taylor, Philip Davidson, Sean Fanello

Through a variety of examples, we apply the sandwich architecture to sources with different numbers of channels, higher resolution, higher dynamic range, and perceptual distortion measures.

Video Compression

Paper
Code

MACS: Mass Conditioned 3D Hand and Object Motion Synthesis

no code implementations • 22 Dec 2023 • Soshi Shimada, Franziska Mueller, Jan Bednarik, Bardia Doosti, Bernd Bickel, Danhang Tang, Vladislav Golyanik, Jonathan Taylor, Christian Theobalt, Thabo Beeler

To improve the naturalness of the synthesized 3D hand object motions, this work proposes MACS the first MAss Conditioned 3D hand and object motion Synthesis approach.

Motion Synthesis Object

Paper
Add Code

Egocentric Whole-Body Motion Capture with FisheyeViT and Diffusion-Based Motion Refinement

no code implementations • 28 Nov 2023 • Jian Wang, Zhe Cao, Diogo Luvizon, Lingjie Liu, Kripasindhu Sarkar, Danhang Tang, Thabo Beeler, Christian Theobalt

In this work, we explore egocentric whole-body motion capture using a single fisheye camera, which simultaneously estimates human body and hand motion.

Ranked #1 on Egocentric Pose Estimation on GlobalEgoMocap Test Dataset (using extra training data)

Egocentric Pose Estimation Hand Detection +2

Paper
Add Code

Spectral Graphormer: Spectral Graph-based Transformer for Egocentric Two-Hand Reconstruction using Multi-View Color Images

no code implementations • ICCV 2023 • Tze Ho Elden Tse, Franziska Mueller, Zhengyang Shen, Danhang Tang, Thabo Beeler, Mingsong Dou, yinda zhang, Sasa Petrovic, Hyung Jin Chang, Jonathan Taylor, Bardia Doosti

We propose a novel transformer-based framework that reconstructs two high fidelity hands from multi-view RGB images.

Hand Pose Estimation

Paper
Add Code

Grad-PU: Arbitrary-Scale Point Cloud Upsampling via Gradient Descent with Learned Distance Functions

1 code implementation • CVPR 2023 • Yun He, Danhang Tang, yinda zhang, xiangyang xue, Yanwei Fu

Most existing point cloud upsampling methods have roughly three steps: feature extraction, feature expansion and 3D coordinate prediction.

point cloud upsampling

Paper
Code

Learning Personalized High Quality Volumetric Head Avatars from Monocular RGB Videos

no code implementations • CVPR 2023 • Ziqian Bai, Feitong Tan, Zeng Huang, Kripasindhu Sarkar, Danhang Tang, Di Qiu, Abhimitra Meka, Ruofei Du, Mingsong Dou, Sergio Orts-Escolano, Rohit Pandey, Ping Tan, Thabo Beeler, Sean Fanello, yinda zhang

The learnt avatar is driven by a parametric face model to achieve user-controlled facial expressions and head poses.

Face Model Vocal Bursts Intensity Prediction

Paper
Add Code

Sandwiched Video Compression: Efficiently Extending the Reach of Standard Codecs with Neural Wrappers

no code implementations • 20 Mar 2023 • Berivan Isik, Onur G. Guleryuz, Danhang Tang, Jonathan Taylor, Philip A. Chou

We propose differentiable approximations to key video codec components and demonstrate that, in addition to providing meaningful compression improvements over the standard codec, the neural codes of the sandwich lead to significantly better rate-distortion performance in two important scenarios. When transporting high-resolution video via low-resolution HEVC, the sandwich system obtains 6. 5 dB improvements over standard HEVC.

Motion Compensation Video Compression

Paper
Add Code

Pixel-Aligned Non-parametric Hand Mesh Reconstruction

no code implementations • 17 Oct 2022 • Shijian Jiang, Guwen Han, Danhang Tang, Yang Zhou, Xiang Li, Jiming Chen, Qi Ye

The decoder aggregate both local image features in pixels and geometric features in vertices.

Paper
Add Code

PRIF: Primary Ray-based Implicit Function

no code implementations • 12 Aug 2022 • Brandon Yushan Feng, yinda zhang, Danhang Tang, Ruofei Du, Amitabh Varshney

We introduce a new implicit shape representation called Primary Ray-based Implicit Function (PRIF).

Inverse Rendering Neural Rendering +1

Paper
Add Code

Density-preserving Deep Point Cloud Compression

no code implementations • CVPR 2022 • Yun He, Xinlin Ren, Danhang Tang, yinda zhang, xiangyang xue, Yanwei Fu

To address this, we propose a novel deep point cloud compression method that preserves local density information.

Paper
Add Code

OmniSyn: Synthesizing 360 Videos with Wide-baseline Panoramas

no code implementations • 17 Feb 2022 • David Li, yinda zhang, Christian Häne, Danhang Tang, Amitabh Varshney, Ruofei Du

Immersive maps such as Google Street View and Bing Streetside provide true-to-life views with a massive collection of panoramas.

Paper
Add Code

VoLux-GAN: A Generative Model for 3D Face Synthesis with HDRI Relighting

no code implementations • 13 Jan 2022 • Feitong Tan, Sean Fanello, Abhimitra Meka, Sergio Orts-Escolano, Danhang Tang, Rohit Pandey, Jonathan Taylor, Ping Tan, yinda zhang

We propose VoLux-GAN, a generative framework to synthesize 3D-aware faces with convincing relighting.

3D-Aware Image Synthesis Data Augmentation +2

Paper
Add Code

Multiresolution Deep Implicit Functions for 3D Shape Representation

no code implementations • ICCV 2021 • Zhang Chen, yinda zhang, Kyle Genova, Sean Fanello, Sofien Bouaziz, Christian Haene, Ruofei Du, Cem Keskin, Thomas Funkhouser, Danhang Tang

To the best of our knowledge, MDIF is the first deep implicit function model that can at the same time (1) represent different levels of detail and allow progressive decoding; (2) support both encoder-decoder inference and decoder-only latent optimization, and fulfill multiple applications; (3) perform detailed decoder-only shape completion.

3D Reconstruction 3D Shape Representation

Paper
Add Code

HumanGPS: Geodesic PreServing Feature for Dense Human Correspondences

1 code implementation • CVPR 2021 • Feitong Tan, Danhang Tang, Mingsong Dou, Kaiwen Guo, Rohit Pandey, Cem Keskin, Ruofei Du, Deqing Sun, Sofien Bouaziz, Sean Fanello, Ping Tan, yinda zhang

In this paper, we address the problem of building dense correspondences between human images under arbitrary camera viewpoints and body poses.

Paper
Code

Deep Implicit Volume Compression

no code implementations • CVPR 2020 • Danhang Tang, Saurabh Singh, Philip A. Chou, Christian Haene, Mingsong Dou, Sean Fanello, Jonathan Taylor, Philip Davidson, Onur G. Guleryuz, yinda zhang, Shahram Izadi, Andrea Tagliasacchi, Sofien Bouaziz, Cem Keskin

We describe a novel approach for compressing truncated signed distance fields (TSDF) stored in 3D voxel grids, and their corresponding textures.

Video Compression

Paper
Add Code

Real-time Background-aware 3D Textureless Object Pose Estimation

no code implementations • 22 Jul 2019 • Mang Shao, Danhang Tang, Tae-Kyun Kim

In this work, we present a modified fuzzy decision forest for real-time 3D object pose estimation based on typical template representation.

Object Pose Estimation

Paper
Add Code

Latent-Class Hough Forests for 6 DoF Object Pose Estimation

no code implementations • 3 Feb 2016 • Rigas Kouskouridas, Alykhan Tejani, Andreas Doumanoglou, Danhang Tang, Tae-Kyun Kim

In this paper we present Latent-Class Hough Forests, a method for object detection and 6 DoF pose estimation in heavily cluttered and occluded scenarios.

object-detection Object Detection +2

Paper
Add Code

Conditional Convolutional Neural Network for Modality-Aware Face Recognition

no code implementations • ICCV 2015 • Chao Xiong, Xiaowei Zhao, Danhang Tang, Karlekar Jayashree, Shuicheng Yan, Tae-Kyun Kim

Faces in the wild are usually captured with various poses, illuminations and occlusions, and thus inherently multimodally distributed in many tasks.

Face Identification Face Recognition +1

Paper
Add Code

Opening the Black Box: Hierarchical Sampling Optimization for Estimating Human Hand Pose

no code implementations • ICCV 2015 • Danhang Tang, Jonathan Taylor, Pushmeet Kohli, Cem Keskin, Tae-Kyun Kim, Jamie Shotton

In this paper, we show that we can significantly improving upon black box optimization by exploiting high-level knowledge of the structure of the parameters and using a local surrogate energy function.

Hand Pose Estimation Image Generation

Paper
Add Code

Latent Regression Forest: Structured Estimation of 3D Articulated Hand Posture

no code implementations • CVPR 2014 • Danhang Tang, Hyung Jin Chang, Alykhan Tejani, Tae-Kyun Kim

In contrast to prior forest-based methods, which take dense pixels as input, classify them independently and then estimate joint positions afterwards; our method can be considered as a structured coarse-to-fine search, starting from the centre of mass of a point cloud until locating all the skeletal joints.

3D Hand Pose Estimation regression

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.