Search Results for author: Danhang Tang

Found 22 papers, 3 papers with code

One-Click Upgrade from 2D to 3D: Sandwiched RGB-D Video Compression for Stereoscopic Teleconferencing

no code implementations15 Apr 2024 Yueyu Hu, Onur G. Guleryuz, Philip A. Chou, Danhang Tang, Jonathan Taylor, Rus Maxham, Yao Wang

In this paper, we propose a new approach to upgrade a 2D video codec to support stereo RGB-D video compression, by wrapping it with a neural pre- and post-processor pair.

Video Compression

GaussianFlow: Splatting Gaussian Dynamics for 4D Content Creation

no code implementations19 Mar 2024 Quankai Gao, Qiangeng Xu, Zhe Cao, Ben Mildenhall, Wenchao Ma, Le Chen, Danhang Tang, Ulrich Neumann

While the optimization can draw photometric reference from the input videos or be regulated by generative models, directly supervising Gaussian motions remains underexplored.

Novel View Synthesis Optical Flow Estimation

Sandwiched Compression: Repurposing Standard Codecs with Neural Network Wrappers

1 code implementation8 Feb 2024 Onur G. Guleryuz, Philip A. Chou, Berivan Isik, Hugues Hoppe, Danhang Tang, Ruofei Du, Jonathan Taylor, Philip Davidson, Sean Fanello

Through a variety of examples, we apply the sandwich architecture to sources with different numbers of channels, higher resolution, higher dynamic range, and perceptual distortion measures.

Video Compression

MACS: Mass Conditioned 3D Hand and Object Motion Synthesis

no code implementations22 Dec 2023 Soshi Shimada, Franziska Mueller, Jan Bednarik, Bardia Doosti, Bernd Bickel, Danhang Tang, Vladislav Golyanik, Jonathan Taylor, Christian Theobalt, Thabo Beeler

To improve the naturalness of the synthesized 3D hand object motions, this work proposes MACS the first MAss Conditioned 3D hand and object motion Synthesis approach.

Motion Synthesis Object

Egocentric Whole-Body Motion Capture with FisheyeViT and Diffusion-Based Motion Refinement

no code implementations28 Nov 2023 Jian Wang, Zhe Cao, Diogo Luvizon, Lingjie Liu, Kripasindhu Sarkar, Danhang Tang, Thabo Beeler, Christian Theobalt

In this work, we explore egocentric whole-body motion capture using a single fisheye camera, which simultaneously estimates human body and hand motion.

 Ranked #1 on Egocentric Pose Estimation on GlobalEgoMocap Test Dataset (using extra training data)

Egocentric Pose Estimation Hand Detection +2

Grad-PU: Arbitrary-Scale Point Cloud Upsampling via Gradient Descent with Learned Distance Functions

1 code implementation CVPR 2023 Yun He, Danhang Tang, yinda zhang, xiangyang xue, Yanwei Fu

Most existing point cloud upsampling methods have roughly three steps: feature extraction, feature expansion and 3D coordinate prediction.

point cloud upsampling

Sandwiched Video Compression: Efficiently Extending the Reach of Standard Codecs with Neural Wrappers

no code implementations20 Mar 2023 Berivan Isik, Onur G. Guleryuz, Danhang Tang, Jonathan Taylor, Philip A. Chou

We propose differentiable approximations to key video codec components and demonstrate that, in addition to providing meaningful compression improvements over the standard codec, the neural codes of the sandwich lead to significantly better rate-distortion performance in two important scenarios. When transporting high-resolution video via low-resolution HEVC, the sandwich system obtains 6. 5 dB improvements over standard HEVC.

Motion Compensation Video Compression

Pixel-Aligned Non-parametric Hand Mesh Reconstruction

no code implementations17 Oct 2022 Shijian Jiang, Guwen Han, Danhang Tang, Yang Zhou, Xiang Li, Jiming Chen, Qi Ye

The decoder aggregate both local image features in pixels and geometric features in vertices.

PRIF: Primary Ray-based Implicit Function

no code implementations12 Aug 2022 Brandon Yushan Feng, yinda zhang, Danhang Tang, Ruofei Du, Amitabh Varshney

We introduce a new implicit shape representation called Primary Ray-based Implicit Function (PRIF).

Inverse Rendering Neural Rendering +1

Density-preserving Deep Point Cloud Compression

no code implementations CVPR 2022 Yun He, Xinlin Ren, Danhang Tang, yinda zhang, xiangyang xue, Yanwei Fu

To address this, we propose a novel deep point cloud compression method that preserves local density information.

OmniSyn: Synthesizing 360 Videos with Wide-baseline Panoramas

no code implementations17 Feb 2022 David Li, yinda zhang, Christian Häne, Danhang Tang, Amitabh Varshney, Ruofei Du

Immersive maps such as Google Street View and Bing Streetside provide true-to-life views with a massive collection of panoramas.

Multiresolution Deep Implicit Functions for 3D Shape Representation

no code implementations ICCV 2021 Zhang Chen, yinda zhang, Kyle Genova, Sean Fanello, Sofien Bouaziz, Christian Haene, Ruofei Du, Cem Keskin, Thomas Funkhouser, Danhang Tang

To the best of our knowledge, MDIF is the first deep implicit function model that can at the same time (1) represent different levels of detail and allow progressive decoding; (2) support both encoder-decoder inference and decoder-only latent optimization, and fulfill multiple applications; (3) perform detailed decoder-only shape completion.

3D Reconstruction 3D Shape Representation

HumanGPS: Geodesic PreServing Feature for Dense Human Correspondences

1 code implementation CVPR 2021 Feitong Tan, Danhang Tang, Mingsong Dou, Kaiwen Guo, Rohit Pandey, Cem Keskin, Ruofei Du, Deqing Sun, Sofien Bouaziz, Sean Fanello, Ping Tan, yinda zhang

In this paper, we address the problem of building dense correspondences between human images under arbitrary camera viewpoints and body poses.

Real-time Background-aware 3D Textureless Object Pose Estimation

no code implementations22 Jul 2019 Mang Shao, Danhang Tang, Tae-Kyun Kim

In this work, we present a modified fuzzy decision forest for real-time 3D object pose estimation based on typical template representation.

Object Pose Estimation

Latent-Class Hough Forests for 6 DoF Object Pose Estimation

no code implementations3 Feb 2016 Rigas Kouskouridas, Alykhan Tejani, Andreas Doumanoglou, Danhang Tang, Tae-Kyun Kim

In this paper we present Latent-Class Hough Forests, a method for object detection and 6 DoF pose estimation in heavily cluttered and occluded scenarios.

object-detection Object Detection +2

Conditional Convolutional Neural Network for Modality-Aware Face Recognition

no code implementations ICCV 2015 Chao Xiong, Xiaowei Zhao, Danhang Tang, Karlekar Jayashree, Shuicheng Yan, Tae-Kyun Kim

Faces in the wild are usually captured with various poses, illuminations and occlusions, and thus inherently multimodally distributed in many tasks.

Face Identification Face Recognition +1

Opening the Black Box: Hierarchical Sampling Optimization for Estimating Human Hand Pose

no code implementations ICCV 2015 Danhang Tang, Jonathan Taylor, Pushmeet Kohli, Cem Keskin, Tae-Kyun Kim, Jamie Shotton

In this paper, we show that we can significantly improving upon black box optimization by exploiting high-level knowledge of the structure of the parameters and using a local surrogate energy function.

Hand Pose Estimation Image Generation

Latent Regression Forest: Structured Estimation of 3D Articulated Hand Posture

no code implementations CVPR 2014 Danhang Tang, Hyung Jin Chang, Alykhan Tejani, Tae-Kyun Kim

In contrast to prior forest-based methods, which take dense pixels as input, classify them independently and then estimate joint positions afterwards; our method can be considered as a structured coarse-to-fine search, starting from the centre of mass of a point cloud until locating all the skeletal joints.

3D Hand Pose Estimation regression

Cannot find the paper you are looking for? You can Submit a new open access paper.