Search Results for author: Yanchao Yang

Found 30 papers, 12 papers with code

BID: Boundary-Interior Decoding for Unsupervised Temporal Action Localization Pre-Trainin

no code implementations • 12 Mar 2024 • Qihang Fang, Chengcheng Tang, Shugao Ma, Yanchao Yang

Skeleton-based motion representations are robust for action localization and understanding for their invariance to perspective, lighting, and occlusion, compared with images.

Temporal Action Localization Unsupervised Pre-training

Paper
Add Code

Towards Learning Geometric Eigen-Lengths Crucial for Fitting Tasks

no code implementations • 25 Dec 2023 • Yijia Weng, Kaichun Mo, Ruoxi Shi, Yanchao Yang, Leonidas J. Guibas

In this work, we therefore for the first time formulate and propose a novel learning problem on this question and set up a benchmark suite including tasks, data, and evaluation metrics for studying the problem.

Common Sense Reasoning

Paper
Add Code

Revisit Human-Scene Interaction via Space Occupancy

no code implementations • 5 Dec 2023 • Xinpeng Liu, Haowen Hou, Yanchao Yang, Yong-Lu Li, Cewu Lu

Human-scene Interaction (HSI) generation is a challenging task and crucial for various downstream tasks.

Paper
Add Code

Text2Reward: Automated Dense Reward Function Generation for Reinforcement Learning

1 code implementation • 20 Sep 2023 • Tianbao Xie, Siheng Zhao, Chen Henry Wu, Yitao Liu, Qian Luo, Victor Zhong, Yanchao Yang, Tao Yu

Unlike inverse RL and recent work that uses LLMs to write sparse reward codes, Text2Reward produces interpretable, free-form dense reward codes that cover a wide range of tasks, utilize existing packages, and allow iterative refinement with human feedback.

reinforcement-learning Reinforcement Learning (RL)

Paper
Code

Divided Attention: Unsupervised Multi-Object Discovery with Contextually Separated Slots

no code implementations • 4 Apr 2023 • Dong Lao, Zhengyang Hu, Francesco Locatello, Yanchao Yang, Stefano Soatto

We introduce a method to segment the visual field into independently moving regions, trained with no ground truth or supervision.

Motion Segmentation Multi-object discovery +2

Paper
Add Code

JacobiNeRF: NeRF Shaping with Mutual Information Gradients

1 code implementation • CVPR 2023 • Xiaomeng Xu, Yanchao Yang, Kaichun Mo, Boxiao Pan, Li Yi, Leonidas Guibas

We propose a method that trains a neural radiance field (NeRF) to encode not only the appearance of the scene but also semantic correlations between scene points, regions, or entities -- aiming to capture their mutual co-variation patterns.

Instance Segmentation Semantic Segmentation

Paper
Code

VDN-NeRF: Resolving Shape-Radiance Ambiguity via View-Dependence Normalization

1 code implementation • CVPR 2023 • Bingfan Zhu, Yanchao Yang, Xulong Wang, Youyi Zheng, Leonidas Guibas

We propose VDN-NeRF, a method to train neural radiance fields (NeRFs) for better geometry under non-Lambertian surface and dynamic lighting conditions that cause significant variation in the radiance of a point when viewed from different angles.

Paper
Code

Structure-Aware Surface Reconstruction via Primitive Assembly

no code implementations • ICCV 2023 • Jingen Jiang, Mingyang Zhao, Shiqing Xin, Yanchao Yang, Hanxiao Wang, Xiaohong Jia, Dong-Ming Yan

We propose a novel and efficient method for reconstructing manifold surfaces from point clouds.

Surface Reconstruction

Paper
Add Code

COPILOT: Human-Environment Collision Prediction and Localization from Egocentric Videos

no code implementations • ICCV 2023 • Boxiao Pan, Bokui Shen, Davis Rempe, Despoina Paschalidou, Kaichun Mo, Yanchao Yang, Leonidas J. Guibas

In this work, we introduce the challenging problem of predicting collisions in diverse environments from multi-view egocentric videos captured from body-mounted cameras.

Collision Avoidance Synthetic Data Generation

Paper
Add Code

6D Camera Relocalization in Visually Ambiguous Extreme Environments

no code implementations • 13 Jul 2022 • Yang Zheng, Tolga Birdal, Fei Xia, Yanchao Yang, Yueqi Duan, Leonidas J. Guibas

To this end, we propose: (i) a hierarchical localization system, where we leverage temporal information and (ii) a novel environment-aware image enhancement method to boost the robustness and accuracy.

Camera Relocalization Image Enhancement

Paper
Add Code

SpOT: Spatiotemporal Modeling for 3D Object Tracking

no code implementations • 12 Jul 2022 • Colton Stearns, Davis Rempe, Jie Li, Rares Ambrus, Sergey Zakharov, Vitor Guizilini, Yanchao Yang, Leonidas J Guibas

In this work, we develop a holistic representation of traffic scenes that leverages both spatial and temporal information of the actors in the scene.

3D Multi-Object Tracking 3D Object Tracking +1

Paper
Add Code

GIMO: Gaze-Informed Human Motion Prediction in Context

1 code implementation • 20 Apr 2022 • Yang Zheng, Yanchao Yang, Kaichun Mo, Jiaman Li, Tao Yu, Yebin Liu, C. Karen Liu, Leonidas J. Guibas

We perform an extensive study of the benefits of leveraging the eye gaze for ego-centric human motion prediction with various state-of-the-art architectures.

Human motion prediction motion prediction

Paper
Code

ADeLA: Automatic Dense Labeling With Attention for Viewpoint Shift in Semantic Segmentation

no code implementations • CVPR 2022 • Hanxiang Ren, Yanchao Yang, He Wang, Bokui Shen, Qingnan Fan, Youyi Zheng, C. Karen Liu, Leonidas J. Guibas

We describe a method to deal with performance drop in semantic segmentation caused by viewpoint changes within multi-camera systems, where temporally paired images are readily available, but the annotations may only be abundant for a few typical views.

Hallucination Semantic Segmentation +1

Paper
Add Code

Domain Adaptation on Point Clouds via Geometry-Aware Implicits

1 code implementation • CVPR 2022 • Yuefan Shen, Yanchao Yang, Mi Yan, He Wang, Youyi Zheng, Leonidas Guibas

Here we propose a simple yet effective method for unsupervised domain adaptation on point clouds by employing a self-supervised task of learning geometry-aware implicits, which plays two critical roles in one shot.

Autonomous Driving Unsupervised Domain Adaptation

Paper
Code

Object Pursuit: Building a Space of Objects via Discriminative Weight Generation

no code implementations • ICLR 2022 • Chuanyu Pan, Yanchao Yang, Kaichun Mo, Yueqi Duan, Leonidas Guibas

We perform an extensive study of the key features of the proposed framework and analyze the characteristics of the learned representations.

Disentanglement Object

Paper
Add Code

IFR-Explore: Learning Inter-object Functional Relationships in 3D Indoor Scenes

no code implementations • ICLR 2022 • Qi Li, Kaichun Mo, Yanchao Yang, Hang Zhao, Leonidas Guibas

While most works focus on single-object or agent-object visual functionality and affordances, our work proposes to study a new kind of visual relationship that is also important to perceive and model -- inter-object functional relationships (e. g., a switch on the wall turns on or off the light, a remote control operates the TV).

Object

Paper
Add Code

ADeLA: Automatic Dense Labeling with Attention for Viewpoint Adaptation in Semantic Segmentation

1 code implementation • 29 Jul 2021 • Yanchao Yang, Hanxiang Ren, He Wang, Bokui Shen, Qingnan Fan, Youyi Zheng, C. Karen Liu, Leonidas Guibas

Furthermore, to resolve ambiguities in converting the semantic images to semantic labels, we treat the view transformation network as a functional representation of an unknown mapping implied by the color images and propose functional label hallucination to generate pseudo-labels in the target domain.

Hallucination Inductive Bias +2

Paper
Code

DCL: Differential Contrastive Learning for Geometry-Aware Depth Synthesis

2 code implementations • 27 Jul 2021 • Yuefan Shen, Yanchao Yang, Youyi Zheng, C. Karen Liu, Leonidas Guibas

We describe a method for unpaired realistic depth synthesis that learns diverse variations from the real-world depth scans and ensures geometric consistency between the synthetic and synthesized depth.

Contrastive Learning Image Generation

Paper
Code

Learning Semantic-Aware Dynamics for Video Prediction

no code implementations • CVPR 2021 • Xinzhu Bei, Yanchao Yang, Stefano Soatto

The appearance of the scene is warped from past frames using the predicted motion in co-visible regions; dis-occluded regions are synthesized with content-aware inpainting utilizing the predicted scene layout.

Optical Flow Estimation Video Prediction

Paper
Add Code

DyStaB: Unsupervised Object Segmentation via Dynamic-Static Bootstrapping

no code implementations • CVPR 2021 • Yanchao Yang, Brian Lai, Stefano Soatto

Then, it uses the segments to learn object models that can be used for detection in a static image.

Continual Learning Object +7

Paper
Add Code

FDA: Fourier Domain Adaptation for Semantic Segmentation

3 code implementations • CVPR 2020 • Yanchao Yang, Stefano Soatto

We describe a simple method for unsupervised domain adaptation, whereby the discrepancy between the source and target distributions is reduced by swapping the low-frequency spectrum of one with the other.

Ranked #3 on Domain Adaptation on Panoptic SYNTHIA-to-Mapillary

Segmentation Semantic Segmentation +1

13,420

Paper
Code

Learning to Manipulate Individual Objects in an Image

1 code implementation • CVPR 2020 • Yanchao Yang, Yutong Chen, Stefano Soatto

We describe a method to train a generative model with latent factors that are (approximately) independent and localized.

Disentanglement

Paper
Code

Phase Consistent Ecological Domain Adaptation

1 code implementation • CVPR 2020 • Yanchao Yang, Dong Lao, Ganesh Sundaramoorthi, Stefano Soatto

We introduce two criteria to regularize the optimization involved in learning a classifier in a domain where no annotated data are available, leveraging annotated data in a different domain, a problem known as unsupervised domain adaptation.

Segmentation Semantic Segmentation +1

Paper
Code

Dense Depth Posterior (DDP) from Single Image and Sparse Range

no code implementations • CVPR 2019 • Yanchao Yang, Alex Wong, Stefano Soatto

We present a deep learning system to infer the posterior distribution of a dense depth map associated with an image, by exploiting sparse range measurements, for instance from a lidar.

Ranked #5 on Depth Completion on VOID

Depth Completion

Paper
Add Code

Unsupervised Moving Object Detection via Contextual Information Separation

1 code implementation • CVPR 2019 • Yanchao Yang, Antonio Loquercio, Davide Scaramuzza, Stefano Soatto

We propose an adversarial contextual model for detecting moving objects in images.

Moving Object Detection Object +3

259

Paper
Code

Conditional Prior Networks for Optical Flow

1 code implementation • ECCV 2018 • Yanchao Yang, Stefano Soatto

On the other hand, fully supervised methods learn the regularity in the annotated data, without explicit regularization and with the risk of overfitting.

Optical Flow Estimation

Paper
Code

S2F: Slow-To-Fast Interpolator Flow

no code implementations • CVPR 2017 • Yanchao Yang, Stefano Soatto

We introduce a method to compute optical flow at multiple scales of motion, without resorting to multi- resolution or combinatorial methods.

Optical Flow Estimation

Paper
Add Code

Self-Occlusions and Disocclusions in Causal Video Object Segmentation

no code implementations • ICCV 2015 • Yanchao Yang, Ganesh Sundaramoorthi, Stefano Soatto

We propose a method to detect disocclusion in video sequences of three-dimensional scenes and to partition the disoccluded regions into objects, defined by coherent deformation corresponding to surfaces in the scene.

Object Semantic Segmentation +2

Paper
Add Code

Coarse-To-Fine Region Selection and Matching

no code implementations • CVPR 2015 • Yanchao Yang, Zhaojin Lu, Ganesh Sundaramoorthi

We present a new approach to wide baseline matching.

Paper
Add Code

Shape Tracking With Occlusions via Coarse-To-Fine Region-Based Sobolev Descent

no code implementations • 21 Aug 2012 • Yanchao Yang, Ganesh Sundaramoorthi

In cases of 3D object motion and viewpoint change, self-occlusions and dis-occlusions of the object are prominent, and current methods employing joint shape and appearance models are unable to adapt to new shape and appearance information, leading to inaccurate shape detection.

Object Object Tracking

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.