Search Results for author: Robby T. Tan

Found 57 papers, 23 papers with code

Object Tracking using Spatio-Temporal Networks for Future Prediction Location

no code implementations • ECCV 2020 • Yuan Liu, Ruoteng Li, Yu Cheng, Robby T. Tan, Xiubao Sui

To facilitate the future prediction ability, we follow three key observations: 1) object motion trajectory is affected significantly by camera motion; 2) the past trajectory of an object can act as a salient cue to estimate the object motion in the spatial domain; 3) previous frames contain the surroundings and appearance of the target object, which is useful for predicting the target object’s future locations.

Future prediction Object +1

Paper
Add Code

Nighttime Defogging Using High-Low Frequency Decomposition and Grayscale-Color Networks

no code implementations • ECCV 2020 • Wending Yan, Robby T. Tan, Dengxin Dai

Given an RGB foggy nighttime image, our grayscale module takes the grayscale version of the image as input, and decomposes it into high and low frequency layers.

Vocal Bursts Intensity Prediction

Paper
Add Code

CAT: Exploiting Inter-Class Dynamics for Domain Adaptive Object Detection

1 code implementation • 28 Mar 2024 • Mikhail Kennerley, Jian-Gang Wang, Bharadwaj Veeravalli, Robby T. Tan

We propose Class-Aware Teacher (CAT) to address the class bias issue in the domain adaptation setting.

Domain Adaptation object-detection +1

Paper
Code

NightHaze: Nighttime Image Dehazing via Self-Prior Learning

no code implementations • 12 Mar 2024 • Beibei Lin, Yeying Jin, Wending Yan, Wei Ye, Yuan Yuan, Robby T. Tan

By increasing the noise values to approach as high as the pixel intensity values of the glow and light effect blended images, our augmentation becomes severe, resulting in stronger priors.

Image Dehazing Image Enhancement

Paper
Add Code

Semantic Segmentation in Multiple Adverse Weather Conditions with Domain Knowledge Retention

no code implementations • 15 Jan 2024 • Xin Yang, Wending Yan, Yuan Yuan, Michael Bi Mi, Robby T. Tan

They struggle to acquire new knowledge while also retaining previously learned knowledge. To address these problems, we propose a semantic segmentation method for multiple adverse weather conditions that incorporates adaptive knowledge acquisition, pseudolabel blending, and weather composition replay.

Multi-target Domain Adaptation Semantic Segmentation +1

Paper
Add Code

HEAP: Unsupervised Object Discovery and Localization with Contrastive Grouping

no code implementations • 29 Dec 2023 • Xin Zhang, Jinheng Xie, Yuan Yuan, Michael Bi Mi, Robby T. Tan

Further, to ensure the distinguishability among various regions, we introduce a region-level contrastive clustering loss to pull closer similar regions across images.

Object Object Discovery +2

Paper
Add Code

Few-Shot Learning from Augmented Label-Uncertain Queries in Bongard-HOI

no code implementations • 17 Dec 2023 • Qinqian Lei, Bo wang, Robby T. Tan

In our proposed method, we introduce novel label-uncertain query augmentation techniques to enhance the diversity of the query inputs, aiming to distinguish the positive HOI class from the negative ones.

Few-Shot Learning Human-Object Interaction Detection +1

Paper
Add Code

ORTexME: Occlusion-Robust Human Shape and Pose via Temporal Average Texture and Mesh Encoding

no code implementations • 21 Sep 2023 • Yu Cheng, Bo wang, Robby T. Tan

In 3D human shape and pose estimation from a monocular video, models trained with limited labeled data cannot generalize well to videos with occlusion, which is common in the wild videos.

Neural Rendering Novel View Synthesis +1

Paper
Add Code

Auxiliary Tasks Benefit 3D Skeleton-based Human Motion Prediction

1 code implementation • ICCV 2023 • Chenxin Xu, Robby T. Tan, Yuhong Tan, Siheng Chen, Xinchao Wang, Yanfeng Wang

To work with auxiliary tasks, we propose a novel auxiliary-adapted transformer, which can handle incomplete, corrupted motion data and achieve coordinate recovery via capturing spatial-temporal dependencies.

Human motion prediction motion prediction

Paper
Code

Enhancing Visibility in Nighttime Haze Images Using Guided APSF and Gradient Adaptive Convolution

1 code implementation • 3 Aug 2023 • Yeying Jin, Beibei Lin, Wending Yan, Yuan Yuan, Wei Ye, Robby T. Tan

In this paper, we enhance the visibility from a single nighttime haze image by suppressing glow and enhancing low-light regions.

131

Paper
Code

Dynamic Transformers Provide a False Sense of Efficiency

1 code implementation • 20 May 2023 • Yiming Chen, Simin Chen, Zexin Li, Wei Yang, Cong Liu, Robby T. Tan, Haizhou Li

Despite much success in natural language processing (NLP), pre-trained language models typically lead to a high computational cost during inference.

Adversarial Attack

Paper
Code

DSFNet: Dual Space Fusion Network for Occlusion-Robust 3D Dense Face Alignment

1 code implementation • CVPR 2023 • Heyuan Li, Bo wang, Yu Cheng, Mohan Kankanhalli, Robby T. Tan

Thanks to the proposed fusion module, our method is robust not only to occlusion and large pitch and roll view angles, which is the benefit of our image space approach, but also to noise and large yaw angles, which is the benefit of our model space method.

Ranked #1 on 3D Face Reconstruction on AFLW2000-3D (Mean NME metric)

3D Face Reconstruction Face Alignment +1

Paper
Code

Seeing What You Said: Talking Face Generation Guided by a Lip Reading Expert

1 code implementation • CVPR 2023 • Jiadong Wang, Xinyuan Qian, Malu Zhang, Robby T. Tan, Haizhou Li

To address the problem, we propose using a lip-reading expert to improve the intelligibility of the generated lip regions by penalizing the incorrect generation results.

Contrastive Learning Lip Reading +1

349

Paper
Code

2PCNet: Two-Phase Consistency Training for Day-to-Night Unsupervised Domain Adaptive Object Detection

1 code implementation • CVPR 2023 • Mikhail Kennerley, Jian-Gang Wang, Bharadwaj Veeravalli, Robby T. Tan

Object detection at night is a challenging problem due to the absence of night image annotations.

object-detection Object Detection +1

Paper
Code

EqMotion: Equivariant Multi-agent Motion Prediction with Invariant Interaction Reasoning

1 code implementation • CVPR 2023 • Chenxin Xu, Robby T. Tan, Yuhong Tan, Siheng Chen, Yu Guang Wang, Xinchao Wang, Yanfeng Wang

In motion prediction tasks, maintaining motion equivariance under Euclidean geometric transformations and invariance of agent interaction is a critical and fundamental principle.

Ranked #1 on Human Pose Forecasting on Human3.6M

Human Pose Forecasting motion prediction +2

100

Paper
Code

Deep Homography Mixture for Single Image Rolling Shutter Correction

1 code implementation • ICCV 2023 • Weilong Yan, Robby T. Tan, Bing Zeng, Shuaicheng Liu

In this work, we adopt a more straightforward method to learn deep homography mixture motion between an RS image and its corresponding GS image, without large solution space or strict restrictions on image features.

Rolling Shutter Correction

Paper
Code

Estimating Reflectance Layer from A Single Image: Integrating Reflectance Guidance and Shadow/Specular Aware Learning

1 code implementation • 27 Nov 2022 • Yeying Jin, Ruoteng Li, Wenhan Yang, Robby T. Tan

To further enforce the reflectance layer to be independent of shadows and specularities in the second-stage refinement, we introduce an S-Aware network that distinguishes the reflectance image from the input image.

highlight removal Intrinsic Image Decomposition +1

Paper
Code

Object Detection in Foggy Scenes by Embedding Depth and Reconstruction into Domain Adaptation

1 code implementation • 24 Nov 2022 • Xin Yang, Michael Bi Mi, Yuan Yuan, Xin Wang, Robby T. Tan

In our DA framework, we retain the depth and background information during the domain feature alignment.

Domain Adaptation Object +2

Paper
Code

MIMT: Multi-Illuminant Color Constancy via Multi-Task Local Surface and Light Color Learning

no code implementations • 16 Nov 2022 • Shuwei Li, Jikai Wang, Michael S. Brown, Robby T. Tan

To have better cues of the local surface/light colors under multiple light color conditions, we design a novel multi-task learning framework.

Color Constancy Edge Detection +1

Paper
Add Code

Uncertainty-aware Gait Recognition via Learning from Dirichlet Distribution-based Evidence

no code implementations • 15 Nov 2022 • Beibei Lin, Chen Liu, Ming Wang, Lincheng Li, Shunli Zhang, Robby T. Tan, Xin Yu

Existing gait recognition frameworks retrieve an identity in the gallery based on the distance between a probe sample and the identities in the gallery.

Gait Recognition Retrieval

Paper
Add Code

DeS3: Adaptive Attention-driven Self and Soft Shadow Removal using ViT Similarity

1 code implementation • 15 Nov 2022 • Yeying Jin, Wei Ye, Wenhan Yang, Yuan Yuan, Robby T. Tan

Most existing methods rely on binary shadow masks, without considering the ambiguous boundaries of soft and self shadows.

Image Shadow Removal Shadow Removal

Paper
Code

Structure Representation Network and Uncertainty Feedback Learning for Dense Non-Uniform Fog Removal

1 code implementation • 6 Oct 2022 • Yeying Jin, Wending Yan, Wenhan Yang, Robby T. Tan

Few existing image defogging or dehazing methods consider dense and non-uniform particle distributions, which usually happen in smoke, dust and fog.

Ranked #1 on Image Dehazing on O-Haze

Image Dehazing Image Enhancement +3

139

Paper
Code

Bottom-Up 2D Pose Estimation via Dual Anatomical Centers for Small-Scale Persons

no code implementations • 25 Aug 2022 • Yu Cheng, Yihao Ai, Bo wang, Xinchao Wang, Robby T. Tan

In multi-person 2D pose estimation, the bottom-up methods simultaneously predict poses for all persons, and unlike the top-down methods, do not rely on human detection.

2D Pose Estimation Human Detection +1

Paper
Add Code

DC-ShadowNet: Single-Image Hard and Soft Shadow Removal Using Unsupervised Domain-Classifier Guided Network

1 code implementation • ICCV 2021 • Yeying Jin, Aashish Sharma, Robby T. Tan

To address the problem, in this paper, we propose an unsupervised domain-classifier guided shadow removal network, DC-ShadowNet.

Ranked #2 on Shadow Removal on SRD

Image Enhancement Image Reconstruction +3

190

Paper
Code

Unsupervised Night Image Enhancement: When Layer Decomposition Meets Light-Effects Suppression

1 code implementation • 21 Jul 2022 • Yeying Jin, Wenhan Yang, Robby T. Tan

To address this problem, we need to suppress the light effects in bright regions while, at the same time, boosting the intensity of dark regions.

Ranked #23 on Low-Light Image Enhancement on LOL

Hallucination Image Restoration +1

362

Paper
Code

Feature-Aligned Video Raindrop Removal with Temporal Constraints

no code implementations • 29 May 2022 • Wending Yan, Lu Xu, Wenhan Yang, Robby T. Tan

Our single image module employs a raindrop removal network to generate initial raindrop removal results, and create a mask representing the differences between the input and initial output.

Optical Flow Estimation Rain Removal

Paper
Add Code

Dual networks based 3D Multi-Person Pose Estimation from Monocular Video

1 code implementation • 2 May 2022 • Yu Cheng, Bo wang, Robby T. Tan

Most of the methods focus on single persons, which estimate the poses in the person-centric coordinates, i. e., the coordinates based on the center of the target person.

Ranked #1 on 3D Human Pose Estimation on JTA

3D Multi-Person Pose Estimation (absolute) 3D Multi-Person Pose Estimation (root-relative) +4

154

Paper
Code

Nighttime Visibility Enhancement by Increasing the Dynamic Range and Suppression of Light Effects

no code implementations • CVPR 2021 • Aashish Sharma, Robby T. Tan

In this paper, given a single nighttime image as input, our goal is to enhance its visibility by increasing the dynamic range of the intensity, and thus can boost the intensity of the low light regions, and at the same time, suppress the light effects (glow, glare) simultaneously.

Paper
Add Code

Self-Aligned Video Deraining With Transmission-Depth Consistency

1 code implementation • CVPR 2021 • Wending Yan, Robby T. Tan, Wenhan Yang, Dengxin Dai

In this paper, we address the problems of rain streaks and rain accumulation removal in video, by developing a self-aligned network with transmission-depth consistency.

Optical Flow Estimation Rain Removal

Paper
Code

Human Object Interaction Detection using Two-Direction Spatial Enhancement and Exclusive Object Prior

no code implementations • 7 May 2021 • Lu Liu, Robby T. Tan

At inference, we propose a human-object regrouping approach by considering the object-exclusive property of an action, where the target object should not be shared by more than one human.

Human-Object Interaction Detection Object

Paper
Add Code

Monocular 3D Multi-Person Pose Estimation by Integrating Top-Down and Bottom-Up Networks

1 code implementation • CVPR 2021 • Yu Cheng, Bo wang, Bo Yang, Robby T. Tan

Besides the integration of top-down and bottom-up networks, unlike existing pose discriminators that are designed solely for single person, and consequently cannot assess natural inter-person interactions, we propose a two-person pose discriminator that enforces natural two-person interactions.

Ranked #1 on 3D Multi-Person Pose Estimation (root-relative) on MuPoTS-3D

3D Multi-Person Pose Estimation (absolute) 3D Multi-Person Pose Estimation (root-relative) +2

154

Paper
Code

Graph and Temporal Convolutional Networks for 3D Multi-person Pose Estimation in Monocular Videos

1 code implementation • 22 Dec 2020 • Yu Cheng, Bo wang, Bo Yang, Robby T. Tan

To tackle this problem, we propose a novel framework integrating graph convolutional networks (GCNs) and temporal convolutional networks (TCNs) to robustly estimate camera-centric multi-person 3D poses that do not require camera parameters.

Ranked #1 on Root Joint Localization on Human3.6M

3D Absolute Human Pose Estimation 3D Multi-Person Pose Estimation (absolute) +5

Paper
Code

Object Tracking Using Spatio-Temporal Future Prediction

no code implementations • 15 Oct 2020 • YuAn Liu, Ruoteng Li, Robby T. Tan, Yu Cheng, Xiubao Sui

Our trajectory prediction module predicts the target object's locations in the current and future frames based on the object's past trajectory.

Future prediction Object +2

Paper
Add Code

Multi-Scale Networks for 3D Human Pose Estimation with Inference Stage Optimization

no code implementations • 13 Oct 2020 • Cheng Yu, Bo wang, Bo Yang, Robby T. Tan

Addressing these problems, we introduce a spatio-temporal network for robust 3D human pose estimation.

2D Pose Estimation Pose Prediction +1

Paper
Add Code

Estimation of Camera Response Function using Prediction Consistency and Gradual Refinement with an Extension to Deep Learning

no code implementations • 8 Oct 2020 • Aashish Sharma, Robby T. Tan, Loong-Fah Cheong

Second, we employ a gradual refinement scheme in which we start from a simple CRF model to generate a result which is more robust to noise but less accurate, and then we gradually increase the model's complexity to improve the result.

Paper
Add Code

3D Human Pose Estimation using Spatio-Temporal Networks with Explicit Occlusion Training

no code implementations • AAAI Conference on Artificial Intelligence, AAAI 2020 2020 • Yu Cheng, Bo Yang, Bo wang, Robby T. Tan

Estimating 3D poses from a monocular video is still a challenging task, despite the significant progress that has been made in recent years.

Ranked #3 on 3D Human Pose Estimation on HumanEva-I

Monocular 3D Human Pose Estimation valid

Paper
Add Code

Optical Flow in Dense Foggy Scenes using Semi-Supervised Learning

no code implementations • CVPR 2020 • Wending Yan, Aashish Sharma, Robby T. Tan

Initially, given a pair of synthetic fog images, its corresponding clean images and optical flow ground-truths, in one training batch we train our network in a supervised manner.

Optical Flow Estimation

Paper
Add Code

Single Image Deraining: From Model-Based to Data-Driven and Beyond

no code implementations • 16 Dec 2019 • Wenhan Yang, Robby T. Tan, Shiqi Wang, Yuming Fang, Jiaying Liu

The goal of single-image deraining is to restore the rain-free background scenes of an image degraded by rain streaks and rain accumulation.

Single Image Deraining

Paper
Add Code

Nighttime Stereo Depth Estimation using Joint Translation-Stereo Learning: Light Effects and Uninformative Regions

2 code implementations • 30 Sep 2019 • Aashish Sharma, Lionel Heng, Loong-Fah Cheong, Robby T. Tan

To address the problem, we introduce a network joining day/night translation and stereo.

Stereo Depth Estimation Translation

Paper
Code

Hyperspectral City V1.0 Dataset and Benchmark

no code implementations • 24 Jul 2019 • Shaodi You, Erqi Huang, Shuaizhe Liang, Yongrong Zheng, Yunxiang Li, Fan Wang, Sen Lin, Qiu Shen, Xun Cao, Diming Zhang, Yuanjiang Li, Yu Li, Ying Fu, Boxin Shi, Feng Lu, Yinqiang Zheng, Robby T. Tan

This document introduces the background and the usage of the Hyperspectral City Dataset and the benchmark.

Paper
Add Code

GraphX$^{NET}-$ Chest X-Ray Classification Under Extreme Minimal Supervision

no code implementations • 23 Jul 2019 • Angelica I. Aviles-Rivero, Nicolas Papadakis, Ruoteng Li, Philip Sellars, Qingnan Fan, Robby T. Tan, Carola-Bibiane Schönlieb

The task of classifying X-ray data is a problem of both theoretical and clinical interest.

Ranked #4 on Semi-supervised Medical Image Classification on Chest X-Ray14 2% labeled

Classification General Classification +2

Paper
Add Code

Heavy Rain Image Restoration: Integrating Physics Model and Conditional Adversarial Learning

1 code implementation • 10 Apr 2019 • Ruotent Li, Loong Fah Cheong, Robby T. Tan

This filtering is guided by a rain-free residue image --- its content is used to set the passbands for the two channels in a spatially-variant manner so that the background details do not get mixed up with the rain-streaks.

Image Restoration Rain Removal

149

Paper
Code

Certainty Driven Consistency Loss on Multi-Teacher Networks for Semi-Supervised Learning

no code implementations • 17 Jan 2019 • Lu Liu, Robby T. Tan

Specifically, we propose two approaches, i. e. Filtering CCL and Temperature CCL to either filter out uncertain predictions or pay less attention on them in the consistency regularization.

Paper
Add Code

Loss Guided Activation for Action Recognition in Still Images

no code implementations • 11 Dec 2018 • Lu Liu, Robby T. Tan, ShaoDi You

This requirement of bounding boxes as part of the input is needed to enable the methods to ignore irrelevant contexts and extract only human features.

Action Recognition In Still Images

Paper
Add Code

Robust Optical Flow in Rainy Scenes

no code implementations • ECCV 2018 • Ruoteng Li, Robby T. Tan, Loong-Fah Cheong

Optical flow estimation in rainy scenes is challenging due to degradation caused by rain streaks and rain accumulation, where the latter refers to the poor visibility of remote scenes due to intense rainfall.

Optical Flow Estimation

Paper
Add Code

Single Image Deraining using Scale-Aware Multi-Stage Recurrent Network

no code implementations • 19 Dec 2017 • Ruoteng Li, Loong-Fah Cheong, Robby T. Tan

Given a single input rainy image, our goal is to visually remove rain streaks and the veiling effect caused by scattering and transmission of rain streaks and rain droplets.

Single Image Deraining

Paper
Add Code

Attentive Generative Adversarial Network for Raindrop Removal from a Single Image

3 code implementations • CVPR 2018 • Rui Qian, Robby T. Tan, Wenhan Yang, Jiajun Su, Jiaying Liu

This injection of visual attention to both generative and discriminative networks is the main contribution of this paper.

Generative Adversarial Network Rain Removal

487

Paper
Code

Robust Optical Flow Estimation in Rainy Scenes

no code implementations • 18 Apr 2017 • Ruoteng Li, Robby T. Tan, Loong-Fah Cheong

To handle rain accumulation, our method decomposes the image into a piecewise-smooth background layer and a high-frequency detail layer.

Optical Flow Estimation

Paper
Add Code

Deep Joint Rain Detection and Removal from a Single Image

2 code implementations • CVPR 2017 • Wenhan Yang, Robby T. Tan, Jiashi Feng, Jiaying Liu, Zongming Guo, Shuicheng Yan

Based on the first model, we develop a multi-task deep learning architecture that learns the binary rain streak map, the appearance of rain streaks, and the clean background, which is our ultimate output.

Rain Removal

Paper
Code

Haze Visibility Enhancement: A Survey and Quantitative Benchmarking

no code implementations • 21 Jul 2016 • Yu Li, ShaoDi You, Michael S. Brown, Robby T. Tan

This paper provides a comprehensive survey of methods dealing with visibility enhancement of images taken in hazy or foggy scenes.

Benchmarking

Paper
Add Code

Rain Streak Removal Using Layer Priors

no code implementations • CVPR 2016 • Yu Li, Robby T. Tan, Xiaojie Guo, Jiangbo Lu, Michael S. Brown

This paper addresses the problem of rain streak removal from a single image.

Dictionary Learning

Paper
Add Code

Robust Optical Flow Estimation of Double-Layer Images under Transparency or Reflection

no code implementations • CVPR 2016 • Jiaolong Yang, Hongdong Li, Yuchao Dai, Robby T. Tan

This paper deals with a challenging, frequently encountered, yet not properly investigated problem in two-frame optical flow estimation.

Optical Flow Estimation valid

Paper
Add Code

Waterdrop Stereo

no code implementations • 4 Apr 2016 • Shaodi You, Robby T. Tan, Rei Kawakami, Yasuhiro Mukaigawa, Katsushi Ikeuchi

(2) The imagery inside a water-drop is determined by the water-drop 3D shape and total reflection at the boundary.

Depth Estimation

Paper
Add Code

Nighttime Haze Removal With Glow and Multiple Light Colors

no code implementations • ICCV 2015 • Yu Li, Robby T. Tan, Michael S. Brown

We demonstrate the effectiveness of our nighttime haze model and correction method on a number of examples and compare our results with existing daytime and nighttime dehazing methods' results.

Paper
Add Code

Water Detection through Spatio-Temporal Invariant Descriptors

no code implementations • 2 Nov 2015 • Pascal Mettes, Robby T. Tan, Remco C. Veltkamp

Experimental evaluation on the Video Water Database and the DynTex database indicates the effectiveness of the proposed algorithm, outperforming multiple algorithms for dynamic texture recognition and material recognition by ca.

Dynamic Texture Recognition Material Recognition

Paper
Add Code

Simultaneous Video Defogging and Stereo Reconstruction

no code implementations • CVPR 2015 • Zhuwen Li, Ping Tan, Robby T. Tan, Danping Zou, Steven Zhiying Zhou, Loong-Fah Cheong

We present a method to jointly estimate scene depth and recover the clear latent image from a foggy video sequence.

Image Matting Stereo Matching +1

Paper
Add Code

Adherent Raindrop Detection and Removal in Video

no code implementations • CVPR 2013 • Shaodi You, Robby T. Tan, Rei Kawakami, Katsushi Ikeuchi

First, it detects raindrops based on the motion and the intensity temporal derivatives of the input video.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.