Search Results for author: Robby T. Tan

Found 57 papers, 23 papers with code

Object Tracking using Spatio-Temporal Networks for Future Prediction Location

no code implementations ECCV 2020 Yuan Liu, Ruoteng Li, Yu Cheng, Robby T. Tan, Xiubao Sui

To facilitate the future prediction ability, we follow three key observations: 1) object motion trajectory is affected significantly by camera motion; 2) the past trajectory of an object can act as a salient cue to estimate the object motion in the spatial domain; 3) previous frames contain the surroundings and appearance of the target object, which is useful for predicting the target object’s future locations.

Future prediction Object +1

Nighttime Defogging Using High-Low Frequency Decomposition and Grayscale-Color Networks

no code implementations ECCV 2020 Wending Yan, Robby T. Tan, Dengxin Dai

Given an RGB foggy nighttime image, our grayscale module takes the grayscale version of the image as input, and decomposes it into high and low frequency layers.

Vocal Bursts Intensity Prediction

NightHaze: Nighttime Image Dehazing via Self-Prior Learning

no code implementations12 Mar 2024 Beibei Lin, Yeying Jin, Wending Yan, Wei Ye, Yuan Yuan, Robby T. Tan

By increasing the noise values to approach as high as the pixel intensity values of the glow and light effect blended images, our augmentation becomes severe, resulting in stronger priors.

Image Dehazing Image Enhancement

Semantic Segmentation in Multiple Adverse Weather Conditions with Domain Knowledge Retention

no code implementations15 Jan 2024 Xin Yang, Wending Yan, Yuan Yuan, Michael Bi Mi, Robby T. Tan

They struggle to acquire new knowledge while also retaining previously learned knowledge. To address these problems, we propose a semantic segmentation method for multiple adverse weather conditions that incorporates adaptive knowledge acquisition, pseudolabel blending, and weather composition replay.

Multi-target Domain Adaptation Semantic Segmentation +1

HEAP: Unsupervised Object Discovery and Localization with Contrastive Grouping

no code implementations29 Dec 2023 Xin Zhang, Jinheng Xie, Yuan Yuan, Michael Bi Mi, Robby T. Tan

Further, to ensure the distinguishability among various regions, we introduce a region-level contrastive clustering loss to pull closer similar regions across images.

Object Object Discovery +2

Few-Shot Learning from Augmented Label-Uncertain Queries in Bongard-HOI

no code implementations17 Dec 2023 Qinqian Lei, Bo wang, Robby T. Tan

In our proposed method, we introduce novel label-uncertain query augmentation techniques to enhance the diversity of the query inputs, aiming to distinguish the positive HOI class from the negative ones.

Few-Shot Learning Human-Object Interaction Detection +1

ORTexME: Occlusion-Robust Human Shape and Pose via Temporal Average Texture and Mesh Encoding

no code implementations21 Sep 2023 Yu Cheng, Bo wang, Robby T. Tan

In 3D human shape and pose estimation from a monocular video, models trained with limited labeled data cannot generalize well to videos with occlusion, which is common in the wild videos.

Neural Rendering Novel View Synthesis +1

Auxiliary Tasks Benefit 3D Skeleton-based Human Motion Prediction

1 code implementation ICCV 2023 Chenxin Xu, Robby T. Tan, Yuhong Tan, Siheng Chen, Xinchao Wang, Yanfeng Wang

To work with auxiliary tasks, we propose a novel auxiliary-adapted transformer, which can handle incomplete, corrupted motion data and achieve coordinate recovery via capturing spatial-temporal dependencies.

Human motion prediction motion prediction

Enhancing Visibility in Nighttime Haze Images Using Guided APSF and Gradient Adaptive Convolution

1 code implementation3 Aug 2023 Yeying Jin, Beibei Lin, Wending Yan, Yuan Yuan, Wei Ye, Robby T. Tan

In this paper, we enhance the visibility from a single nighttime haze image by suppressing glow and enhancing low-light regions.

Dynamic Transformers Provide a False Sense of Efficiency

1 code implementation20 May 2023 Yiming Chen, Simin Chen, Zexin Li, Wei Yang, Cong Liu, Robby T. Tan, Haizhou Li

Despite much success in natural language processing (NLP), pre-trained language models typically lead to a high computational cost during inference.

Adversarial Attack

DSFNet: Dual Space Fusion Network for Occlusion-Robust 3D Dense Face Alignment

1 code implementation CVPR 2023 Heyuan Li, Bo wang, Yu Cheng, Mohan Kankanhalli, Robby T. Tan

Thanks to the proposed fusion module, our method is robust not only to occlusion and large pitch and roll view angles, which is the benefit of our image space approach, but also to noise and large yaw angles, which is the benefit of our model space method.

 Ranked #1 on 3D Face Reconstruction on AFLW2000-3D (Mean NME metric)

3D Face Reconstruction Face Alignment +1

Seeing What You Said: Talking Face Generation Guided by a Lip Reading Expert

1 code implementation CVPR 2023 Jiadong Wang, Xinyuan Qian, Malu Zhang, Robby T. Tan, Haizhou Li

To address the problem, we propose using a lip-reading expert to improve the intelligibility of the generated lip regions by penalizing the incorrect generation results.

Contrastive Learning Lip Reading +1

EqMotion: Equivariant Multi-agent Motion Prediction with Invariant Interaction Reasoning

1 code implementation CVPR 2023 Chenxin Xu, Robby T. Tan, Yuhong Tan, Siheng Chen, Yu Guang Wang, Xinchao Wang, Yanfeng Wang

In motion prediction tasks, maintaining motion equivariance under Euclidean geometric transformations and invariance of agent interaction is a critical and fundamental principle.

Human Pose Forecasting motion prediction +2

Deep Homography Mixture for Single Image Rolling Shutter Correction

1 code implementation ICCV 2023 Weilong Yan, Robby T. Tan, Bing Zeng, Shuaicheng Liu

In this work, we adopt a more straightforward method to learn deep homography mixture motion between an RS image and its corresponding GS image, without large solution space or strict restrictions on image features.

Rolling Shutter Correction

Estimating Reflectance Layer from A Single Image: Integrating Reflectance Guidance and Shadow/Specular Aware Learning

1 code implementation27 Nov 2022 Yeying Jin, Ruoteng Li, Wenhan Yang, Robby T. Tan

To further enforce the reflectance layer to be independent of shadows and specularities in the second-stage refinement, we introduce an S-Aware network that distinguishes the reflectance image from the input image.

highlight removal Intrinsic Image Decomposition +1

MIMT: Multi-Illuminant Color Constancy via Multi-Task Local Surface and Light Color Learning

no code implementations16 Nov 2022 Shuwei Li, Jikai Wang, Michael S. Brown, Robby T. Tan

To have better cues of the local surface/light colors under multiple light color conditions, we design a novel multi-task learning framework.

Color Constancy Edge Detection +1

Uncertainty-aware Gait Recognition via Learning from Dirichlet Distribution-based Evidence

no code implementations15 Nov 2022 Beibei Lin, Chen Liu, Ming Wang, Lincheng Li, Shunli Zhang, Robby T. Tan, Xin Yu

Existing gait recognition frameworks retrieve an identity in the gallery based on the distance between a probe sample and the identities in the gallery.

Gait Recognition Retrieval

DeS3: Adaptive Attention-driven Self and Soft Shadow Removal using ViT Similarity

1 code implementation15 Nov 2022 Yeying Jin, Wei Ye, Wenhan Yang, Yuan Yuan, Robby T. Tan

Most existing methods rely on binary shadow masks, without considering the ambiguous boundaries of soft and self shadows.

Image Shadow Removal Shadow Removal

Structure Representation Network and Uncertainty Feedback Learning for Dense Non-Uniform Fog Removal

1 code implementation6 Oct 2022 Yeying Jin, Wending Yan, Wenhan Yang, Robby T. Tan

Few existing image defogging or dehazing methods consider dense and non-uniform particle distributions, which usually happen in smoke, dust and fog.

Image Dehazing Image Enhancement +3

Bottom-Up 2D Pose Estimation via Dual Anatomical Centers for Small-Scale Persons

no code implementations25 Aug 2022 Yu Cheng, Yihao Ai, Bo wang, Xinchao Wang, Robby T. Tan

In multi-person 2D pose estimation, the bottom-up methods simultaneously predict poses for all persons, and unlike the top-down methods, do not rely on human detection.

2D Pose Estimation Human Detection +1

Unsupervised Night Image Enhancement: When Layer Decomposition Meets Light-Effects Suppression

1 code implementation21 Jul 2022 Yeying Jin, Wenhan Yang, Robby T. Tan

To address this problem, we need to suppress the light effects in bright regions while, at the same time, boosting the intensity of dark regions.

Hallucination Image Restoration +1

Feature-Aligned Video Raindrop Removal with Temporal Constraints

no code implementations29 May 2022 Wending Yan, Lu Xu, Wenhan Yang, Robby T. Tan

Our single image module employs a raindrop removal network to generate initial raindrop removal results, and create a mask representing the differences between the input and initial output.

Optical Flow Estimation Rain Removal

Dual networks based 3D Multi-Person Pose Estimation from Monocular Video

1 code implementation2 May 2022 Yu Cheng, Bo wang, Robby T. Tan

Most of the methods focus on single persons, which estimate the poses in the person-centric coordinates, i. e., the coordinates based on the center of the target person.

3D Multi-Person Pose Estimation (absolute) 3D Multi-Person Pose Estimation (root-relative) +4

Nighttime Visibility Enhancement by Increasing the Dynamic Range and Suppression of Light Effects

no code implementations CVPR 2021 Aashish Sharma, Robby T. Tan

In this paper, given a single nighttime image as input, our goal is to enhance its visibility by increasing the dynamic range of the intensity, and thus can boost the intensity of the low light regions, and at the same time, suppress the light effects (glow, glare) simultaneously.

Self-Aligned Video Deraining With Transmission-Depth Consistency

1 code implementation CVPR 2021 Wending Yan, Robby T. Tan, Wenhan Yang, Dengxin Dai

In this paper, we address the problems of rain streaks and rain accumulation removal in video, by developing a self-aligned network with transmission-depth consistency.

Optical Flow Estimation Rain Removal

Human Object Interaction Detection using Two-Direction Spatial Enhancement and Exclusive Object Prior

no code implementations7 May 2021 Lu Liu, Robby T. Tan

At inference, we propose a human-object regrouping approach by considering the object-exclusive property of an action, where the target object should not be shared by more than one human.

Human-Object Interaction Detection Object

Monocular 3D Multi-Person Pose Estimation by Integrating Top-Down and Bottom-Up Networks

1 code implementation CVPR 2021 Yu Cheng, Bo wang, Bo Yang, Robby T. Tan

Besides the integration of top-down and bottom-up networks, unlike existing pose discriminators that are designed solely for single person, and consequently cannot assess natural inter-person interactions, we propose a two-person pose discriminator that enforces natural two-person interactions.

3D Multi-Person Pose Estimation (absolute) 3D Multi-Person Pose Estimation (root-relative) +2

Graph and Temporal Convolutional Networks for 3D Multi-person Pose Estimation in Monocular Videos

1 code implementation22 Dec 2020 Yu Cheng, Bo wang, Bo Yang, Robby T. Tan

To tackle this problem, we propose a novel framework integrating graph convolutional networks (GCNs) and temporal convolutional networks (TCNs) to robustly estimate camera-centric multi-person 3D poses that do not require camera parameters.

3D Absolute Human Pose Estimation 3D Multi-Person Pose Estimation (absolute) +5

Object Tracking Using Spatio-Temporal Future Prediction

no code implementations15 Oct 2020 YuAn Liu, Ruoteng Li, Robby T. Tan, Yu Cheng, Xiubao Sui

Our trajectory prediction module predicts the target object's locations in the current and future frames based on the object's past trajectory.

Future prediction Object +2

Estimation of Camera Response Function using Prediction Consistency and Gradual Refinement with an Extension to Deep Learning

no code implementations8 Oct 2020 Aashish Sharma, Robby T. Tan, Loong-Fah Cheong

Second, we employ a gradual refinement scheme in which we start from a simple CRF model to generate a result which is more robust to noise but less accurate, and then we gradually increase the model's complexity to improve the result.

Optical Flow in Dense Foggy Scenes using Semi-Supervised Learning

no code implementations CVPR 2020 Wending Yan, Aashish Sharma, Robby T. Tan

Initially, given a pair of synthetic fog images, its corresponding clean images and optical flow ground-truths, in one training batch we train our network in a supervised manner.

Optical Flow Estimation

Single Image Deraining: From Model-Based to Data-Driven and Beyond

no code implementations16 Dec 2019 Wenhan Yang, Robby T. Tan, Shiqi Wang, Yuming Fang, Jiaying Liu

The goal of single-image deraining is to restore the rain-free background scenes of an image degraded by rain streaks and rain accumulation.

Single Image Deraining

Hyperspectral City V1.0 Dataset and Benchmark

no code implementations24 Jul 2019 Shaodi You, Erqi Huang, Shuaizhe Liang, Yongrong Zheng, Yunxiang Li, Fan Wang, Sen Lin, Qiu Shen, Xun Cao, Diming Zhang, Yuanjiang Li, Yu Li, Ying Fu, Boxin Shi, Feng Lu, Yinqiang Zheng, Robby T. Tan

This document introduces the background and the usage of the Hyperspectral City Dataset and the benchmark.

Heavy Rain Image Restoration: Integrating Physics Model and Conditional Adversarial Learning

1 code implementation10 Apr 2019 Ruotent Li, Loong Fah Cheong, Robby T. Tan

This filtering is guided by a rain-free residue image --- its content is used to set the passbands for the two channels in a spatially-variant manner so that the background details do not get mixed up with the rain-streaks.

Image Restoration Rain Removal

Certainty Driven Consistency Loss on Multi-Teacher Networks for Semi-Supervised Learning

no code implementations17 Jan 2019 Lu Liu, Robby T. Tan

Specifically, we propose two approaches, i. e. Filtering CCL and Temperature CCL to either filter out uncertain predictions or pay less attention on them in the consistency regularization.

Loss Guided Activation for Action Recognition in Still Images

no code implementations11 Dec 2018 Lu Liu, Robby T. Tan, ShaoDi You

This requirement of bounding boxes as part of the input is needed to enable the methods to ignore irrelevant contexts and extract only human features.

Action Recognition In Still Images

Robust Optical Flow in Rainy Scenes

no code implementations ECCV 2018 Ruoteng Li, Robby T. Tan, Loong-Fah Cheong

Optical flow estimation in rainy scenes is challenging due to degradation caused by rain streaks and rain accumulation, where the latter refers to the poor visibility of remote scenes due to intense rainfall.

Optical Flow Estimation

Single Image Deraining using Scale-Aware Multi-Stage Recurrent Network

no code implementations19 Dec 2017 Ruoteng Li, Loong-Fah Cheong, Robby T. Tan

Given a single input rainy image, our goal is to visually remove rain streaks and the veiling effect caused by scattering and transmission of rain streaks and rain droplets.

Single Image Deraining

Robust Optical Flow Estimation in Rainy Scenes

no code implementations18 Apr 2017 Ruoteng Li, Robby T. Tan, Loong-Fah Cheong

To handle rain accumulation, our method decomposes the image into a piecewise-smooth background layer and a high-frequency detail layer.

Optical Flow Estimation

Deep Joint Rain Detection and Removal from a Single Image

2 code implementations CVPR 2017 Wenhan Yang, Robby T. Tan, Jiashi Feng, Jiaying Liu, Zongming Guo, Shuicheng Yan

Based on the first model, we develop a multi-task deep learning architecture that learns the binary rain streak map, the appearance of rain streaks, and the clean background, which is our ultimate output.

Rain Removal

Haze Visibility Enhancement: A Survey and Quantitative Benchmarking

no code implementations21 Jul 2016 Yu Li, ShaoDi You, Michael S. Brown, Robby T. Tan

This paper provides a comprehensive survey of methods dealing with visibility enhancement of images taken in hazy or foggy scenes.

Benchmarking

Robust Optical Flow Estimation of Double-Layer Images under Transparency or Reflection

no code implementations CVPR 2016 Jiaolong Yang, Hongdong Li, Yuchao Dai, Robby T. Tan

This paper deals with a challenging, frequently encountered, yet not properly investigated problem in two-frame optical flow estimation.

Optical Flow Estimation valid

Waterdrop Stereo

no code implementations4 Apr 2016 Shaodi You, Robby T. Tan, Rei Kawakami, Yasuhiro Mukaigawa, Katsushi Ikeuchi

(2) The imagery inside a water-drop is determined by the water-drop 3D shape and total reflection at the boundary.

Depth Estimation

Nighttime Haze Removal With Glow and Multiple Light Colors

no code implementations ICCV 2015 Yu Li, Robby T. Tan, Michael S. Brown

We demonstrate the effectiveness of our nighttime haze model and correction method on a number of examples and compare our results with existing daytime and nighttime dehazing methods' results.

Water Detection through Spatio-Temporal Invariant Descriptors

no code implementations2 Nov 2015 Pascal Mettes, Robby T. Tan, Remco C. Veltkamp

Experimental evaluation on the Video Water Database and the DynTex database indicates the effectiveness of the proposed algorithm, outperforming multiple algorithms for dynamic texture recognition and material recognition by ca.

Dynamic Texture Recognition Material Recognition

Adherent Raindrop Detection and Removal in Video

no code implementations CVPR 2013 Shaodi You, Robby T. Tan, Rei Kawakami, Katsushi Ikeuchi

First, it detects raindrops based on the motion and the intensity temporal derivatives of the input video.

Cannot find the paper you are looking for? You can Submit a new open access paper.