Search Results for author: Rui Huang

Found 69 papers, 37 papers with code

Learning Memory Augmented Cascading Network for Compressed Sensing of Images

1 code implementation • ECCV 2020 • Jiwei Chen, Yubao Sun, Qingshan Liu, Rui Huang

The IDR module is designed to reconstruct the remaining details from the residual measurement vector, and MRU is employed to update the residual measurement vector and feed it into the next IDR module.

Paper
Code

HybriMap: Hybrid Clues Utilization for Effective Vectorized HD Map Construction

no code implementations • 17 Apr 2024 • Chi Zhang, Qi Song, Feifei Li, Yongquan Chen, Rui Huang

Constructing vectorized high-definition maps from surround-view cameras has garnered significant attention in recent years.

Paper
Add Code

Towards Balanced RGB-TSDF Fusion for Consistent Semantic Scene Completion by 3D RGB Feature Completion and a Classwise Entropy Loss Function

no code implementations • 25 Mar 2024 • Laiyan Ding, Panwen Hu, Jie Li, Rui Huang

To address this RGB-TSDF distribution difference, we propose a two-stage network with a 3D RGB feature completion module that completes RGB features with meaningful values for occluded areas.

Paper
Add Code

Negative-Binomial Randomized Gamma Markov Processes for Heterogeneous Overdispersed Count Time Series

no code implementations • 29 Feb 2024 • Rui Huang, Sikun Yang, Heinz Koeppl

Modeling count-valued time series has been receiving increasing attention since count time series naturally arise in physical and social domains.

Imputation Time Series

Paper
Add Code

A Saliency Enhanced Feature Fusion based multiscale RGB-D Salient Object Detection Network

no code implementations • 22 Jan 2024 • Rui Huang, Qingyi Zhao, Yan Xing, Sihua Gao, Weifeng Xu, Yuxiang Zhang, Wei Fan

SEFF utilizes saliency maps of the neighboring scales to enhance the necessary features for fusing, resulting in more representative fused features.

object-detection RGB-D Salient Object Detection +2

Paper
Add Code

Spy-Watermark: Robust Invisible Watermarking for Backdoor Attack

no code implementations • 4 Jan 2024 • Ruofei Wang, Renjie Wan, Zongyu Guo, Qing Guo, Rui Huang

Backdoor attack aims to deceive a victim model when facing backdoor instances while maintaining its performance on benign data.

Backdoor Attack backdoor defense

Paper
Add Code

Segment3D: Learning Fine-Grained Class-Agnostic 3D Segmentation without Manual Labels

no code implementations • 28 Dec 2023 • Rui Huang, Songyou Peng, Ayca Takmaz, Federico Tombari, Marc Pollefeys, Shiji Song, Gao Huang, Francis Engelmann

Therefore, we explore the use of image segmentation foundation models to automatically generate training labels for 3D segmentation.

Image Segmentation Scene Segmentation +1

Paper
Add Code

SmartEdit: Exploring Complex Instruction-based Image Editing with Multimodal Large Language Models

1 code implementation • 11 Dec 2023 • Yuzhou Huang, Liangbin Xie, Xintao Wang, Ziyang Yuan, Xiaodong Cun, Yixiao Ge, Jiantao Zhou, Chao Dong, Rui Huang, Ruimao Zhang, Ying Shan

Both quantitative and qualitative results on this evaluation dataset indicate that our SmartEdit surpasses previous methods, paving the way for the practical application of complex instruction-based image editing.

114

Paper
Code

C-NERF: Representing Scene Changes as Directional Consistency Difference-based NeRF

1 code implementation • 5 Dec 2023 • Rui Huang, Binbin Jiang, Qingyi Zhao, William Wang, Yuxiang Zhang, Qing Guo

Our approach surpasses state-of-the-art 2D change detection and NeRF-based methods by a significant margin.

Change Detection

Paper
Code

Mol-Instructions: A Large-Scale Biomolecular Instruction Dataset for Large Language Models

1 code implementation • 13 Jun 2023 • Yin Fang, Xiaozhuan Liang, Ningyu Zhang, Kangwei Liu, Rui Huang, Zhuo Chen, Xiaohui Fan, Huajun Chen

Large Language Models (LLMs), with their remarkable task-handling capabilities and innovative outputs, have catalyzed significant advancements across a spectrum of fields.

Catalytic activity prediction Chemical-Disease Interaction Extraction +14

182

Paper
Code

Producing Usable Taxonomies Cheaply and Rapidly at Pinterest Using Discovered Dynamic $μ$-Topics

no code implementations • 29 Jan 2023 • Abhijit Mahabal, Jiyun Luo, Rui Huang, Michael Ellsworth, Rui Li

Creating a taxonomy of interests is expensive and human-effort intensive: not only do we need to identify nodes and interconnect them, in order to use the taxonomy, we must also connect the nodes to relevant entities such as users, pins, and queries.

Specificity

Paper
Add Code

Joint Representation Learning for Text and 3D Point Cloud

no code implementations • 18 Jan 2023 • Rui Huang, Xuran Pan, Henry Zheng, Haojun Jiang, Zhifeng Xie, Shiji Song, Gao Huang

During the pre-training stage, we establish the correspondence of images and point clouds based on the readily available RGB-D data and use contrastive learning to align the image and point cloud representations.

Contrastive Learning Instance Segmentation +4

Paper
Add Code

Learning To Generate Language-Supervised and Open-Vocabulary Scene Graph Using Pre-Trained Visual-Semantic Space

1 code implementation • CVPR 2023 • Yong Zhang, Yingwei Pan, Ting Yao, Rui Huang, Tao Mei, Chang-Wen Chen

Specifically, cheap scene graph supervision data can be easily obtained by parsing image language descriptions into semantic graphs.

Graph Generation object-detection +3

Paper
Code

A Faster, Lighter and Stronger Deep Learning-Based Approach for Place Recognition

no code implementations • 27 Nov 2022 • Rui Huang, Ze Huang, Songzhi Su

We designed RepVGG-lite as the backbone network in our architecture, it is more discriminative than other general networks in the Place Recognition task.

Camera Localization Loop Closure Detection +1

Paper
Add Code

Background-Mixed Augmentation for Weakly Supervised Change Detection

1 code implementation • 21 Nov 2022 • Rui Huang, Ruofei Wang, Qing Guo, Jieda Wei, Yuxiang Zhang, Wei Fan, Yang Liu

Change detection (CD) is to decouple object changes (i. e., object missing or appearing) from background changes (i. e., environment variations) like light and season variations in two images captured in the same scene over a long time span, presenting critical applications in disaster management, urban development, etc.

Change Detection Data Augmentation +1

Paper
Code

Cross-Modal Adapter for Text-Video Retrieval

1 code implementation • 17 Nov 2022 • Haojun Jiang, Jianke Zhang, Rui Huang, Chunjiang Ge, Zanlin Ni, Jiwen Lu, Jie zhou, Shiji Song, Gao Huang

However, as pre-trained models are scaling up, fully fine-tuning them on text-video retrieval datasets has a high risk of overfitting.

Retrieval Video Retrieval

Paper
Code

Rate-Splitting for Intelligent Reflecting Surface-Aided Multiuser VR Streaming

1 code implementation • 21 Oct 2022 • Rui Huang, Vincent W. S. Wong, Robert Schober

In the proposed system, RS facilitates the exploitation of the shared interests of the users in VR streaming, and IRS creates additional propagation channels to support the transmission of high-resolution 360-degree videos.

Continuous Control Imitation Learning +1

Paper
Code

Efficient Knowledge Distillation from Model Checkpoints

1 code implementation • 12 Oct 2022 • Chaofei Wang, Qisen Yang, Rui Huang, Shiji Song, Gao Huang

Knowledge distillation is an effective approach to learn compact models (students) with the supervision of large and strong models (teachers).

Knowledge Distillation

Paper
Code

RenderNet: Visual Relocalization Using Virtual Viewpoints in Large-Scale Indoor Environments

no code implementations • 26 Jul 2022 • Jiahui Zhang, Shitao Tang, Kejie Qiu, Rui Huang, Chuan Fang, Le Cui, Zilong Dong, Siyu Zhu, Ping Tan

Visual relocalization has been a widely discussed problem in 3D vision: given a pre-constructed 3D visual map, the 6 DoF (Degrees-of-Freedom) pose of a query image is estimated.

Image Retrieval Retrieval +1

Paper
Add Code

Deep Semantic Statistics Matching (D2SM) Denoising Network

1 code implementation • 19 Jul 2022 • Kangfu Mei, Vishal M. Patel, Rui Huang

The ultimate aim of image restoration like denoising is to find an exact correlation between the noisy and clear image domains.

Denoising Image Restoration +2

Paper
Code

IDET: Iterative Difference-Enhanced Transformers for High-Quality Change Detection

1 code implementation • 15 Jul 2022 • Qing Guo, Ruofei Wang, Rui Huang, Shuifa Sun, Yuxiang Zhang

Change detection (CD) aims to detect change regions within an image pair captured at different times, playing a significant role in diverse real-world applications.

Change Detection Vocal Bursts Intensity Prediction

Paper
Code

Exploring Structure-aware Transformer over Interaction Proposals for Human-Object Interaction Detection

1 code implementation • CVPR 2022 • Yong Zhang, Yingwei Pan, Ting Yao, Rui Huang, Tao Mei, Chang-Wen Chen

Such design decomposes the process of HOI set prediction into two subsequent phases, i. e., an interaction proposal generation is first performed, and then followed by transforming the non-parametric interaction proposals into HOI predictions via a structure-aware Transformer.

Ranked #3 on Human-Object Interaction Detection on V-COCO

Human-Object Interaction Detection Object

Paper
Code

Revisiting the role of heterophily in graph representation learning: An edge classification perspective

no code implementations • 23 May 2022 • Jincheng Huang, Ping Li, Rui Huang, Chen Na, Acong Zhang

Alternatively, it is possible to exploit the information about the presence of heterophilous neighbors for feature learning, so a hybrid message passing approach is devised to aggregate homophilious neighbors and diversify heterophilous neighbors based on edge classification.

Edge Classification Graph Learning +1

Paper
Add Code

Co-visual pattern augmented generative transformer learning for automobile geo-localization

no code implementations • 17 Mar 2022 • Jianwei Zhao, Qiang Zhai, Pengbo Zhao, Rui Huang, Hong Cheng

Geolocation is a fundamental component of route planning and navigation for unmanned vehicles, but GNSS-based geolocation fails under denial-of-service conditions.

Paper
Add Code

Domain Adaptation via Prompt Learning

1 code implementation • 14 Feb 2022 • Chunjiang Ge, Rui Huang, Mixue Xie, Zihang Lai, Shiji Song, Shuang Li, Gao Huang

Unsupervised domain adaption (UDA) aims to adapt models learned from a well-annotated source domain to a target domain, where only unlabeled samples are given.

Domain Adaptation

Paper
Code

Salient-to-Broad Transition for Video Person Re-Identification

1 code implementation • CVPR 2022 • Shutao Bai, Bingpeng Ma, Hong Chang, Rui Huang, Xilin Chen

To further improve SBM, an Integration-and-Distribution Module (IDM) is introduced to enhance frame-level representations.

Video-Based Person Re-Identification

Paper
Code

Fully Attentional Network for Semantic Segmentation

1 code implementation • 8 Dec 2021 • Qi Song, Jie Li, Chenghong Li, Hao Guo, Rui Huang

Recent non-local self-attention methods have proven to be effective in capturing long-range dependencies for semantic segmentation.

Computational Efficiency Segmentation +1

Paper
Code

Coupled Segmentation and Edge Learning via Dynamic Graph Propagation

no code implementations • NeurIPS 2021 • Zhiding Yu, Rui Huang, Wonmin Byeon, Sifei Liu, Guilin Liu, Thomas Breuel, Anima Anandkumar, Jan Kautz

It is therefore interesting to study how these two tasks can be coupled to benefit each other.

Edge Detection Image Segmentation +2

Paper
Add Code

Denoised Non-Local Neural Network for Semantic Segmentation

no code implementations • 27 Oct 2021 • Qi Song, Jie Li, Hao Guo, Rui Huang

Without any external training data, our proposed Denoised NL can achieve the state-of-the-art performance of 83. 5\% and 46. 69\% mIoU on Cityscapes and ADE20K, respectively.

Semantic Segmentation

Paper
Add Code

PLNet: Plane and Line Priors for Unsupervised Indoor Depth Estimation

1 code implementation • 12 Oct 2021 • Hualie Jiang, Laiyan Ding, Junjie Hu, Rui Huang

Unsupervised learning of depth from indoor monocular videos is challenging as the artificial environment contains many textureless regions.

Depth Estimation

Paper
Code

On the Importance of Gradients for Detecting Distributional Shifts in the Wild

1 code implementation • NeurIPS 2021 • Rui Huang, Andrew Geng, Yixuan Li

Detecting out-of-distribution (OOD) data has become a critical component in ensuring the safe deployment of machine learning models in the real world.

Ranked #12 on Out-of-Distribution Detection on ImageNet-1k vs SUN

Out-of-Distribution Detection

Paper
Code

Domain Composition and Attention for Unseen-Domain Generalizable Medical Image Segmentation

1 code implementation • 18 Sep 2021 • Ran Gu, Jingyang Zhang, Rui Huang, Wenhui Lei, Guotai Wang, Shaoting Zhang

First, we present a domain composition method that represents one certain domain by a linear combination of a set of basis representations (i. e., a representation bank).

Domain Generalization Image Segmentation +2

Paper
Code

Unsupervised Monocular Depth Perception: Focusing on Moving Objects

1 code implementation • 30 Aug 2021 • Hualie Jiang, Laiyan Ding, Zhenglong Sun, Rui Huang

We first propose an outlier masking technique that considers the occluded or dynamic pixels as statistical outliers in the photometric error map.

Autonomous Driving Motion Estimation

Paper
Code

IMENet: Joint 3D Semantic Scene Completion and 2D Semantic Segmentation through Iterative Mutual Enhancement

no code implementations • 29 Jun 2021 • Jie Li, Laiyan Ding, Rui Huang

3D semantic scene completion and 2D semantic segmentation are two tightly correlated tasks that are both essential for indoor scene understanding, because they predict the same semantic classes, using positively correlated high-level features.

2D Semantic Segmentation 3D Semantic Scene Completion +3

Paper
Add Code

Toward Less Hidden Cost of Code Completion with Acceptance and Ranking Models

no code implementations • 26 Jun 2021 • Jingxuan Li, Rui Huang, Wei Li, Kai Yao, Weiguo Tan

We integrate this ranking scheme with two frequency models and a GPT-2 styled language model, along with the acceptance model to yield 27. 80% and 37. 64% increase in TOP1 and TOP5 accuracy, respectively.

Code Completion Language Modelling

Paper
Add Code

Not All Images are Worth 16x16 Words: Dynamic Transformers for Efficient Image Recognition

2 code implementations • NeurIPS 2021 • Yulin Wang, Rui Huang, Shiji Song, Zeyi Huang, Gao Huang

Inspired by this phenomenon, we propose a Dynamic Transformer to automatically configure a proper number of tokens for each input image.

Ranked #29 on Image Classification on CIFAR-100 (using extra training data)

Computational Efficiency Image Classification

241

Paper
Code

MOS: Towards Scaling Out-of-distribution Detection for Large Semantic Space

4 code implementations • CVPR 2021 • Rui Huang, Yixuan Li

Detecting out-of-distribution (OOD) inputs is a central challenge for safely deploying machine learning models in the real world.

Ranked #3 on Out-of-Distribution Detection on ImageNet-1k vs iNaturalist (using extra training data)

Out-of-Distribution Detection

Paper
Code

BiCnet-TKS: Learning Efficient Spatial-Temporal Representation for Video Person Re-Identification

1 code implementation • CVPR 2021 • Ruibing Hou, Hong Chang, Bingpeng Ma, Rui Huang, Shiguang Shan

Detail Branch processes frames at original resolution to preserve the detailed visual clues, and Context Branch with a down-sampling strategy is employed to capture long-range contexts.

Video-Based Person Re-Identification

Paper
Code

FocusNetv2: Imbalanced Large and Small Organ Segmentation with Adversarial Shape Constraint for Head and Neck CT Images

1 code implementation • 5 Apr 2021 • Yunhe Gao, Rui Huang, Yiwei Yang, Jie Zhang, Kainan Shao, Changjuan Tao, YuanYuan Chen, Dimitris N. Metaxas, Hongsheng Li, Ming Chen

Radiotherapy is a treatment where radiation is used to eliminate cancer cells.

Organ Segmentation Segmentation

Paper
Code

SDAN: Squared Deformable Alignment Network for Learning Misaligned Optical Zoom

1 code implementation • 2 Apr 2021 • Kangfu Mei, Shenglong Ye, Rui Huang

Deep Neural Network (DNN) based super-resolution algorithms have greatly improved the quality of the generated images.

Computational Efficiency Super-Resolution

Paper
Code

Learning Camera Localization via Dense Scene Matching

1 code implementation • CVPR 2021 • Shitao Tang, Chengzhou Tang, Rui Huang, Siyu Zhu, Ping Tan

We present a new method for scene agnostic camera localization using dense scene matching (DSM), where a cost volume is constructed between a query image and a scene.

Camera Localization

Paper
Code

AR Mapping: Accurate and Efficient Mapping for Augmented Reality

no code implementations • 27 Mar 2021 • Rui Huang, Chuan Fang, Kejie Qiu, Le Cui, Zilong Dong, Siyu Zhu, Ping Tan

Secondly, we propose an AR mapping pipeline which takes the input from the scanning device and produces accurate AR Maps.

Paper
Add Code

AttaNet: Attention-Augmented Network for Fast and Accurate Scene Parsing

1 code implementation • 10 Mar 2021 • Qi Song, Kangfu Mei, Rui Huang

In this paper, we propose a new model, called Attention-Augmented Network (AttaNet), to capture both global context and multilevel semantics while keeping the efficiency high.

Scene Parsing Segmentation +1

Paper
Code

UniFuse: Unidirectional Fusion for 360$^{\circ}$ Panorama Depth Estimation

1 code implementation • 6 Feb 2021 • Hualie Jiang, Zhe Sheng, Siyu Zhu, Zilong Dong, Rui Huang

Besides, we also designed a more effective fusion module for our fusion scheme.

Ranked #1 on Depth Estimation on Matterport3D

Depth Estimation

Paper
Code

Automatic Segmentation of Organs-at-Risk from Head-and-Neck CT using Separable Convolutional Neural Network with Hard-Region-Weighted Loss

1 code implementation • 3 Feb 2021 • Wenhui Lei, Haochen Mei, Zhengwentai Sun, Shan Ye, Ran Gu, Huan Wang, Rui Huang, Shichuan Zhang, Shaoting Zhang, Guotai Wang

Despite the stateof-the-art performance achieved by Convolutional Neural Networks (CNNs) for automatic segmentation of OARs, existing methods do not provide uncertainty estimation of the segmentation results for treatment planning, and their accuracy is still limited by several factors, including the low contrast of soft tissues in CT, highly imbalanced sizes of OARs and large inter-slice spacing.

Computed Tomography (CT) Segmentation

Paper
Code

Throughput Optimization for Grant-Free Multiple Access With Multiagent Deep Reinforcement Learning

no code implementations • IEEE Transactions on Wireless Communications 2021 • Rui Huang, Vincent W.S. Wong, Robert Schober

Grant-free multiple access (GFMA) is a promising paradigm to efficiently support uplink access of Internet of Things (IoT) devices.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

Uncertainty-Guided Transformer Reasoning for Camouflaged Object Detection

1 code implementation • ICCV 2021 • Fan Yang, Qiang Zhai, Xin Li, Rui Huang, Ao Luo, Hong Cheng, Deng-Ping Fan

Spotting objects that are visually adapted to their surroundings is challenging for both humans and AI.

Object object-detection +2

Paper
Code

Sparse Single Sweep LiDAR Point Cloud Segmentation via Learning Contextual Shape Priors from Scene Completion

2 code implementations • 7 Dec 2020 • Xu Yan, Jiantao Gao, Jie Li, Ruimao Zhang, Zhen Li, Rui Huang, Shuguang Cui

In practice, an initial semantic segmentation (SS) of a single sweep point cloud can be achieved by any appealing network and then flows into the semantic scene completion (SSC) module as the input.

Ranked #3 on 3D Semantic Scene Completion on SemanticKITTI

3D Semantic Scene Completion from a single RGB image 3D Semantic Segmentation +3

202

Paper
Code

Glance and Focus: a Dynamic Approach to Reducing Spatial Redundancy in Image Classification

1 code implementation • NeurIPS 2020 • Yulin Wang, Kangchen Lv, Rui Huang, Shiji Song, Le Yang, Gao Huang

The accuracy of deep convolutional neural networks (CNNs) generally improves when fueled with high resolution images.

Computational Efficiency General Classification +1

180

Paper
Code

Concentrated Multi-Grained Multi-Attention Network for Video Based Person Re-Identification

no code implementations • 28 Sep 2020 • Panwen Hu, Jiazhen Liu, Rui Huang

The attention mechanism has been proved to be helpful in solving the occlusion problem by a large number of existing methods.

Video-Based Person Re-Identification

Paper
Add Code

CA-Net: Comprehensive Attention Convolutional Neural Networks for Explainable Medical Image Segmentation

3 code implementations • 22 Sep 2020 • Ran Gu, Guotai Wang, Tao Song, Rui Huang, Michael Aertsen, Jan Deprest, Sébastien Ourselin, Tom Vercauteren, Shaoting Zhang

Also, we propose a scale attention module implicitly emphasizing the most salient feature maps among multiple scales so that the CNN is adaptive to the size of an object.

Image Segmentation Lesion Segmentation +3

163

Paper
Code

Multi-organ Segmentation via Co-training Weight-averaged Models from Few-organ Datasets

no code implementations • 17 Aug 2020 • Rui Huang, Yuanjie Zheng, Zhiqiang Hu, Shaoting Zhang, Hongsheng Li

In most scenarios, one might obtain annotations of a single or a few organs from one training set, and obtain annotations of the the other organs from another set of training images.

Organ Segmentation

Paper
Add Code

Global Optimum Search in Quantum Deep Learning

no code implementations • 9 Aug 2020 • Lanston Hau Man Chu, Tejas Bhojraj, Rui Huang

This paper aims to solve machine learning optimization problem by using quantum circuit.

BIG-bench Machine Learning

Paper
Add Code

An LSTM Approach to Temporal 3D Object Detection in LiDAR Point Clouds

no code implementations • ECCV 2020 • Rui Huang, Wanyue Zhang, Abhijit Kundu, Caroline Pantofaru, David A. Ross, Thomas Funkhouser, Alireza Fathi

We use a U-Net style 3D sparse convolution network to extract features for each frame's LiDAR point-cloud.

3D Object Detection Autonomous Driving +2

Paper
Add Code

Disentangle Perceptual Learning through Online Contrastive Learning

no code implementations • 24 Jun 2020 • Kangfu Mei, Yao Lu, Qiaosi Yi, Hao-Yu Wu, Juncheng Li, Rui Huang

Perceptual learning approaches like perceptual loss are empirically powerful for such tasks but they usually rely on the pre-trained classification network to provide features, which are not necessarily optimal in terms of visual perception of image transformation.

Contrastive Learning feature selection

Paper
Add Code

DiPE: Deeper into Photometric Errors for Unsupervised Learning of Depth and Ego-motion from Monocular Videos

1 code implementation • 3 Mar 2020 • Hualie Jiang, Laiyan Ding, Zhenglong Sun, Rui Huang

Unsupervised learning of depth and ego-motion from unlabelled monocular videos has recently drawn great attention, which avoids the use of expensive ground truth in the supervised one.

Ranked #52 on Monocular Depth Estimation on KITTI Eigen split

Autonomous Driving Monocular Depth Estimation +1

Paper
Code

HighEr-Resolution Network for Image Demosaicing and Enhancing

1 code implementation • 19 Nov 2019 • Kangfu Mei, Juncheng Li, Jiajie Zhang, Hao-Yu Wu, Jie Li, Rui Huang

However, plenty of studies have shown that global information is crucial for image restoration tasks like image demosaicing and enhancing.

Demosaicking

Paper
Code

Low-Resource Sequence Labeling via Unsupervised Multilingual Contextualized Representations

1 code implementation • IJCNLP 2019 • Zuyi Bao, Rui Huang, Chen Li, Kenny Q. Zhu

Previous work on cross-lingual sequence labeling tasks either requires parallel data or bridges the two languages through word-byword matching.

Language Modelling NER +1

Paper
Code

FocusNet: Imbalanced Large and Small Organ Segmentation with an End-to-End Deep Neural Network for Head and Neck CT Images

no code implementations • 28 Jul 2019 • Yunhe Gao, Rui Huang, Ming Chen, Zhe Wang, Jincheng Deng, YuanYuan Chen, Yiwei Yang, Jie Zhang, Chanjuan Tao, Hongsheng Li

In this paper, we propose an end-to-end deep neural network for solving the problem of imbalanced large and small organ segmentation in head and neck (HaN) CT images.

Organ Segmentation Segmentation

Paper
Add Code

How Effectively Can Indoor Wireless Positioning Relieve Visual Tracking Pains: A Camera-Rao Bound Viewpoint

no code implementations • 9 Mar 2019 • Panwen Hu, Zizheng Yan, Rui Huang, Feng Yin

Visual tracking is fragile in some difficult scenarios, for instance, appearance ambiguity and variation, occlusion can easily degrade most of visual trackers to some extent.

Visual Tracking

Paper
Add Code

ClickBAIT-v2: Training an Object Detector in Real-Time

no code implementations • 27 Mar 2018 • Ervin Teng, Rui Huang, Bob Iannucci

Modern deep convolutional neural networks (CNNs) for image classification and object detection are often trained offline on large static datasets.

Image Classification Interactive Segmentation +4

Paper
Add Code

Multiple Target Tracking by Learning Feature Representation and Distance Metric Jointly

no code implementations • 9 Feb 2018 • Jun Xiang, Guoshuai Zhang, Jianhua Hou, Nong Sang, Rui Huang

Designing a robust affinity model is the key issue in multiple target tracking (MTT).

Position

Paper
Add Code

Learning Dynamic Siamese Network for Visual Object Tracking

no code implementations • ICCV 2017 • Qing Guo, Wei Feng, Ce Zhou, Rui Huang, Liang Wan, Song Wang

How to effectively learn temporal variation of target appearance, to exclude the interference of cluttered background, while maintaining real-time response, is an essential problem of visual object tracking.

Ranked #5 on Visual Object Tracking on OTB-2013

Object Visual Object Tracking

Paper
Add Code

Active Image-based Modeling with a Toy Drone

no code implementations • 2 May 2017 • Rui Huang, Danping Zou, Richard Vaughan, Ping Tan

Image-based modeling techniques can now generate photo-realistic 3D models from images.

Paper
Add Code

Beyond Face Rotation: Global and Local Perception GAN for Photorealistic and Identity Preserving Frontal View Synthesis

3 code implementations • ICCV 2017 • Rui Huang, Shu Zhang, Tianyu Li, Ran He

This paper proposes a Two-Pathway Generative Adversarial Network (TP-GAN) for photorealistic frontal view synthesis by simultaneously perceiving global structures and local details.

Face Recognition Generative Adversarial Network

Paper
Code

A Spatio-Temporal Appearance Representation for Viceo-Based Pedestrian Re-Identification

no code implementations • ICCV 2015 • Kan Liu, Bingpeng Ma, Wei zhang, Rui Huang

Pedestrian re-identification is a difficult problem due to the large variations in a person's appearance caused by different poses and viewpoints, illumination changes, and occlusions.

Paper
Add Code

Recognizing Focal Liver Lesions in Contrast-Enhanced Ultrasound with Discriminatively Trained Spatio-Temporal Model

1 code implementation • 3 Feb 2015 • Xiaodan Liang, Qingxing Cao, Rui Huang, Liang Lin

The aim of this study is to provide an automatic computational framework to assist clinicians in diagnosing Focal Liver Lesions (FLLs) in Contrast-Enhancement Ultrasound (CEUS).

Paper
Code

An Expressive Deep Model for Human Action Parsing from A Single Image

no code implementations • 2 Feb 2015 • Zhujin Liang, Xiaolong Wang, Rui Huang, Liang Lin

This paper aims at one newly raising task in vision and multimedia research: recognizing human actions from still images.

Action Parsing Action Understanding +2

Paper
Add Code

Exemplar-based Linear Discriminant Analysis for Robust Object Tracking

no code implementations • 24 Feb 2014 • Changxin Gao, Feifei Chen, Jin-Gang Yu, Rui Huang, Nong Sang

However, the task in tracking is to search for a specific object, rather than an object category as in detection.

Object Object Tracking

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.