Search Results for author: Zhe Huang

Found 25 papers, 11 papers with code

Private Wasserstein Distance with Random Noises

1 code implementation10 Apr 2024 Wenqian Li, Haozhi Wang, Zhe Huang, Yan Pang

Wasserstein distance is a principle measure of data divergence from a distributional standpoint.

InterLUDE: Interactions between Labeled and Unlabeled Data to Enhance Semi-Supervised Learning

no code implementations15 Mar 2024 Zhe Huang, Xiaowei Yu, Dajiang Zhu, Michael C. Hughes

In this paper, we introduce InterLUDE, a new approach to enhance SSL made of two parts that each benefit from labeled-unlabeled interaction.

Image Classification Representation Learning

VOLoc: Visual Place Recognition by Querying Compressed Lidar Map

1 code implementation25 Feb 2024 Xudong Cai, Yongcai Wang, Zhe Huang, Yu Shao, Deying Li

Then the QPC is compressed by the same GPC, and is aggregated into a global descriptor by an attention-based aggregation module, to query the compressed Lidar map in the vector space.

Pose Estimation Transfer Learning +2

MGTR: Multi-Granular Transformer for Motion Prediction with LiDAR

no code implementations5 Dec 2023 Yiqian Gan, Hao Xiao, Yizhe Zhao, Ethan Zhang, Zhe Huang, Xin Ye, Lingting Ge

Motion prediction has been an essential component of autonomous driving systems since it handles highly uncertain and complex scenarios involving moving agents of different types.

Autonomous Driving motion prediction

Split-and-Denoise: Protect large language model inference with local differential privacy

no code implementations13 Oct 2023 Peihua Mai, Ran Yan, Zhe Huang, Youjia Yang, Yan Pang

Large Language Models (LLMs) shows powerful capability in natural language understanding by capturing hidden semantics in vector space.

Language Modelling Large Language Model +2

Workshop on Document Intelligence Understanding

no code implementations31 Jul 2023 Soyeon Caren Han, Yihao Ding, Siwen Luo, Josiah Poon, HeeGuen Yoon, Zhe Huang, Paul Duuring, Eun Jung Holden

Document understanding and information extraction include different tasks to understand a document and extract valuable information automatically.

document understanding Visual Question Answering (VQA)

FSD: Fully-Specialized Detector via Neural Architecture Search

no code implementations26 May 2023 Zhe Huang, Yudian Li

Most generic object detectors are mainly built for standard object detection tasks such as COCO and PASCAL VOC.

Lesion Detection Neural Architecture Search +3

Detecting Heart Disease from Multi-View Ultrasound Images via Supervised Attention Multiple Instance Learning

1 code implementation25 May 2023 Zhe Huang, Benjamin S. Wessler, Michael C. Hughes

To automate screening for AS, deep networks must learn to mimic a human expert's ability to identify views of the aortic valve then aggregate across these relevant images to produce a study-level diagnosis.

Contrastive Learning Multiple Instance Learning

AirBirds: A Large-scale Challenging Dataset for Bird Strike Prevention in Real-world Airports

no code implementations23 Apr 2023 Hongyu Sun, Yongcai Wang, Xudong Cai, Peng Wang, Zhe Huang, Deying Li, Yu Shao, Shuo Wang

To advance the research and practical solutions for bird strike prevention, in this paper, we present a large-scale challenging dataset AirBirds that consists of 118, 312 time-series images, where a total of 409, 967 bounding boxes of flying birds are manually, carefully annotated.

Time Series

Fix-A-Step: Semi-supervised Learning from Uncurated Unlabeled Data

1 code implementation25 Aug 2022 Zhe Huang, Mary-Joy Sidhom, Benjamin S. Wessler, Michael C. Hughes

Semi-supervised learning (SSL) promises improved accuracy compared to training classifiers on small labeled datasets by also training on many unlabeled images.

Addressing Optimism Bias in Sequence Modeling for Reinforcement Learning

no code implementations21 Jul 2022 Adam Villaflor, Zhe Huang, Swapnil Pande, John Dolan, Jeff Schneider

Impressive results in natural language processing (NLP) based on the Transformer neural network architecture have inspired researchers to explore viewing offline reinforcement learning (RL) as a generic sequence modeling problem.

Autonomous Driving D4RL +2

Data Encryption based on 9D Complex Chaotic System with Quaternion for Smart Grid

no code implementations3 Jun 2022 Fangfang Zhang, Zhe Huang, Lei Kou, Yang Li, Maoyong Cao, Fengying Ma

In this paper, a new 9D complex chaotic system with quaternion is proposed for the encryption of smart grid data.

Management

V-Doc : Visual questions answers with Documents

no code implementations27 May 2022 Yihao Ding, Zhe Huang, Runlin Wang, Yanhang Zhang, Xianru Chen, Yuzhong Ma, Hyunsuk Chung, Soyeon Caren Han

We propose V-Doc, a question-answering tool using document images and PDF, mainly for researchers and general non-deep learning experts looking to generate, process, and understand the document visual question answering tasks.

Question Answering Question Generation +2

V-Doc: Visual Questions Answers With Documents

no code implementations CVPR 2022 Yihao Ding, Zhe Huang, Runlin Wang, Yanhang Zhang, Xianru Chen, Yuzhong Ma, Hyunsuk Chung, Soyeon Caren Han

We propose V-Doc, a question-answering tool using document images and PDF, mainly for researchers and general non-deep learning experts looking to generate, process, and understand the document visual question answering tasks.

Question Answering Question Generation +2

A New Semi-supervised Learning Benchmark for Classifying View and Diagnosing Aortic Stenosis from Echocardiograms

1 code implementation30 Jul 2021 Zhe Huang, Gary Long, Benjamin Wessler, Michael C. Hughes

Semi-supervised image classification has shown substantial progress in learning from limited labeled data, but recent advances remain largely untested for clinical applications.

Semi-Supervised Image Classification

Learning Sparse Interaction Graphs of Partially Detected Pedestrians for Trajectory Prediction

1 code implementation15 Jul 2021 Zhe Huang, Ruohua Li, Kazuki Shin, Katherine Driggs-Campbell

Multi-pedestrian trajectory prediction is an indispensable element of autonomous systems that safely interact with crowds in unstructured environments.

Pedestrian Trajectory Prediction Trajectory Prediction

Reducing false-positive biopsies with deep neural networks that utilize local and global information in screening mammograms

no code implementations19 Sep 2020 Nan Wu, Zhe Huang, Yiqiu Shen, Jungkyu Park, Jason Phang, Taro Makino, S. Gene Kim, Kyunghyun Cho, Laura Heacock, Linda Moy, Krzysztof J. Geras

Breast cancer is the most common cancer in women, and hundreds of thousands of unnecessary biopsies are done around the world at a tremendous cost.

Online monitoring for safe pedestrian-vehicle interactions

no code implementations12 Oct 2019 Peter Du, Zhe Huang, Tianqi Liu, Ke Xu, Qichao Gao, Hussein Sibai, Katherine Driggs-Campbell, Sayan Mitra

As autonomous systems begin to operate amongst humans, methods for safe interaction must be investigated.

Robotics Multiagent Systems Signal Processing

Gradual Network for Single Image De-raining

no code implementations20 Sep 2019 Zhe Huang, Weijiang Yu, Wayne Zhang, Litong Feng, Nong Xiao

Taking the residual result (the coarse de-rained result) between the rainy image sample (i. e. the input data) and the output of coarse stage (i. e. the learnt rain mask) as input, the fine stage continues to de-rain by removing the fine-grained rain streaks (e. g. light rain streaks and water mist) to get a rain-free and well-reconstructed output image via a unified contextual merging sub-network with dense blocks and a merging block.

Rain Removal

Cannot find the paper you are looking for? You can Submit a new open access paper.