Search Results for author: Tao He

Found 29 papers, 11 papers with code

AMP: Autoregressive Motion Prediction Revisited with Next Token Prediction for Autonomous Driving

no code implementations20 Mar 2024 Xiaosong Jia, Shaoshuai Shi, Zijun Chen, Li Jiang, Wenlong Liao, Tao He, Junchi Yan

As an essential task in autonomous driving (AD), motion prediction aims to predict the future states of surround objects for navigation.

Motion Forecasting motion prediction +1

Towards Lifelong Scene Graph Generation with Knowledge-ware In-context Prompt Learning

no code implementations26 Jan 2024 Tao He, Tongtong Wu, Dongyang Zhang, Guiduo Duan, Ke Qin, Yuan-Fang Li

Besides, extensive experiments on the two mainstream benchmark datasets, VG and Open-Image(v6), show the superiority of our proposed model to a number of competitive SGG models in terms of continuous learning and conventional settings.

Graph Generation In-Context Learning +1

A Survey of Chain of Thought Reasoning: Advances, Frontiers and Future

1 code implementation27 Sep 2023 Zheng Chu, Jingchang Chen, Qianglong Chen, Weijiang Yu, Tao He, Haotian Wang, Weihua Peng, Ming Liu, Bing Qin, Ting Liu

Chain-of-thought reasoning, a cognitive process fundamental to human intelligence, has garnered significant attention in the realm of artificial intelligence and natural language processing.

Deformable Kernel Expansion Model for Efficient Arbitrary-shaped Scene Text Detection

no code implementations28 Mar 2023 Tao He, Sheng Huang, Wenhao Tang, Bo Liu

DKE employs a segmentation module to segment the shrunken text region as the text kernel, then expands the text kernel contour to obtain text boundary by regressing the vertex-wise offsets.

Graph Matching Scene Text Detection +2

X-ReID: Cross-Instance Transformer for Identity-Level Person Re-Identification

no code implementations4 Feb 2023 Leqi Shen, Tao He, Yuchen Guo, Guiguang Ding

In this paper, we propose to promote Instance-Level features to Identity-Level features by employing cross-attention to incorporate information from one image to another of the same identity, thus more unified and discriminative pedestrian information can be obtained.

Person Re-Identification

SoccerNet 2022 Challenges Results

7 code implementations5 Oct 2022 Silvio Giancola, Anthony Cioppa, Adrien Deliège, Floriane Magera, Vladimir Somers, Le Kang, Xin Zhou, Olivier Barnich, Christophe De Vleeschouwer, Alexandre Alahi, Bernard Ghanem, Marc Van Droogenbroeck, Abdulrahman Darwish, Adrien Maglo, Albert Clapés, Andreas Luyts, Andrei Boiarov, Artur Xarles, Astrid Orcesi, Avijit Shah, Baoyu Fan, Bharath Comandur, Chen Chen, Chen Zhang, Chen Zhao, Chengzhi Lin, Cheuk-Yiu Chan, Chun Chuen Hui, Dengjie Li, Fan Yang, Fan Liang, Fang Da, Feng Yan, Fufu Yu, Guanshuo Wang, H. Anthony Chan, He Zhu, Hongwei Kan, Jiaming Chu, Jianming Hu, Jianyang Gu, Jin Chen, João V. B. Soares, Jonas Theiner, Jorge De Corte, José Henrique Brito, Jun Zhang, Junjie Li, Junwei Liang, Leqi Shen, Lin Ma, Lingchi Chen, Miguel Santos Marques, Mike Azatov, Nikita Kasatkin, Ning Wang, Qiong Jia, Quoc Cuong Pham, Ralph Ewerth, Ran Song, RenGang Li, Rikke Gade, Ruben Debien, Runze Zhang, Sangrok Lee, Sergio Escalera, Shan Jiang, Shigeyuki Odashima, Shimin Chen, Shoichi Masui, Shouhong Ding, Sin-wai Chan, Siyu Chen, Tallal El-Shabrawy, Tao He, Thomas B. Moeslund, Wan-Chi Siu, Wei zhang, Wei Li, Xiangwei Wang, Xiao Tan, Xiaochuan Li, Xiaolin Wei, Xiaoqing Ye, Xing Liu, Xinying Wang, Yandong Guo, YaQian Zhao, Yi Yu, YingYing Li, Yue He, Yujie Zhong, Zhenhua Guo, Zhiheng Li

The SoccerNet 2022 challenges were the second annual video understanding challenges organized by the SoccerNet team.

Action Spotting Camera Calibration +3

Detecting Rotated Objects as Gaussian Distributions and Its 3-D Generalization

1 code implementation22 Sep 2022 Xue Yang, Gefan Zhang, Xiaojiang Yang, Yue Zhou, Wentao Wang, Jin Tang, Tao He, Junchi Yan

Existing detection methods commonly use a parameterized bounding box (BBox) to model and detect (horizontal) objects and an additional rotation angle parameter is used for rotated objects.

regression

Towards Open-vocabulary Scene Graph Generation with Prompt-based Finetuning

no code implementations17 Aug 2022 Tao He, Lianli Gao, Jingkuan Song, Yuan-Fang Li

In this paper, we introduce open-vocabulary scene graph generation, a novel, realistic and challenging setting in which a model is trained on a set of base object classes but is required to infer relations for unseen target object classes.

Graph Generation Object +1

VEM$^2$L: A Plug-and-play Framework for Fusing Text and Structure Knowledge on Sparse Knowledge Graph Completion

no code implementations4 Jul 2022 Tao He, Ming Liu, Yixin Cao, Tianwen Jiang, Zihao Zheng, Jingrun Zhang, Sendong Zhao, Bing Qin

In this paper, we solve the sparse KGC from these two motivations simultaneously and handle their respective drawbacks further, and propose a plug-and-play unified framework VEM$^2$L over sparse KGs.

Knowledge Distillation Missing Elements +1

A High-Accuracy Unsupervised Person Re-identification Method Using Auxiliary Information Mined from Datasets

1 code implementation6 May 2022 Hehan Teng, Tao He, Yuchen Guo, Guiguang Ding

Combined with auxiliary information exploiting modules, our methods achieve mAP of 89. 9% on DukeMTMC, where TOC, STS and SCP all contributed considerable performance improvements.

STS Unsupervised Person Re-Identification

TAGPerson: A Target-Aware Generation Pipeline for Person Re-identification

1 code implementation28 Dec 2021 Kai Chen, Weihua Chen, Tao He, Rong Du, Fan Wang, Xiuyu Sun, Yuchen Guo, Guiguang Ding

In TAGPerson, we extract information from target scenes and use them to control our parameterized rendering process to generate target-aware synthetic images, which would hold a smaller gap to the real images in the target domain.

Person Re-Identification

Camera Bias Regularization for Person Re-identification

no code implementations29 Sep 2021 Tao He, Tongkun Xu, Weihua Chen, Yuchen Guo, Guiguang Ding, Zhenhua Guo

Due to the discrepancies between cameras caused by illumination, background, or viewpoint, the underlying difficulty for Re-ID is the camera bias problem, which leads to the large gap of within-identity features from different cameras.

Person Re-Identification

Unsupervised Domain-adaptive Hash for Networks

no code implementations20 Aug 2021 Tao He, Lianli Gao, Jingkuan Song, Yuan-Fang Li

Abundant real-world data can be naturally represented by large-scale networks, which demands efficient and effective learning algorithms.

Link Prediction Node Classification +1

Semi-supervised Network Embedding with Differentiable Deep Quantisation

no code implementations20 Aug 2021 Tao He, Lianli Gao, Jingkuan Song, Yuan-Fang Li

Learning accurate low-dimensional embeddings for a network is a crucial task as it facilitates many downstream network analytics tasks.

Link Prediction Network Embedding +2

Exploiting Scene Graphs for Human-Object Interaction Detection

1 code implementation ICCV 2021 Tao He, Lianli Gao, Jingkuan Song, Yuan-Fang Li

Human-Object Interaction (HOI) detection is a fundamental visual task aiming at localizing and recognizing interactions between humans and objects.

Human-Object Interaction Detection Object

A Computer-Aided Diagnosis System for Breast Pathology: A Deep Learning Approach with Model Interpretability from Pathological Perspective

no code implementations5 Aug 2021 Wei-Wen Hsu, Yongfang Wu, Chang Hao, Yu-Ling Hou, Xiang Gao, Yun Shao, Xueli Zhang, Tao He, Yanhong Tai

Objective: We develop a computer-aided diagnosis (CAD) system using deep learning approaches for lesion detection and classification on whole-slide images (WSIs) with breast cancer.

Classification Lesion Classification +3

Learning from the Scene and Borrowing from the Rich: Tackling the Long Tail in Scene Graph Generation

no code implementations13 Jun 2020 Tao He, Lianli Gao, Jingkuan Song, Jianfei Cai, Yuan-Fang Li

Despite the huge progress in scene graph generation in recent years, its long-tail distribution in object relationships remains a challenging and pestering issue.

Graph Generation Object +2

R3Det: Refined Single-Stage Detector with Feature Refinement for Rotating Object

10 code implementations15 Aug 2019 Xue Yang, Junchi Yan, Ziming Feng, Tao He

Considering the shortcoming of feature misalignment in existing refined single-stage detector, we design a feature refinement module to improve detection performance by getting more accurate features.

object-detection Object Detection In Aerial Images

One Network for Multi-Domains: Domain Adaptive Hashing with Intersectant Generative Adversarial Network

1 code implementation1 Jul 2019 Tao He, Yuan-Fang Li, Lianli Gao, Dongxiang Zhang, Jingkuan Song

We evaluate our framework on {four} public benchmark datasets, all of which show that our method is superior to the other state-of-the-art methods on the tasks of object recognition and image retrieval.

Generative Adversarial Network Image Retrieval +2

Understanding the Mechanism of Deep Learning Framework for Lesion Detection in Pathological Images with Breast Cancer

no code implementations4 Mar 2019 Wei-Wen Hsu, Chung-Hao Chen, Chang Hoa, Yu-Ling Hou, Xiang Gao, Yun Shao, Xueli Zhang, Jingjing Wang, Tao He, Yanghong Tai

Most of the characteristics learned by the deep learning models have summarized the detection rules that can be recognized by the experienced pathologists, whereas there are still some features may not be intuitive to domain experts but discriminative in classification for machines.

General Classification Lesion Detection

Deep Discrete Hashing with Self-supervised Pairwise Labels

1 code implementation7 Jul 2017 Jingkuan Song, Tao He, Hangbo Fan, Lianli Gao

2) how to equip the binary representation with the ability of accurate image retrieval and classification in an unsupervised way?

Deep Hashing General Classification +2

Deep Region Hashing for Efficient Large-scale Instance Search from Images

no code implementations26 Jan 2017 Jingkuan Song, Tao He, Lianli Gao, Xing Xu, Heng Tao Shen

Specifically, DRH is an end-to-end deep neural network which consists of object proposal, feature extraction, and hash code generation.

Code Generation Image Retrieval +3

Cannot find the paper you are looking for? You can Submit a new open access paper.