Search Results for author: Hantao Zhou

Found 3 papers, 2 papers with code

Video Object Segmentation with Dynamic Query Modulation

1 code implementation18 Mar 2024 Hantao Zhou, Runze Hu, Xiu Li

Storing intermediate frame segmentations as memory for long-range context modeling, spatial-temporal memory-based methods have recently showcased impressive results in semi-supervised video object segmentation (SVOS).

Object Segmentation +3

UniHead: Unifying Multi-Perception for Detection Heads

1 code implementation23 Sep 2023 Hantao Zhou, Rui Yang, Yachao Zhang, Haoran Duan, Yawen Huang, Runze Hu, Xiu Li, Yefeng Zheng

More precisely, our approach (1) introduces deformation perception, enabling the model to adaptively sample object features; (2) proposes a Dual-axial Aggregation Transformer (DAT) to adeptly model long-range dependencies, thereby achieving global perception; and (3) devises a Cross-task Interaction Transformer (CIT) that facilitates interaction between the classification and localization branches, thus aligning the two tasks.

SemanticAC: Semantics-Assisted Framework for Audio Classification

no code implementations12 Feb 2023 Yicheng Xiao, Yue Ma, Shuyan Li, Hantao Zhou, Ran Liao, Xiu Li

In this paper, we propose SemanticAC, a semantics-assisted framework for Audio Classification to better leverage the semantic information.

Audio Classification Language Modelling

Cannot find the paper you are looking for? You can Submit a new open access paper.