no code implementations • 11 Oct 2023 • Yaru Chen, Ruohao Guo, Xubo Liu, Peipei Wu, Guangyao Li, Zhenbo Li, Wenwu Wang
Audio-visual video parsing is the task of categorizing a video at the segment level with weak labels, and predicting them as audible or visible events.
no code implementations • ICCV 2023 • Fei Li, Linfeng Zhang, Zikun Liu, Juan Lei, Zhenbo Li
CNN's limited receptive field restricts its ability to capture long-range spatial-temporal dependencies, leading to unsatisfactory performance in video super-resolution.
no code implementations • 6 Oct 2021 • Weiran Li, Zhenbo Li, Fei Li, Meng Yuan, Chaojun Cen, Yanyu Qi, Qiannan Guo, You Li
Fish tracking is a key technology for obtaining movement trajectories and identifying abnormal behavior.
1 code implementation • ICCV 2021 • Ruohao Guo, Dantong Niu, Liao Qu, Zhenbo Li
Most recent transformer-based models show impressive performance on vision tasks, even better than Convolution Neural Networks (CNN).
1 code implementation • 8 Aug 2021 • Ruohao Guo, Liao Qu, Dantong Niu, Zhenbo Li, Jun Yue
In this work, we present the LeafMask neural network, a new end-to-end model to delineate each leaf region and count the number of leaves, with two main components: 1) the mask assembly module merging position-sensitive bases of each predicted box after non-maximum suppression (NMS) and corresponding coefficients to generate original masks; 2) the mask refining module elaborating leaf boundaries from the mask assembly module by the point selection strategy and predictor.
Ranked #1 on Instance Segmentation on Leaf Segmentation Challenge