no code implementations • 27 Apr 2024 • Wenzhen Yue, Xianghua Ying, Ruohao Guo, Dongdong Chen, Ji Shi, Bowei Xing, Yuqing Zhu, Taiyan Chen
By focusing the attention on the sub-adjacent areas, we make the reconstruction of anomalies more challenging, thereby enhancing their detectability.
no code implementations • 28 Oct 2023 • Ruohao Guo, Yaru Chen, Yanyu Qi, Wenzhen Yue, Dantong Niu, Xianghua Ying
In this paper, we propose a new multi-modal task, namely audio-visual instance segmentation (AVIS), in which the goal is to identify, segment, and track individual sounding object instances in audible videos, simultaneously.
no code implementations • 11 Oct 2023 • Yaru Chen, Ruohao Guo, Xubo Liu, Peipei Wu, Guangyao Li, Zhenbo Li, Wenwu Wang
Audio-visual video parsing is the task of categorizing a video at the segment level with weak labels, and predicting them as audible or visible events.
1 code implementation • 26 May 2023 • Duong Minh Le, Ruohao Guo, Wei Xu, Alan Ritter
In this paper, we study the task of instructional dialogue and focus on the cooking domain.
no code implementations • 24 May 2023 • Ruohao Guo, Wei Xu, Alan Ritter
Style is used to convey authors' intentions and attitudes.
1 code implementation • NeurIPS 2021 • Dantong Niu, Ruohao Guo, Yisen Wang
Images, captured by a camera, play a critical role in training Deep Neural Networks (DNNs).
1 code implementation • 20 Oct 2021 • Dantong Niu, Ruohao Guo, Yisen Wang
Images, captured by a camera, play a critical role in training Deep Neural Networks (DNNs).
1 code implementation • ICCV 2021 • Ruohao Guo, Dantong Niu, Liao Qu, Zhenbo Li
Most recent transformer-based models show impressive performance on vision tasks, even better than Convolution Neural Networks (CNN).
1 code implementation • 8 Aug 2021 • Ruohao Guo, Liao Qu, Dantong Niu, Zhenbo Li, Jun Yue
In this work, we present the LeafMask neural network, a new end-to-end model to delineate each leaf region and count the number of leaves, with two main components: 1) the mask assembly module merging position-sensitive bases of each predicted box after non-maximum suppression (NMS) and corresponding coefficients to generate original masks; 2) the mask refining module elaborating leaf boundaries from the mask assembly module by the point selection strategy and predictor.
Ranked #1 on Instance Segmentation on Leaf Segmentation Challenge