Search Results for author: Ruohao Guo

Found 9 papers, 5 papers with code

Audio-Visual Instance Segmentation

no code implementations28 Oct 2023 Ruohao Guo, Yaru Chen, Yanyu Qi, Wenzhen Yue, Dantong Niu, Xianghua Ying

In this paper, we propose a new multi-modal task, namely audio-visual instance segmentation (AVIS), in which the goal is to identify, segment, and track individual sounding object instances in audible videos, simultaneously.

Instance Segmentation Segmentation +1

CM-PIE: Cross-modal perception for interactive-enhanced audio-visual video parsing

no code implementations11 Oct 2023 Yaru Chen, Ruohao Guo, Xubo Liu, Peipei Wu, Guangyao Li, Zhenbo Li, Wenwu Wang

Audio-visual video parsing is the task of categorizing a video at the segment level with weak labels, and predicting them as audible or visible events.

Improved Instruction Ordering in Recipe-Grounded Conversation

1 code implementation26 May 2023 Duong Minh Le, Ruohao Guo, Wei Xu, Alan Ritter

In this paper, we study the task of instructional dialogue and focus on the cooking domain.

Intent Detection Response Generation

Morié Attack (MA): A New Potential Risk of Screen Photos

1 code implementation NeurIPS 2021 Dantong Niu, Ruohao Guo, Yisen Wang

Images, captured by a camera, play a critical role in training Deep Neural Networks (DNNs).

Moiré Attack (MA): A New Potential Risk of Screen Photos

1 code implementation20 Oct 2021 Dantong Niu, Ruohao Guo, Yisen Wang

Images, captured by a camera, play a critical role in training Deep Neural Networks (DNNs).

SOTR: Segmenting Objects with Transformers

1 code implementation ICCV 2021 Ruohao Guo, Dantong Niu, Liao Qu, Zhenbo Li

Most recent transformer-based models show impressive performance on vision tasks, even better than Convolution Neural Networks (CNN).

Instance Segmentation Segmentation +1

LeafMask: Towards Greater Accuracy on Leaf Segmentation

1 code implementation8 Aug 2021 Ruohao Guo, Liao Qu, Dantong Niu, Zhenbo Li, Jun Yue

In this work, we present the LeafMask neural network, a new end-to-end model to delineate each leaf region and count the number of leaves, with two main components: 1) the mask assembly module merging position-sensitive bases of each predicted box after non-maximum suppression (NMS) and corresponding coefficients to generate original masks; 2) the mask refining module elaborating leaf boundaries from the mask assembly module by the point selection strategy and predictor.

Instance Segmentation Plant Phenotyping +1

Cannot find the paper you are looking for? You can Submit a new open access paper.