Search Results for author: Ruozhen He

Found 4 papers, 2 papers with code

Learning from Models and Data for Visual Grounding

no code implementations20 Mar 2024 Ruozhen He, Paola Cascante-Bonilla, Ziyan Yang, Alexander C. Berg, Vicente Ordonez

We introduce SynGround, a novel framework that combines data-driven learning and knowledge transfer from various large-scale pretrained models to enhance the visual grounding capabilities of a pretrained vision-and-language model.

Language Modelling Large Language Model +2

Improved Visual Grounding through Self-Consistent Explanations

no code implementations7 Dec 2023 Ruozhen He, Paola Cascante-Bonilla, Ziyan Yang, Alexander C. Berg, Vicente Ordonez

Vision-and-language models trained to match images with text can be combined with visual explanation methods to point to the locations of specific objects in an image.

Language Modelling Large Language Model +1

Efficient Mirror Detection via Multi-level Heterogeneous Learning

1 code implementation28 Nov 2022 Ruozhen He, Jiaying Lin, Rynson W. H. Lau

We present HetNet (Multi-level \textbf{Het}erogeneous \textbf{Net}work), a highly efficient mirror detection network.

Weakly-Supervised Camouflaged Object Detection with Scribble Annotations

1 code implementation28 Jul 2022 Ruozhen He, Qihua Dong, Jiaying Lin, Rynson W. H. Lau

To achieve this, we first relabel 4, 040 images in existing camouflaged object datasets with scribbles, which takes ~10s to label one image.

Object object-detection +1

Cannot find the paper you are looking for? You can Submit a new open access paper.