Search Results for author: Ruozhen He

Found 4 papers, 2 papers with code

Learning from Models and Data for Visual Grounding

no code implementations • 20 Mar 2024 • Ruozhen He, Paola Cascante-Bonilla, Ziyan Yang, Alexander C. Berg, Vicente Ordonez

We introduce SynGround, a novel framework that combines data-driven learning and knowledge transfer from various large-scale pretrained models to enhance the visual grounding capabilities of a pretrained vision-and-language model.

Language Modelling Large Language Model +2

Paper
Add Code

Improved Visual Grounding through Self-Consistent Explanations

no code implementations • 7 Dec 2023 • Ruozhen He, Paola Cascante-Bonilla, Ziyan Yang, Alexander C. Berg, Vicente Ordonez

Vision-and-language models trained to match images with text can be combined with visual explanation methods to point to the locations of specific objects in an image.

Language Modelling Large Language Model +1

Paper
Add Code

Efficient Mirror Detection via Multi-level Heterogeneous Learning

1 code implementation • 28 Nov 2022 • Ruozhen He, Jiaying Lin, Rynson W. H. Lau

We present HetNet (Multi-level \textbf{Het}erogeneous \textbf{Net}work), a highly efficient mirror detection network.

Paper
Code

Weakly-Supervised Camouflaged Object Detection with Scribble Annotations

1 code implementation • 28 Jul 2022 • Ruozhen He, Qihua Dong, Jiaying Lin, Rynson W. H. Lau

To achieve this, we first relabel 4, 040 images in existing camouflaged object datasets with scribbles, which takes ~10s to label one image.

Object object-detection +1

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.