Search Results for author: Rongjie Li

Found 7 papers, 4 papers with code

Learning by Correction: Efficient Tuning Task for Zero-Shot Generative Vision-Language Reasoning

no code implementations • 1 Apr 2024 • Rongjie Li, Yu Wu, Xuming He

Generative vision-language models (VLMs) have shown impressive performance in zero-shot vision-language tasks like image captioning and visual question answering.

Image Captioning Instruction Following +5

Paper
Add Code

From Pixels to Graphs: Open-Vocabulary Scene Graph Generation with Vision-Language Models

no code implementations • 1 Apr 2024 • Rongjie Li, Songyang Zhang, Dahua Lin, Kai Chen, Xuming He

Scene graph generation (SGG) aims to parse a visual scene into an intermediate graph representation for downstream reasoning tasks.

Graph Generation Relation +2

Paper
Add Code

SGTR+: End-to-end Scene Graph Generation with Transformer

1 code implementation • 23 Jan 2024 • Rongjie Li, Songyang Zhang, Xuming He

Moreover, we design a graph assembling module to infer the connectivity of the bipartite scene graph based on our entity-aware structure, enabling us to generate the scene graph in an end-to-end manner.

graph construction Graph Generation +1

Paper
Code

SGTR: End-to-end Scene Graph Generation with Transformer

1 code implementation • CVPR 2022 • Rongjie Li, Songyang Zhang, Xuming He

Scene Graph Generation (SGG) remains a challenging visual understanding task due to its compositional property.

graph construction Graph Generation +1

Paper
Code

SGTR: Generating Scene Graph by Learning Compositional Triplets with Transformer

no code implementations • 29 Sep 2021 • Rongjie Li, Songyang Zhang, Xuming He

We develop a decoding-and-assembling paradigm for the end-to-end scene graph generation.

Graph Generation Scene Graph Generation

Paper
Add Code

Bipartite Graph Network with Adaptive Message Passing for Unbiased Scene Graph Generation

3 code implementations • CVPR 2021 • Rongjie Li, Songyang Zhang, Bo Wan, Xuming He

Scene graph generation is an important visual understanding task with a broad range of vision applications.

Graph Generation Unbiased Scene Graph Generation

Paper
Code

Pose-aware Multi-level Feature Network for Human Object Interaction Detection

1 code implementation • ICCV 2019 • Bo Wan, Desen Zhou, Yongfei Liu, Rongjie Li, Xuming He

Reasoning human object interactions is a core problem in human-centric scene understanding and detecting such relations poses a unique challenge to vision systems due to large variations in human-object configurations, multiple co-occurring relation instances and subtle visual difference between relation categories.

Human-Object Interaction Detection Object +2

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.