no code implementations • 12 Nov 2023 • Wenkai Yang, Wenyuan Sun, Runxaing Huang
This architecture utilizes a graph feature stream and an image feature stream, aiming to merge the strengths of both modalities for improved performance in image classification and scene graph generation tasks.