Referring expression generation

13 papers with code • 0 benchmarks • 1 datasets

Generate referring expressions

Most implemented papers

Elysium: Exploring Object-level Perception in Videos via MLLM

hon-wong/elysium 25 Mar 2024

Multi-modal Large Language Models (MLLMs) have demonstrated their ability to perceive objects in still images, but their application in video-related tasks, such as object tracking, remains understudied.