1 code implementation • ICCV 2023 • Yixuan Wu, Zhao Zhang, Xie Chi, Feng Zhu, Rui Zhao
To overcome this limitation, we propose a more realistic and general setting, named Group-wise Referring Expression Segmentation (GRES), which expands RES to a collection of related images, allowing the described objects to be present in a subset of input images.