no code implementations • 7 May 2024 • Yadang Chen, Wentao Zhu, Zhi-Xin Yang, Enhua Wu
Recently, video object segmentation (VOS) networks typically use memory-based methods: for each query frame, the mask is predicted by space-time matching to memory frames.
no code implementations • 24 Apr 2023 • Yadang Chen, Dingwei Zhang, Zhi-Xin Yang, Enhua Wu
For limitation 2, we first adaptively decide whether to update the memory features depending on the variation of foreground objects to reduce temporal redundancy.