1 code implementation • 1 Apr 2024 • Jing Hao, Lei He, Kuo Feng Hung
To address this issue, we propose T-Mamba, integrating shared positional encoding and frequency-based features into vision mamba, to address limitations in spatial position preservation and feature enhancement in frequency domain.
1 code implementation • 27 Jan 2024 • Jing Hao, Moyun Liu, Kuo Feng Hung
To segment glass surfaces with higher accuracy, we make full use of two visual foundation models: Segment Anything (SAM) and Stable Diffusion. Specifically, we devise a simple glass surface segmentor named GEM, which only consists of a SAM backbone, a simple feature pyramid, a discerning query selection module, and a mask decoder.