1 code implementation • 29 Mar 2024 • Peijie Qiu, Jin Yang, Sayantan Kumar, Soumyendu Sekhar Ghosh, Aristeidis Sotiras
However, we argue that the current design of the vision transformer-based UNet (ViT-UNet) segmentation models may not effectively handle the heterogeneous appearance (e. g., varying shapes and sizes) of objects of interest in medical image segmentation tasks.
Ranked #2 on Medical Image Segmentation on ACDC