no code implementations • 12 Jul 2021 • Chih-Chung Hsu, Guan-Lin Chen, Mei-Hsuan Wu
The frame-level feature is extracted from each CT slice based on any backbone network and followed by feeding the features to our within-slice-Transformer (WST) to discover the context information in the pixel dimension.