Image Model Blocks

Local Patch Interaction

Introduced by El-Nouby et al. in XCiT: Cross-Covariance Image Transformers

Local Patch Interaction, or LPI, is a module used for the XCiT layer to enable explicit communication across patches. LPI consists of two depth-wise 3×3 convolutional layers with Batch Normalization and GELU non-linearity in between. Due to its depth-wise structure, the LPI block has a negligible overhead in terms of parameters, as well as a limited overhead in terms of throughput and memory usage during inference.

Source: XCiT: Cross-Covariance Image Transformers

Papers


Paper Code Results Date Stars

Tasks


Categories