Vision Transformers

Pyramid Vision Transformer v2

Introduced by Wang et al. in PVT v2: Improved Baselines with Pyramid Vision Transformer

Pyramid Vision Transformer v2 (PVTv2) is a type of Vision Transformer for detection and segmentation tasks. It improves on PVTv1 through several design improvements: (1) overlapping patch embedding, (2) convolutional feed-forward networks, and (3) linear complexity attention layers that are orthogonal to the PVTv1 framework.

Source: PVT v2: Improved Baselines with Pyramid Vision Transformer

Papers


Paper Code Results Date Stars

Categories