[PVTv2] Improved Baselines with Pyramid Vision Transformer

{High Output Resolution, A Progressive Shrinking Pyramid, Spatial-reduction Attention - SRA}

Paper: https://arxiv.org/abs/2106.13797

Code: https://github.com/whai362/PVT