[PVT] Pyramid Vision Transformer: A Versatile Backbone for Dense Prediction without Convolutions

{High Output Resolution, A Progressive Shrinking Pyramid, Spatial-reduction Attention - SRA}

Paper: https://arxiv.org/pdf/2102.12122.pdf

Code: https://github.com/whai362/PVT