[ViT] An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale

{Pure transformer}

Paper: https://arxiv.org/abs/2010.11929

Code: https://github.com/lucidrains/vit-pytorch