Transformers 

Transformer architecture (or attention-based network) has achieved state-of-the-art results in many NLP (Natural Language Processing) tasks. One of the main breakthroughs with the Transformer model could be the powerful GPT-3 released in the middle of the year, which has been awarded Best Paper at NeurIPS2020. 

Overall, there are 2 major model architectures in the related work of adopting Transformers in Computer Vision tasks: