CS480/680 - Lecture 19:

Attention and Transformer Networks

[Vaswani et al., Attention is All You Need, NeurIPS, 2017]

[Link]