Generative Pretraining from Pixels (Image GPT), 

uses transformer for pixel level image completion, just like other GPT for text completion

In this post, I summary the ideas from a new paper from Google Brain.