"DreamTeacher: Pretraining Image Backbones with Deep Generative Models"
"Distilling the Knowledge in a Neural Network"
"Co-training 2^L Submodels for Visual Recognition"
https://phamdinhkhanh.github.io/2021/03/13/KnownledgeDistillation.html