DeiT:Training data-efficient image transformers & distillation through attention
2012.12877v2
PreviousDistilling the Knowledge in a Neural NetworkNextSwin Transformer:Hierarchical Vision Transformer using Shifted Windows
Last updated
2012.12877v2
Last updated