DeiT:Training data-efficient image transformers & distillation through attention

2012.12877v2

Last updated