Dynamic Resolution Vision Transformer: Faster Inference Without Extra Training
Vision Transformers run at a fixed image resolution. Easy images still go through the full high resolution pipeline, wasting time and compute.
In this post, I explain a Dynamic Vision Transformer that
prashantpandey.hashnode.dev2 min read