Abstract: Although vision transformers (ViTs) have achieved great success in computer vision, the heavy computational cost hampers their applications to dense prediction tasks such as semantic ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results