Abstract: We propose DINOBot, a novel imitation learning framework for robot manipulation, which leverages the image-level and pixel-level capabilities of features extracted from Vision Transformers ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results