Vision limits under occlusion
Zheng et al. 2025, EgoScale, arXiv:2602.16710
Evaluated 59 vision-language models on dexterous manipulation tasks. Card manipulation and force closure tasks showed the highest failure rates. These are precisely the task classes where the contact interface is occluded during execution.