Our Embodied Interpretability Paper Got Accepted by ICML 2026
Our latest work has been accepted at ICML 2026. As VLA models become a key route towards general-purpose robot policies, we should not only ask whether a robot completes a task, but also why it chooses an action. Sometimes a policy can appear to work while relying on the wrong visual cues, such as background texture, lighting, or shortcuts in the scene. Our work provides a way to test this more directly: which parts of the image actually change the robot’s action? This helps researchers diagnose VLA failure modes and better understand why some policies generalise to new environments while others do not.
Read more






