VLAs as Model Organisms for AI Safety
What Training Robot Policies Taught Me About Emergent Capabilities and Control I spent six weeks training a humanoid robot to do household tasks. Along the way, my research lead and I started noticing things about the particular failure modes of the robot that seemed to indicate some strange architectural vulnerabilities...
Jan 1816