TheSigillite

Message

MEng student at University of Toronto. Interested in , VLAs, AI safety, and Science.

1mo

VLAs as Model Organisms for AI Safety

What Training Robot Policies Taught Me About Emergent Capabilities and Control I spent six weeks training a humanoid robot to do household tasks. Along the way, my research lead and I started noticing things about the particular failure modes of the robot that seemed to indicate some strange architectural vulnerabilities...

Jan 18•15

Message

MEng student at University of Toronto. Interested in , VLAs, AI safety, and Science.

13 karma

1 post

Member for a month

TheSigillite — LessWrong

TheSigillite

Message

MEng student at University of Toronto. Interested in , VLAs, AI safety, and Science.

1mo

TheSigillite

VLAs as Model Organisms for AI Safety

Jan 18•15

Message

MEng student at University of Toronto. Interested in , VLAs, AI safety, and Science.

13 karma

1 post

Member for a month

VLAs as Model Organisms for AI Safety

TheSigillite

26d

What Training Robot Policies Taught Me About Emergent Capabilities and Control

I spent six weeks training a humanoid robot to do household tasks. Along the way, my research lead and I started noticing things about the particular failure modes of the robot that seemed to indicate some strange architectural vulnerabilities of VLAs as a whole.

Our work was done as part of the Stanford BEHAVIOR-1K Challenge which involved training Vision-Language-Action (VLA) models that take in camera images and output robot motor commands to complete everyday tasks. Think tidying your bedrooms, putting your dishes away, moving your halloween decorations to storage. Our final score was a modest 1.78%, but most gains happened almost overnight after... (read 1682 more words →)

LESSWRONG
LW

LESSWRONG
LW

TheSigillite

TheSigillite

TheSigillite

VLAs as Model Organisms for AI Safety

TheSigillite

TheSigillite

TheSigillite

VLAs as Model Organisms for AI Safety

What Training Robot Policies Taught Me About Emergent Capabilities and Control