This website requires javascript to properly function. Consider activating javascript to get access to all site functionality.
LESSWRONG
is fundraising!
LW
$
Login
Govind Pimpale
Posts
Sorted by New
149
Current safety training techniques do not fully transfer to the agent setting
1mo
8
45
~80 Interesting Questions about Foundation Model Agent Safety
1mo
4
69
Analyzing DeepMind's Probabilistic Methods for Evaluating Agent Capabilities
Ω
4mo
Ω
0
Wiki Contributions
Comments
Sorted by
Newest