Power-seeking agents will likely be developed
I am going to argue that we will likely eventually get AIs that are strongly power-seeking, much more so than current SOTA LLMs.[1] TLDR 1. Right now SOTA LLMs are still largely in a simulator regime. This buffers against power-seeking. 2. Long-horizon RL or similar methods (applied to LLMs or...
May 2042