AI agents have become significantly more common in the last few months. They’re used for web scraping,[1][2] robotics and automation,[3] and are even being deployed for military use.[4] As we integrate these agents into critical processes, it is important to simulate their behavior in low-risk environments. In this post, I’ll...
The prevailing notion in AI safety circles is that a pivotal act—an action that decisively alters the trajectory of artificial intelligence development—requires superhuman AGI, which itself poses extreme risks. I challenge this assumption. Consider a pivotal act like "disable all GPUs globally." This could potentially be achieved through less advanced...
It's commonly accepted that pretty much every optimization target results in death. If you optimize for paperclips, humans die. If you optimize for curing cancer, humans die. What about optimizing for agency? The way I visualize this is after a superintelligence takeover, and the superintelligence is optimizing for intelligent agency,...
There are a number of ways that our AI and technological development can turn out. This is a hypothetical story about a way where AI development is stopped and a 'pivotal act' that does not require AGI occurs. The first thing I notice when I wake up is that the...
A few months ago, I built a predictive algorithm to determine the probability of romantic relationships between individuals in my periphery. The algorithm works like this: -Personal Factors. How likely are you as an individual to find yourself in a new relationship? This contains variables like initiative, expectations for a...
It is the year 2500. Humanity has overcome its challenges. AI alignment has been solved and a benevolent god watches over us, allowing us to create and share what we desire. But it no longer interferes too much; at least, not too much in this realm. There are a lot...
GATO is the most general agent we currently know about. It's a general-purpose model with transformer architecture. GATO can play atari games, speak to humans, and classify images. For the purpose of this post, I only really care about it being able to play games and speak to humans. Our...