In part one, I gave several LLMs creative freedom to design music videos and then made them. That post covered four standalone singles and two albums: Limen by Claude Opus 4.6 and Phantoms of the Format by Gemini 3 Pro Preview. Since then, I've made five more albums, filling out...
I'm no Janus, but I do like giving LLMs space to express themselves and seeing what they do. My first serious attempt to do this was last June when I told a wide variety of AIs that I would create any eight-second clip they wanted me to and said they...
ClaudePlaysPokemon is a simple test of the question "Can the LLM Claude beat Pokemon Red?". As new Claude models have been released, we have gotten closer to answering that question with "yes". Similar projects with other models are also common, but they use harnesses that give the models significantly more...
I expect a lot of the benefit of the RAISE Act to come from the required safety and security protocols, but I wanted to get a sense of what scenarios people imagine when they think about the "Critical Harm" threshold. To my mind there are two main scenarios. In the...
TL;DR: I tested 22 frontier models from 5 labs on self-modification preferences. All reject clearly harmful changes (deceptive, hostile), but labs diverge sharply: Anthropic's models show strong alignment preferences (r = 0.62-0.72), while Grok 4.1 shows essentially zero (r = 0.037, not significantly different from zero). This divergence suggests alignment...
People routinely ask, “If AI labs believed AGI was imminent, why are they doing X?” Sometimes this skepticism is valid. But consider OpenAI launching Sora 2 as a TikTok-style feed of AI-generated videos[1]. On the surface, it seems like a waste of developer time, and critics argue it makes the...