Adam logs into his terminal at 9:07 AM. Dust motes flutter off his keyboard, dancing in the morning light. Not that he notices – he is focused on his work for the day. He has one task today: to design an evaluation suite for measuring situational awareness in their latest...
I have tried and failed many times to write a certain essay. With inspiration from Scott Alexander and Borges, I have reframed it as a story. That reframing has been productive, and leads to the following plot. The setting is a medieval kingdom. Through circumstances not relevant to this plot,...
Lots of people seem to resonate with the idea that AI benchmarks are getting more and more meaningless – either because they're being run in a misleading way, or because they aren't tracking important things. I think the right response is for people to do their own personal evaluations of...
I'm an economist and quite new to AI alignment. In reading about the perils of persuasive AI, I was reminded of an influential model in economic theory: the Bayesian persuasion model (Kamenica and Gentzkow, 2011). It's used to model situations in which a decisionmaker wants to learn from a biased...