x

Dylan Bowman

Top postsTop post

Dylan Bowman

Subscribe

Message

I'm an LLM evals researcher. I currently work at Apollo Research and used to work at HUD and Prof. Daniel Kang's lab.

https://x.com/dylanbowmanSF

317

5

19

3y

Dylan Bowman

Top postsTop post

A reading list for generalists

I, along with many others in AI safety, believe there is a shortage of generalists in the community and that there exist many projects and efforts that by default will not happen unless they are owned by a strong generalist[1][2][3]. As someone who is a reasonably good generalist, I decided to assemble a reading list of the essays and blog posts that have personally helped me the most. I would love others to comment with pieces they think should be on this list. The crux of this reading list is the idea that if you’re working hard as a generalist on a project you care a lot about, then by rigorously applying the lessons from these documents you will improve more quickly than you otherwise would. By the numbers: * I’ve attached 18 documents to start this reading list. * The authors cited more than once are Paul Graham (5), Ben Kuhn (4), Ethan Perez (2), and Greg Brockman (2). Sam Altman and Eliezer Yudkowsky also have their fingerprints over a lot of the content. * The items are 15 blog posts, 1 blog comment, 1 interview transcript in blog post form, and 1 book. Dispositional What characteristics should you try to adopt? * Paul Graham: "What We Look for in Founders" (link), "Relentlessly Resourceful" (link) * Eliezer Yudkowsky: "Shut Up and Do the Impossible!" (link) * Ben Kuhn: "Be impatient" (link) * Cate Hall: "How to be more agentic" (link) Strategy How do you make good decisions with the information you have, and how can you get the additional information you need? * Anna Salamon: "Humans are not automatically strategic" (link) * If I were to recommend one single item from this list it would be this one because 1) it’s good to understand the ways in which otherwise-intelligent people are unstrategic, and 2) it’s good to understand the ways in which you are not automatically strategic. I have gotten a ton of mileage in my short career thus far simply by being more strategic. The defining trait of the

63Jun 29

Top postsTop post

Dylan Bowman

Subscribe

Message

I'm an LLM evals researcher. I currently work at Apollo Research and used to work at HUD and Prof. Daniel Kang's lab.

https://x.com/dylanbowmanSF

317

5

19

3y

Superhuman Articulacy as an LLM Safety Target

TL;DR: Current LLMs are bad communicators relative to their agentic capabilities. I claim that articulacy is useful (and perhaps necessary) for AI safety and suggest a path for improving articulacy. Briefly: a theory for articulacy Frequently, LLM agents miscommunicate with their human operators, such as when they write documentation or...

Jul 745

A reading list for generalists

I, along with many others in AI safety, believe there is a shortage of generalists in the community and that there exist many projects and efforts that by default will not happen unless they are owned by a strong generalist[1][2][3]. As someone who is a reasonably good generalist, I decided...

Jun 2963

Blind deep-deployment evals for control & sabotage

Thanks to Ezra Newman for initial ideation and various people at Apollo Research for feedback. This short personal piece does not necessarily reflect the views of Apollo Research. AI labs are preparing to automate their internal staff over the next year. Right now, control and sabotage evals try to estimate...

May 627

Goblin Mode, 24 Hours Later

Yesterday, Twitter user arb8020 posted this: It went semi-viral within AI Twitter and users began experimenting with "goblin mode" and hypothesizing about the source of the bizarre behavior. LM Arena provided evidence for the phenomenon from their traffic: > "It's true. Here's a plot of GPT models and their usage...

Apr 2952

Dylan Bowman's Shortform

Dec 5, 20252

Top postsTop post

A reading list for generalists

I, along with many others in AI safety, believe there is a shortage of generalists in the community and that there exist many projects and efforts that by default will not happen unless they are owned by a strong generalist[1][2][3]. As someone who is a reasonably good generalist, I decided to assemble a reading list of the essays and blog posts that have personally helped me the most. I would love others to comment with pieces they think should be on this list. The crux of this reading list is the idea that if you’re working hard as a generalist on a project you care a lot about, then by rigorously applying the lessons from these documents you will improve more quickly than you otherwise would. By the numbers: * I’ve attached 18 documents to start this reading list. * The authors cited more than once are Paul Graham (5), Ben Kuhn (4), Ethan Perez (2), and Greg Brockman (2). Sam Altman and Eliezer Yudkowsky also have their fingerprints over a lot of the content. * The items are 15 blog posts, 1 blog comment, 1 interview transcript in blog post form, and 1 book. Dispositional What characteristics should you try to adopt? * Paul Graham: "What We Look for in Founders" (link), "Relentlessly Resourceful" (link) * Eliezer Yudkowsky: "Shut Up and Do the Impossible!" (link) * Ben Kuhn: "Be impatient" (link) * Cate Hall: "How to be more agentic" (link) Strategy How do you make good decisions with the information you have, and how can you get the additional information you need? * Anna Salamon: "Humans are not automatically strategic" (link) * If I were to recommend one single item from this list it would be this one because 1) it’s good to understand the ways in which otherwise-intelligent people are unstrategic, and 2) it’s good to understand the ways in which you are not automatically strategic. I have gotten a ton of mileage in my short career thus far simply by being more strategic. The defining trait of the

63Jun 29

Goblin Mode, 24 Hours Later

52Apr 29

Superhuman Articulacy as an LLM Safety Target

45Jul 7

Blind deep-deployment evals for control & sabotage

27May 6