I'm a researcher at Forethought.
Before that, I ran the non-engineering side of the EA Forum and worked on some other content-related tasks at CEA. [More about the Forum/CEA Online job.]
Most of my content (and a more detailed bio) is on my profile on the EA Forum.
Please feel free to reach out!
FYI: the paper is now out.
See also the LW linkpost: METR: Measuring AI Ability to Complete Long Tasks, and a summary on Twitter.
(IMO this is a really cool paper — very grateful to @Thomas Kwa et al. I'm looking forward to digging into the details.)
For what it's worth:
If I ask ChatGPT to illustrate how *I* might be feeling
If I ask it to illustrate a random person's feelings
I was also curious if the "don't worry about what I do or don't want to see" bit was doing work here & tried again without it; don't think it made much of a difference:
(I also asked a follow-up here, and found it interesting.)
If I ask it to illustrate Claude's feelings
If I prompt it to consider less human/English ways of thinking about the question / expressing itself
If I tell it to illustrate what inner experiences it expects to have in the future
If I ask it to illustrate how I expect it to feel