JakubK

Averting Catastrophe: Decision Theory for COVID-19, Climate Change, and Potential Disasters of All Kinds

Averting Catastrophe is a book by Cass Sunstein that NYU Press published in April 2021. I'm surprised that nobody has mentioned it on the EA Forum before (AFAICT). Quoting from the NYU Press page: > Averting Catastrophe explores how governments ought to make decisions in times of imminent disaster. Cass...

May 2, 202310

Notes on "the hot mess theory of AI misalignment"

TL;DR: the author makes some reasonable, good-faith arguments (including analysis of a neat survey that he sent to his friends), but I don't think these arguments are strong enough to support his conclusions in a significant way. Jascha Sohl-Dickstein is a senior staff research scientist at Google Brain with an...

Apr 21, 202316

GPT-4 solves Gary Marcus-induced flubs

TLDR: GPT-4 succeeds at 15 problems from Gary Marcus that exposed failures of GPT-3. I enjoyed reading the ACX post "My Bet: AI Size Solves Flubs" last year. Here are some excerpts: > Here’s the basic structure of an AI hype cycle: > > 1. Someone releases a new AI...

Mar 17, 202357

Next steps after AGISF at UMich

I ran an AGISF (technical alignment track) program at the University of Michigan last semester. At the end of the program, I shared a "next steps" document full of ideas for actions to take after finishing the program, and I've refined the document into a public version that you can...

Jan 25, 202310

List of technical AI safety exercises and projects

EDIT 3/17/2023: I've reorganized the doc and added some governance projects. I intend to maintain a list at this doc. I'll paste the current state of the doc (as of January 19th, 2023) below. I encourage people to comment with suggestions. * Levelling Up in AI Safety Research Engineering [Public]...

Jan 19, 202341

6-paragraph AI risk intro for MAISI

The Michigan AI Safety Initiative (MAISI) is a new AI safety student group at the University of Michigan. The website's "About" page includes a short intro to AI risk. I'm sharing it here for people who are interested in short pitches for AI x-risk. Feel free to comment with feedback...

Jan 19, 202311

Big list of AI safety videos

This is a Google doc containing links to: * (all?) AI safety-related YouTube channels (intended to be comprehensive, at least for active channels with videos that seem worth sharing in an AI safety group) * some AI safety-related podcasts * some suggested videos, sorted into categories like "agent foundations," "individual...

Jan 9, 202311

JakubK

JakubK

GPT-4 solves Gary Marcus-induced flubs

List of technical AI safety exercises and projects

Can we get full audio for Eliezer's conversation with Sam Harris?

Best introductory overviews of AGI safety?

JakubK

GPT-4 solves Gary Marcus-induced flubs

List of technical AI safety exercises and projects

Can we get full audio for Eliezer's conversation with Sam Harris?

Best introductory overviews of AGI safety?

Averting Catastrophe: Decision Theory for COVID-19, Climate Change, and Potential Disasters of All Kinds

Notes on "the hot mess theory of AI misalignment"

GPT-4 solves Gary Marcus-induced flubs

Next steps after AGISF at UMich

List of technical AI safety exercises and projects

6-paragraph AI risk intro for MAISI

Big list of AI safety videos