Tapatakt

I turned decision theory problems into memes about trolleys

I hope it has some educational, memetic or at least humorous potential. Newcomb's problemSmoking lesionParfit's HitchhikerCounterfactual muggingXor-blackmail Bonus Five-and-ten problem

104Oct 30, 2024

Tapatakt

Message

1241

286

Lucky Omega Problem

This is a later better version of the problem in this post. This problem emerged from my work in the "Deconfusing Commitment Races" project under the Supervised Program for Alignment Research (SPAR), led by James Faville. I'm grateful to SPAR for providing the intellectual environment and to James Faville personally...

Jun 13, 202510

Weird Random Newcomb Problem

Epistemic status: I'm pretty sure the problem is somewhat interesting, because it temporarily confused several smart people. I'm not at all sure that it is very original; probably somebody has already thought about something similar. I'm not at all sure that I have actually found a flaw in UDT, but...

Apr 11, 202521

I turned decision theory problems into memes about trolleys

I hope it has some educational, memetic or at least humorous potential. Newcomb's problemSmoking lesionParfit's HitchhikerCounterfactual muggingXor-blackmail Bonus Five-and-ten problem

Oct 30, 2024104

Tapatakt's Shortform

Mar 11, 20245

Should we cry "wolf"?

Epistemic status: I am confused and trying to become less confused. So, our shepherd community noticed the wolf. It's a small wolf, about the size of a chihuahua, and is unlikely to cause any serious damage. Should we cry "wolf"? Possible arguments against crying "wolf" 1. When someone cries "wolf",...

Feb 18, 202324

AI Safety "Textbook". Test chapter. Orthogonality Thesis, Goodhart Law and Instrumental Convergency

This is a test chapter of the project described here. When the project is completed in its entirety, this chapter will most likely be the second one. We want to create a sequence that a fifteen or sixteen year old smart school student can read and that can encourage them...

Jan 21, 20234

I (with the help of a few more people) am planning to create an introduction to AI Safety that a smart teenager can understand. What am I missing?

Disclaimer: My English isn't very good, but do not dissuade me on this basis - the sequence itself will be translated by a professional translator. I want to create a sequence that a fifteen or sixteen year old smart school student can read and that can encourage them to go...

Nov 14, 20223

Load More (7/8)

LESSWRONG
LW

LESSWRONG
LW

Tapatakt

Tapatakt

Tapatakt

I turned decision theory problems into memes about trolleys

I currently translate AGI-related texts to Russian. Is that useful?

Should we cry "wolf"?

Weird Random Newcomb Problem

Tapatakt

Lucky Omega Problem

Weird Random Newcomb Problem

I turned decision theory problems into memes about trolleys

Tapatakt's Shortform

Should we cry "wolf"?

AI Safety "Textbook". Test chapter. Orthogonality Thesis, Goodhart Law and Instrumental Convergency

I (with the help of a few more people) am planning to create an introduction to AI Safety that a smart teenager can understand. What am I missing?

I turned decision theory problems into memes about trolleys

I currently translate AGI-related texts to Russian. Is that useful?

Should we cry "wolf"?

Weird Random Newcomb Problem

Lucky Omega Problem

Weird Random Newcomb Problem

I turned decision theory problems into memes about trolleys

Tapatakt's Shortform

Should we cry "wolf"?

AI Safety "Textbook". Test chapter. Orthogonality Thesis, Goodhart Law and Instrumental Convergency

I (with the help of a few more people) am planning to create an introduction to AI Safety that a smart teenager can understand. What am I missing?