This website requires javascript to properly function. Consider activating javascript to get access to all site functionality.
LESSWRONG
LW
Login
Home
All Posts
Concepts
Library
Sequence Highlights
Rationality: A-Z
The Codex
HPMOR
Best Of
Community Events
AISC end of program presentations
Thu Jun 15
•
Online
Effective Altruism Virtual Programs Jul-Aug 2023
Sat Jun 17
•
Online
Argentines LW/SSC/EA/MIRIx - Call to All
Tue Apr 18
•
Online
EA/LW/SSC Argentina First Meeting!
Sun Jun 4
•
Online
Subscribe (RSS/Email)
About
FAQ
All Posts
Sorted by New
Timeframe:
All time
Daily
Weekly
Monthly
Yearly
Sorted by:
Magic (New & Upvoted)
Top
Top (Inflation Adjusted)
Recent Comments
New
Old
Filtered by:
All Posts
Frontpage
Curated
Questions
Events
Show Low Karma
Show Events
208
The Base Rate Times, news through prediction markets
vandemonian
3d
38
349
The ants and the grasshopper
Richard_Ngo
8d
32
268
Book Review: How Minds Change
bc4026bd4aaa5b7fe
16d
48
113
Trust develops gradually via making bids and setting boundaries
Richard_Ngo
12d
10
382
Steering GPT-2-XL by adding an activation vector
Ω
TurnTrout
,
Monte M
,
David Udell
,
lisathiergart
,
Ulisse Mini
1mo
Ω
79
147
When is Goodhart catastrophic?
Ω
Drake Thomas
,
Thomas Kwa
24d
Ω
18
272
Predictable updating about AI risk
Joe Carlsmith
1mo
20
313
How to have Polygenically Screened Children
GeneSmith
19d
87
404
How much do you believe your results?
Eric Neyman
1mo
13
81
Hell is Game Theory Folk Theorems
jessicata
1mo
101
257
Notes on Teaching in Prison
jsd
2mo
12
245
On AutoGPT
Zvi
2mo
45
155
What would a compute monitoring plan look like? [Linkpost]
Akash
2mo
9
155
A stylized dialogue on John Wentworth's claims about markets and optimization
Ω
So8res
2mo
Ω
21
233
More information about the dangerous capability evaluations we did with GPT-4 and Claude.
Ω
Beth Barnes
3mo
Ω
54
246
"Carefully Bootstrapped Alignment" is organizationally hard
Raemon
2mo
20
241
Discussion with Nate Soares on a key alignment difficulty
Ω
HoldenKarnofsky
2mo
Ω
37
163
Acausal normalcy
Ω
Andrew_Critch
3mo
Ω
28
266
The Parable of the King and the Random Process
moridinamael
3mo
21
200
Enemies vs Malefactors
So8res
3mo
61
180
AI alignment researchers don't (seem to) stack
So8res
3mo
38
326
Please don't throw your mind away
TsviBT
4mo
41
209
Elements of Rationalist Discourse
Rob Bensinger
2mo
41
321
Cyborgism
Ω
NicholasKees
,
janus
4mo
Ω
43
307
Childhoods of exceptional people
Henrik Karlsson
4mo
58
654
SolidGoldMagikarp (plus, prompt generation)
Ω
Jessica Rumbelow
,
mwatkins
4mo
Ω
199
241
I hired 5 people to sit behind me and make me productive for a month
Simon Berens
4mo
81
381
Focus on the places where you feel shocked everyone's dropping the ball
So8res
4mo
58
289
On not getting contaminated by the wrong obesity ideas
Natália Coelho Mendonça
3mo
63
243
Basics of Rationalist Discourse
[DEACTIVATED] Duncan Sabien
4mo
178
228
My Model Of EA Burnout
LoganStrohl
5mo
48
145
Sapir-Whorf for Rationalists
[DEACTIVATED] Duncan Sabien
5mo
48
260
We don’t trade with ants
KatjaGrace
,
gwern
5mo
108
208
Recursive Middle Manager Hell
Raemon
5mo
39
218
The Feeling of Idea Scarcity
johnswentworth
5mo
21
283
Models Don't "Get Reward"
Ω
Sam Ringer
5mo
Ω
58
86
Can we efficiently distinguish different mechanisms?
Ω
paulfchristiano
5mo
Ω
30
512
Let’s think about slowing down AI
Ω
KatjaGrace
6mo
Ω
179
273
Staring into the abyss as a core life skill
benkuhn
5mo
15
236
Sazen
[DEACTIVATED] Duncan Sabien
6mo
79
235
How "Discovering Latent Knowledge in Language Models Without Supervision" Fits Into a Broader Alignment Scheme
Ω
Collin
5mo
Ω
31
174
Finite Factored Sets in Pictures
Ω
Magdalena Wache
6mo
Ω
35
231
The Plan - 2022 Update
Ω
johnswentworth
6mo
Ω
36
157
Be less scared of overconfidence
benkuhn
6mo
22
132
Mechanistic anomaly detection and ELK
Ω
paulfchristiano
7mo
Ω
20
183
What it's like to dissect a cadaver
Alok Singh
7mo
20
271
Mysteries of mode collapse
Ω
janus
7mo
Ω
51
133
Superintelligent AI is necessary for an amazing future, but far from sufficient
Ω
So8res
7mo
Ω
47
160
The Social Recession: By the Numbers
antonomon
3mo
29
207
Introduction to abstract entropy
Alex_Altair
8mo
73