LESSWRONG
LW

69
Wikitags

Abstraction

Edited by satpugnet, adamShimi last updated 25th Jul 2025

Abstraction is the process of simplifying a system by capturing only the essential features needed for your purpose, while deliberately ignoring irrelevant details. In AI alignment, effective abstraction means creating models or concepts that genuinely reflect what matters for reasoning or control, not just convenient proxies. If the abstraction misses important structure, it can fail dramatically when optimized or applied in new situations. The challenge is to develop abstractions that remain valid and useful, even as systems scale or face new pressures.

(This is a stub, please rewrite if you have a better tag description).

Subscribe
Discussion
2
Subscribe
Discussion
2
Posts tagged Abstraction
37What is Abstraction?
Ω
johnswentworth
6y
Ω
8
29Abstraction = Information at a Distance
Ω
johnswentworth
5y
Ω
1
25What is abstraction?
Q
Adam Zerner, johnswentworth
7y
Q
11
12Whence Your Abstractions?
Eliezer Yudkowsky
17y
6
11Underconstrained Abstractions
Eliezer Yudkowsky
17y
27
177Alignment By Default
Ω
johnswentworth
5y
Ω
101
97Public Static: What is Abstraction?
Ω
johnswentworth
5y
Ω
18
90Writing Causal Models Like We Write Programs
Ω
johnswentworth
5y
Ω
11
61Pointing to a Flower
Ω
johnswentworth
5y
Ω
18
48(A -> B) -> A in Causal DAGs
Ω
johnswentworth
6y
Ω
11
43Motivating Abstraction-First Decision Theory
Ω
johnswentworth
5y
Ω
16
37Trace README
Ω
johnswentworth
6y
Ω
1
37Logical Representation of Causal Models
Ω
johnswentworth
6y
Ω
0
36The Indexing Problem
Ω
johnswentworth
5y
Ω
2
35Cartesian Boundary as Abstraction Boundary
Ω
johnswentworth
5y
Ω
3
Load More (15/90)
Add Posts