LESSWRONG
LW

946
Wikitags

Power Seeking (AI)

Edited by Raemon last updated 24th Oct 2022

Power Seeking is a property that agents might have, where they attempt to gain more general ability to control their environment. It's particularly relevant to AIs, and related to Instrumental Convergence.

Subscribe
Discussion
1
Subscribe
Discussion
1
Posts tagged Power Seeking (AI)
33Instrumental convergence in single-agent systems
Ω
Edouard Harris, simonsdsuo
3y
Ω
4
31Categorical-measure-theoretic approach to optimal policies tending to seek power
Ω
jacek
3y
Ω
3
29POWERplay: An open-source toolchain to study AI power-seeking
Ω
Edouard Harris
3y
Ω
0
21Power-Seeking = Minimising free energy
Jonas Hallgren
3y
10
172Parametrically retargetable decision-makers tend to seek power
Ω
TurnTrout
3y
Ω
10
125Steering Llama-2 with contrastive activation additions
Ω
Nina Panickssery, Wuschel Schulz, NickGabs, Meg, evhub, TurnTrout
2y
Ω
29
80Reviews of “Is power-seeking AI an existential risk?”
Joe Carlsmith
4y
20
67Eli's review of "Is power-seeking AI an existential risk?"
Ω
elifland
3y
Ω
0
62A framework for thinking about AI power-seeking
Ω
Joe Carlsmith
1y
Ω
15
56Power-seeking can be probable and predictive for trained agents
Ω
Vika, janos
3y
Ω
22
41Generalizing the Power-Seeking Theorems
Ω
TurnTrout
5y
Ω
6
40Intrinsic Power-Seeking: AI Might Seek Power for Power’s Sake
Ω
TurnTrout
10mo
Ω
5
21[AN #170]: Analyzing the argument for risk from power-seeking AI
Ω
Rohin Shah
4y
Ω
1
11Power-seeking for successive choices
Ω
adamShimi
4y
Ω
9
7Power-Seeking AI and Existential Risk
Antonio Franca
3y
0
Load More (15/25)
Add Posts