x

LESSWRONG

LW

AI Capabilities — LessWrong

AI Capabilities

Edited by plex last updated 29th Aug 2021

AI Capabilities are the growing abilities of AIs to act effectively in increasingly complex environments. It is often compared to to AI Alignment, which refers to efforts to ensure that these effective actions taken by AIs are also intended by the creators and beneficial to humanity.

Add Posts

Posts tagged AI Capabilities

9

137EfficientZero: human ALE sample-efficiency w/MuZero+self-supervised

5y

52

7

89[Paper] Stress-testing capability elicitation with password-locked models

Fabien Roger, ryan_greenblatt

2y

10

7

62A small update to the Sparse Coding interim research report

Lee Sharkey, Dan Braun, beren

3y

5

7

60Memorizing weak examples can elicit strong behavior out of password-locked models

Fabien Roger, ryan_greenblatt

2y

5

5

301EfficientZero: How It Works

5y

50

5

267Getting 50% (SoTA) on ARC-AGI with GPT-4o

ryan_greenblatt

2y

50

5

58Competitive programming with AlphaCode

4y

36

3

353What DALL-E 2 can and cannot do

Swimmer963 (Miranda Dixon-Luinenburg)

4y

305

3

278Is AI Progress Impossible To Predict?

4y

39

3

93Meta AI announces Cicero: Human-Level Diplomacy play (with dialogue)

Jacy Reese Anthis

4y

64

3

79The case for a negative alignment tax

Cameron Berg, Kvee, Diogo de Lucena, Trent Hodgeson

2y

20

3

34What will the scaled up GATO look like? (Updated with questions)

4y

22

3

16[Crosspost] AlphaTensor, Taste, and the Scalability of AI

4y

4

3

15DeepMind on Stratego, an imperfect information game

4y

9

3

10Devil's Advocate: Adverse Selection Against Conscientiousness

lionhearted (Sebastian Marshall)

3y

2

Load More (15/153)

Add Posts