x

LESSWRONG

LW

AI Robustness — LessWrong

AI Robustness

Edited by markov last updated 24th Oct 2022

AI Robustness is an agents ability to maintain its goal and its capabilities when exposed to different data distributions or environments.

Add Posts

Posts tagged AI Robustness

5

145Robustness to Scale

Scott Garrabrant

8y

23

3

71AI Safety in a World of Vulnerable Machine Learning Systems

AdamGleave, EuanMcLean

3y

29

2

38Robustness to Scaling Down: More Important Than I Thought

4y

5

2

22Do the Safety Properties of Powerful AI Systems Need to be Adversarially Robust? Why?

3y

42

2

16Squeezing foundations research assistance out of formal logic narrow AI.

3y

1

2

16AXRP Episode 17 - Training for Very High Reliability with Daniel Ziegler

4y

0

2

9Desiderata for an AI

Nathan Helm-Burger

3y

0

2

-11Random Observation on AI goals

[anonymous]3y

2

1

41What's new at FAR AI

AdamGleave, EuanMcLean

2y

0

1

41Beyond the Board: Exploring AI Robustness Through Go

2y

2

1

19Why Eliminating Deception Won’t Align AI

Priyanka Bharadwaj

10mo

6

1

182023 Alignment Research Updates from FAR AI

AdamGleave, EuanMcLean

2y

0

1

14Does robustness improve with scale?

ChengCheng, niki.h, Ian McKenzie, Oskar Hollinsworth, Tom Tseng, AdamGleave

2y

0

1

11On Interpretability's Robustness

3y

0

1

10Robustness & Evolution [MLAISU W02]

3y

0

Load More (15/21)

Add Posts