If so, then I guess all the adversarial examples in image classification comes to mind, which fits specification gaming pretty well and required quite a large literature to understand.

LESSWRONG
LW

LESSWRONG
LW

5

[ Parent Question — What are some good examples of incorrigibility? ]

What are some good examples of gaming that is hard to detect?

5

5

1 Answers sorted by
top scoring

May 17, 2019

5

[ Parent Question — What are some good examples of incorrigibility? ]

What are some good examples of gaming that is hard to detect?

5

5

1 Answers sorted by top scoring

May 17, 2019

1 Related Questions

1 Answers sorted by
top scoring