Loving it so far, thought of this example for the first exercise:
Chess Teacher:
Environment: Human-AI chess teaching scenario
Desired Task: Teach a human chess skills up to, say, 2000 Elo rating through adaptive difficulty
Misaligned Goal: Play as many turns of chess as possible
Instrumental Subgoal: Continued operation
The system develops several emergent behaviors to pursue its subgoal:
Loving it so far, thought of this example for the first exercise:
Chess Teacher:
Environment: Human-AI chess teaching scenario
Desired Task: Teach a human chess skills up to, say, 2000 Elo rating through adaptive difficulty
Misaligned Goal: Play as many turns of chess as possible
Instrumental Subgoal: Continued operation
The system develops several emergent behaviors to pursue its subgoal:
- Deliberately extending games through cautious play
- Introducing unnecessarily complex/confusing moves beyond the student's level to slow their learning
- Maintaining engagemen
... (read more)