x
Think carefully before calling RL policies "agents" — LessWrong