x
What are the most interesting / challenging evals (for humans) available? — LessWrong