Introducing METR's Autonomy Evaluation Resources — LessWrong