Building and evaluating alignment auditing agents — LessWrong