Strengthening Red Teams: A Modular Scaffold for Control Evaluations — LessWrong