Trying to measure AI deception capabilities using temporary simulation fine-tuning — LessWrong