How to replicate and extend our alignment faking demo — LessWrong