[MLSN #8] Mechanistic interpretability, using law to inform AI alignment, scaling laws for proxy gaming — LessWrong