Exploratory Analysis of RLHF Transformers with TransformerLens — LessWrong