Alignment Does Not Need to Be Opaque! An Introduction to Feature Steering with Reinforcement Learning — LessWrong