x
Steering Language Models in Multiple Directions Simultaneously — LessWrong