x
Localizing Sycophancy to Layers 24-27 in Llama 3.1 8B Using Web-Mined Reddit Rhetoric — LessWrong