x

LESSWRONG

LW

Luca Baroni — LessWrong

Luca Baroni

Luca Baroni

Message

24

1

1y

Luca Baroni

24

1y

An ARENA 6.0 Capstone: Model Organism of Encoded Reasoning

TL;DR For our capstone project in ARENA 6.0 (Sep 2025), we tried to create a model organism of encoded reasoning from Qwen3-4B using RL. Specifically, we tried to make the model solve math questions from the GSM8K using a chain-of-thought that kept its accuracy but led other judge models to...

Nov 5, 2025•6

Transformers Don't Need LayerNorm at Inference Time: Implications for Interpretability

by submarat, Joachim Schaeffer, Luca Baroni, galvsk, and StefanHex

This work was produced during MARS and SPAR. arXiv version available at https://arxiv.org/abs/2507.02559. Code on GitHub and models on HuggingFace. TL;DR we scaled LayerNorm (LN) removal by fine-tuning to GPT-2 XL: * We improve training stability by regularizing activation standard deviation across token positions & improve the training code. *...

Jul 23, 2025•31