x
Assessing heterogeneity in METR's late 2025 developer productivity experiment — LessWrong