Satya Benson — LessWrong

Modeling a Constant-Compute Automated AI R&D Process

We’d like to know how much limits on compute scaling will constrain AI R&D. This post doesn’t have answers, but it does attempt to clarify thinking about how to use economic models to explore the question. The Standard Model of Idea Production A general “Jones-style” model of idea production[1] is...

Mar 1220

Goodfire and Training on Interpretability

Goodfire wrote Intentionally designing the future of AI about training on interpretability. This seems like an instance of The Most Forbidden Technique which has been warned against over and over - optimization pressure on interpretability technique [T] eventually degrades [T]. Goodfire claims they are aware of the associated risks and...

Feb 632

Moving Past the Question of Consciousness: A Thought Experiment

Humans are contacted by a mysterious type of being calling themselves “Galabren” who say they are “aelthous”. They’d like to know if we, too, are aelthous, since if we are they’d like to treat us well, as they care about aelthous things. We ask the Galabren what aelthous means and...

Jun 19, 202513

satchlj's Shortform

Nov 15, 20241