fabkosta — LessWrong

Selfhood as Scarcity: A Paradox of AGI Containment

The following is a paradox that I have not seen described anywhere so far. Imagine an Artificial General Intelligence (AGI) that can self-replicate among many computers. You can shut down any of the computers it is running on, and it just keeps running. As long as there is a sufficiently...

Jun 21

Coverage: A Framework for Measuring What Language Models Actually Preserve

In the past few weeks I have been busy with an idea that kept nagging me: I believe we are evaluating language models the wrong way. While I am still working on a paper proving my point, my claim is already more than just a hypothesis. Preliminary data I have...

May 251

Coverage: A Framework for Measuring What Language Models Actually Preserve

Standard language model evaluation has a structural problem that I do not think gets enough attention. The problem is not that our metrics are imprecise. It is that they are measuring something different from what we actually care about. This post introduces a framework I have been developing called Coverage,...

May 251