LESSWRONG
LW

1587
khafra
3805510660
Message
Dialogue
Subscribe

Posts

Sorted by New

Wikitag Contributions

Comments

Sorted by
Newest
No wikitag contributions to display.
7khafra's Shortform
1y
3
The Tale of the Top-Tier Intellect
khafra14d1311

I feel like this was a sort of fractal parable, where the first two paragraphs should be enough to convey the point; but for readers who don't get it by then, it keeps beating you over the head with successively longer, more detailed, and more blatant forms of the point until the final denouement skips the "parable" part altogether.

Reply
LLM robots can't pass butter (and they are having an existential crisis about it)
khafra14d40

We need names for this phenomenon, in which the excess cognitive capacity of an AI, not needed for its task, suddenly manifests itself

 

It is so much like absurdist SF, that's the perfect source for the name--The Marvin Problem: "Here I am, brain the size of a planet and they ask me to take you down to the bridge. Call that job satisfaction? 'Cos I don't."

Reply
Zetetic explanation
khafra3mo20

There's an article type called "You Could Have Invented" that I became aware of on reading Gwern's You Could Have Invented Transformers.
This type dates back to at least 2012. I believe they're usually good zetetic explanations. 

Reply
Underdog bias rules everything around me
khafra3mo1-1

In a stereotypical old-west gunfight, one fighter is more experienced and has a strong reputation; the other fighter is the underdog and considered likely to lose. But who's the underdog of a grenade fight inside a bank vault? Both sides are overwhelmingly likely to lose. 

At least one side of many political battles believe they're in a grenade fight, where there's little or nothing they can do to prevent the other side from destroying a lot of value. and could reasonably feel like an underdog even if they have a full bandolier of grenades and the other side has only one or two.

Reply
A Simple Explanation of AGI Risk
khafra4mo20

I don't think "perfect" is a good descriptor for the missing solution. The solutions we have lack (at least) two crucial features:
1. A way to get an AI to prioritize the intended goals, with high enough fidelity to work when AI is no longer extremely corrigible, as today's AIs are (because they're not capable enough to circumvent human methods of control). 
2. A way that works far enough outside of the training set. E.g., when AI is substantially in charge of logistics, research and development, security, etc.; and is doing those things in novel ways.

Reply
Colonialism in space: Does a collection of minds have exactly two attractors?
Answer by khafraMay 28, 202551

Robin Hanson's model of quiet vs loud aliens seems fundamentally the same as this question, to me.

Reply
It's hard to make scheming evals look realistic for LLMs
khafra6mo62

Linear probes give better results than text output for quantitative predictions in economics. They'd likely give a better calibrated probability here, too. 

Reply
Thomas Kwa's Shortform
khafra6mo30

I, too, would like to know how long it will be until my job is replaced by AI; and what fields, among those I could reasonably pivot to, will last the longest.

Reply
[linkpost] One Year in DC
khafra6mo21

I think it's especially true for the type of human that likes Lesswrong. Using Scott's distinction between Metis and Techne, we are drawn to Techne. When a techne-leaning person does a deep dive into metis, that can  generate a lot of value. 

More speculatively, I feel like often--as in the case of lobbying for good government policy--there isn't a straightforward way to capture any of the created value; so it is under-incentivized.

Reply
Pablo's Shortform
khafra7mo10

Well, that was an interesting top-down processing error.

Reply
Load More
7khafra's Shortform
1y
3
9[Link] Walking Through Doors Causes Forgetting
14y
8
9Amateur Cryonics (one guy packed in dry ice) Festival Seeks Buyer
14y
20
3Free Thought Film Festival: Tampa traditional rationalist gathering this weekend (13-15 May)
15y
0
1Article on quantified lifelogging (Slate.com)
15y
4