"Intelligence" -> "Relentless, Creative Resourcefulness"

[-]Raemon2mo80

Man the thing that is so wuckles about LLMs and their lack of agency is... like, I get why "creativity" is hard. But relentlessness is... so brute forceable? Just... just fucking keep doing stuff, man.

I guess this actually does work when you spend enough compute and you can solve IMO gold problems.

Also I guess I recall the real bottleneck here is LLM discernment, which, does make sense as a fundamental bottleneck.

[-]leogao1mo812

that's the easy part of relentlessness. LMs already often get stuck in loops of trying increasingly hopeless things while getting utterly stuck.

[-]Raemon1mo20

Nod, seems like that falls under "actually the bottleneck is discernment"? (albeit "discernment" is vague could be decomposed further)

[-]Seth Herd1mo40

I agree that discernment is necessary (so maybe expand to RCRD?).

This lens is pretty clarifying I think. That's relative to repeatedly pointing out that "agency" in the sense of just relentlessly pursuing a goal is trivially easy to add via scaffolding, so not the missing piece many people think it is, and pointing out that LLMs are creative as hell. They might need a little prompting to get creative enough, but again that's trivial.

Hm, what about "relentless creative refinement" since I'm not sure what resourcefulness directly points at?

Anyway, discernment does seem like the limiting factor. You've got to discern which of your relentlessly creative efforts are most worth further pursuit. I think discernment is a somewhat better term than the others I've seen used for this missing capability. Getting the right term seems worthwhile.

The following is just some of my follow-on thoughts on the path to discernment in agentic LLMs and therefore timelines. Perhaps this will be fodder for a future post. It's pretty divergent from the original topic so feel free to ignore.

Thinking about how humans acquire discernment in a given area should give some clues as to how hard it would be to add that to agentic LLMs.

Humans do discernment (IMO) sometimes with a bunch of very complex System 2 explicit analysis of a situation to get a decent guess at whether this approach is good/working. Over enough examples/experiences we can learn/compile those many judgments into effortless and mysterious intuitive judgments (I guess more how "discernment" is usually used"). Or we get enough data/experiences to learn/compile by using some faster rubric, like "I think those pants are fashionable because something else that person is wearing seems probably fashionable."

It's a bunch of online learning specific to a situation OR careful analysis following strategies and formulas that are maybe less situation-specific and more general, but quite time-consuming. For instance, Google's co-scientist project has a highly complex scaffolding to create, evolve, and evaluate scientific hypotheses, including discerning their worth against the literature and in other ways. And it seems to work. That system doesn't have the continuous learning to compile that into better judgments. It's unclear how far you could get by fine-tuning on results of those laborious judgments in a given domain.

The other approach would be to create datasets that include much more/better value judgments than text corpora usually do. I don't know how easy/hard that would be to create.

To me this suggests it's not trivial to add discernment, but also doesn't require breakthroughs to add some, leaving the question how much discernment is enough for any given purpose.

[-]Raemon1mo40

A few people surprise-reacted to

It was around this point that it occurred to me that, for now, I could go to the front desk and ask them to lower the volume, which I did, which worked.

Which kind of surprises me. I mean, obviously most people don't do this, but, I kinda thought this was more like an embarrassing story about how long it took me to get to this (fairly obvious) point, given that I was trying at all. I'm curious for to hear more surprise-qualia-details from @Thane Ruthenis and @Eli Tyre.

[-]Eli Tyre1mo20

I guess I wasn't expecting that they had a volume control, at all (which once I say it out loud, is a kind of dumb).

But even more than that, I'm surprised that they were willing to change the volume. I would have guessed that the low level guy who could physically change the dial, would be like "sorry, the rule is I'm not allowed to change the volume."

Were you the only person in the theater?

[-]Raemon1mo20

Were you the only person in the theater?

It does so happen the answer is "basically yes" (me and a friend) although the way they responded suggested that wasn't too cruxy. I did phrase in it "could we turn it down, like, one notch?" and when they said yes I asked for "what about like 3 notches?" and it didn't seem like they even hesitated.

[-]Thane Ruthenis1mo30

It does so happen the answer is "basically yes" (me and a friend)

I'm now unsurprised.

I was surprised at the idea they were willing to change the conditions for an entire theater at the request of one person. Seems like if the ability to do that were known, it'd create obvious issues, with people with different preferences constantly walking up and asking to revert each other's requests. I assumed there was no rule against it only because it didn't actually occur to anyone to do that, and therefore it wasn't commonly exploited.

If the theater wasn't actually full of people with potentially different preferences (and that was known to the worker?), then that's not surprising.

[-]Raemon1mo40

Yeah I agree it's much less surprising, but, a thing I was surprised by was how immediately/enthusiastically they went for it. Like I came in prepared for a lot of cajoling and and the guy was like "oh yeah for sure!" basically as soon as I started talking.

So, it seemed like there was a lot of slack in the system.

[-]Raemon2mo30

Given that "relentless creative resourcefulness" is a mouthful nobody will ever say, maybe it's an improvement to talk about "superagency" as the problem.

[-]Towards_Keeperhood2mo20

I don't like superagency, but yeah seems important to have a better word for this. Maybe just RCR as abbreviation. Or hard-going or hard-optimizing.

I sometimes used "Harry-Factor" when talking to people who read HPMoR to describe what kind of intelligence I mean, and made examples like what he came up with in the last army battle, but obviously we want a different word.

[-]Eli Tyre2mo20

At Palisade we often short circuit discussions about if AI can really be creative or whatever, by clarifying that the thing that we're concerned with is superhuman strategic capabilities, and that we're not mainly worried about AIs that don't have that.

I kind of like the evocativeness of "relentless creative resourcefulness" though, and might try using it.

[-]quetzal_rainbow1mo20

IIRC, canonical old-MIRI definition of intelligence is "intelligence is cross-domain optimization" and it captures your definition modulo emotional/easy-to-understand-for-human part?

Relentlessness comes both from "optimization as looking for the 'best' solution" and "cross-domaining as ignoring conventional boundaries", resourcefulness comes from picking more resources from non-standard domains, creativity is just consequence of sufficiently long optimization in sufficiently large search space.

[-]Raemon1mo20

Yeah to be clear I don't think I'm proposing anything new, just better articulating some of the gears inside the earlier definition.

Insofar as there (might be) a new thing here, it's articulating a narrower mechanism of cross-domain-optimization that makes it a bit more clear what kinds of not-fully-cross-domain optimization might be safer.

(there might totally other forms of dangerously efficient crossdomain optimization that aren't covered under "relentlessly creative resourcefulnes", articulating counterexamples seems good)

[-]gustaf1mo10

It seems you recall correctly. The old definition seemed to additionally include "efficient".

Muehlhauser & Salamon (2012):

For our purposes, “intelligence” measures an agent’s capacity for efficient cross-domain optimization of the world according to the agent’s preferences.

Yudkowsky (2013) (also expands on this definition):

I usually use a notion of “intelligence ≡ efficient cross-domain optimization,”

[-]Hastings2mo20

Rambling on the subject of UI frustrations, and the modern age of customizable software.

You can just have a self authored browser extension. Once I realized this, it took ten minutes to follow a browser extension hello world tutorial, and five minutes to purge all youtube shorts straight to hell.

It turns out that this was actually the only thing I wanted to change about any websites I visited, all other changes I desired were of the shape "stop visiting this website" which is harder to fix with software.

Also, if anyone gets the brilliant idea to make relentless-creative-resourcefullness-bench after reading this post, message me. I will venmo you a dollar to not do that. Cobra paradox be damned.

[-]Eli Tyre1mo20

it took ten minutes to follow a browser extension hello world tutorial

Which tutorial did you use?

[-]Hastings1mo40

https://developer.apple.com/documentation/safariservices/creating-a-safari-web-extension

[-]Raemon1mo20

In the current age I think "ask claude to make it for you" will basically works.

[-]Hastings1mo20

That was the first thing I tried, but unfortunately extension hello world is a computer use task, not something amenable to text interfaces- lots of clicking through menus in both safari and xcode in exactly the blessed way.

[-]Raemon1mo20

Huh, sort of surprised at the xcode part. (I didn't actually do the very first steps of my browser extension but it seemed to be mostly using Cursor like normal, and then clicking a couple buttons to import it into google chrome)

(this is maybe taking as axiom 'learn to use Cursor, it's part of being adult in 2026')

[-]Hastings1mo20

Probably a safari vs chrome difference! (I'm curious- is your parenthetical actually cursor specific, or did you mean learn to use at least one of cursor / claude code / codex / etc )

[-]Raemon1mo20

One of cursor/claude code/etc. (I'm not sure if Xcode counts as one of these by now and if that's all you meant)

[-]mishka2mo20

It turns out that this was actually the only thing I wanted to change about any websites I visited, all other changes I desired were of the shape "stop visiting this website" which is harder to fix with software.

It should be possible to do that with a browser extension which substitutes pages from an unwanted site with a blank page or with a page with a "self-prohibition notice" or just redirects elsewhere (of course, nothing prevents a user from disabling the extension).

[-]Raemon2mo20

I think at that point you should just get https://freedom.to/, which is already pretty optimized.

[-]Hastings1mo20

I may have a bit of a trapped prior that browser extensions written by other people are either malicious, or will auto-update to become malicious in the future.

[-]Raemon1mo20

quick note freedom isn't a browser extension, it's an app (which can still be malicious, but, literally their one job is make a VPN that prevents your computer from accessing the sites you don't want from anywhere, and I think they have a reputation such that if it came out they were malicious malware it'd be a big deal for them)

[-]bfinn1mo10

I ask, "okay, if I were to buckle down, what'd be the first step?" (without committing to doing it, or the necessary followup).
Then, I often automatically adjust to "okay, I could do the first step", and then I start feeling some momentum, and then the complete Buckle Down process doesn't feel as hard.

Cf advice from Mark Forster (a v thoughtful personal productivity author): to start work on a task you don’t feel like doing, tell yourself you’re not going to start on it, just get ready to. Then do a preparatory step (eg get the relevant paperwork out). You’ll usually then find yourself segueing into starting work without realising it.

(Cf Parkinson’s patients who get stuck and can’t start walking can be unstuck by putting eg a pencil on the ground and telling them to step over it. This literal first step then enables them to continue with further steps.)

^{^}

Metaphorical "buckling up" and "buckling down" AFAICT mean the same thing, which is funny.

LESSWRONG
LW

LESSWRONG
LW

78

"Intelligence" -> "Relentless, Creative Resourcefulness"

78

78

Examples

Paul Graham on "Startup Founders"

Richard Feynman

Elon Musk

Back to AI: Sable, in IABIED

Notes on Sable

Reflections from Rationality Training

Buckling Up/Down^[1]

Thinking Assistants

Quiet Theaters

One Shot Baba is You

Takeaways

Intelligence without RCR?

What, if not agency?

Abrupt Ending

78

"Intelligence" -> "Relentless, Creative Resourcefulness"

78

78

Examples

Paul Graham on "Startup Founders"

Richard Feynman

Elon Musk

Back to AI: Sable, in IABIED

Notes on Sable

Reflections from Rationality Training

Buckling Up/Down[1]

Thinking Assistants

Quiet Theaters

One Shot Baba is You

Takeaways

Intelligence without RCR?

What, if not agency?

Abrupt Ending

Buckling Up/Down^[1]