Compounding Resource X

[-]ryan_b3y40

Alternative frame: I've been poking at the idea of quantum resource theories periodically, literally on the strength of a certain word-similarity between quantum stuff and alignment stuff.

The root inspiration for this comes from Scott Aaronson's Quantum Computing Since Democritus, specifically two things: one, the "certain generalization of probability" lens pretty directly liberates me to throw QM ideas at just about anything, the same way I might with regular probability; two, the introduction of negative probability and through that "cancelling out" possibilities is super cool and feels like a useful way to think about certain problems.

So, babbling: can we loot resource theories from quantum thermodynamics as a way to reason more precisely about the constraints we want for alignment?

A Quanta article animating the thought: https://www.quantamagazine.org/physicists-trace-the-rise-in-entropy-to-quantum-information-20220526/

Direct quote -

“A resource theory is a simple model for any situation in which the actions you can perform and the systems you can access are restricted for some reason,” said the physicist Nicole Yunger Halpern of the National Institutes of Standards and Technology.

This sounds like a good match for alignment-ish problems on the face of it. In the alignment case the some reason for the restrictions is so it doesn't kill us. There are two elements to the resource theory: firstly a set of free operations and states we assume can be gotten to at no cost; secondly valuable resources like entanglement, purity, and asymmetry which are states which can be achieved at a cost (and therefore are limited). The gist is, what if we swapped out words like entanglement and purity with words like corrigibility and interpretability?

[-]the gears to ascension3y22

quantum probability is a very specific thing; I agree that it's an incredibly interesting metaphor, and I also think there's something to be had there, but I'd caution against applying it too literally without care. the kinds of interference patterns at quantum scale are in fact qualitatively different from the ones at larger spatial scales under most conditions.

neural networks are not usually complex valued, for starters. and not because it hasn't been tried.

[-][anonymous]3y10

Which areas of neural network would fit under the complex number paradigm?

[-]the gears to ascension3y41

anything processing complex valued phenomena or modeling reality in high enough resolution that the network should learn small-scale complex valued patterns; so, chemistry, fluid waves eg sound, electricity, etc. some very solid results: https://arxivxplorer.com/?query=complex+valued+neural+networks

[-]Espedair Street2y10

Thank you for this post.

The only thing about which I want to encourage more reflection is Have more "Move fast and break things" attitude [admittedly there is a bit of context I'm not edit-pasting here, but you do seem to favour this approach to a fair extent].

My gentle nudge here is based on my sense that 'moving fast and breaking things' can have pretty bad consequences if you're (collaboratively) exploring AI safety research tracks that recklessly put into circulation knowledge that can be used to increase capabilities over/without safety 'points'.

[-]rpglover643y10

This seems like a useful special case of "conditions-consequences" reasoning. I wonder whether

Avoiding meddling is a useful subskill in this context (probably not)
There is another useful special case

^{^}

Re "Be Goal Oriented in Your Research": People I respect keep warning me about being too goal oriented, and failing to see the forest for the trees. Trying too hard may cause you to lose the forest for the trees, or fail to have the kind of curiosity that's needed to really think the most important thoughts, or end up goodharting, etc. I'm not sure how to navigate this tensions. It sure seems like both sides are important. It seems kinda obviously good to reflect on those tradeoffs, find the healthy middle, and experiment with third alternatives that get the best of both worlds. (That process seems like the kind of meta-reflection this section is all about)

Re "Move Fast and Break Things": I think people often get annoyed at the rationalsphere for being more on the 'move fast and break things' end of the philosophical spectrum. I think it's doing useful work. but I think a reasonable case can be made that you at least need multiple processes going, some of which are optimized for robust legibility.

LESSWRONG
is fundraising!
LW

LESSWRONG
is fundraising!
LW

77

Compounding Resource X

77

77

Resource X for "Solve AGI"

Two answers

Good (meta)cognitive processes entangled with the territory

Coordination Capital

"Pointed at the right target"

Further Thoughts?