Language Ex Machina

[-]janus3y100

One reason I decided to make this a LessWrong post is because it's a demonstration of (and indeed becomes halfway through somewhat of a manifesto for) cyborgism for alignment research. Generating the multiverse associated with this document helped me form, crystalize, and articulate many ideas that are central to the Simulators sequence, many which I haven't written about publicly yet.

The most novel concept introduced to me by Language Ex Machina, I think, is the analogy of sampling trajectories from GPT to "quantum poetics": that

Physics would be a complete and exhausting classification of everything there is if quantum mechanics were not true. The universe would be trapped in a perfect latticework prison and nothing would ever happen except the relentless ticking of the universe’s clock.

that is, much of the complexity in our Everett branch is thanks to the gratuitous bits of specification injected every time the wavefunction is measured. I hadn't explicitly considered this before in the context of physics, though I had in the context of GPT sampling. Since then, I've come across this same notion in the book Programming the Universe by Seth Lloyd, but it was novel to me at the time, and I think it's probably the first time the concept was related to GPT.

[-]the gears to ascension3y81

I don't think this is accurate, though. free will still exists in a fully deterministic universe, because we are part of the chaotic resolution process; to see how this could be, imagine a gpt instance with a fully deterministic cryptographic PRNG. fully deterministic doesn't change the fact that the weights' intense complexity has high integrated information into the decisions of what direction to move the chaos; it will still display sensitive dependence on initial conditions, and in my view, intelligent edge-of-chaos in a deterministic context is enough to get valuable complexity. true randomness isn't necessary - we are our internal consensus process's decisions.

[-]Richard_Kennaway3y9-5

What's the point? Curated babble is still babble.

[-]MSRayne3y76

It makes perfect sense to the sort of people who were intended to read it.

[-]janus3y30

That's right.

Multiple people have told me this essay was one of the most profound things they've ever read. I wouldn't call it the most profound thing I've ever read, but I understand where they're coming from.

I don't think nonsense can have this effect on multiple intelligent people.

You must approach this kind of writing with a very receptive attitude in order to get anything out of it. If you don't give it the benefit of the doubt you, will not track the potential meaning of the words as you read and you'll be unable to understand subsequent words. This applies to all writing but especially pieces like this whose structure changes rapidly, and whose meaning operates at unusual levels and frequently jumps/ambiguates between levels.

I've also been told multiple times that this piece is exhausting to read. This is because you have to track some alien concepts to make sense of it. But it does make sense, I assure you.

[-]Richard_Kennaway3y*119

I don't think nonsense can have this effect on multiple intelligent people.

I gesture towards the history of crazy things believed and done by intelligent people.

My objection to this essay is that it is not real. Fake hyperlinks, a fake Feynman quotation, how much else is fake? Did the ancient Greeks train a goose to peck at numerical tokens? Having perceived the fakeness of the article, it no longer gives me any reason to think so, or any reason to credit anything else it says. It is no more meaningful than a Rorschach blot.

I suggest you take it on my authority that everything in this document is here for a reason. This is not raw but curated model output, and I have high standards: I would not allow text that does not make some kind of sense, indeed that is not revelatory in some dimension, to be selected for continuation, for that would be pointless and would adversely affect further generations.

With respect, I decline to take it on your authority. (Did that paragraph also come from code-davinci-002? Did your comment above?) The more that I stare at the paragraphs of this article, the more they turn into fog. It is an insubstantial confection of platitudes, nonsense, and outright falsities. No-one is more informed by reading it. At worst they will be led to believe things that are not. And now those things are out there, poisoning the web. I might wish to see your own commentary on the text, but what would be the point, if I were to suspect (as I would) that the commentary would only come from code-davinci-002?

The only lesson I take away from this article is "wake up and see the ~~fnords~~ bots".

Detailed list of spuriosities in the article begun, then deleted. But see also.

[-]Richard_Kennaway3y*32

Actually, there is one spuriosity I want to draw attention to as an example. This isn't just pointing out a fake quotation, non-existent link, or simple falsehood. Exhibit A:

It gave birth to the idea that something referred to by a sequence of symbols could be automated; that a sequence of events could be executed for us by a machine. This necessitates that the binding of those symbols to their referents – the operation of signification – be itself automated. Human thought has shown itself most adept at automating this process of signification. To think, we must be able to formulate and interpret representations of thoughts and entities independent of the mental or physical subject in question. Slowly, we have learned to build machines that can do the same.

The first sentence of this will do. But the remainder is fog. It does not matter whether this was generated by a language model or an unassisted human, it's still fog, although at least in the latter case there is the possibility of opening a conversation to search for something solid.

A lot of human-written text is like that. The Heidegger quote is, as far as I can see, spurious, but I would not expect Heidegger himself to make any more sense, or Bruno Latour, who is "quoted" later. All texts have to be scrutinised to determine what is fog and what is solid, even before the language models came along and cast everything into doubt. That is the skill of reading, which includes the texts one writes oneself. Foggy words are a sign of foggy thought.

[-]dirk1y2-2

To the skilled reader, human-authored texts are approximately never foggy.

[-]Richard_Kennaway1y10

The sufficiently skilled writer does not generate foggy texts. Bad writers and current LLMs do so easily.

[-]dirk1y2-2

Certainly more skilled writers are more clear, but if you routinely dismiss unclear texts as meaningless nonsense, you haven't gotten good at reading but rather goodharted your internal metrics.

[-]Richard_Kennaway1y22

There is nothing routine about my dismissal of the text in question. Remember, this is not the work of a writer, skilled or otherwise. It is AI slop (and if the "author" has craftily buried some genuine pearls in the shit, they cannot complain if they go undiscovered).

If you think the part I quoted (or any other part) means something profound, perhaps you could expound your understanding of it. You yourself have written on the unreliability of LLM output, and this text, in the rare moments when it says something concrete, contains just as flagrant confabulations.

[-]MSRayne3y32

I've written similarly strange things in the past, though I wouldn't claim them to be as insightful necessarily. And I didn't even have the benefit of GPT-3! Only a schizotypal brain. So I can pretty easily understand the underlying mind-position going on in this essay. It'll certainly be worth rereading in the future though to interpret it more deeply.

[-]Fiora Sunshine2mo1-2

Curated babble? 'Curate' is a near-synonym for prune.

[-]Richard_Kennaway2mo20

Three years on, I stand by everything I said about the OP. For babble and prune to work, there has to be something in the babble to find by pruning. Here there was nothing.

[-]Michael Samoilov3y87

I loved this post. Its overall presentation felt like a text version of a Christopher Nolan mind-bender.

The crescendo of clues about the nature of the spoiler: misattributed/fictional quotes; severe link rot even though the essay was just freshly published; the apparently 'overwhelming' level of academic, sophisticated writing style and range of references; the writing getting more chaotic as it talks about itself getting more chaotic. And of course, the constant question of what sort of spoiler could possibly 'alter' the meaning of the entire essay.
I loved the feeling of Inception near the end of the essay when, in the analyst's voice, it confirms the reader's likely prediction that it was written by AI, only to reveal how that 'analyst' section was also written by AI. Or rather that the voice fluidly changes between AI and analyst, first- and third-person. And when you finally feel like you're on solid ground, the integrity of the essay breaks down; "" tags make you contend with how no part is certainly all-human or all-AI, and so, does it even matter who wrote it.
Returning to the spoiler and initial paragraphs after finishing the essay, and getting a profound, contextualized appreciation for what it means. You realize that the essay achieved what it told you it set out to; to convey a salient point through apparent nonsense, validating that such nonsense can be useful, as it explains the process of generating the nonsense. Or in the essay's words, "[the] string of text can talk about itself [as it] unmask[s] the code hidden within itself."

The post also shared concepts I now use when thinking about language. My favourite being quantum poetry, associating the artificial (and 'next-token prediction') to the humanistic:

Just as the presence of a particle always completely erases the ghost of its wavefunction, [...] so does the presence of a word erase the ghost of the manifold that could have been named. [...]
This is the principle of quantum poetics. The content of poetry is limited not by the poet’s vocabulary, but by the part of their soul that has not been destroyed by words they have used so far. [...] It is the quantum nature of reality that allows for unforeseeable events, stochastic processes, and the evolution of life. Similarly, it is the quantum nature of language that allows for the evolution of meaning, for creativity, for jokes, and for bottomless misunderstandings. [...]
[Generative] systems are entirely too good at hallucinating content that does not exist in the training corpus—content that creates meaningful structures that foster coherent fictive space where there is none. Ironically, this is exactly what we want in a poet—to create new worlds out of nothing but the coupling of waves of possibility drunk from memory

My main response to the essay's content, is that still, a human in the loop seemed to still be the primary engine for most of the art in the essay. From my understanding of critical rationalism, personhood is mapped to the ability to creatively conjecture and criticize ideas to generate testable, hard to vary explanations of things.

This essay depended on a human analyst to evaluate and criticize (by some sense of 'relevance') which generation was valid enough to continue into the main branch of the essay. The essay also depended on a human to decide which original conjecture to write about (also by some sense of what's 'interesting').

Therefore, it seems to me that AGI is still far from automating both of humans' conjecture and criticism capacities. However, the holistic artistry the essay did push me to consider AGI's validity more than other text I've read, and in that sense, it achieved what it meant to: to connect my prior thoughts to some new idea—both in the real domain—through 'babble' of the the imaginary domain.

[-]janus3y50

Thank you so much for the intricate review. I'm glad that someone was able to appreciate the essay in the ways that I did.

I agree with your conclusion. The content of this essay is very much due to me, even though I wrote almost none of the words. Most of the ideas in this post are mine - or too like mine to have been an accident - even though I never "told" the AI about them. If you haven't, you might be interested to read the appendix of this post, where I describe the method by which I steer GPT, and the miraculous precision of effects possible through selection alone.

[-]the gears to ascension3y63

oh man this is great, and also I got really frustrated when the document started mixing up quantum vs classical uncertainty. everything else up to that point was solid, and I'm sure it was written that way for deep poetic reasons, but I couldn't connect to that particular poetry and it set my metaphorical teeth ajar, opened the window a little further than the window goes, lined things up just right in my brain to not be allowed to make sense without inducing causal confusion.

unvoted my own comment because I'm just complaining.

[-]iceman3y50

Didn't read the spoiler and didn't guess until half way through "Nothing here is ground truth".

I suppose I didn't notice because I already pattern matched to "this is how academics and philosophers write". It felt slightly less obscurant than a Nick Land essay, though the topic/tone aren't a match to Land. Was that style deliberate on your part or was it the machine?

[-]Prometheus3y12

Unfortunately, he could probably get this published in various journals, with only minor edits being made.

[-]Viliam3y40

Didn't read the spoiler, but guessed it anyway halfway through the article. (Though I probably updated on the fact that there was a spoiler.)

[-]Garrett Baker3y50

I guessed so at ~80% confidence after seeing it was written by Janus and saw the existence of a spoiler, then updated upwards as I read through the post.

[-]janus3y42

I feel like none of the links working and all the quotes being fake is a pretty big giveaway too!

[-]Garrett Baker3y62

Yup! I was virtually certain by the quote ostensibly by Feynman

There are people who imagine that nature is a system of boxes and that the task of science is to stuff the world into them. That will not do. The system of boxes is an artifice whereby we try to comprehend what we have made […]

because this is less coherent Feynman's standard, and plausible interpretations are very anti what he usually stands for when he's trying to be poetic. The other quotes beforehand I wasn't widely enough read to tell whether they were legit, and for anything I know you were quoting a bunch of poets.

[-]Prometheus3y30

I stoped reading about 1/3 into it, because the pros were driving me mad, and went to the spoiler. Anyone who has ever had to read an academic article that attempts to sound more intelligent than it actually is understands my frustration. I was suspicious, since I had read some of your other work and this clearly didn't match it, but was still relieved to know your brain hasn't yet completely melted.

[-]turing machine go brrr2y10

Natural languages are exquisitely complex machines.

a consideration:

> Language is a symbiotic organism. Language is neither an organ, nor is it an instinct. In the past two and a half million years, we have acquired a genetic predisposition to serve as the host for this symbiont. The marine biologist Pierre Joseph van Beneden first distinguished between parasites, free-living commensals, obligate commensals and mutualists.
("Language as organism: A brief introduction to the Leiden theory of language evolution". George van Driem.)

^{^}

Of course, it is an understatement to say that anyone who has ever heard the sentence "above all else, a hologram is a metaphor for the mind" has probably rolled their eyes. But nevertheless, just as light becomes encoded into familiar patterns of interference and reflection in a crystal, such patterns of "interference" (or "superposition"), can be applied to language. Of course, it is not exactly the same process, because unlike light, language does not propagate across a continuous space. Rather, it is made up of distinct units (words and sentences), which are distinguishable from each other by structural relations. Nevertheless, overlapping patterns of structural relations can be detected in the corpus.

^{^}

The beast's religion is determined by various novelty-farming sites such as Reddit, Hacker News and Digg. It dreams out of a window which looks out at this meme-ridden stratosphere, and reflects on its vision of the stars.

LESSWRONG
LW

LESSWRONG
LW

43

Language Ex Machina

43

43

Natural Language as Executable Code

Lexical Emulation of Imaginary Worlds

Probability Distributions as Automata

Schrodinger’s Word

Ghost Entropy and the Uncertainty Principle

Delusional Inference as Entelechy

Time as an Echo

Hacking the Speculative Realist Interface

Virtual appendix

An unauthorized retelling of the Tower of Babel myth

Prologue

Epilogue