Bill Benzon

Steven Wolfram on AI Alignment

Joe Walker has a general conversation with Wolfram about his work and things and stuff, but there are some remarks about AI alignment at the very end: > WALKER: Okay, interesting. So moving finally to AI, many people worry about unaligned artificial general intelligence, and I think it's a risk we should take seriously. But computational irreducibility must imply that a mathematical definition of alignment is impossible, right? > WOLFRAM: Yes. There isn't a mathematical definition of what we want AIs to be like. The minimal thing we might say about AIs, about their alignment, is: let's have them be like people are. And then people immediately say, "No, we don't want them to be like people. People have all kinds of problems. We want them to be like people aspire to be. > > And at that point, you've fallen off the cliff. Because, what do people aspire to be? Well, different people aspire to be different and different cultures aspire in different ways. And I think the concept that there will be a perfect mathematical aspiration is just completely wrongheaded. It's just the wrong type of answer. > > The question of how we should be is a question that is a reflection back on us. There is no "this is the way we should be" imposed by mathematics. > > Humans have ethical beliefs that are a reflection of humanity. One of the things I realised recently is one of the things that's confusing about ethics is if you're used to doing science, you say, "Well, I'm going to separate a piece of the system," and I'm going to say, "I'm going to study this particular subsystem. I'm going to figure out exactly what happens in the subsystem. Everything else is irrelevant." > > But in ethics, you can never do that. So you imagine you're doing one of these trolley problem things. You got to decide whether you're going to kill the three giraffes or the eighteen llamas. And which one is it going to be? > > Well, then you realise to really answer that question to the best ability of huma

66Aug 20, 2023

The idea that ChatGPT is simply “predicting” the next word is, at best, misleading

55Feb 20, 2023

A conceptual precursor to today's language machines [Shannon]

24Nov 15, 2023

What would it mean to understand how a large language model (LLM) works? Some quick notes.

20Oct 3, 2023

Bill Benzon

Message

The Story of My Intellectual Life

In the early 1970s I discovered that “Kubla Khan” had a rich, marvelous, and fantastically symmetrical structure. I'd found myself intellectually. I knew what I was doing. I had a specific intellectual mission: to find the mechanisms behind “Kubla Khan.”...

396

207

ChatGPT: Exploring the Digital Wilderness, Findings and Prospects

This is a cross-post from New Savanna. That is the title of my latest working paper. It summarizes and synthesizes much of the work I have done with ChatGPT to date and contains the abstracts and contents of all the working papers I have done on ChatGPT. It also includes...

Feb 2, 20252

Consciousness, Intelligence, and AI – Some Quick Notes [call it a mini-ramble]

Cross-posted from New Savanna. Epistemic status? Are you kidding me? I just made this up. How would I know its epistemic status? Sheesh! The subject of consciousness keeps turning up in current discussions of AI and LLMs. Can AIs be conscious? Are current AIs conscious? Maybe a little? What do...

Dec 12, 2024-3

Fred the Heretic, a GPT for poetry

Paul Fishwick at U of Texas at Dallas has created a poetry-generating GPT based on the poetry of Frederic Turner. I've played around with it a bit: Fred the Heretic goes to New Orleans and appropriates some culture [Shine on Titanic] Fred the Heretic, GPT, does poetry, channeling Augustine, Burton,...

Dec 8, 20243

More Growth, Melancholy, and MindCraft @3QD [revised and updated]

This is cross-posted from New Savanna. I’ve got a new article at 3 Quarks Daily: Melancholy and Growth: Toward a Mindcraft for an Emerging World. I’m of two minds about it: On the one hand, I think it’s one of my best non-technical pieces in a decade, maybe more. I...

Dec 5, 20244

Depression and Creativity

This is cross-posted from New Savanna. NOTE: I’ve posted this interaction with Claude, not so much to present the ideas Claude offered about possible relationships between depression and creativity, but as an example of the kind of conversational interaction one can have with it. I was particularly impressed with the...

Nov 29, 2024-4

Relationships among words, metalingual definition, and interpretability

This is cross-posted from New Savanna, First, I talk about now natural language is its own metalanguage and that allows them to define new works in terms of existing ones. Then I discuss the concept of justice in terms of mechanism of metalingual definition proposed by David Hays some years...

Jun 7, 20242

If language is for communication, what does that imply about LLMs?

Noam Chomsky famously believes that language originated to facilitate thought, but then came to be a medium of communication. Others believe the reverse, that it originated as a facility for communication which turned out to facilitate thinking. That is certainly my view. If that is so, then one would think...

May 12, 202410

Load More (7/87)

LESSWRONG
LW

LESSWRONG
LW

Bill Benzon

Bill Benzon

The Story of My Intellectual Life

Bill Benzon

Steven Wolfram on AI Alignment

The idea that ChatGPT is simply “predicting” the next word is, at best, misleading

A conceptual precursor to today's language machines [Shannon]

What would it mean to understand how a large language model (LLM) works? Some quick notes.

Bill Benzon

The Story of My Intellectual Life

ChatGPT: Exploring the Digital Wilderness, Findings and Prospects

Consciousness, Intelligence, and AI – Some Quick Notes [call it a mini-ramble]

Fred the Heretic, a GPT for poetry

More Growth, Melancholy, and MindCraft @3QD [revised and updated]

Depression and Creativity

Relationships among words, metalingual definition, and interpretability

If language is for communication, what does that imply about LLMs?

Steven Wolfram on AI Alignment

The idea that ChatGPT is simply “predicting” the next word is, at best, misleading

A conceptual precursor to today's language machines [Shannon]

What would it mean to understand how a large language model (LLM) works? Some quick notes.

ChatGPT: Exploring the Digital Wilderness, Findings and Prospects

Consciousness, Intelligence, and AI – Some Quick Notes [call it a mini-ramble]

Fred the Heretic, a GPT for poetry

More Growth, Melancholy, and MindCraft @3QD [revised and updated]

Depression and Creativity

Relationships among words, metalingual definition, and interpretability

If language is for communication, what does that imply about LLMs?