LESSWRONG
LW

199
Kenku
350220
Message
Dialogue
Subscribe

Posts

Sorted by New

Wikitag Contributions

Comments

Sorted by
Newest
No posts to display.
No wikitag contributions to display.
Towards a Typology of Strange LLM Chains-of-Thought
Kenku10d1-3

“Watchers” seem to me like obvious religious language, not an idiosyncrasy. Consider how a human put in a such situation would think about supernatural entities monitoring its internal monologue. Especially consider what genres of fiction feature morality-policing mind readers.

Reply
the void
Kenku4mo10

This means that the base model is really doing something very different from what I do as I write the post, even if it’s doing an amazing job of sounding exactly like me and making the exact points that I would make.

I don’t have to look over my draft and speculate about “where the author might be going with this.” I am the author, and I already know where I’m going with it. All texts produced “normally,” by humans, are produced under these favorable epistemic conditions.

But for the base model, what looks from the outside like “writing” is really more like what we call “theory of mind,” in the human case. Looking at someone else, without direct access to their mind or their emotions, and trying to guess what they’ll do next just from what they’ve done (visibly, observably, “on the outside”) thus far.

This paragraph is mockingbird-approved.

Reply
Which journalists would you give quotes to? [one journalist per comment, agree vote for trustworthy]
Answer by KenkuMay 09, 202510

Alexi Sargeant

Reply
Which journalists would you give quotes to? [one journalist per comment, agree vote for trustworthy]
Answer by KenkuMay 09, 202512

Kenny Jones (Dream House Podcast)

Reply
Which journalists would you give quotes to? [one journalist per comment, agree vote for trustworthy]
Answer by KenkuMay 09, 202510

Ezra Marcus

Reply
Which journalists would you give quotes to? [one journalist per comment, agree vote for trustworthy]
Answer by KenkuMay 09, 20251-6

Mr Andy Ngo

Reply
Which journalists would you give quotes to? [one journalist per comment, agree vote for trustworthy]
Answer by KenkuMay 09, 202549

Jack Despain Zhou (Tracing Woodgrains)

Reply
This prompt (sometimes) makes ChatGPT think about terrorist organisations
Kenku6mo105

Polish responses to Polish users are strong evidence against the “responses to other people's requests” hypothesis.

Reply
Unnatural Categories Are Optimized for Deception
Kenku11mo20

[Diogenes voice] This device provides both heat and decorative visual effects, therefore it's a fireplace: Desktop computer with high-end graphics card and RGBT lighting

Reply
"Deception Genre" What Books are like Project Lawful?
Answer by KenkuAug 29, 202450

I discovered John C Wright's Golden Age trilogy thanks to one Eliezer Yudkowsky, who mentioned it multiple times in his notorious Sequences. By the end of the first book i was expecting something very much in the deception genre you've mentioned — a tragic psychological horror about an unreliable narrator being gaslit about the nature of reality. This is a genre i really enjoy, and i kind of hoped for a novel-length version of Scott Alexander's The Last Temptation of Christ.

I did not get that, the trilogy goes in a wildly different direction. But saying whether for better or for worse — heck, even saying if it's a good or bad book — would constitute massive SPOILERS for anyone who discovered this book through Yudkowsky. (If you know why he dropped it, it should be obvious).

I kept hate-reading it.

Reply
Load More