Ben Pace — LessWrong

LESSWRONG
LW

Reminder that spoiler tags exist, like this:

I think this is better for hiding spoilers than the long dots... because when I saw this post in recent discussion, I saw all the dots and also some of the first paragraph after them.

You make spoiler tags by adding >! at the front of the para.

Alignment will happen by default. What’s next?

Ben Pace1d77

Huh, I quite like the crystalized/fluid split in describing what the LLMs are good and bad at. I'm not sure if it's an analogy or just a literal description.

Why Not Just Train For Interpretability?

Ben Pace1d20

Aside: This is why subtweeting is bad. It makes people paranoid that people are subtweeting them when they aren't.

Avoid Fooling Yourself By Believing Two Opposing Things At Once

Ben Pace1d20

It is mistaken to hold the 'reasonable' middle position that everything is neither 'great' nor 'inadequate' but 'middling'. In fact, some things are at one extreme, and some things are at the other!

Why Talk to Journalists

Ben Pace3d20

Just checking, did you first record your conversation with him due to my recommending that course of action to you?

8 Questions for the Future of Inkhaven

Ben Pace5d20

Normal mode: 500+ word post each day

Hard mode: Above, and 200+ word comment each day

Nightmare mode: Above, and 3,500+ word post each week (replacing one of the 500+ word posts)

7 Vicious Vices of Rationalists

Ben Pace5d40

I don't know. Some frames:

Scope sensitivity: Some amount of it should be able to outweigh a certain amount of meaning.
Virtue ethics: I am willing to push through a lot of suffering if it means something; the simple ratio of the two does not determine whether the overall thing is worthwhile.
Deontology: it does kind of differ on whether you're responsible for the suffering happening or not.

It's also plausible to me that I am more coming at this from a deontological feeling of "One should not kill everyone if one has a good reason" rather than "The world is net positive".

Evrart Claire: A Case Study in Anti-Epistemology

Ben Pace5d40

I talked with Zvi Mowshowitz who is quite skilled at seeing all the things being communicated at once, about the last section of dialogue above, where Evrart Claire talks about being transparent with Joyce Messier. This befuddled me for a while.

He said that what is being communicated is:

I know that you're talking with my primary political opponent.
That's okay with me, I'm still happy to work with you.
Also you haven't got anything on me; nothing I've ever said is actually incriminating (as per the plausible deniability point).
1. (By implication: And of course, if we're going to talk about shady stuff, don't forget that I've got stuff on you.)

I didn't quite notice that 1 and 2 were intended to be communicated. I don't think of myself as someone to hide, so number 1 was not something I noticed.

Anyway, I am personally kind of annoyed that both (a) many things are being communicated at once, and (b) not all of them can be said explicitly (and in fact the explicit content of the words is kind of the opposite of true and of what's being discussed). I wish either everyone stopped doing this, or I were better at tracking it all. (Zvi roughly said "That's what your brain is built for, tracking this all.")

Paranoia: A Beginner's Guide

Ben Pace6d20

I think he said to me that he disagrees with them, on their practical utility.

Dominance: The Standard Everyday Solution To Akrasia

Ben Pace6d352

LESSWRONG
LW

LESSWRONG
LW

Sequences

Posts

Wikitag Contributions

Comments