hairyfigment — LessWrong

See also: https://en.wikipedia.org/wiki/Uzumaki

That's exactly what I mean. You aren't special. It's a mistake to act like nobody else is using the same method to make decisions.

Saying Goodbye

hairyfigment3mo20

See: self-reference paradoxes.

Saying Goodbye

hairyfigment3mo1717

I do have to note that "secure her slice of the lightcone" seems like laughable nonsense to me, unless it's a fancy way of saying 'live comfortably until everyone dies.' A rationally selfish entity would be trying to delay AGI until they had some hope of understanding what they were doing. What I see happening is, instead, the behavior of contemptibly stupid and short-sighted primates.

Thane Ruthenis's Shortform

hairyfigment5mo64

This sounds like a question which can be addressed after we figure out how to avoid extinction.

I do note that you were the one who brought in "biological humans," as if that meant the same as "ourselves" in the grandparent. That could already be a serious disagreement, in some other world where it mattered.

Am I confused about the "malign universal prior" argument?

hairyfigment1y44

I don't see how any of it can be right. Getting one algorithm to output Spongebob wouldn't cause the SI to watch Spongebob -even a less silly claim in that vein would still be false. The Platonic agent would know the plan wouldn't work, and thus wouldn't do it.

Since no individual Platonic agent could do anything meaningful alone, and they plainly can't communicate with each other, they can only coordinate by means of reflective decision theory. That's fine, we'll just assume that's the obvious way for intelligent minds to behave. But then the SI works the same way, and knows the Platonic agents will think that way, and per RDT it refuses to change its behavior based on attempts to game the system. So none of this ever happens in the first place.

(This is without even considering the serious problems with assuming Platonic agents would share a goal to coordinate on. I don't think I buy it. You can't evolve a desire to come into existence, nor does an arbitrary goal seem to require it. Let me assure you, there can exist intelligent minds which don't want worlds like ours to exist.)

Optimistic Assumptions, Longterm Planning, and "Cope"

hairyfigment1y20

https://arxiv.org/abs/1712.05812

It's directly about inverse reinforcement learning, but that should be strictly stronger than RLHF. Seems incumbent on those who disagree to explain why throwing away information here would be enough of a normative assumption (contrary to every story about wishes.)

TurnTrout's shortform feed

hairyfigment1y20

this always helps in the short term,

You seem to have 'proven' that evolution would use that exact method if it could, since evolution never looks forward and always must build on prior adaptations which provided immediate gain. By the same token, of course, evolution doesn't have any knowledge, but if "knowledge" corresponds to any simple changes it could make, then that will obviously happen.

I was raised by devout Mormons, AMA [&|] Soliciting Advice

hairyfigment2y20

Well that's disturbing in a different way. How often do they lose a significant fraction of their savings, though? How many are unvaccinated, which isn't the same as loudly complaining about the shot's supposed risks? The apparent lack of Flat Earthers could point to them actually expecting reality to conform to their words, and having a limit on the silliness of the claims they'll believe. But if they aren't losing real money, that could point to it being a game (or a cost of belonging).

I was raised by devout Mormons, AMA [&|] Soliciting Advice

hairyfigment2y20

The answer might be unhelpful due to selection bias, but I'm curious to learn your view of QAnon. Would you say it works like a fandom for people who think they aren't allowed to read or watch fiction? I get the strong sense that half the appeal - aside from the fun of bearing false witness - is getting to invent your own version of how the conspiracy works. (In particular, the pseudoscientific FNAF-esque idea at the heart of it isn't meant to be believed, but to inspire exegesis like that on the Kessel Run.) This would be called fanfic or "fanwank" if they admitted it was based on a fictional setting. Is there something vital you think I'm missing?

LESSWRONG
LW

LESSWRONG
LW

Posts

Wikitag Contributions

Comments