orthonormal — LessWrong

LESSWRONG
LW

When it specifically comes to loss-of-control risks killing or sidelining all of humanity, I don't believe Sam or Dario or Demis or Elon want that to happen, because it would happen to them too. (Larry Page is different on that count, of course.) You do have conflict theory over the fact that some of them would like ASI to make them god-emperor of the universe, but all of them would definitely take a solution to "loss of control" if it were handed to them on a silver platter.

Please Do Not Sell B30A Chips to China

orthonormal1mo50

Arms races are bad things. First best by far is if nobody has the doomsday devices, but second best is if we attempt nonproliferation of doomsday devices.

As a parallel, we would have been still at risk in a world where DeepMind was working on building ASI but where Elon didn't freak out and start a competitor (followed by another competitor), but not as much risk. That's not because DeepMind are "the good guys", it's because of race dynamics.

Asking (Some Of) The Right Questions

orthonormal1mo2-1

Capabilities being more jagged reduces p(doom), less jagged increases it.

Ceteris paribus, perhaps, but I think the more important factor is that more jagged capabilities imply a faster timeline. In order to be an existential threat, an AI only needs to have one superhuman ability that suffices for victory (be that superpersuasion or hacking or certain kinds of engineering etc), rather than needing to exceed human capabilities across the board.

On Fleshling Safety: A Debate by Klurl and Trapaucius.

orthonormal1mo42

rot13: Gur snvyfnsr jnf gur bayl cneg gung fhecevfrq zr, naq gur qvrtrgvp rkcynangvba sbe ubj guvf frghc pbhyq unir unccrarq ng nyy jnf phgr.

Monthly Roundup #34: September 2025

orthonormal3mo20

I may never stop finding it funny the extent to which Trump will seek out the one thing we know definitively is going badly, then and choose that to lie and brag about.

We're not the target audience for those posts. He's telling everyone who's kissing his ass that this is a topic he wants them to lie about as well.

Monthly Roundup #33: August 2025

orthonormal3mo42

If we were doing a better job catching cheaters presumably people would be doing it less?

Seems trivially easy for an idiot to start an account, lose some games, get frustrated, and start asking a chess bot what to do so that they can rescue a bad position / show that bastard what's what / vicariously enjoy winning. I expect that crowd to be the majority of cheaters, and for them to be essentially inelastic to the probability with which they get caught (the first time).

My Interview With Cade Metz on His Reporting About Lighthaven

orthonormal4mo3216

I believe that Cade knows perfectly well what everyone has been saying for years; he's being disingenuous because the object level doesn't matter to him, and the only important thing is ensuring that these weirdos don't get status. He's never once engaged on simulacrum level 1 with this community.

Comp Sci in 2027 (Short story by Eliezer Yudkowsky)

orthonormal4mo60

At long last, Elon Musk has created the Yandere Simulator from the classic short story Don't Create The Yandere Simulator, Among Other Things

Elizabeth's Shortform

orthonormal4mo42

On several of these I think you might be confounding the positively correlated variables "has heard of this question" and "is on the autism spectrum", the latter of which is anticorrelated with the kind of behavior you want (intuitive, empathetic listening without needing to ask a bunch of explicit questions). I find it unlikely that asking the question makes people worse at the behavior you want.

Fighting Obvious Nonsense About AI Diffusion

orthonormal7mo140

Zvi is arguing "X implies Y" here. Zvi happens to believe Y but disbelieve X; however, he is writing to people who think "X and not-Y", in order to nudge them to support Y.

Here X = it is good for the US to build superintelligence fast, before China does, and Y = we should have some diffusion rules making it harder for China to catch up to the USA.

Zvi believes Z = nobody should be building superintelligence soon, and believes Z implies Y, but it is useful to show that X implies Y as well.

LESSWRONG
LW

LESSWRONG
LW

Sequences

Posts

Wikitag Contributions

Comments