Preserving and continuing alignment research through a severe global catastrophe

[-]UHMWPE-UwU4y170

I wrote about this on EA Forum a few days ago. I'm glad others are starting to think about this. I do think archiving all existing alignment work is very important and perhaps equally important as efforts to keep alive people who represent existing experts & talent in the field. It would be much better for them to be able to continue their work than for new people to attempt to pick off where they left off, especially since many things like intuitions honed over time etc. may not be readily learnable.

I'm increasingly inclined to think that a massive "shock" in the near future (like a nuclear war or a severe pandemic) which effectively halts economic progress, perhaps for a few decades or more, then restarts it at a lower baseline, may be one of the few remaining scenarios we can reasonably expect to survive AGI, taking into account the grim present strategic situation as Eliezer outlined in the recent sequence. Such a world might especially favour alignment since AI work (prosaic AI especially) seems to be much more capital intensive than alignment work, so in a post-shock world with less capital available it would be disadvantaged or impossible to continue carrying out at all. There are a few other reasons such a catastrophic shock may actually increase our collective odds of success re: AI risk, such as a greatly reduced population implying fewer AGI projects & race pressures, etc., morbid as it is.

Given this, the OP's project is doubly important.

[-]Yitz4y10

Assuming your beliefs as stated above are truly held, why shouldn’t I be worried that you’ll try to deliberately induce such a “shock,” and thereby undertake action to kill a significant percentage of the (currently living) population?

(Apologies for being horribly blunt, not sure how else to word this)

[This comment is no longer endorsed by its author]Reply

[-]Yitz4y40

Interesting! One potential downside my mind immediately goes to is public perception, in the (hopefully probable) case that such a contingency plan isn’t needed. In popular culture, the idea of a privileged (usually very wealthy) class of people escaping to an “ark” as the world ends for everyone else is generally considered a classic evil villain trope. For instance, in Don’t Look Up (a recent Hollywood blockbuster involving a GCR), the good guy scientists are offered a refuge in the evil president’s secret escape spaceship, but refuse. This is presented as the heroic and correct thing to do, even though refusing was an effective act of suicide (within the context of the movie). Not that your idea is actually in any way a bad one, but I would wager that the similarities between your proposition and what evil Hollywood villains stereotypically do is likely to increase the public perception of EA folks being cult-like (if your plan captures any press attention), which could potentially drive talent away, and discourage outsiders from cooperating with the community. All that being said, this is ultimately a rather minor concern compared to, say, the possibility of human extinction, so take the above with a grain of salt. If you do plan on going ahead with this on a large scale, I would definitely talk to some people outside the community with PR experience, so as to minimize any possible negative social effects. Good luck!!!

[-]A_donor4y50

I'm hopeful that most people would see the difference between "rich people trying to save their own skin" and "allowing researchers who are trying to make sure humanity has a long-term future at all to continue their work", but I would be very happy to have leads on who to talk to about presenting this well.

[-]magic9mushroom2y10

Should be noted that while there are indeed tons of people who will fault you for taking steps to survive GCR, in the aftermath of a GCR most of those people will be dead (or at the very least, hypocrites who did the thing they're upset about) and thus not able to fault you for anything. History is written by, if not the winners, at least the survivors.

Admittedly, this is contingent on the GCR happening, but I think there's a pretty-high chance of nuclear war in particular in the near future (the Paul Symon interview in particular has me spooked; a random saying that a "linear path" leads to "major-power conflict" would be meh, but a Five Eyes intelligence chief saying it - well, I might be right or wrong about my guesses at what's prompting that, but I'll take the oracle statement at face value and that's P(WWIII) ~> 0.5).

[+][comment deleted]4y10

[-]jrincayc2y10

Sounds interesting.

For somewhat related reasons, I made a 2 column version of Rationality from AI to Zombies https://github.com/jrincayc/rationality-ai-zombies which is easier to print than the original version, and have printed multiple copies in the hope that some survive a global catastrophe.

[-]TLW4y10

Various forms of embossing/etc on metal sheeting can also be decent, although beware the tradeoff of 'cheap metals corrode; expensive metals have a tendency to get melted down because they are expensive'.

[-]Donald Hobson4y30

Stainless steel is not that expensive, and pretty corrosion resistant. Although laser etched glass may be a better option.

[-]TLW4y10

Stainless steel is an option. It does still corrode long-term^[1]. It works fine over decade-to-century timescales for structural applications^[2]; I don't know if we can trust it to retain fine details^[3] over long timescales^[4].

Laser etched glass is interesting, though brittle.

^{^}
Said corrosion is slow, especially in proper conditions (in still dry air, no other metals around for galvanic corrosion, etc); it is not, and cannot be, non-existent.
^{^}
...most of the time. Salt water destroys everything.
^{^}
1mm of corrosion in a 2cm-deep structural member is far less of a problem than 1mm of corrosion on 0.1mm-deep lettering.
^{^}
The longest study I found on atmospheric exposure of stainless steel was 10 years. Somewhat surprising considering that stainless steel has been around for ~180y at this point (1840s or so).

[-]Dustin4y10

Footnote #5 seems as if it cuts off too soon.

[-]A_donor4y10

Fixed, thanks.

[+][comment deleted]4y10

^{^}

"The best time to plant a tree is twenty years ago. The second best time is now." - Quote

^{^}

They want to use it to train language models to help with alignment research, but it aims to contain exactly what we'd want.

^{^}

Work In Progress

^{^}

Pull Request - A way of suggesting changes to a repository using version control, usually used in programming.

^{^}

Global Catastrophic Risk - An event which causes massive global disruption, such as a severe pandemic or nuclear war.

^{^}

The website is unclear on whether it's immediately available.

^{^}

If you're a researcher and want to be on the list, feel free to contact me with your location and I'll keep track of everyone's requests. We might possibly use Alignment EigenKarma as an unbiased metric to prioritize if that exists in time.

^{^}

Unless anyone knows of good places which might be joinable already, if you do please message me!

^{^}

They are compatible with Rationalist/EA culture, more likely than most to be able to create stable communities, and some of them like the idea of building strong community for the benefit of all of humanity.

^{^}

I have a reasonably strong track record as a Mentor/Manager/Mysterious Old Wizard/Funder package deal. If you're enthusiastic and bright don't worry if the task seems overwhelming, I can help you pick up the skills and decompose tasks.

LESSWRONG
is fundraising!
LW

LESSWRONG
is fundraising!
LW

48

Preserving and continuing alignment research through a severe global catastrophe

48

48

Introduction

Preserving alignment knowledge through a global catastrophe

What data do we want to store?

How do we want to store it?

Where do we store it?

Continuing alignment research after a global catastrophe

Evacuation plans

Designing havens

Call to action