Focus on the places where you feel shocked everyone's dropping the ball

Hell yeah!

This matches my internal experience that caused me to bring a ton of resources into existence in the alignment ecosystem (with various collaborators):

aisafety.info - Man, there really should be a single point of access that lets people self-onboard into the effort. (Helped massively by Rob Miles's volunteer community, soon to launch a paid distillation fellowship)
aisafety.training - Maybe we should have a unified place with all the training programs and conferences so people can find what to apply to? (AI Safety Support had a great database that just needed a frontend)
aisafety.world - Let's make a map of everything in AI existential safety so people know what orgs, blogs, funding sources, resources, etc exist, in a nice sharable format. (Hamish did the coding, Superlinear funded it)
ea.domains - Wow, there sure are a lot of vital domains that could get grabbed by squatters. Let's step in and save them for good orgs and projects.
aisafety.community - There's no up-to-date list of online communities. This is an obvious missing resource.
Rob Miles videos are too rare, almost entirely bottlenecked on the research and scriptwriting process. So I built some infrastructure which allows volunteers to collaborate as teams on scripts for him, being tested now.
Ryan Kidd said there should be a nice professional site which lists all the orgs in a format which helps people leaving SERI MATS decide where to apply. aisafety.careers is my answer, though it's not quite ready yet. Volunteers wanted to help write up descriptions for orgs in the Google Docs we have auto-syncing with the site!
Nonlinear wanted a prize platform, and that seemed pretty useful as a way to usefully use the firehose of money while FTXFF was still a thing, so I built Superlinear.
There are a lot of obvious low-hanging fruit here. I need more hands. Let's make a monthly call and project database so I can easily pitch these to all the people who want to help save the world and don't know what to do. A bunch of great devs joined!
and 6+ more major projects as well as a ton of minor ones, but that's enough to list here.

I do worry I might be neglecting my actual highest EV thing though, which is my moonshot formal alignment proposal (low chance of the research direction working out, but much more direct if it does). Fixing the alignment ecosystem is just so obviously helpful though, and has nice feedback loops.

[-]romeostevensit3y2211

I've kept updating in the direction of do a bunch of little things that don't seem blocked/tangled on anything even if they seem trivial in the grand scheme of things. In the process of doing those you will free up memory and learn a bunch about the nature of the bigger things that are blocked while simultaneously revving your own success spiral and action-bias.

[-]plex3y60

Yeah, that makes a lot of sense and fits my experience of what works.

[-]Garrett Baker3y5714

I like this post, with one exception: I don't think putting out fires feels like putting out fires. I think it feels like you're utterly confused, and instead of nodding your head when you explain the confusion & people try to resolve it but you don't understand them, continuing to actively notice & chase the confusion no matter how much people decrease your status due to you not being able to understand what they're saying. It feels far more similar to going to school wearing a clown suit than heroically putting out obvious-to-you fires.

[-]LawrenceC3y3521

Upvoted but strong disagree.

I think "putting out fires" has more of the correct connotations -- insofar as I'm correctly identifying what Nate means, it feels more like defiance and agency than anything about status. I know fully well that most of the fires I'm addressing/have addressed are not considered fires by other people (or they would've put them out already)! It feels like being infuriated that no one is doing the obvious thing and almost everyone I talk to is horribly unreasonable about this, so it's time to roll up my sleeves and go to work.

On the other hand, I think going to school wearing a clown suit has many wrong connotations. It brings in feelings of shame and self-consciousness, when the appropriate emotion is (imo) defiance and doing the blazing obvious thing! I don't think the Shard Theory folk think they are wearing a clown suit; in my interactions with him I feel like Alex Turner tends to be more defiant or infuriated than self-conscious. (Feel free to correct me if this is not the case!)

[-]TurnTrout3y229

Shard theory did have some clown suit energy at first. Shard theory / disagreeing strongly with Eliezer (!) felt like wearing a clown suit, to some part of me, but the rest of me didn't care.

I also felt something like "if I can't communicate these ideas or am not willing to burn status to get eyes on them, there was no point in my having had status anyways." From Looking back on my alignment PhD:

I realized my gut expectations were that he was broadly correct and that this theory of alignment could actually be right. However, I realized I wasn't consciously letting myself think that because it would be Insufficiently Skeptical to actually think the alignment problem is solvable. This seemed obviously stupid to me, so I quickly shut that line of thinking down and second-order updated towards optimism so that I would stop predictably getting more optimistic about Quintin's theory.
I realized I assigned about 5% credence^[1] to "this line of thinking marks a direct and reasonably short path to solving alignment." Thus, on any calculation of benefits and harms, I should be willing to stake some reputation to quickly get more eyeballs on the theory, even though I expected to end up looking a little silly (with about 95% probability). With my new attitude, I decided "whatever, let's just get on with it and stop wasting time."
The old "don't leave any avenue of being criticized!" attitude would have been less loyal to my true beliefs: "This could work, but there are so many parts I don't understand yet. If I figure those parts out first, I can explain it better and avoid having to go out on a limb in the process." Cowardice and social anxiety, dressed up as prudence and skepticism.

These days, I do feel more defiant/irritated, and not like I'm wearing a clown suit.

^{^}
"Is there really something there with shard theory?" does not feel like a live question to me anymore, because it resolved "yes", in my view. But I also have closed off the more optimistic ends of my uncertainty, where I thought there was a ~5% chance of quickly and knowably-to-me solving alignment.

[-]LawrenceC3y104

I also felt something like "if I can't communicate these ideas or am not willing to burn status to get eyes on them, there was no point in my having had status anyways."

Yeah, I resonate very strongly with this feeling as well! The whole reason to have generic resources is to spend them on useful things!

[-]Adam Zerner3y*2720

Upvoted but disagreed. It isn't my model of what putting out fires feels like most of the time but I'm not sure, it's plausible, and if it's true it's important.

It also makes me think that maybe it's super, critically important to have social norms that make wearing a clown suit not so bad. There are downsides to this of course but if the importance of wearing a clown suit is that high it probably outweighs the downsides enough such that the optimal point on the spectrum is pretty close to "not too uncomfortable wearing the suit".

[-]Viliam3y50

Sometimes people are confused because their model is worse that everyone else's (i.e. of everyone involved in given situation). Sometimes people are confused because their model is better... and they noticed a problem that other's do not see yet, but they do not yet know how to solve the problem themselves.

What you describe sounds to me like someone who sees a problem, cannot solve it fully, but at least has a few guesses how to reduce it in short term, so keeps doing at least that. While other people either genuinely do not see the problem, or have a vague idea but also see very clearly the status cost of acting worried while everyone else remains calm.

[-]Elizabeth3y5020

This was a big driver behind my vegan nutrition project: I could not believe people were strongly pushing drastic diet changes without any concern for nutrition, when the mitigations were costless morally and almost so resource-wise.

[-]MSRayne3y10

Ooh, this could be useful to me, thank you!

[-]janus3y3119

I adore this post.

“optimize for your own understanding” chase the things that feel alive and puzzling to you, as opposed to dutifully memorizing other people’s questions and ideas. “[D]on’t ask “What are the open questions in this field?” Ask: “What are my questions in this field?””

Basically everything I've done that I think has been helpful at all has been the result of chasing the things that feel alive and puzzling to me.

When I feel stagnated, I very often find that it's because I've been thinking too much in the frame of "the alignment problem as other people see it".

[-]Said Achmiz3y2216

For a concrete example, consider Devansh. Devansh came to me last year and said something to the effect of, “Hey, wait, it sounds like you think Eliezer does a sort of alignment-idea-generation that nobody else does, and he’s limited here by his unusually low stamina, but I can think of a bunch of medical tests that you haven’t run, are you an idiot or something?” And I was like, “Yes, definitely, please run them, do you need money”.

What sort of tests might these be, can you say? Eliezer is certainly not the only one with “low stamina” problems, and if there are medical tests to run that he wouldn’t already have had done, I’d like to know about them!

[-]devansh3y221

This list is quite good - https://mecfsroadmap.altervista.org/ Feel free to DM me if you want to chat more.

[-]johnlawrenceaspden3y154

Me too! Is there a list somewhere of 'tests to run in case of fatigue/low energy'?

There is thyroid, diabetes, anaemia, sleep apnoea, and....

[-]rchplg3y73

Relatedly on "obviously dropping the ball": has Eliezer tried harder stimulants? With his P(doom) & timelines, there's relatively little downside to this done in reasonable quantities I think. Seems very likely to help with fatigue

From what I've read, main warning would be to get harder blocks on whatever sidetracks eliezer (e.g. use friends to limit access, have a child lock given to a trusted person, etc)

[-]mic3y30

@devansh

[-]Richard Korzekwa3y210

Like, keep your eye out. For sure, keep your eye out.

I think this is related to my relative optimism about people spending time on approaches to alignment that are clearly not adequate on their own. It's not that I'm particularly bullish on the alignment schemes themselves, it's that don't think I'd realized until reading this post that I had been assuming we all understood that we don't know wtf we're doing so the most important thing is that we all keep an eye out for more promising threads (or ways to support the people following those threads, or places where everyone's dropping the ball on being prepared for a miracle, or whatever). Is this... not what's happening?

[-]Linda Linsefors3y60

Is this... not what's happening?

No by default.

I did not have this mindset right away. When I was new to AI Safety I though it would require much more experience before I was qualified to question the consensus, because that is the normal situation, in all the old sciences. I knew AI Safety was young, but I did not understand the implications at first. I needed someone to prompt me to get started.

Because I've run various events and co-founded AI Safety Support, I've talked to loooots of AI Safety newbies. Most people are too causes when it comes to believing themselves and too ready to follow authorities. It's usually only takes a short conversation pointing out how incredibly young AI Safety is, and what that means, but many people do need this one push.

[-]Raemon3y154

Curated.

I don't have as strong intuitions about this as So8res does, but I do think this is a useful heuristic. The post feels both useful epistemically, and motivationally. I liked reading comments from other people describing plans they embarked on because it seemed like "holy shit, is nobody doing this?"

I do kinda wish I had more than vague intuitions backing this up. Ironically, while I'd be interested in someone studying "What motivations tend to drive the largest effect sizes on humanity? How do you control for survivorship bias? Is So8res right about this being a useful prompt?"... it neither gives me a strong sense of "geez, everyone is idiotically dropping the ball on this, I believe in my heart this is the best thing" nor seem really like the top result of a measured, careful spreadsheet of possible goals.

(But, you're reading this and thinking either "man, people are idiotically dropping the ball not having done a rigorous analysis of this" or "man I think So8res is wrong that you need to believe in your goals for them to be particularly useful, and my careful spreadsheet of goals says that measuring this effect is the best use of my time", um, I'm interested in what you find)

[-]Richard Korzekwa3y21

What motivations tend to drive the largest effect sizes on humanity?

FWIW, I think questions like "what actually causes globally consequential things to happen or not happen" are one of the areas in which we're most dropping the ball. (AI Impacts has been working on a few related question, more like "why do people sometimes not do the consequential thing?")

How do you control for survivorship bias?

I think it's good to at least spot check and see if there are interesting patterns. If "why is nobody doing X???" is strongly associated with large effects, this seems worth knowing, even if it doesn't constitute a measure of expected effect sizes.

[-]Tor Økland Barstad3y*140

For the record: The kind of internal experience you describe is a good description of how things currently feel to me (when I look at alignment discourse).

My internal monologue is sort of like:

Here are various ideas/concepts/principles that seem to me like low-hanging fruit and potentially very important for AI aligment. It seems kind of weird that ideas along these lines aren't already being discussed extensively. Weird enough that it makes me experience significant cognitive dissonance.
Are these ideas maybe less new compared to what it seems like to me? Or are they maybe misguided in ways I don't realize? Maybe. But it really seems like groups of smart people are overlooking ideas/concepts/considerations in ways that I feel to be somewhat baffling/surprising/weird.

I'm working on posts with more well-developed versions of these ideas, where I also try to explain things better and more quickly than I've done previously. In the meantime, the best summary that I currently can point people to are these tweet-threads:

[-]Adam Zerner3y*149

For a concrete example, consider Devansh. Devansh came to me last year and said something to the effect of, “Hey, wait, it sounds like you think Eliezer does a sort of alignment-idea-generation that nobody else does, and he's limited here by his unusually low stamina, but I can think of a bunch of medical tests that you haven't run, are you an idiot or something?" And I was like, "Yes, definitely, please run them, do you need money".

I've always wondered about things in this general area. Higher levels of action that improve the productivity of alignment researchers (well not just researchers, anyone in the field) seems like a very promising avenue to explore.

For example, I know that for me personally, "dealing with dinner" often takes way longer than I hope, consumes a lot of my time, and makes me less productive. That's a problem that could easily be solved with money (which I'm working towards). Do alignment researchers also face that problem? If so it seems worth solving.

Continuing that thought, some people find cooking to be relaxing and restorative but what about things like cleaning, paperwork, and taxes? Most people find that to be somewhat stressful, right? And reducing stress helps with productivity, right? So maybe some sort of personal assistant a la The 4-Hour Work Week for alignment researchers would make sense.

And for medical stuff, some sort of white glove membership like what Tim Urban describes + resurrecting something like MetaMed to be available as a service for higher-impact people like Eliezer also sounds like it'd make sense.

Or basically anything else that can improve productivity. I was gonna say "at a +ROI" or something, but I feel like it almost always will be. Improved productivity is so valuable, and things like personal assistants are relatively so cheap. It reminds me of something I heard once about rich businesspeople needing private yachts: if the yacht leads to just one more closed deal at the margin then it paid for itself and so is easily worth it. Maybe alignment researchers should be a little more "greedy" in that way.

A different way to improve productivity would be through better pedagogy. Something I always think back to is that in dath ilan "One hour of instruction on a widely-used subject got the same kind of attention that an hour of prime-time TV gets on Earth". I don't get the sense that AI safety material is anywhere close to that level. Bringing it to that point would mean that researchers -- senior, junior, prospective -- would have an easier time going through the material, which would improve their productivity.

I'm not sure how impactful it would be to attract new researchers vs empowering existing ones, but if attracting new researchers is something that would be helpful I suspect that career guidance sorts of things would really yield a lot of new researchers.

Well, I had "smart SWE at Google who is interested in doing alignment research" in mind here. Another angle is recruiting top mathematicians and academics like Terry Tao. I know that's been discussed before and perhaps pursued lightly, but I don't get the sense that it's been pursued heavily. Being able to recruit people like Terry seems incredibly high impact though. At the very least it seems worth exploring the playbooks of people in related fields like executive recruiters and look for anything actionable.

Probably more though. If you try to recruit an individual like Terry there's an X% chance of having a Y% impact. OTOH, if you come across a technique regarding such recruitment more generally, then it's an X% chance of finding a technique that has a Y% chance of working on Z different people. Multiplying by Z seems kinda big, and so learning how to "do recruitment" seems pretty worthwhile.

A lot of this stuff requires money. Probably a lot of it. But that's a very tractable problem, I think. And maybe establishing that ~$X would yield ~Y% more progress would inspire donors. Is that something that has been discovered before in other fields?

Or maybe funding is something that is already in abundance? I recall hearing that this is the case and that the limitation is ideas. That never made sense to me though. I see a lot of things like those white glove medical memberships that seem obviously worthwhile. Are all the alignment researchers in NYC already members of The Lanby for $325/month? Do they have someone to clean their apartments? If not and if funding truly is abundant, then I "feel shocked that everyone's dropping the ball".

[-]Linda Linsefors3y1110

Funding is not truly abundant.

There are people who have above zero chance of helping that don't get upskilling grants or research grants.
There are several AI Safety orgs that are for profit in order to get investment money, and/or to be self sufficient, because given their particular network, it was easier to get money that way (I don't know the details of their reasoning).
I would be more efficient if I had some more money and did not need to worry about budgeting in my personal life.

I don't know to what extent this is due to the money not existing, or it's due to grant evaluation is hard, and there are some reason to not give out money to easily.

[-]Viliam3y42

Cooking is a great example. People eat every day; even small costs (both time and money) are multiplied by 365. Rationalists in Bay Area are likely to either live together, or work together, so the distribution could also be trivial: bring your lunch box to work. So if you are bad at research, but good at cooking, you could contribute indirectly by preparing some tasty and healthy meals for the researchers.

(Possible complications: some people would want vegan meals, or paleo meals, could have food allergies, etc. Still, if you cooked for 80% of them, that could make them more productive.)

Or generally, thinking about things, and removing trivial inconveniences. Are people more likely to exercise during a break, if you bring them some weights?

Sometimes money alone is not enough, because you still have the principal-agent problem.

Another angle is recruiting top mathematicians and academics like Terry Tao. I know that's been discussed before and perhaps pursued lightly, but I don't get the sense that it's been pursued heavily.

Yeah, the important thing, if he was approached and refused, would be to know why. Then maybe we can do something about it, and maybe we can't. But if we approach 10 people, hopefully we will be able to make at least one of them happy somehow.

[-]Adam Zerner3y30

Or generally, thinking about things, and removing trivial inconveniences. Are people more likely to exercise during a break, if you bring them some weights?

Ah, great point. That makes a lot of sense. I was thinking about things that are known to be important like exercise and sleep but wasn't really seeing ways to help people with that but trivial inconveniences seem like a problem that people have and is worth solving. I'd think the first step would be either a) looking at existing research/findings for what these trivial inconveniences are likely to be or maybe b) user interviews.

Yeah, the important thing, if he was approached and refused, would be to know why. Then maybe we can do something about it, and maybe we can't.

Yes, absolutely. It reminds me a little bit of Salesforce. Have a list of leads; talk to them; or the ones that don't work out add notes discussing why; over time go through the notes and look for any learnings or insights. (I'm not actually sure if salespeople do this currently.)

[-]Linda Linsefors3y30

I "feel shocked that everyone's dropping the ball".

Maybe not everyone
The Productivity Fund (nonlinear.org)
Although this project has been "Coming soon!" for several months now. If you want to help with the non-dropping of this ball, you could check in with them to see if they could use some help.

[-]MichaelDickens6mo123

In the version of this mental motion I’m proposing here, you keep your eye out for ways that everyone's being totally inept and incompetent, ways that maybe you could just do the job correctly if you reached in there and mucked around yourself.

I would go even further than that—you don't have to think you can do the job correctly. Some of my most fruitful projects started with me thinking something like

Everyone is dropping the ball on this. I really don't think I can do the job correctly, but someone should at least give it a shot, so I might as well do it.

[-]Metacelsus3y123

I feel shocked that so little effort is being put into human genetic enhancement, relative to its potential. Everyone here seems focused on AI!

[-]GeneSmith3y5-3

Strongly agree with this one. It's pretty clear from plant breeding and husbandry that one can push any given trait tens of standard deviations from its natural mean even just using brain-dead selective breeding techniques. Research from Shai Carmi, Steve Hsu and others has shown that most traits are relatively independent from another (meaning most alleles that affect one trait don't affect another trait). And most genes have a linear effect: they increase or decrease some trait by an amount, and don't require some gene-gene interaction term to model.

Together these suggest that we could likely increase positive traits in humans such as prosocial behavior, intelligence, health and others by gigantic amounts simultaneously.

This is already possible to a limited degree using IVF and polygenic predictors already available. A gain of perhaps 0.2-1 standard deviations on a variety of traits is already feasible using simple embryo selection alone.

I've been working on a guide for people to do this for almost a year now. It has been an incredibly involved research project, mostly because I've spent a huge amount of time trying to quantify which IVF clinics in the US are best and how large of an advantage picking a really good one can give you.

Embryo selection to reduce disease, increase intelligence, and reduce dark tetrad traits in future generations is just such an obvious no-brainer. The expected medical savings alone are in the hundreds of thousands of dollars. Throw in extra earnings from a higher IQ, and reduced societal costs from less crime, greater community bonds etc and you may understand why I think genetic engineering holds such incredible promise.

[-]Nathan Helm-Burger3y31

Not enough time. I was researching intelligence enhancement of adults via genetically engineering delivered hy viral vectors. Then my timelines for AGI got shorter than my time estates for having a testable prototype. I switched to directly studying machine learning. That was like 7 years ago. We probably have less than 5 years left. That's not much time for genetic engineering projects.

[-]GeneSmith3y32

That’s probably true. I’m taking a gamble that only pays off in world where biological brains matter for at least another 30 years. But given the size of the potential impact and the neglected mess, I think it is a gamble worth taking.

[-]Matt Vincent3y11

I would be completely on-board with this if there was a method of improvement other than IVF embryo selection, since I consider human embryos to have moral value. Even if you don't, unless you're very sure of your position, I'd ask you to reconsider on the basis of the precautionary principle alone--i.e. if you're wrong, then you'd be creating a huge problem.

[-]GeneSmith3y2-1

I’d give us 50% odds of developing the technology capable of human genetic enhancement without excess embryos in the next decade. Editing looks like the most plausible candidate, though chromosome selection also looks pretty feasible.

I’ve given a lot of thought to the question of whether discarding embryos is acceptable. Maybe I’ll write a post about this at some point, but I’ll try to give a quick summary:

At the time of selection, human embryos have about 100 cells. They have no brain, no heart, and no organs. They don’t even have a nervous system. If they stopped development and never grew into humans, we would give them zero moral weight. Unless you believe that the soul enters the embryo during fertilization, the moral importance of an embryo is entirely down to its potential to develop into a human.
The potential of any given pairing of egg and sperm is almost unchanged after fertilization. A given pairing of sperm and egg will produce the same genome every time. I don’t see a clear line at fertilization regarding the potential of a particular sperm/egg coupling.
Roughly a third of regular non-IVF pregnancies end in miscarriage; usually before the mother even knows she’s pregnant. The rate of miscarriage approaches 100% towards a woman’s late 40s. If embryos are morally equivalent to babies, there is a huge ongoing preventable moral disaster going on during normal conception, to the point where one could make a case that unprotected sex between 40 and menopause is immoral.

[-]Matt Vincent2y41

Thanks for the response. I realize that this is a very belated reply, and that it would have done a lot more good prior to the release of your How-To-PSC essay. Nevertheless, I'll respond to a few of your points.

For one thing, an embryo that was conceived from the gametes of two humans doesn't "grow into a human" or "develop into a human"; it is a human. I'm not saying that this necessarily confers moral worth, but it does jog the question of which trait does, and you don't provide a strong alternative.

In defense of the ZEF's potentiality: before fertilization, an arbitrary pair of sperm and egg isn't a coherent object any more than union of my left sock and the moon is a coherent object. In contrast, after fertilization, it's the sperm and the egg that cease to be coherent objects. The egg releases chemical signals to reject additional sperm, the successful sperm's cell membrane disintegrates, and the former contents of the gametes are bound together within one structure: the zygote.

I think that natural pregnancies are more nuanced than that, although I do agree that it involves an ongoing moral disaster to some extent. I don't think it's immoral for a woman to become pregnant despite the high miscarriage rate--just as I don't believe it was immoral for a woman living 1,000 years ago to become pregnant, even though a third of her children who were born would die by the age of 5. Instead, I think that there's an imperative on society to develop medical technology that prevents (pre)natal deaths.

[-]GeneSmith2y20

Interesting viewpoint. I think your point about the morality of having children despite the high natural miscarriage rate is a good one.

My basic view is that human moral value develops throughout pregnancy (and indeed continues to develop after birth). I don't think there's a simple binary switch from "no value" to "value". I'd treat it more like a gradual ramp-up beginning with brain development during pregnancy.

I'm curious how you feel about culturing of naive embryonic stem cells. It's possible to culture cells from a very early embryo and maintain their epigenomic state. One might then perform some editing on each, then grow each into a colony of perhaps 100 cells before destructively sequencing some of the stem cells and then performing subsequent edits on the stem cells in which the edits successfully took place.

If done correctly, the process would result in an embryo with much better prospects for a healthy and happy life. One embryo goes in and one embryo goes out. But the sequencing in the interim steps would require the destruction of naive embryonic stem cells.

Would you consider such a process morally permissible?

[-]Noosphere893y10

Question, can we ever get somatic gene editing that is as good or better than having to edit the gametes?

[-]GeneSmith3y11

No. You might get something that works, but it will never be as good as intervening at the gamete or embryo stage simply because half the genes you’d want to change are only active during development (ie before adulthood).

I’m I get the same effects you would need really crazily advanced biotech that could somehow edit the genes and replay the development stage of life without interfering with the current functioning of the organism. I don’t see anything like that being developed in the next 50 years without some kind of strong intelligence (whether artificial or biological in nature).

[-]Noosphere893y10

Then we have the reason why so little effort is going to genetic engineering on LW: The viable options here are way too slow, and the fast options are very weak, relative to how fast the world is changing.

Thus, it's worth waiting so that we can reconsider our options later.

[-]GeneSmith3y10

Yes, I completely understand why there is MORE interest in alignment, engineered pandemics and nuclear war. I think that is correct. But I don’t think the balance is quite right. Genetic engineering could be a meta-level solution to all those problems given enough time.

That seems like something worth working on for a larger chunk of people than those currently involved.

[-]the gears to ascension3y20

We don't have enough time, and by the time the relevant amount of time has passed, ai will have blasted genetic augmentation into a new era. Existential AI alignment is necessary to do any significant amount of genetic modding.

[-]Metacelsus3y10

I disagree. I could do a moderate but substantial amount of human genetic engineering right now, if I had more resources and if the police wouldn't arrest me. AI is not required for this.

[-]Raemon3y20

Can we do genetic engineering that is immediately useful, as opposed to "at a minimum wait ~ 10 years for an infant to become Ender Wiggin?"

[-]Noosphere893y20

Given the responses to a similar question, I think the answer is no, that is I would expect basically no genetic editing/IVF breakthroughs to transfer to the somatic cells.

[-]Metacelsus3y10

No, probably not. But I think it's still a good idea that most people are ignoring.

[-]Raemon3y20

I think that's a fine position, but doesn't seem to be addressing' gears' point. ("We don't know for sure how much time we have and this seems like a thing that's worth working on" seems like a fine answer though)

[-]Eli Tyre11mo95Review for 2023 Review

I think about this post several times a year when evaluating plans.

(Or actually, I think about a nearby concept that Nate voiced in person to me, about doing things that you actually believe in, in your heart. But this is the public handle for that.)

[-]rpglover643y86

English doesn’t have great words for me to describe what I mean here, but it’s something like: your visualization machinery says that it sees no obstacle to success, such that you anticipate either success or getting a very concrete lesson.

One piece of advice/strategy I've received that's in this vein is "maximize return on failure". So prefer to fail in ways that you learn a lot, and to fail quickly, cheaply, and conclusively, and produce positive externalities from failure. This is not so much a good search strategy but a good guiding principle and selection heuristic.

[-]Screwtape11mo70Review for 2023 Review

I'm glad I read this, and it's been a repeating line in my head when I've tried to make long term plans. I'd like this to be included in the Best Of LessWrong posts.

Even if you are doing something fairly standard and uncomplicated, there are likely multiple parts to what you do. A software engineer can look at a bunch of tickets, some code reviews, the gap where good documentation can be, and the deployment pipeline before deciding that the team is dropping the ball on documentation. A schoolteacher might look at the regular classes, the extracurricular programs, the recess policy, and decide that the school is dropping the ball by focusing too much on sports and not having any theatre program.

If you're doing something weird and unusual, this gets more useful. While I haven't done it, my understanding is that finding a place where everyone else is dropping the ball is a good way to find startup ideas. If you're a new non-profit trying to help people, looking for the underserved parts of your community seems like a good way to make an impact.

I don't know how idiosyncratic my situation is, and suspect there should be a contrasting essay. Sometimes the reason it looks like everyone is dropping the ball is because there's some hidden hard part. Inadequate Equilibria might be the long version of that essay, but I'd like a short one to put next to Focus On The Places Where You Feel Shocked Everyone's Dropping The Ball.

[-]tricky_labyrinth3y43

https://twitter.com/carmenleelau/status/1593354133146402816 is another recent formulation of ~the same idea.

[-]Nathan Helm-Burger3y55

Meta note: I strongly dislike Twitter and wish that people would just copy the raw text they want to share instead of a link.

[-]Richard Korzekwa3y30

Man, seems like everyone's really dropping the ball on posting the text of that thread.

Make stuff only you can make. Stuff that makes you sigh in resignation after waiting for someone else to make happen so you can enjoy it, and realizing that’s never going to happen so you have to get off the couch and do it yourself

Do it the entire time with some exasperation. It’ll be great. Happy is out. “I’m so irritated this isn’t done already, we deserve so much better as a species” with a constipated look on your face is in. Hayao Miyazaki “I’m so done with this shit” style

(There's an image of an exasperated looking Miyazaki)

You think Miyazaki wants to be doing this? The man drinks Espresso and makes ramen in the studio. Nah he hates this more than any of his employees, he just can’t imagine doing anything else with his life. It’s heavy carrying the entire animation industry on his shoulders

I meant Nespresso sry I don’t drink coffee

[-]rchplg3y*40

Relatedly on "obviously dropping the ball": has Eliezer tried harder prescription stimulants? With his P(doom) & timelines, there's relatively little downside to this done in reasonable quantities I think. They can be prescribed. Seems extremely likely to help with fatigue

From what I've read, the main warning would be to get harder blocks on whatever sidetracks eliezer (e.g. use friends to limit access, have a child lock given to a trusted person, etc)

Seems like this hasn't been tried much beyond a basic level, and I'm really curious why not given high Eliezer/Nate P(doom)s. There are several famously productive researchers who did this

[-]gilch3y46

I can think of a bunch of medical tests that you haven't run, are you an idiot or something?

Not that I've paid much attention, but has anyone checked him for iron overload? Insulin resistance? Thyroid?

[-]Yoav Ravid11mo30Review for 2023 Review

I think I was already doing what this post suggested before it was published, but the distilled phrase was good and I thought about it quite often since.

Where it meets me personally - I'm shocked at how Liberals are dropping the ball on Liberalism. It is incredibly important, and yet Liberals don't properly understand it and don't know how to defend it, at a time where it's under an onslaught by anti-liberals. To be slightly glib, I basically believe that everyone is wrong about Liberalism. I don't know of anyone who shares my understanding of it. So I'm trying to finally pick up the ball by writing a book about how to fix Liberalism (and actually, a year ago today is exactly when I began writing it).

[-]Max D'Ambrosio2y30

I’ve been wondering: Regarding the kinds of policy changes by governments and legitimate businesses intended to stall AI development… won’t such policies simply ensure that illegitimate actors eventually develop AI outside the law, without states and more well-intended actors being able to counter them? Or is there no such thing as a sufficiently large AI data centre that can be kept secret? I would imagine, at the very least, that some states would have the resources to keep such secrets effectively, though maybe not for long. By turning to secrecy, doesn’t the AI race shift from being one of “who’s the fastest at achieving foom” to “who’s the best at keeping secrets for longer?” If so, does that shift actually advantage people who are better at achieving friendly AI? It seems obvious that the endgame is to race to understanding friendliness generally before actualizing strong AI, so that someone can achieve the former before the latter. Perhaps better minds than me have concluded that secrecy will actually lend more momentum to friendliness while effectively delaying progress towards AI foom, but given that progress still seems possible in secrecy, I don’t understand how. Hopefully I’m not jeopardizing such a use of secrecy just by saying this, but if that were possible then lots of other people are capable of jeopardizing it anyway, and it’s better to draw attention to it.

[-]Matt Vincent3y35

I find this post very encouraging, but I can't shake a particular concern about the approach that it recommends.

From extrapolating past experiences, it seems like every time I try (or even succeed) at something ambitious, I soon find that somebody else already did that thing, or proved why that thing can't work, and they did it better than I would have unless I put in ten times as much effort as I did. In other words, I struggle to know what's already been done.

I notice that this happens a lot less often with mathematics than it used to. Perhaps part of it is that I became less ambitious, but I also think that part of it was formal education. (I finished a BS in math a few years ago.) I do think one of the major benefits of formal education is that it gives the student a map of the domain they're interested in, so that they can find their way to the boundary with minimal wasted effort.

[-]Review Bot2y*10

The LessWrong Review runs every year to select the posts that have most stood the test of time. This post is not yet eligible for review, but will be at the end of 2024. The top fifty or so posts are featured prominently on the site throughout the year.

Hopefully, the review is better than karma at judging enduring value. If we have accurate prediction markets on the review results, maybe we can have better incentives on LessWrong today. Will this post make the top fifty?

[-]Chris Lakin2y10

This is what I'm trying to do with my current research:

«Boundaries/Membranes» and AI safety compilation
«Boundaries» for formalizing a bare-bones morality
more posts soon
(also maybe this comment?)

[-]Thoth Hermes3y10

I feel like a lot of this advice is telling me to do what I was going to do anyway. Which is why I wonder if you're actually telling me not to do what I was going to do anyway, because it makes sense for advice posts to normally be about telling people to do something other than what they were normally going to do:

I don't see ways to really help in a sizable way right now. I'm keeping my eyes open, and I'm churning through a giant backlog of things that might help a nonzero amount—but I think it's important not to confuse this with taking meaningful bites out of a core problem the world is facing, and I won’t pretend to be doing the latter when I don’t see how to.

Since most of this post is interspersed with things like this, which say that the problem seems to you to be intractable, and that therefore most avenues of research will end up being dead-ends, it seems like you're also advising people not to worry about things too much, just let the experts handle it, who also say that the problem is intractable and too hard.

If you're saying "look at where everyone else has dropped the ball", and I notice that everyone else has dropped the ball, on purpose, because they think the problem is intractable, then, I have to disagree strongly with that, if I am to follow your advice.

It's just odd that you would say this and have it very openly apply to everything you're saying as well.

[-]catubc3y1-5

Thank you so much for this effectiveness focused post. I thought I would add another perspective, namely "against the lone wolf" approach, i.e. that AI-safety will come down to one person, or a few persons, or an elite group of engineers somewhere. I agree for now there are some individuals who are doing more conceptual AI-framing than others, but in my view I am "shocked that everyone's dropping the ball" by putting up walls and saying that general public is not helpful. Yes, they might not be helpful now, but we need to work on this!... Maybe someone with the right skill will come along :)

I also view academia as almost hopeless (it's where I work). But it feels that if a few of us can get some stable jobs/positions/funding - we can start being politically active within academia and the return on investment there could be tremendous.

[-]Aorou3y15

Maybe I don't know what I'm talking about and obviously we've tried this already.

I've heard Eliezer mention that the ability to understand AI risk is linked to Security Mindset.
Security Mindset is basically: you can think like a hacker, of exploits, how to abuse rules etc. So you can defend against hacks & exploits. You don't stop at basic "looks safe to me!"

There are a lot of examples of this Security/Hacker Mindset in HPMOR. When Harry learns of rates between magical coins vs his known prices for gold, silver, etc, he instantly thinks of a scheme to trade between the magical and the muggle world to make infinite money.

Eliezer also said that Security Mindset is something you either got or not.
I remember thinking: that can't be true!

Are we bottlenecking AI alignment on not having enough people with Eliezer-level Security Mindset, and saying "Oh well, it can't be taught!"?!
(That's where I've had the "people are dropping the ball" feeling. But maybe I just don't know enough.)

Two things seem obvious to me:
- Couldn't one devise a Security Mindset test, and get the high scorers to work on alignment?
(So even if we can't teach it, we get more people who have it)(I assume it was a similar process to find Superforecasters).
- Have we already tried really hard to teach Security Mindset, so that we're sure it can't be taught?
Presumably, Eliezer did try, and concluded it wasn't teachable?

I won't be the one doing this, since I'm unclear on whether I'm Security gifted myself (I think a little, and I think more than I used to, but I'm too low g to play high level games).

[-]Thane Ruthenis3y20

Security Mindset is basically: you can think like a hacker, of exploits, how to abuse rules etc. So you can defend against hacks & exploits. You don't stop at basic "looks safe to me!"

Mm, that's not exactly how I'd summarize it. That seems more like ordinary paranoia:

Lots of programmers have the ability to imagine adversaries trying to threaten them. They imagine how likely it is that the adversaries are able to attack them a particular way, and then they try to block off the adversaries from threatening that way. Imagining attacks, including weird or clever attacks, and parrying them with measures you imagine will stop the attack; that is ordinary paranoia.

My understanding is that Security Mindset-style thinking doesn't actually rest on your ability to invent a workable plan of attack. Instead, it's more like imagining that there exists a method for unstoppably breaking some (randomly-chosen) element of your security, and then figuring out how to make your system secure despite that. Or... that it's something like the opposite of fence-post security, where you're trying to make sure that for your system to be broken, several conditionally independent things need to go wrong or be wrong.

[-]Aorou3y12

Ok, thanks for the correction! My definition was wrong but the argument still stands that it should be teachable, or at least testable.

[+][comment deleted]3y10

LESSWRONG
LW

LESSWRONG
LW

469

Focus on the places where you feel shocked everyone's dropping the ball

469

469