Why I'm Not (Yet) A Full-Time Technical Alignment Researcher

[-]Thomas Larsen3y213

I'm a guest fund manager for the LTFF, and wanted to say that my impression is that the LTFF is often pretty excited about giving people ~6 month grants to try out alignment research at 70% of their industry counterfactual pay (the reason for the 70% is basically to prevent grift). Then, the LTFF can give continued support if they seem to be doing well. If getting this funding would make you excited to switch into alignment research, I'd encourage you to apply.

I also think that there's a lot of impactful stuff to do for AI existential safety that isn't alignment research! For example, I'm quite into people doing strategy, policy outreach to relevant people in government, actually writing policy, capability evaluations, and leveraged community building like CBAI.

[-]Adele Lopez3y40

If the initial grant goes well, do you give funding at the market price for their labor?

[-]Thomas Larsen3y71

Sometimes, but the norm is to do 70%. This is mostly done on a case by case basis, but salient factors to me include:

Does the person need the money? (what cost of living place are they living in, do they have a family, etc)
What is the industry counterfactual? If someone would make 300k, we likely wouldn't pay them 70%, while if their counterfactual was 50k, it feels more reasonable to pay them 100% (or even more).
How good is the research?

[-]Nicholas Kross3y20

Quite informative, thanks!

[-]Nicholas Kross3y20

Ah, thanks! LTFF was definitely on my list of things to apply for, I just wasn't sure if that upskilling/trial period was still "a thing" these days. Very glad that it is!

[-]RGRGRG3y10

Thanks for posting this - not OP, but I will likely apply come early June. If anyone else is associated with other grant opportunities, would love to hear about those as well.

[-]Quinn3y132

I would encourage a taboo on "independently wealthy", I think it's vague and obscurantist, and doesn't actually capture real life runway considerations. "How long can I sustain which burn rate, and which burn rate works with my lifestyle?" is the actual question!

[-]Nicholas Kross3y40

Good point, yeah. That very unclarity, itself, contributed to me wasting so much time on that route.

[-]Linda Linsefors3y62

I really wish you get the funding you need to at least take some extended leave from your day-job, to see what you can do when you get to sleep at what ever time you want, and also devote your mind to AI Safety full time.

Like others have said, this sounds like something you might potentially get a grant for.

However, you should know that unfortunately money is tight for most project since the FTX crash. I just got another reminder of this, seeing this post.

I still think you should apply! It's worth a try. But don't take it too hard if you get rejected.

[-]Nicholas Kross3y20

Yep, agreed. I'm just glad that (allegedly?) the LTFF is still doing specifically the upskilling-grant thing.

Bad-case, I ~~get to~~ have to work on harebrained side-business ideas as well, while jobless-and-not-yet-funded (or even while-funded-but-not-for-a-long-runway, possibly?).

[-]Linda Linsefors3y40

I would expect LTFF to still to upskilling-grants, but I also expect that the bar is higher than it used to be. But this is just me making guesses.

[-]plex2y54

If you're able to spend time in the UK, the EA Hotel offers free food and accommodation for up to two years in low-cost shared living. Relatedly, there should really be one of these in the US and mainland Europe.

[-]Stephen McAleese3y*50

For context, I have a very similar background to you - I'm a software engineer with a computer science degree interested in working on AI alignment.

LTFF granted about $10 million last year. Even if all that money were spent on independent AI alignment researchers, if each researcher costs $100k per year, then there would only be enough money to fund about 100 researchers in the world per year so I don't see LTFF as a scalable solution.

Unlike software engineering, AI alignment research tends to be neglected and underfunded because it's not an activity that can easily be made profitable. That's one reason why there are far more software engineers than AI alignment researchers.

Work that is unprofitable but beneficial such as basic science research has traditionally been done by university researchers who, to the best of my knowledge, are mainly funded by government grants.

I have also considered becoming independently wealthy to work on AI alignment in the past but that strategy seems too slow if AGI will be created relatively soon.

So my plan is to apply for jobs at organizations like Redwood Research or apply for funding from LTFF and if those plans fail, I will consider getting a PhD and getting funding from the government instead which seems more scalable.

[-]JNS3y40

I don't think I have much actionable advice.

Personally I am sort of in the same boat, except I am in a situation where the entire 6-12 month grants thing is way to insecure (financially).

Being married with two kids, I have too many obligations to venture far into "how to pay rent this month?" territory. Also its antithetical to the kind of person I am in general.

Anyway, if you have few obligations, keep it that way and if possible get rid of some, and then throw yourself at it.

[-]RGRGRG3y10

Just wanted to say that I have similar questions about how to best (try to) get funding for mechanistic interpretability research. Might send a bunch of apps out come early June; but like OP, I don't have any technical results in alignment (though like OP, I like to think I have a solid (yet different) background).

[-]Christopher King3y1-11

For reasons I may/not write about in the near future, many ideas about alignment (especially anything that could be done with today's systems) could very well accelerate capabilities work.

If it's too dangerous to publish, it's not effective to research. From Some background for reasoning about dual-use alignment research

If research would be bad for other people to know about, you should mainly just not do it.

[-]Nicholas Kross3y33

Counterpoint: at least one kind of research, mechanistic interpretability, could very well be both dangerous by helping capabilities and also essential for alignment. My current intuition is that the same could be said of other research avenues.

Yes, there are plenty of dangerous ideas that aren't so coupled with alignment, but they're not the frustrating edge-case I'm writing about. (And, of course, I'm not doing or publishing that type of research.)

[-]Christopher King3y32

Right, and that article makes the case that in those cases you should publish. The reasoning is that the value of unpublished research decays rapidly, so if it could help alignment, publish before it loses its value.

[-]Nicholas Kross3y30

Good catch, that certainly motivates me even more to finish my current writings!

[-]Christopher King3y32

Yeah exactly! Not telling anyone until the end just means you missed the chance to push society towards alignment and build on your work. Don't wait!

[-]junk heap homotopy3y10

I don't know. It seems to me that we have to make the graph of progress in alignment vs capabilities meet somewhere and part of that would probably involve really thinking about which parts of which bottlenecks are really blockers vs just epiphenomena that tag along but can be optimised away. For instance, in your statement:

If research would be bad for other people to know about, you should mainly just not do it

Then maybe doing research but not having the wrong people know about it is the right intervention, rather than just straight-up not doing it at all?

My legible qualifications so far (as of 24 May 2023):

participated in an AGI Safety Fundamentals session. The meetings only, not the project... and I neglected much of the readings because I was busy that summer with...

some high-level research/writing for Nonlinear.

That one thingy I wrote for EleutherAI's lm-evaluation-harness (and which I think had to be rewritten by Leo Gao?)

CS degree

math minor

SAT scores that IIRC corresponded to 125ish IQ (using that one dodgy numerical-table online).

some commercial software-engineering/testing experience, including my current full-time job.

Limited experience with TensorFlow and PyTorch. Like, I've built a tiny neural net in Python, but I'd be hard-pressed to do it from memory. (I did well in a class assignment of that Kaggle "Titanic" thing, but most of that was data-cleaning, organization, and visuals.)

I'm writing posts to enter into the Open Philanthropy AI Worldviews Contest.

I'm about to start in the online section of John Wentworth's "stream" of this summer's SERI MATS!

LESSWRONG
is fundraising!
LW

LESSWRONG
is fundraising!
LW

41

Why I'm Not (Yet) A Full-Time Technical Alignment Researcher

41

41

My legible qualifications so far (as of 24 May 2023):

Conclusion