AI Safety Needs Great Engineers

[-]Multicore4y350

That 80k guide seems aimed at people who don't yet have any software engineering experience. I'm curious what you think the path is from "Average software engineer with 5+ years experience" to the kind of engineer you're looking for, since that's the point I'm starting from.

[-]Insub4y50

I'm in a similar place, and had the exact same thought when I looked at the 80k guide.

[-]Alex_Altair4y30

Andy may have meant to link to this article instead, which also has this podcast companion.

[-]Ben Pace4y330

Can someone briefly describe what empirical AI safety work Cohere is doing? I hadn't heard of them until this post.

[-]jjbalisan4y80

This comment reflects those of me and not my employer (Cohere).

We are currently massively growing our safety team on both engineering and product sides and one of our major bottlenecks is the above technical talent. We are currently heavily focused on making our models in production as safe as possible during training and during production. One of the biggest projects to this extent is the safety harness project which should have more information coming out soon. https://docs.cohere.ai/safety-harness/. We are heavily focused on worse-case scenario's especially as anyone can use our models relatively quickly. Here are 2 of the papers the safety team has worked on in the past. We have much more in the timeline.

[-]habryka4y20

I am also interested in this.

[-]ChristianKl4y210

Given the discussion around OpenAI plausible increasing overall AI risk, why should we believe that the work will reduce in a net risk reduction?

[+]Tom Lieberum4y-60

[-]tailcalled4y150

I'm an engineer, but the positions seem to tend to require living in specific locations, so I cannot apply.

[-]banmin4y130

I'm going to take this blog post as the explanation for the rejection I got from Anthropic five mins ago for the researcher position.

[-]Randomized, Controlled4y*580

As a self-taught programmer who's dabbled in ML, but has only done front and back-end web work: it's been pretty frustrating trying to find a way to work on ML or AI safety the last four years. I think some of the very recent developments like RR's ML boot camp are promising on this front, but I'm pretty surprised that Redwood was surprised they would get 500 applications. We've been telling people explicitly "this is an emergency" for years now, but tacitly "but you can't do anything about it unless you're a 99th percentile programmer and also positioned in the right place at the right time to apply and live in the bay area." Or, that's how it's felt to me.

[-]Richard_Ngo4y270

I wonder if some subset of the people who weren't accepted to the Redwood thing could organise a remote self-taught version. They note that "the curriculum emphasises collaborative problem solving and pair programming", so I think that the supervision Redwood provides would be helpful but not crucial. Probably the biggest bottleneck here would be someone stepping up to organise it (assuming Redwood would be happy to share their curriculum for this version).

[-]Jozdien4y90

I agree that this would be helpful if Redwood shares their curriculum. If someone is willing to take up lead organizing, I'd be happy to help out as much as I can (and I suspect this would be true for a non-insignificant number of people who applied to the thing). I'd do it myself, but I expect not to have the free time to commit to that and do it right in the next few months.

[-]Randomized, Controlled4y*50

[-]Walter Laurito4y*30

Same here (Not sure yet if I get accepted to AISC though). But I would be happy with helping or co-organizing something like Richard_Ngo suggested. (Although I've never organized something like that before) Maybe a virtual version in (Continental?) Europe, if there are enough people

[-]Walter Laurito4y50

Maybe, we could also send out an invitation to all the people who got rejected to join a Slack channel. (I could set that up, if necessary. Since I don't have the emails, though, someone would need to send the invitations). There, based on the curriculum, people could form self-study groups on their own with others close-by (or remotely) and talk about difficulties, bugs, etc. Maybe, even the people who got not rejected could join the slack and help to answer questions (if they like and have time, of course)?

[-]Walter Laurito4y*20

I've created a discord for the people interested in organizing / collaborating / self-study: https://discord.gg/Ckj4BKUChr People could start with the brief curriculum published in this document, until a full curriculum might be available :)

[-]Alex_Altair4y20

FYI That invite link has now expired!

[-]Walter Laurito4y10

Should work again :)

[+][comment deleted]4y10

[-]RobertM4y20

Redwood was surprised they would get 500 applications

I'm curious what this is referring to - was there public communication to that effect?

[-]hath4y30

From Redwood's application update (rejecting those who didn't make the cut):

We had many more applicants than I expected, and even though we expanded the program to have space for 30 participants instead of 20, we aren't able to accept that many of our 500 applicants, including many applicants who seem very promising and competent. I am sad that we don't have space for more people.

[-]RobertM4y10

Oh, I misread, I thought they would have been surprised to get 500 applicants for an open job position.

[+][comment deleted]4y40

[-]timbuckley4y10

Sorry, but what is RR?

[-]UHMWPE-UwU4y80

Redwood research

[-]lc4y120

This might be a false alarm, but "tell me your thoughts on AI and the future" is an extremely counterproductive interview question. You're presenting it as a litmus test for engineers to apply to themselves, and that's fine as far as it goes. But if it's typical or analogous to some other test(s) you use to actually judge incoming hires, it doesn't bode well. By asking it you are, on some level, filtering for public speaking aptitude and ability to sound impressively thoughtful, two things which probably have little or nothing to do with the work you do.

I realize that might seem like a pedantic point, and you might be asking yourself: "how many smart people who want to work here can't drop impressive speeches about X? We'll just refrain from hiring that edge case population." The reason it's relevant that your interview "could" be selecting for the wrong thing is because recruitment is an adversarial process, not a random process. You are fighting against other technology companies who have better and more scientific hiring pipelines, and more time and money to build them. Those companies often diligently reject the people who can speak well but not code. The result is the candidates you're looking at will almost always seem curiously good at answering these questions, and under-performing on actual workplace tasks. Even if this were happening I'm sure you'd believe everything is fine, because your VC money lets you give enormous salaries that obscure the problem and because AI safety companies get a glut of incoming attention from sites like Lesswrong. All the more reason not to waste those things.

Worse, you have now published that question, so you will now get a large amount of people who coach their answers and practice them in front of a mirror in preparation for the interview. "Oh well, most people are honest, it'll only be like 1/2/5/10/25% of our applicants that..." - again, not necessarily true of your passing applicants, and definitely not necessarily true of applicants rejected or less-well-compensated by your competitors.

[-]Andy Jones4y120

You're presenting it as a litmus test for engineers to apply to themselves, and that's fine as far as it goes

I can reassure you that it is in fact a litmus test for engineers to apply to themselves, and that's as far as it goes.

While part of me is keen to discuss our interview design further, I'm afraid you've done a great job of laying out some of the reasons not to!

[-]lc4y20

Glad to hear that :)

[-]eugene_black4y60

Any chance that Anthropic might expand the team to remote international collaboration in the future? I would apply but I am from Ukraine. Many great software companies successfully switched to remote work and covid crysis boosted this practice a lot. So just wondering.

[-]Andy Jones4y20

It's not impossible, but it appears unlikely for the foreseeable future. We do sponsor visas, but if that doesn't suit then I'd take a look at Cohere.ai, as they're one org I know of with a safety team who are fully-onboard with remote.

[-]Brendan Long4y70

Can you add something about whether Anthropic does or doesn't allow remote work to the job listings? I'm infering from the lack of any mention of remote work that in-person is strictly required but I'm not sure if that's what you're intending.

[-]Andy Jones4y40

In-person is required. We'll add something to the job descriptions in the new year, thanks for the heads up!

[-]cata4y40

I'm an experienced engineer and EA excited to work on these things, but I am only available part time remote because I am raising my kid, so I'm not applying right now.

If I knew of useful FOSS work that was directly applicable I might be spending time doing it.

[-]Zac Hatfield-Dodds4y50

EleutherAI has a whole project board dedicated to open-source ML, both replicating published papers and doing new research on safety and interpretability.

(opinions my own, etc)

[-]cata4y20

Thanks, I was aware of Eleuther but I wasn't previously aware how much they cared about alignment-related progress.

[-]Stephen McAleese4y20

I have a background in software engineering but I would like to get into AI safety research.

A problem I have had is that I didn't know whether I should pursue the research scientist or research engineer paths which seem to be quite different. Becoming a research engineer involves lots of work with ML code whereas to become a research engineer you usually have to get a PhD and do some research.

I read in an older document that there was a bottleneck in talent for research scientists and engineers. However, this seems to have changed according to your post and now there seems to be a greater shortage of research engineers than research scientists.

As a result, I am now leaning more in favor of becoming a research engineer. Another advantage is that the research engineer path seems to have a lower barrier to entry.

[-]Lone Pine4y10

Would you be interested in a great engineer who is a skeptic about the alignment problem?

[-]Andy Jones4y20

Yes! Though that engineer might not be interested in us.

[-]Lone Pine4y10

Well the engineer might be trying to break into AGI research from a traditional software (web and gamedev) background, haha.

LESSWRONG
LW

LESSWRONG
LW

91

AI Safety Needs Great Engineers

91

Ω 27

91

Ω 27

Why engineers?

What kind of engineers?

How does engineering compare to research?

Should I apply?

Should I skill up?

Take homes