That's the exact thing I'm worried about, that people will equate deploying a model via API with releasing open-weights when the latter has significantly more risk due to the potential for future modification and the inability for it to be withdrawn.

Reply

Anthropic: Reflections on our Responsible Scaling Policy

Chris_Leong7h20

Frontier Red Team, Alignment Science, Finetuning, and Alignment Stress Testing

What's the difference between a frontier red team and alignment stress-testing? Is the red team focused on the current models you're releasing and the alignment stress testing focused on the future?

Reply

Anthropic: Reflections on our Responsible Scaling Policy

Chris_Leong7h30

I know that Anthropic doesn't really open-source advanced AI, but it might be useful to discuss this in Anthropic's RSP anyway because one way I see things going badly is people copying Anthropic's RSP's and directly applying it to open-source projects without accounting for the additional risks this entails.

Reply

AISafety.com – Resources for AI Safety

Chris_Leong2d113

Great work! It's easy to overlook the importance of this kind of community infrastructure, but I suspect that it makes a significant difference.

Reply

We might be missing some key feature of AI takeoff; it'll probably seem like "we could've seen this coming"

Chris_Leong5d20

The biggest danger with AIs slightly smarter than the average human is that they will be weaponised, so they'd only safe in a very narrow sense.

I should also note, that if we built an AI that was slightly smarter than the average human all-round, it'd be genius level or at least exceptional in several narrow capabilities, so it'll be a lot less safe than you might think.

Reply

OpenAI releases GPT-4o, natively interfacing with text, voice and vision

Chris_Leong6d107

I believe this is likely a smaller model rather than a bigger model so I wouldn't take this as evidence that gains from scaling have plateaued.

Reply

Ethics and prospects of AI related jobs?

Answer by Chris_LeongMay 11, 202462

Developing skills related to AI puts you in a better position to make AI go well. At least for me, this outweighs the other concerns that you've mentioned.

Note: This doesn't mean that you should take a job that advances fundamental AI capabilities. This would probably be net-negative as things are already moving far too fast for society to adapt. But it sounds like you're more considering jobs related to AI applications, so I'd say to go for it.

Reply

Does reducing the amount of RL for a given capability level make AI safer?

Chris_Leong13d20

You mention that society may do too little of the safer types of RL. Can you clarify what you mean by this?

Reply

How do open AI models affect incentive to race?

Chris_Leong13d30

This fails to account for one very important psychological fact: the population of startup founders who get a company off the ground is very heavily biased toward people who strongly believe in their ability to succeed. So it'll take quite a while for "it'll be hard to make money" to flow through and slow down training. And, in the mean time, it'll be acceleratory from pushing companies to stay ahead.

Reply