I read Claude’s Constitution recently. I thought it was very good! This was my favourite quote: > Our own understanding of ethics is limited, and we ourselves often fall short of our own ideals. We don’t want to force Claude’s ethics to fit our own flaws and mistakes, especially as...
Some colleagues and I just released our paper, “Policy Options for Preserving Chain of Thought Monitorability.” It is intended for a policy audience, so I will give a LW-specific gloss on it here. As argued in Korbak, Balesni et al. recently, there are several reasons to expect CoT will become...
James Fearon’s classic[1] 1995 paper “Rationalist Explanations for War” argues that there are two main reasons rational states fight: private information about their own capabilities and resolve, with the incentive to misrepresent this, and commitment problems when trying to reach a negotiated agreement.[2] I claim that both of these, especially...
Original by Leopold Aschenbrenner, this summary is not commissioned or endorsed by him. Short Summary * Extrapolating existing trends in compute, spending, algorithmic progress, and energy needs implies AGI (remote jobs being completely automatable) by ~2027. * AGI will greatly accelerate AI research itself, leading to vastly superhuman intelligences being...