RFC: Philosophical Conservatism in AI Alignment Research — LessWrong