Can Current LLMs be Trusted To Produce Paperclips Safely? — LessWrong