Three Weeks In: What GPT-5 Still Gets Wrong — LessWrong