x
After 500 Tests, My AI System Prefers Broken “Celebrity” Models Over Ones That Actually Work — LessWrong