Systematic runaway-optimiser-like LLM failure modes on Biologically and Economically aligned AI safety benchmarks for LLMs with simplified observation format (BioBlue)
By Roland Pihlakas, Sruthi Susan Kuriakose, Shruti Datta Gupta This research is now available as an Arxiv preprint at https://arxiv.org/abs/2509.02655 . The paper includes tables with select snippets of failure mode examples. Summary and Key Takeaways Relatively many past AI safety discussions have centered around the dangers of unbounded utility...