AI Task Length Horizons in Offensive Cybersecurity
This is a rough research note where the primary objective was my own learning. I am sharing it because I’d love feedback and I thought the results were interesting. Introduction A recent METR paper [1] showed that the length of software engineering tasks that LLMs could successfully complete appeared to...
Jul 2, 202573