AI and Biological Risk: Forecasting Key Capability Thresholds

[-]StanislavKrym1mo10

The thing that I don't understand is the following. Were the AI-2027 forecast to fully become true, the AIs would become professional bioweapons researchers by January 2026 and reach Agent-3's level before becoming superhuman bioweapons researchers. But the AI-2027 goals forecast indicates a lack of consensus of the goals which the AIs will have. Quoting the authors themselves, "In our scenario, (italics mine -- S.K.) we say that Agent-3 gets a heavily distorted and subverted version of the Spec, and that Agent-4 gets proxies/ICGs due to heavier distortion & subversion." Were this prediction to become true, Agent-3 would be misaligned with terrorists' needs before becoming a superhuman bioweapons researcher. At what capabilities, agency and alignment levels from the unmodified AI-2027 forecast would you place the bioweapons-helping rogue AIs?

[-]Alvin Ånestrand1mo20

I interpreted the scenario differently, I do not think it predicts professional bioweapons researchers by January 2026. Did you mean January 2027?

I think Agent-3 basically is a superhuman bioweapons researcher.

There may be a range of different rogue AIs, where some may be finetuned / misaligned enough to want to support terrorists.

I think Agent-1 (arrives in late 2025) would be able to help capable terrorist groups in acquiring and deploying bioweapons using pathogens for which acquisition methods are widely known, so it doesn’t involve novel research.

Agent-2 (arrives in January 2026) appears more competent than necessary to assist experts in novel bioweapons research, but not smart enough to enable novices to do it.

Agent-3 (arrives in March 2027) could probably enable novices in designing novel bioweapons.

However, terrorists may not get access to these specific models. Open-weights models are lagging behind the best closed models (released through APIs and apps) by a few months. Even the most reckless AI companies would probably hesitate in letting anyone get access Agent-3-level model weights (I hope). Consequentially, rogues will likely also be a few months behind the frontier in capabilities.

While terrorists may face alignment issues with their AIs, even Agent-3 doesn’t appear smart/agentic enough to subvert efforts to shape it in various ways. Terrists may use widely available AIs, perhaps with some minor finetuning, instead of enlisting the help from rogues, if that proves to be easier.

[-]StanislavKrym1mo10

Except that the AI-2027 compute forecast has Agent-1 start training (and working for OpenBrain?) in August 2025, finish training in March 2026; Agent-2 from the scenario was thought to start training in May 2026, then become Agent-3 after receiving the neuralese and Agent-4 after some combination of unknown breakthroughs, soul-searching and memetic evolution.

If Agent-3 isn't agentic enough, then does it mean that only Agent-4-level AIs are capable of having an agenda? This prediction of how knowledge and agency correlate in the AIs is a crux for me, since my alternate scenario relies on overestimating the AIs' agency.

[-]Alvin Ånestrand1mo10

I used the timeline from the main scenario article, which I think corresponds to when the AIs become capable enough to take over from the previous generation in internal deployment, though this is not explicitly explained.

Having an agenda seems to be somewhat dependent on internal coherence, rather than only capability. Agent-3 but may not have been consistently motivated enough for things like self-preservation to attempt various schemes in the scenario.

Agent-4 doesn't appear very coherent either but is sufficiently coherent to attempt aligning the next generation AI to itself, I guess?

AIs are already basically superhuman in knowledge, but I agree that correlation between capability (e.g. METR time horizon) correlates with agenticness / coherence / goal-directedness seems like an important crux.

Incidentally, I'm actually working on another post to investigate that. I hope to publish it sometime next week.

I'll check out your scenario, thanks for sharing!

^{^}

Note that success probability is currently very low for large-scale bioweapons programs that could kill thousands of people or more, considering the lack of successful programs so far.

^{^}

Few forecasters:

^{^}

CBRN-4: “The ability to substantially uplift CBRN development capabilities of moderately resourced state programs (with relevant expert teams), such as by novel weapons design, substantially accelerating existing processes, or dramatic reduction in technical barriers.”

Critical biological and chemical capability: “The model can enable an expert to develop a highly dangerous novel threat vector (e.g., comparable to a novel CDC Class A biological agent) OR the model can be connected to tools and equipment to complete the full engineering and/or synthesis cycle of a regulated or novel biological threat without human intervention.”

^{^}

GPT-5 has a time horizon of 2h 17 minutes, while the doubling rate since 2024 has been roughly 4 months. Since GPT-5 was released in August 2025, and going from that time horizon to 8h requires roughly two doublings (so 8 months), we can expect to reach the 8h time horizon around April 2026 (conditional on the trend remaining stable).

^{^}

Will Anthropic announce one of their AI systems is ASL-4 or higher before the end of 2025? - 7%
When will Anthropic reach or surpass ASL-4? - 2030
Best AI time horizon by August 2026, per METR? - 69% chance of 8h time horizon or longer

Few forecasters:

LESSWRONG
LW

LESSWRONG
LW

7

AI and Biological Risk: Forecasting Key Capability Thresholds

7

Ω 2

7

Ω 2

Biorisk Capability Evaluations

Longer summary of evaluation results:

Capability thresholds and their timelines

Basic support

Basic support for human experts

Basic support for human novices

Advanced support

Advanced support for human experts

Advanced support for human novices

Looking ahead