LESSWRONG
LW

Nikola Jurkovic — LessWrong

If you have a Costco membership you can buy $100 dollars of Uber gift cards online for $80. This provides a 20% discount on all of Uber.

Sadly you can only buy $100 every 2 weeks, meaning your savings per year are limited to 20$ * (52/2) = $520. A Costco membership costs $65 a year. It's unclear how long this will stay an option.

Nikola Jurkovic1moQuick Take

I think that space-based power grabs are unlikely as long as powers care about, and are equally-matched on, Earth.

This is the rough story that I think is unlikely to happen:

Two superpowers have roughly equal power on the Earth during the singularity, and remain roughly equal in power after both creating ASIs that are at least intent-aligned with them. They maintain mutually assured destruction on Earth. Superpower A is much more focused on building space infrastructure than Superpower B. Within a decade, Superpower A's space infrastructure means that Superpower A has a decisive advantage superpower B.

This story to me seems unlikely because in this scenario, Superpower A probably still has most of its human population on Earth (relocating millions of people to space would probably be very slow). Therefore, as long as mutually assured destruction is maintained on Earth, Superpower B will retain a lot of its bargaining power despite having a disadvantage in space infrastructure.

Replying toTurning 20 in the probable pre-apocalypse

Nikola Jurkovic2mo

Turning 20 in the probable pre-apocalypse

Thank you for writing this, I find it very relatable. I'd heart react the post if that feature existed, so I'll heart react my comment instead.

Replying toInsights into Claude Opus 4.5 from Pokémon

Nikola Jurkovic2mo

Insights into Claude Opus 4.5 from Pokémon

This benchmark includes a Slay the Spire environment! When it was written, Gemini 2.5 did the best, getting roughly halfway through a non-Ascension run.

Nikola Jurkovic2mo

This very roughly implies that the median of "50% time horizon as predicted by METR staff" by EOY 2026 is a bit higher than 20 hours.

Nikola Jurkovic2moQuick Take

I very roughly polled METR staff (using Fatebook) what the 50% time horizon will be by EOY 2026, conditional on METR reporting something analogous to today's time horizon metric.

I got the following results: 29% average probability that it will surpass 32 hours. 68% average probability that it will surpass 16 hours.

The first question got 10 respondents and the second question got 12. Around half of the respondents were technical researchers. I expect the sample to be close to representative, but maybe a bit more short-timelines than the rest of METR staff.

The average probability that the question doesn't resolve AMBIGUOUS is somewhere around 60%.

Replying toOrienting to 3 year AGI timelines

Nikola Jurkovic2mo

Orienting to 3 year AGI timelines

I think my median is now 4 years, due to 2025 progress being underwhelming. I plan to write a follow up post sometime soon.

Replying toOrienting to 3 year AGI timelines

Nikola Jurkovic3moReview for 2024 Review

Orienting to 3 year AGI timelines

I don't endorse the timelines in this post anymore (my median is now around EOY 2029 instead of EOY 2027) but I think the recommendations stand up.

In person, especially in 2024, many people would mention my post to me, and I think it helped people think about their career plans. I still endorse the robustly good actions.

How did my 2025 predictions hold up? Pretty well! I plan to write up a full post reviewing my predictions, but they seem pretty calibrated. I think I overestimated public attention, frontiermath, and I slightly overestimated SWE-Bench verified and OSWorld. All of the preparedness categories were hit I think.

College life with short AGI timelines

Nikola Jurkovic

3mo

When I started my freshman year, my median estimate for AGI was 20 years. In my senior year it was down to 3 years (although it’s gone back up to 5 years since then). My expectations of the future made my college experience somewhat unusual and I will share some reflections as someone who recently graduated.

I came into college wanting to minimize existential risks, from the simple fact that AGI is likely to happen this century and biological weapons and nuclear war could cause catastrophes even if AGI doesn’t happen.

The calm before the storm

College is usually a time when people mature, take steps towards finding their place in the world, and start... (read 1136 more words →)

•••

Diplomacy during AI takeoff

Nikola Jurkovic

3mo

AI 2027, Situational Awareness, and basically every scenario that tries to seriously wrestle with AGI, assume that the US and China are basically the only countries that matter in shaping the future of humanity. I think this assumption is mostly valid. But, if other countries wake up to AGI, how might they behave during AI takeoff?

States will be faced with the following situation: Within a few years, some country will control superintelligence, or create a runaway superintelligence that causes human extinction. Once either nation creates a superintelligence, if humanity is not extinct, then every other nation will be at the mercy of the group that controls ASI.

ASI-proof alliances

Fundamentally, countries will be in... (read 482 more words →)

Are AI time horizons inherently superexponential?

Nikola Jurkovic

3mo

A few people have made the prediction that there’s inherent superexponentiality in time horizons. One way to define inherently superexponential time horizons is:

Even without substantial AI R&D automation, there is some reason to expect time horizons to grow faster than exponentially at some level of capabilities.

There are two common arguments that I’ve seen for this:

Under some definitions of time horizon, a superintelligence has an infinite time horizon. An infinite time horizon is not achievable in finite time if growth is exponential. Therefore, if a superintelligence is achievable in finite time, the rate at which the time horizon grows will have to be faster than exponential.
The difference between the difficulty of tasks is

... (read 810 more words →)

How likely is dangerous AI in the short term?

Nikola Jurkovic

3mo

How large of a breakthrough is necessary for dangerous AI?

In order to cause a catastrophe, an AI system would need to be very competent at agentic tasks^[1]. The best metric of general agentic capabilities is METR’s time horizon. The time horizon measures the length of well-specified software tasks AI systems can do, and is grounded in human baselines, which means AI performance can be closely compared to human performance.

Causing a catastrophe^[2] is very difficult. It would likely take many decades, or even centuries, of skilled human labor. Let’s use one year of human labor as a lower bound on how difficult it is. This means that AI systems will need to at least... (read 1104 more words →)

Replying toMourning a life without AI

Nikola Jurkovic3mo

Mourning a life without AI

I don't see why either of those things stop you from having a family.

I think we might be using different operationalizations of "having a family" here. I was imagining it to mean something that at least includes "raise kids from the age of ~0 to 18". If x-risk were to materialize within the next ~19 years, I would be literally stopped from "having a family" by all of us getting killed.

But under a definition of "have a family" which is means "raise a child from the age of ~0 to 1", then yeah, I think P(doom) is <20% in the next 2 years and I'm probably not literally getting stopped.

Also to be clear, my P(ASI within our lifetimes) is like 85%, and my P(doom) is like 2/3.

Replying toMourning a life without AI

Nikola Jurkovic3mo

Mourning a life without AI

This is because the correct answer is option three: try to modify the button to lower the 60 and raise the 15, until such time as a 1-in-5 chance of survival is a net improvement relative to your default situation.

Yes, the counterfactual I was imagining in this button world was just living a normal life and dying at the end. If indeed there's a way to shift around the probabilities I'd devote my life to it. Which is what we're doing!

It's been honestly very freeing to be able to discuss these things somewhere other than this community.

I agree. This year I've had the policy of being very direct about what I think... (read more)

Mourning a life without AI

Nikola Jurkovic

3mo

Recently, I looked at the one pair of winter boots I own, and I thought “I will probably never buy winter boots again.” The world as we know it probably won’t last more than a decade, and I live in a pretty warm area.

I. AGI is likely in the next decade

It has basically become consensus within the AI research community that AI will surpass human capabilities sometime in the next few decades. Some, including myself, think this will likely happen this decade.

II. The post-AGI world will be unrecognizable

Assuming AGI doesn’t cause human extinction, it is hard to even imagine what the world will look like. Some have tried, but many of their... (read 1642 more words →)

184

•••

How to survive until AGI

Nikola Jurkovic

3mo

In my previous post, I made the case that surviving until AGI seems very worthwhile, and that people should consider taking actions to make that more likely. This post goes into what the most low-hanging fruit are for surviving until AGI. I’ll assume that AGI is less than 20 years away.

I’ll rank the various interventions by how many micromorts they reduce (1 micromort = 1 in a million chance of dying). The optimal strategy varies a bit based on how old one is because the older you get, the more of your micromorts are chronic rather than acute.

Main anti-recommendations

Hard drugs (tens of thousands of micromorts a year)

If you’re a young person, by... (read 703 more words →)

Anthropic wrote a pilot risk report where they argued that Opus 4 and Opus 4.1 present very low sabotage risk. METR independently reviewed their report and we agreed with their conclusion.

During this process, METR got more access than during any other evaluation we've historically done, and we were able to review Anthropic's arguments and evidence presented in a lot of detail. I think this is a very exciting milestone in third-party evaluations!

I also think that the risk report itself is the most rigorous document of its kind. AGI companies will need a lot more practice writing similar documents, so that they can be better at assessing risks once AI systems become very... (read 387 more words →)

Thoughts on extrapolating time horizons

Nikola Jurkovic

6mo

(written for a Twitter audience)

Has AI progress slowed down? I’ll write some personal takes and predictions in this post.

The main metric I look at is METR’s time horizon, which measures the length of tasks agents can perform. It has been doubling for more than 6 years now, and might have sped up recently.

By measuring the length of tasks AI agents can complete, we can get a continuous metric of AI capabilities.

Since 2019, the time horizon has been doubling every 7 months. But since 2024, it’s been doubling every 4 months. What if we irresponsibly extrapolated these to 2030?

If AI progress continues at its recent rate, we get AI systems which can do... (read 241 more words →)

Forbes: Fear Of Super Intelligent AI Is Driving Harvard And MIT Students To Drop Out

Nikola Jurkovic

6mo

Forbes wrote an article interviewing a few recent dropouts and graduates who work on AI safety. They also interviewed Gary Marcus who represented the long AI timelines and low p(doom) position.

Note that I personally don't endorse dropping out unless you're very resilient and have a good opportunity lined up (e.g. the things my friends who were interviewed are up to now).

Grok 4 is slightly above SOTA on 50% time horizon and slightly below SOTA on 80% time horizon: https://x.com/METR_Evals/status/1950740117020389870

xAI's safety team is 3 people.

The US Secretary of Energy says "The AI race is the second Manhattan project."

https://x.com/SecretaryWright/status/1945185378853388568

Similarly, the US Department of Energy says: "AI is the next Manhattan Project, and THE UNITED STATES WILL WIN. 🇺🇸"

https://x.com/ENERGY/status/1928085878561272223

•••

Outcomes of the Geopolitical Singularity

Nikola Jurkovic

9mo

We will soon enter an unstable state where the balance of military and political power will shift significantly because of advanced AI.

As different nations gain access to new advanced technologies, nations that have a relative lead will amass huge power over those that are left behind. The difference in technological capabilities between nations might end up becoming so large that one nation will possess the capability to disable the militaries (including nuclear arsenals) of other countries with minimal retaliation. While this all is happening, AIs will possibly be accruing power of their own.

The unstable state where the relative power of nations is shifting dramatically cannot be sustained for long. In particular, it’s... (read 1265 more words →)

Survey of Multi-agent LLM Evaluations

Nikola Jurkovic

9mo

Disclaimer for LessWrong: This is a project I worked on for a college class. I think it’s useful for people working on evaluations to skim this to get a sense of what types of multi-agent evals exist. Thank you to Xavier Roberts-Gaal for advising me for this project!

Abstract

LLM-based AI agents have the potential to reshape knowledge work and introduce novel risks while amplifying some existing AI risks. In order to adequately measure risks posed by multi-agent systems, bespoke multi-agent evaluations might be needed. We searched the literature and compiled a list of 32 papers about multi-agent evaluations. We found that 26 out of 32 papers measured miscoordination failure modes, while only 5... (read 2448 more words →)

One analogy for AGI adoption I prefer over "tech diffusion a la the computer" is "employee turnover."

Assume you have an AI system which can do everything any worker could do, including walking around in an office, reading social cues, and doing everything else needed for an excellent human coworker.

Then, barring regulation or strong taste based preferences, any future hiring round will hire such a robot over a human. Then, the question of when most of the company are robots is just the question of when most of the workforce naturally turns over through hiring and firing, because all new incoming employees will be robots.

Of course, in this world, there wouldn't just be... (read more)

The waiting room strategy for people in undergrad/grad school who have <6 year median AGI timelines: treat school as "a place to be until you get into an actually impactful position". Try as hard as possible to get into an impactful position as soon as possible. As soon as you get in, you leave school.

Upsides compared to dropping out include:

Lower social cost (appeasing family much more, which is a common constraint, and not having a gap in one's resume)
Avoiding costs from large context switches (moving, changing social environment).

Extremely resilient individuals who expect to get an impactful position (including independent research) very quickly are probably better off directly dropping out.

Dario Amodei and Demis Hassabis statements on international coordination (source):

Interviewer: The personal decisions you make are going to shape this technology. Do you ever worry about ending up like Robert Oppenheimer?

Demis: Look, I worry about those kinds of scenarios all the time. That's why I don't sleep very much. There's a huge amount of responsibility on the people, probably too much, on the people leading this technology. That's why us and others are advocating for, we'd probably need institutions to be built to help govern some of this. I talked about CERN, I think we need an equivalent of an IAEA atomic agency to monitor sensible projects and those that are more... (read 408 more words →)

LESSWRONG
LW

LESSWRONG
LW

Nikola Jurkovic

Orienting to 3 year AGI timelines

Mourning a life without AI

Thoughts on extrapolating time horizons

Could Advanced AI Accelerate the Pace of AI Progress? Interviews with AI Researchers

Nikola Jurkovic

College life with short AGI timelines

Diplomacy during AI takeoff

Are AI time horizons inherently superexponential?

How likely is dangerous AI in the short term?

Mourning a life without AI

How to survive until AGI

Thoughts on extrapolating time horizons

Nikola Jurkovic

Orienting to 3 year AGI timelines

Mourning a life without AI

Thoughts on extrapolating time horizons

Could Advanced AI Accelerate the Pace of AI Progress? Interviews with AI Researchers

Nikola Jurkovic

College life with short AGI timelines

Diplomacy during AI takeoff

Are AI time horizons inherently superexponential?

How likely is dangerous AI in the short term?

Mourning a life without AI

How to survive until AGI

Thoughts on extrapolating time horizons

The calm before the storm

ASI-proof alliances

How large of a breakthrough is necessary for dangerous AI?

I. AGI is likely in the next decade

II. The post-AGI world will be unrecognizable

Main anti-recommendations

Hard drugs (tens of thousands of micromorts a year)

Abstract