Matrice Jacobine's Shortform

Matrice Jacobine

Matrice Jacobine's Shortform — LessWrong

Matrice Jacobine's Shortform

3rd May 2026

1 min read

6

This is a special post for quick takes by Matrice Jacobine. Only they can create top-level comments. Comments here also appear on the Quick Takes page and All Posts page.

47 comments, sorted by

top scoring

Click to highlight new comments since: Today at 5:46 PM

[-]Matrice Jacobine7d791

So... what's the general take on the hantavirus outbreak?

[-]DirectedEvolution7d*441

I ran through my checklist. Looks low-risk to me. Basically a very deadly disease that nevertheless shows nil or inefficient human-to-human transmission and doesn't show signs of regional or global growth. It depends heavily on specific South American rat species for transmission to humans. The original exposure was on or around April 1. There are no new concerning mutations (see below), and so far the flight attendant who had been feared to have picked it up on a flight has tested negative, though there's an up to 6 week incubation period and she was exposed on April 25th, about 2 weeks ago. There were also 3 other plane passengers who had contact with an infected person on a plane and they've tested negative so far as well.

The consensus public health position is that "person-to-person transmission is possible through close personal contact, such as between couples." If it turns out that this is wrong and that it's much more human-transmissible than previously believed, then the complete failure to quarantine passengers after the discovery of hantavirus on the Hondius cruise ship and first passenger death would completely alter my projection. But so far, there do not seem to be signs of that.

Getting on a evolutionary biology level, if Andes virus were to become efficiently human-transmissible, the last place I'd expect to see it enter the human population is via tourists picking it up while bird watching in a garbage dump. That's a setting where the human-rodent interface is minimal. In that setting, there's little selection pressure on the virus to jump to humans and transmit efficiently. In a hypothetical Andes virus pandemic, I'd strongly have expected patient 0 to be a Chilean or Argentinian villager living in conditions that sadly expose them to routine contact with rats. I'd expect the outbreak to spread first in a local village, not on a luxury cruise ship.

Instead, it looks to me like a couple of tourists made the risky and unusual decision to put themselves in a uniquely rodent-infested location without protection, where they likely had the misfortune to inhale a large quantity of virus-infested aerosolized rat droppings. Then they got into the close quarters of a cruise ship, which completely failed to execute the measures it should have taken to address their illness, including a failure to quarantine. This allowed limited spread to other close-quarters passengers, exactly as we have observed this virus to be capable of in the past. This, combined with the lack of new mutations, suggests we're dealing with exposure to a known-quantity, transmission-inefficient virus in a demographic that doesn't usually get exposed.

The concern is due to the fact that those passengers had travelled internationally by the time quarantining and contact tracing was initiated, combined with the deadliness of the virus. But it takes a combination of spread in international populations and transmissibility and deadliness and lack of treatment to make a pandemic, and currently we only appear to have deadliness and lack of treatment. The only reason it even gets 4 points is probably because it's so rare and such a disease of poverty that big pharma's never invested in a vaccine or treatment. The main scary thing, like ebola, is the very high CFR. But that is not the grounds for a global pandemic. It's just a tragedy for a few hundred people a year.

THE CHECKLIST

Version: 0.2.4

Current score: 4/14

Last updated: 8 May 2026

Transmissibility: efficiency, intra-community spread, inter-community spread, outside view

Is there an efficient transmission route, such as respiratory droplets, airborne transmission or via the bites of common jumping or flying insects?
- "Hantaviruses are transmitted mainly through aerosols and droplets that contain rodent excretions, as well as through contaminated food, bites, and scratches." - Wikipedia.
- Incubation time is up to 6 weeks.
- Possibly. There are cases of human-to-human transmission, but this is contested. The Argentine government thinks the original two cruise ship cases contracted it while bird watching at a garbage dump, but several other passengers on the cruise ship were sickened.
- Update 5/8/2026: flight attendant tests negative, but result inconclusive due to 6-week incubation period. No new concerning mutations. source
- - "On Thursday evening, Tedros Ghebreyesus, the director-general of the World Health Organization, said in a message that the flight attendant, who has been reported to have symptoms, had tested negative on two tests.... Mr. Ghebreyesus did caution, however, that the virus can have a long incubation time."
  - - For context, a Hondius cruise linear passenger, very sick with what turned out to be hantavirus, got on a plane. The passenger was handled on that plane by a flight attendant. The sick passenger was removed from the flight before takeoff due to her illness and later died. The flight attendant has been hospitalized and tested for hantavirus. I haven't found a clear report of whether the flight attendant was sick or not. The flight attendant's test results have been negative so far, but the incubation time is up to 6 weeks.
  - "Genetic sequencing of samples in South Africa suggests that the virus is nearly identical to the version seen in Argentina and has not mutated in ways that would make it a greater threat, according to Tulio de Oliveira, director of the Centre for Epidemic Response and Innovation at Stellenbosch University in South Africa. Dr. De Oliveira was not involved in the work but said the results had been presented to an expert group of which he is a member."
- Graded as a "no" as even if human-to-human transmission is possible, it currently appears to be inefficient and still heavily depend on rats.
Does it seem to spread rapidly within affected communities, going from a few cases to a major local emergency within a month? If R0 has been credibly estimated, is the mean of the range higher than 1?
- No. This appears to be a sporatic, locally contained, chronic issue. "More than a hundred cases of HPS caused by Andes virus occur each year, most of them in Argentina and Chile." - Wikipedia
Has it achieved community spread in non-endemic countries on at least 3 continents, and in a set of countries comprising both 15% of world population and 15% of world GDP?
- No.
Is screening for the causative agent difficult due to test unavailability/unreliability/slowness, vector-based transmission, or transmissibility that is highest in early/asymptomatic stages? If the causative agent is unknown, mark "unknown."
- No. "ANDV infection is diagnosed based on observation of symptoms and testing for hantavirus nucleic acid, proteins, or hantavirus-specific antibodies." - Wikipedia

Danger: case fatality rates, overwhelm, economic impacts, treatment

If a credible case fatality rate has been estimated, is it 1% or higher? Alternatively, is the number of deaths divided by the number of confirmed cases being reported at around 5% or higher in at least 3 countries with reliable data?
- Yes. The case fatality rate from infection is high, at about 40%.
Is there a concern about hospital overwhelm or medical supply shortages in industrialized nations?
- No.
Does the disease heavily affect career-age people (age 25-65), or frequently leave survivors with lasting disability?
- Yes (tentative)
Is there no clearly effective treatment?
- Yes, there is no clearly effective treatment. "Treatment is supportive in nature and includes supplementing oxygen during the cardiopulmonary phase."

Spread limitations: demographics, geography

If some non-age-related demographics are heavily affected and others are not, do the heavily affected demographics amount to 15% or more of the population? If almost the whole population is about equally affected, mark "yes."
- Yes (tentative)
Is the disease potentially transmissible across most of the world population (i.e. does not work via a vector that has a geographically limited range)? If the causative agent is unknown, mark "unknown."
- No. Reports of human-to-human transmission are rare and contested. It appears to heavily depend on the particular rat species that transmit it.

Social effects: communications, shutdown, research, deaths

Has the disease made front page news on at least 3 different days in the New York Times, and also received the WHO designation "public health emergency of international concern" or the equivalent?
- No.
Has there been a quarantine of a city with over 1 million inhabitants? In a country comprising at least 5% of world population or GDP, has there been a cancellation of major public events, or travel restrictions on passengers arriving from or via this country?
- No.
Has the pharmaceutical industry begun a widespread research effort to produce a novel treatment or novel vaccine, and/or has industry begun a major emergency effort to build physical infrastructure or equipment (hospitals, ventilators, etc)?
- No.
Have the death toll reached at least 2,000?
- No.

[-]papetoast6d72

Peter Wildeford on twitter

The Andes-strain hantavirus outbreak aboard the MV Hondius cruise ship is tragic. But I estimate a <1% chance this becomes a pandemic. It is on track to self-extinguish within weeks.

[-]jelly7d60

A Manifold market suggests an 8% chance of Hantavirus causing a pandemic in 2026

[-]habryka7d5245

@mabramov See! Right here! It's happening! The mundane utility! Do you know how annoying arriving at a calibrated probability would have been before prediction markets took off?

[-]DirectedEvolution7d14-2

Hard disagree. 8% is wildly overinflated and it's at 4% now. If you don't find yourself substantially revising your plans based on a 50% drop in pandemic probability, and if you didn't get seriously concerned about an 8% chance of a 40% CFR pandemic, then even you aren't taking them seriously. The main utility of this prediction market is driving internet chatter on how useful prediction markets are.

[-]AprilSR7d1610

I think the prediction market can be useful even if you need to apply corrections to the headline number. It makes a huge difference to me whether Manifold says 8% or 40%, even though I won't necessarily fully trust either number until it's had a while longer to shake out (and even then it depends on the market dynamics, liquidity, etc.)

[+]DirectedEvolution6d-19-33

[-]Shankar Sivarajan7d82

based on a 50% drop in pandemic probability

Relevant xkcd: 1252

[-]Afterimage6d78

I think that's an unfair deployment of xkcd mockery. 8% of a pandemic this year is not a tiny chance, which means a 50% drop is actually a big deal. The issue was interpreting the prediction market as an accurate percentage when it should just be an indication of approximate risk.

[-]habryka7d41

I mean, no, of course I am not changing my plans based on a 50% drop in pandemic probability. There are tons of pandemic probabilities that change from 0.1% to 0.05%, every week or so, and I am not changing my actions based on that.

In this case, knowing the probability seems to be somewhere in the 1%-10% range is already extremely helpful! I don't really need to know much more. And I have proxies I can use to evaluate the robustness of the market (like volume), so I am not miscalibrated about the noise.

[-]mabramov7d51

You actually wouldnt know if it was less than 1% due to interest rates

[-]habryka7d10

Yep, not saying that the probability I was referencing here is a prediction market probability (not sure what would have made someone think that, but happy to clarify).

[-]Ben Pace6d20

Hard disagree. 8% is wildly overinflated... The main utility of this prediction market is driving internet chatter on how useful prediction markets are.

Polymarket has it at 10%. I don't know where you're located, but if you're able to buy some NO, and the WHO doesn't characterize Hantavirus as a pandemic this year, then you'll make ~11% returns.

[-]DirectedEvolution6d20

I have better economic opportunities, thanks.

[-]lc7d*5-1

In 2015:

B: "Do you think there's gonna be a Hantavirus pandemic?"
A: "uh well it seems unlikely"
B: "I disagree"
A: "totally valid"

[-]Ben Pace7d20

I think all that "unlikely" means is "less than 50%". I value knowing what different sources believe at more granularity than that.

[-]lc7d62

Right, I am mocking what the conversation would have been had the prediction market not existed (edited).

[-]mabramov7d40

The accuracy of this market is dwarfed by the interest rate involved.

[-]habryka7d30

I am not sure what you mean, do you currently think the probability is outside the 1%-10% range? Imperfect accuracy is fine, especially when it's easy to adjust for.

[-]Philip Harker7d1-17

Maybe I'm just super new to the world of prediction markets in general, but I don't feel like I'm getting any immediate utility from that 8% figure. If anything the rhetoric in this three comment thread makes me agree with @mabramov even more now.

Like, hooray, we have a number. What now?

[-]Ben Pace7d165

It sounds to me like you're expecting too much from all information. Consider the information that physical matter is made of atoms, which are about a nanometer across, and have positively charged protons and neutrons in the center, and then much smaller electrons whizzing around in orbits. This has been revolutionary for society, leading to so much ability to do engineering and chemistry and understanding cosmology and so forth.

A typical teenager learning this information for the first time might ask "what utility will this give me personally?" and find it has little direct application. Yet I would not advise them that this information is not worth knowing.

That's the first point, that information doesn't need to be directly connected to an outcome to be worth knowing.

The second point is that fine gradations in information are valuable. I can imagine someone similarly saying "Why should it be valuable to anyone to know the difference between Apple Stock being $285 versus $275? Surely we should just care whether it's doing well or not? Why don't we just replace it with the words "Great" "Good" "Bad" "Worse"? Yet often small signs tell us something. A few percentage points of dip can imply that a new product release went poorly. A change in CEO leading to stock price raising a few points can indicate very good things about this new CEO.

In this case, I find information like "8% chance of a pandemic" valuable in lots of ways.

It helps me to know how often we get 8% chance pandemics. I wish I could look back over the decades and see how often pandemics got to being this likely. This generally helps me understand our civilizational resilience and how much of an issue pandemics are.^[1]
It helps me to know how much to keep tracking it. If the prediction market gave it <1% then I would be like "Great, this is an update that I can stop thinking about it and stop tracking it." At this range I expect we're fine but I will still be on the lookout for more info.
I also like knowing the specific probabilities so I can learn which events have an important effect. For instance, it's currently around 5%. There may be many news stories about this pandemic, and seeing which ones cause movement vs which ones don't, will help me build a model of what information actually matters.

Added: The third point is that public legibility is a massive value-add, and could well be most of the value. Given that it's public, it makes me more confident that, if it were to get higher, people would notice and warn me. Much of the news landscape is just people arguing whether something is an emergency (which our memetics are perversely incentivized to say is true all of the time), so whether lots of people are acting alarmed just isn't something most people can be very sensitive to changes in. The change from 5% to 50% is serious for me and yet I don't know how to tell that difference from the tone of people on twitter or in many media outlets, especially when they are not themselves precise.

^{^}
To go into more detail on this, I had an LLM write the following so that the math checked out: you can imagine a world where every year carries a 5% pandemic risk, and another world where every 15 years carries a 75% pandemic risk. Over 60 years, both imply 3 expected pandemics, but they suggest very different prevention strategies: steady-state risk reduction in the first case, versus identifying and defusing rare high-risk transition periods in the second.

[-]habryka7d43

Like, hooray, we have a number. What now?

I mean, so much. First of all, the number is enormously more helpful than reading a bunch of articles that give me lots of detail that then force me to come up with my own model of the situation to come up with a probability.

And then the number itself of course drives all kinds of actions! Another COVID-level pandemic would be a huge deal that would change my actions drastically in hundreds of ways (this market is not about it being another COVID-level pandemic, but is a lower threshold).

[-]Seth Herd7d20

Why don't you think that's useful? It seems like at least some smart people thought about it carefully, which is a great start. Are you referencing some other thread? If so, please share?

[-]Philip Harker7d41

I'm referring to this post from last month (and I assume that post is why habyrka is tagging mabramov).

Sure, maybe there's some usefulness in that it got smart people thinking about the question. And it gave us a figure of 8%.

But I don't really understand what a person is to do with that number. What utility follows from that? And why is it worth, as @mabramov emphasises, nine-figure EA funding?

[-]Seth Herd7d52

Thanks for clarifying.

The use is that I don't have to figure it out for myself! This saved me at least an hour, probably more.

[-]Caleb Biddulph7d40

My reaction to an 8% probability of a pandemic is currently "doing nothing," so in some sense I agree with you. If it was >50%, I would probably read up more on hantavirus and think about preparations I could make now (maybe at that point, I would have picked up on ambient freaking-out even without the prediction market, but I do like having a precise number). Maybe I "should" in fact be preparing for hantavirus even at an 8% probability, but I only have so much willpower and time in the day.

Notably, I can't immediately think of any important decisions in my life that I made differently due to prediction markets. But someone whose job depends on making quick decisions based on global events, e.g. some government role in biosecurity, would probably find calibrated estimates on this kind of thing quite useful.

[-]DirectedEvolution7d60

4% now.

[-]Ben Pace6d30

Polymarket has it at 10%, with $1.6M traded, at the time of writing.

[-]eukaryote6d20

Posted some thoughts here (hadn't seen this thread yet, whoops)

[-]Bruno Oliveira7d10

Cruise ships, unfortunately, are prone to exacerbating any sort of disease outbreak due to their nature. For instance, back in 2020, aboard the Diamond Princess, Covid spread with an R0 5x higher than usual.

If anything, it's surprising that more people didn't get infected by this, which shows just how poor hantavirus is at human-to-human spread.

[-]Matrice Jacobine10d252

Ricardo Olmedo: "We fine-tuned Alec Radford’s 1930 vintage LLM to solve SWE-bench issues. After just ‼️250‼️ training examples, the model solves its first issue, a simple patch to the xarray library."

This is interesting because, it is not discussed enough that AlphaGo worked the same way LLMs work: it was pretrained on a large dataset of human moves, then posttrained through reinforcement learning. The shift to AlphaZero removed the pretraining, showing that pretraining data didn't matter at all for the model's capabilities (quite the contrary, AlphaZero is superior to AlphaGo). While this coding model is still pretrained, it also shows that good pretraining data doesn't matter nearly as much as one could think.

[-]Steven Byrnes10d*1412

it also shows that good pretraining data doesn't matter nearly as much as one could think

I don’t think it shows that. It arguably suggests that abundant ~~pre~~training data doesn’t matter as much as one could think. As opposed to good ~~pre~~training data. I presume that the codebases + agentic coding transcripts that they SFT’d on were high quality, right? [ETA: WHOOPS SEE MATRICE REPLY]

As for data efficiency, after the pre-1930 pretraining, IIUC it takes 250 training examples ≈ 13 million tokens before “the model solves its first [SWE-bench] issue”, and 75000 training examples ≈ 4 billion tokens gets to pass@1 of 4.5%.

Is that a more or less than expected? I dunno, it depends on what you were expecting. For what it’s worth, Gemini says 13 million tokens is about what a human could read in 650 hours non-stop (40 hours/week for 16 weeks).

[-]Matrice Jacobine9d40

I don’t think it shows that. It arguably suggests that abundant pretraining data doesn’t matter as much as one could think. As opposed to good pretraining data.

I think it straightforwardly show the reverse (good pretraining data doesn't matter as much as one could think, but abundant pretraining data does)? Olmedo himself notes: "What holds the 1930 model back is that it is severely undertrained (only 260B tokens), rather than its pre-training data."

I presume that the codebases + agentic coding transcripts that they SFT’d on were high quality, right?

SFT is posttraining??

[-]Steven Byrnes9d40

Lol oops yeah SFT is posttraining, that explains why I found your comment confusing.

The quote emphasized “after just 250 training examples”, and I thought that was the context, i.e. that you were impressed by the “just 250” part and commenting on that. But I guess the “just 250” was irrelevant to your comment.

On top of that, I tend to mentally lump pretraining and SFT together because they’re algorithmically exactly the same thing, except maybe different hyperparameters. So that’s the other half of why I misread your comment.

Still, given that pretraining and SFT are algorithmically exactly the same thing, it would follow that you need no pretraining data whatsoever if you have enough of the right kind of SFT data. …In principle. Probably not in practice. But still, that’s relevant context here I think.

[-]testingthewaters9d30

I think the main shock comes from none of the data involved having code in it, and then the model quickly learns coding skills with only a few examples.

[-]Steven Byrnes9d52

You say "with only a few examples", I say "after ~13 million tokens of some mix of code and agentic reasoning about code". Is it a "shock" because you expecting it to take much more than 13 million tokens? Or is it a "shock" because you expected "number of examples" to be an important constraint independent of the length of each example? Or something else?

[-]testingthewaters9d50

Hmm. I didn't realise there were that many tokens per example. I must have glossed your comment incorrectly. Sorry on my part.

[-]Dylan Bowman10d42

I think the initial link is good to share, but I disagree with the analogy to AlphaGo/AlphaZero. The RL process for current models still involves humans heavily in creating the tasks, assuring task correctness, and deciding what kind of tasks are useful to train on. We don't have anything like self-play except maybe a small amount in math training (synthetic math data could be construed as self-play).

Moderation Log

THE CHECKLIST

Version: 0.2.4

Current score: 4/14

Last updated: 8 May 2026

Transmissibility: efficiency, intra-community spread, inter-community spread, outside view

Is there an efficient transmission route, such as respiratory droplets, airborne transmission or via the bites of common jumping or flying insects?
- "Hantaviruses are transmitted mainly through aerosols and droplets that contain rodent excretions, as well as through contaminated food, bites, and scratches." - Wikipedia.
- Incubation time is up to 6 weeks.
- Possibly. There are cases of human-to-human transmission, but this is contested. The Argentine government thinks the original two cruise ship cases contracted it while bird watching at a garbage dump, but several other passengers on the cruise ship were sickened.
- Update 5/8/2026: flight attendant tests negative, but result inconclusive due to 6-week incubation period. No new concerning mutations. source
- - "On Thursday evening, Tedros Ghebreyesus, the director-general of the World Health Organization, said in a message that the flight attendant, who has been reported to have symptoms, had tested negative on two tests.... Mr. Ghebreyesus did caution, however, that the virus can have a long incubation time."
  - - For context, a Hondius cruise linear passenger, very sick with what turned out to be hantavirus, got on a plane. The passenger was handled on that plane by a flight attendant. The sick passenger was removed from the flight before takeoff due to her illness and later died. The flight attendant has been hospitalized and tested for hantavirus. I haven't found a clear report of whether the flight attendant was sick or not. The flight attendant's test results have been negative so far, but the incubation time is up to 6 weeks.
  - "Genetic sequencing of samples in South Africa suggests that the virus is nearly identical to the version seen in Argentina and has not mutated in ways that would make it a greater threat, according to Tulio de Oliveira, director of the Centre for Epidemic Response and Innovation at Stellenbosch University in South Africa. Dr. De Oliveira was not involved in the work but said the results had been presented to an expert group of which he is a member."
- Graded as a "no" as even if human-to-human transmission is possible, it currently appears to be inefficient and still heavily depend on rats.
Does it seem to spread rapidly within affected communities, going from a few cases to a major local emergency within a month? If R0 has been credibly estimated, is the mean of the range higher than 1?
- No. This appears to be a sporatic, locally contained, chronic issue. "More than a hundred cases of HPS caused by Andes virus occur each year, most of them in Argentina and Chile." - Wikipedia
Has it achieved community spread in non-endemic countries on at least 3 continents, and in a set of countries comprising both 15% of world population and 15% of world GDP?
- No.
Is screening for the causative agent difficult due to test unavailability/unreliability/slowness, vector-based transmission, or transmissibility that is highest in early/asymptomatic stages? If the causative agent is unknown, mark "unknown."
- No. "ANDV infection is diagnosed based on observation of symptoms and testing for hantavirus nucleic acid, proteins, or hantavirus-specific antibodies." - Wikipedia

Danger: case fatality rates, overwhelm, economic impacts, treatment

If a credible case fatality rate has been estimated, is it 1% or higher? Alternatively, is the number of deaths divided by the number of confirmed cases being reported at around 5% or higher in at least 3 countries with reliable data?
- Yes. The case fatality rate from infection is high, at about 40%.
Is there a concern about hospital overwhelm or medical supply shortages in industrialized nations?
- No.
Does the disease heavily affect career-age people (age 25-65), or frequently leave survivors with lasting disability?
- Yes (tentative)
Is there no clearly effective treatment?
- Yes, there is no clearly effective treatment. "Treatment is supportive in nature and includes supplementing oxygen during the cardiopulmonary phase."

Spread limitations: demographics, geography

If some non-age-related demographics are heavily affected and others are not, do the heavily affected demographics amount to 15% or more of the population? If almost the whole population is about equally affected, mark "yes."
- Yes (tentative)
Is the disease potentially transmissible across most of the world population (i.e. does not work via a vector that has a geographically limited range)? If the causative agent is unknown, mark "unknown."
- No. Reports of human-to-human transmission are rare and contested. It appears to heavily depend on the particular rat species that transmit it.

Social effects: communications, shutdown, research, deaths

Has the disease made front page news on at least 3 different days in the New York Times, and also received the WHO designation "public health emergency of international concern" or the equivalent?
- No.
Has there been a quarantine of a city with over 1 million inhabitants? In a country comprising at least 5% of world population or GDP, has there been a cancellation of major public events, or travel restrictions on passengers arriving from or via this country?
- No.
Has the pharmaceutical industry begun a widespread research effort to produce a novel treatment or novel vaccine, and/or has industry begun a major emergency effort to build physical infrastructure or equipment (hospitals, ventilators, etc)?
- No.
Have the death toll reached at least 2,000?
- No.

[-]papetoast6d72

Peter Wildeford on twitter

The Andes-strain hantavirus outbreak aboard the MV Hondius cruise ship is tragic. But I estimate a <1% chance this becomes a pandemic. It is on track to self-extinguish within weeks.

[-]jelly7d60

A Manifold market suggests an 8% chance of Hantavirus causing a pandemic in 2026

[-]habryka7d5245

@mabramov See! Right here! It's happening! The mundane utility! Do you know how annoying arriving at a calibrated probability would have been before prediction markets took off?

[-]DirectedEvolution7d14-2

[-]AprilSR7d1610

[+]DirectedEvolution6d-19-33

[-]Shankar Sivarajan7d82

based on a 50% drop in pandemic probability

Relevant xkcd: 1252

[-]Afterimage6d78

[-]habryka7d41

[-]mabramov7d51

You actually wouldnt know if it was less than 1% due to interest rates

[-]habryka7d10

Yep, not saying that the probability I was referencing here is a prediction market probability (not sure what would have made someone think that, but happy to clarify).

[-]Ben Pace6d20

Hard disagree. 8% is wildly overinflated... The main utility of this prediction market is driving internet chatter on how useful prediction markets are.

Polymarket has it at 10%. I don't know where you're located, but if you're able to buy some NO, and the WHO doesn't characterize Hantavirus as a pandemic this year, then you'll make ~11% returns.

[-]DirectedEvolution6d20

I have better economic opportunities, thanks.

[-]lc7d*5-1

In 2015:

B: "Do you think there's gonna be a Hantavirus pandemic?"
A: "uh well it seems unlikely"
B: "I disagree"
A: "totally valid"

[-]Ben Pace7d20

I think all that "unlikely" means is "less than 50%". I value knowing what different sources believe at more granularity than that.

[-]lc7d62

Right, I am mocking what the conversation would have been had the prediction market not existed (edited).

[-]mabramov7d40

The accuracy of this market is dwarfed by the interest rate involved.

[-]habryka7d30

I am not sure what you mean, do you currently think the probability is outside the 1%-10% range? Imperfect accuracy is fine, especially when it's easy to adjust for.

[-]Philip Harker7d1-17

Like, hooray, we have a number. What now?

[-]Ben Pace7d165

That's the first point, that information doesn't need to be directly connected to an outcome to be worth knowing.

In this case, I find information like "8% chance of a pandemic" valuable in lots of ways.

It helps me to know how often we get 8% chance pandemics. I wish I could look back over the decades and see how often pandemics got to being this likely. This generally helps me understand our civilizational resilience and how much of an issue pandemics are.^[1]
It helps me to know how much to keep tracking it. If the prediction market gave it <1% then I would be like "Great, this is an update that I can stop thinking about it and stop tracking it." At this range I expect we're fine but I will still be on the lookout for more info.
I also like knowing the specific probabilities so I can learn which events have an important effect. For instance, it's currently around 5%. There may be many news stories about this pandemic, and seeing which ones cause movement vs which ones don't, will help me build a model of what information actually matters.

^{^}
To go into more detail on this, I had an LLM write the following so that the math checked out: you can imagine a world where every year carries a 5% pandemic risk, and another world where every 15 years carries a 75% pandemic risk. Over 60 years, both imply 3 expected pandemics, but they suggest very different prevention strategies: steady-state risk reduction in the first case, versus identifying and defusing rare high-risk transition periods in the second.

[-]habryka7d43

Like, hooray, we have a number. What now?

[-]Seth Herd7d20

Why don't you think that's useful? It seems like at least some smart people thought about it carefully, which is a great start. Are you referencing some other thread? If so, please share?

[-]Philip Harker7d41

I'm referring to this post from last month (and I assume that post is why habyrka is tagging mabramov).

Sure, maybe there's some usefulness in that it got smart people thinking about the question. And it gave us a figure of 8%.

But I don't really understand what a person is to do with that number. What utility follows from that? And why is it worth, as @mabramov emphasises, nine-figure EA funding?

[-]Seth Herd7d52

Thanks for clarifying.

The use is that I don't have to figure it out for myself! This saved me at least an hour, probably more.

[-]Caleb Biddulph7d40

[-]DirectedEvolution7d60

4% now.

[-]Ben Pace6d30

Polymarket has it at 10%, with $1.6M traded, at the time of writing.

[-]eukaryote6d20

Posted some thoughts here (hadn't seen this thread yet, whoops)

[-]Bruno Oliveira7d10

If anything, it's surprising that more people didn't get infected by this, which shows just how poor hantavirus is at human-to-human spread.

[-]Matrice Jacobine10d252

[-]Steven Byrnes10d*1412

it also shows that good pretraining data doesn't matter nearly as much as one could think

[-]Matrice Jacobine9d40

I don’t think it shows that. It arguably suggests that abundant pretraining data doesn’t matter as much as one could think. As opposed to good pretraining data.

I presume that the codebases + agentic coding transcripts that they SFT’d on were high quality, right?

SFT is posttraining??

[-]Steven Byrnes9d40

Lol oops yeah SFT is posttraining, that explains why I found your comment confusing.

[-]testingthewaters9d30

I think the main shock comes from none of the data involved having code in it, and then the model quickly learns coding skills with only a few examples.

[-]Steven Byrnes9d52

[-]testingthewaters9d50

Hmm. I didn't realise there were that many tokens per example. I must have glossed your comment incorrectly. Sorry on my part.

[-]Dylan Bowman10d42

Moderation Log