Seemingly Popular Covid-19 Model is Obvious Nonsense

[-]Bundle_Gerbe6y290

The model seems not far off estimating peak hospitalization date, at least for states that are currently peaking like CA and NY. The peaks in places that are close to peaking can be pretty accurately estimated just with curve fitting though, I assume that being fit to past data is why the model works OK for this.

It's clearly overly optimistic about the rate of drop-off after the peak in deaths, at least in some cases. Look at Spain and Italy. Right now here's how they look:

Italy: graph shows 610 deaths on April 9. Predicts 335 on April 10, 281 on April 11. Actual is 570 on April 10, 619 on April 11.

Spain: graph shows 683 on April 8, Predicts 372, 304, 262 on next three days. Actual 655, 634, 525.

The model for New York says deaths will be down to 48, 6% of the peak, in 15 days. Italy is 15 days from it's peak of 919 and is only down to 619, 67% of the peak.

The model for the US as a whole is a little less obviously over-optimistic, assuming the peak really was April 10. it's only predicting 40% decline in the next 15 days. California model predicts an even slower decline. It seems to think fast growth in cases in the outbreak phase leads to fast recovery, which has not been borne out thus far in Italy and Spain.

[-][anonymous]6y50

This increases my estimated odds of the federal government attempting to suppress positive test numbers via defunding and not collecting statistics.

[-]Bucky6y40

Italy seems to me to have stalled in decreasing R at about R=0.9. China and South Korea both got down to R=0.5. I have a concern that the UK has stalled at about R=1.3 (25% confidence) but I suspect that a few days more data may disprove this.

The US appears to still be on a downwards trajectory (currently just above R=1) but where exactly it stops will make a huge difference to the final tally. If I were to be making a model then this is the main place where I would focus my attention to give reasonable confidence intervals.

[-]mfoley6y30

We need a new model I think. The purpose of the IHME was to figure out how to allocate hospital resources at the peak. Now we are roughly at or past the peak and we need to figure out how to re-open and what calculated risks are worth taking to ensure that businesses don't get devastated even more. Hopefully someone is working on it.

[-]Yandong Zhang6y10

Below is a simplified COVID-19 framework:

Data acquiring ---> social engineering based on model ----> better result

Yes. A better model will be definitely helpful. However, (as pointed out indirectly earlier by someone else), to my best knowledge, there were no good and robust model for large lag dynamic systems. Such kind of model could lead to Chaos and random like result easily. Thus, I believed that increasing the data acquiring capability was the key (South Korea's approach).

[This comment is no longer endorsed by its author]Reply

[-]WilliamKiely6y170

April 17th Stat News story: Influential Covid-19 model uses flawed methods and shouldn’t guide U.S. policies, critics say:

“It’s not a model that most of us in the infectious disease epidemiology field think is well suited” to projecting Covid-19 deaths, epidemiologist Marc Lipsitch of the Harvard T.H. Chan School of Public Health told reporters this week, referring to projections by the Institute for Health Metrics and Evaluation at the University of Washington.

Others experts, including some colleagues of the model-makers, are even harsher. “That the IHME model keeps changing is evidence of its lack of reliability as a predictive tool,” said epidemiologist Ruth Etzioni of the Fred Hutchinson Cancer Center, home to several of the researchers who created the model, and who has served on a search committee for IHME. “That it is being used for policy decisions and its results interpreted wrongly is a travesty unfolding before our eyes.”

[-]Spenced6y130

“ Deaths lag positive tests by weeks.”

False. Deaths lag new infections by 3-4 weeks.

Positive tests are an extremely misleading stat and definitely do NOT represent actual infection rates except in the few places where testing is widespread (ie a few small countries that have highly prioritized testing like Iceland, Estonia, and Bahrain).

[-]jessicata6y120

Epistemic Status: Something Is Wrong On The Internet.

If you think this applies, it would seem that "The Internet" is being construed so broadly that it includes the mainstream media, policymaking, and a substantial fraction of people, such that the "Something Is Wrong On The Internet" heuristic points against correction of public disinformation in general.

This is a post that is especially informative, aligned with justice, and likely to save lives, and so it would be a shame if this heuristic were to dissuade you from writing it.

[-]Davidmanheim6y110

"If I am incorrect, and that is how any of this works I have some very, very large bets I would like to place."

Maybe you can state what bets you'd like to make? Are you predicting that the number of cases or deaths in, say, NYC will look very different from consensus estimates?

[-]Pablo6y40

An update by the OP on what bets they are willing to make would be much appreciated.

[-]WilliamKiely6y20

Zvi commenting on his The One Mistake Rule post: "E.g. if you want to bet me that there will be no American Covid-19 deaths in July, I will be very, very surprised."

[-]Davidmanheim6y50

Yes, the model isn't properly sensitive to uncertainties - but the projection that they are near zero isn't unreasonable, if transmission is stopped.

[-]Jonathan_Graehl6y100

You can be pretty sure that whatever forecast is touted by authorities is one designed to increase support+compliance with whatever measures they decided to take this time. Just like the previous was badly overestimating severity with social distancing (and probably without too), I'm willing to believe this one is optimistic about a gradual reopening of physical commerce in select areas.

[-]Zvi4y60Review for 2020 Review

This post was important to my own thinking because it solidified the concept that there exists the thing Obvious Nonsense, that Very Serious People would be saying such Obvious Nonsense, that the government and mainstream media would take it seriously and plan and talk on such a basis, and that someone like me could usefully point out that this was happening, because when we say Obvious Nonsense oh boy are they putting the Obvious in Nonsense. It's strange to look back and think about how nervous I was then about making this kind of call, even when it was this, well, obvious. Making that first correct call makes a difference.

But in terms of being part of an overall 'best of' or 'most important' collection for a community as a whole, it would only count if you think it had the same effect on you/others, and made it clear how nonsensical all the Very Serious People could be, and that you had to think for yourself. If all it did for others was point out that the Obvious Nonsense was obvious nonsense in this particular case, there's not much point.

[-]Decius6y60

Note that the model assumes that those level 4 measures remain in place forever.

What do you predict will be, assuming level 4 restrictions remain in place, the last day in which fewer than 100 people are infected who go on to become symptomatic in King County, and in CA?

Again assuming that there is a continuous level 4 quarantine, when do you predict the first non-weekend day without a C19-attributed death will be in those areas?

[-]SpicyLemonZest6y40

The money quote is misleading, because they don't actually have a mechanistic model. They're just fitting a parameterized logistic curve to all the death data in the world. They incorporate some black-box factor that causes more deaths without social distancing, and arbitrarily declare that factor's effect is 66%/33%/0 with 1/2/3+ social distancing measures. The goal isn't to claim that nobody's ever infected in the 0 case, just that the not-social-distancing factor is gone, so our course should follow the empirical progression of countries that do social distancing.

[-]paulfchristiano6y260

From a quick skim of the paper it looks like they effectively assume that implementing any 3 of those social distancing measures at the same time that Wuhan implemented their lockdown would lead to the same number of total deaths (with some adjustments).

This is less aggressive than assuming no new deaths after lockdown, but does seem quite optimistic given that the lockdown in Wuhan seems (much) more severe than school closures + travel restrictions + non-essential business closures. And this part of the model seems to be assumed rather than fit to data.

[-]spender6y30

Your rule of models is flawed. Newton's mechanical model of the universe is good enough for all practical purposes except where it isn't. Then you have to go relativistic or quantum. Probabilities only apply to future events. Once events have been observed, the probability changes.

[-]Bucky6y130

The real rules have no exceptions

In Newton’s case the real rule (or at least the practical rule) is the meta-rule of when Newton is good enough and what to use when it isn’t. Without that knowledge you can’t form a meta-rule and you don’t know when to believe the model and when not to. You can maybe assess it probabilistically but I wouldn’t want to place much on the result.

[-]Anon User6y30

They are not very explicit about it (which is a huge problem by itself), but they seem to be saying that they are only predicting the "first wave" - so they are not predicting 0 deaths after July - they just defining them to not be a part of the "first wave" anymore. So the way they present the model predictions is even more unbelievably wrong than the model itself!

[-]gallabytes6y50

Even with that as the goal this model is useless - social distancing demonstrably does not lead to 0 new infections. Even Wuhan didn't manage that, and they were literally welding people's doors shut.

[-]Anon User6y10

But don't you see - those infections are a second wave, so do not have to be counted. The model is almost tautologically true that way. But terribly misleading, and very irresponsibly so.

[-]ioannes6y10

StatNews piece on the IHME model: https://www.statnews.com/2020/04/17/influential-covid-19-model-uses-flawed-methods-shouldnt-guide-policies-critics-say/

[-]kjz6y10

Interesting that the model hasn't been updated since April 13, which was the point when daily deaths started to rise above the model's predictions.

[-]kjz6y10

Update: I'm quite surprised that the total expected deaths has gone down in today's update. I would have expected it to rise after this week's data.

[-]Davidmanheim6y10

The problem the modelers have is how to account for reduced transmission in a continuous model. If you don't set it to zero, you can end up with 1/10,000th of a person still sick, and then the virus comes back full force a couple months later, despite having literally eradicated it. So yes, setting it to zero is wrong, but not doing so is also wrong. Because all models are wrong.

Perhaps you think they should be using an entirely different and more sophisticated model, and maybe they should, but it turns out that those have other drawbacks, like needing far more data than we have to calibrate and build, or needing you to make up inputs.

[-]orthonormal6y50

With actual numbers very, very large, this isn't remotely a concern; the domain of a correct continuous model might be "so long as there are at least 100 positive tests per week" or the like. Once we're there, we obviously need to treat things more discretely.

It's just not a sufficient reason for the modelers to make this egregious an optimistic error in setting R as a function of social distancing measures.

those have other drawbacks, like needing far more data than we have to calibrate and build, or needing you to make up inputs

Those are exactly the drawbacks Zvi is pointing to! And they're not even putting distributions on the parameter values the pulled from their asses!

[-]Yandong Zhang6y00

(1)

A wrong model could be useful if the action (based on the module) can compensate the models' error effectively. Usually, you need to know some properties of the model's error.

(2)

Even a wrong model could be very useful. For example, the earth is flat. That wrong model setup the question correctly and so that people could start thinking the shape of the earth.

[This comment is no longer endorsed by its author]Reply

LESSWRONG
LW

LESSWRONG
LW

129

Seemingly Popular Covid-19 Model is Obvious Nonsense

129

129

The Baseline Scenario That Makes No Sense

They Account for Uncertainty, Right?

A Simpler Version of the Same Model

What the Model Outputs