Book Review of 5 Applied Bayesian Statistics Books

[-]Clark Benham5y51

This would be useful as a comment on https://www.lesswrong.com/posts/xg3hXCYQPJkwHyik2/the-best-textbooks-on-every-subject, not sure of best way to update existing lists though.

[-]Jan Christian Refsgaard5y10

I will write a post shilling for myself, thanks. I was waiting for the post to be 'liked', if it got -10 karma then there would be no use in shilling for it :)

[-]SarahNibs5y30

I almost didn't open it because it looked like you were asking a question, not giving an answer, and there were 0 (now 2) comments. Title change?

[-]Jan Christian Refsgaard5y20

Good point!

original: Applied Bayesian Statistics - Which book to read?

Applied Bayesian Statistics - Which book should you read?
Literature Review of 5 Applied Bayesian Statistics Books.
Book Review of 5 Applied Bayesian Statistics Books.

I picked 3, if other people have strong feeling feel free to suggest other titles

[-]wangscarpet5y*30

I'm reading BDA3 right now, and I'm on chapter 6. You described it well. It takes a lot of thinking to get through, but is very comprehensive. I like how it's explicitly not just a theory textbook. They demonstrate each major point by describing a real-world problem (measuring cancer rates across populations, comparing test-prep effectiveness), and attacking it with multiple models (usually frequentist to show limitations and then their Bayesian model more thoroughly. It has a focus on learning the tools well enough to apply them to real-world problems.

I plan to start skimming soon. It seems the first two sections are pedagogical, and the remainder covers techniques which I would like to know about but don't need in detail.

Edit: One example I really enjoyed, and which felt very relevant to today, was on estimating lung-cancer hotspots in America. It broke the country down by county, and first displayed a map of the USA with counties in the top 10% of lung-cancer rates. Much of the highlighted region was in the rural southwest and Rocky mountain region. It asked, what do you think makes these regions have such high rates? It then showed another map, this one of counties in the bottom 10% of lung-cancer rates, and the map focused on the same regions!

Turns out, this was mostly the result of these regions containing many low-population counties, which meant rare-event sampling could skew high very easily, just by chance. If the base rate is 5 per 10,000, and you have 2 cases in a county with 1,000 people, you look like a superfund site. But sample the next year and you might find 0 cases: a county full of young health-freaks.

If you model lung-cancer rates as a hierarchical model with a distribution for county cancer-rates, and each county as being sampled from this, and then sampling cancer events from it's specific rate, then you can get a Bayes-adjusted incidence rate for each county which will regress small counties to the mean.

This made me read Covid charts which showed hot-spot counties much differently. I noticed that the counties they list are frequently small. Right now, all the counties on the NYTimes list, for example have less than 20,000 people in them, which is, I believe, in the bottom 25% of counties by size roughly.

[-]Jan Christian Refsgaard5y10

I loved that example as well, I have heard it elsewhere described as "The law of small numbers", where small subsets have higher variance and therefore more frequent extreme outcomes. I think it's particularly good as the most important part of the Bayesian paragdime is the focus on uncertainty.

The appendix on HMC is also a very good supplement to gain a deeper understanding of the algorithm after having read the description in another book first.

[-]randalljellis3y20

This book used to be the bible.

What is the current bible?

[-]Jan Christian Refsgaard3y1-1

SR if you can only read one, if you do not expect to do fancy things then ROS may be better as it is very good and explains the basics better. The logic of Science should be your 5th book and is good goal to set, The logic of Science is probably the rationalist bible, much like the real bible everybody swears by it but nobody has read or understood it :)

Richard McElreath "Statistical Rethinking" ↩︎
John Kruschke "Doing Bayesian Data Analysis: A Tutorial with R, JAGS, and Stan ↩︎
Ben Lambert "A Student’s Guide to Bayesian Statistics" ↩︎
Gelman, Hill and Vehtari, “Regression and Other Stories” ↩︎
Andrew Gelman, John Carlin, Hal Stern, David Dunson, Aki Vehtari, and Donald Rubin. "Bayesian Data Analysis" ↩︎

LESSWRONG
is fundraising!
LW

LESSWRONG
is fundraising!
LW

68

Book Review of 5 Applied Bayesian Statistics Books

68

68

TLDR

Short review of each Book

Recommendations / Extra considerations

My Experience with the books