In January 2017, Vipul Naik obtained Google Analytics daily sessions and pageviews data for LessWrong from Kaj Sotala. Vipul asked me to write a short post giving an overview of the data, so here it is.
This post covers just the basics. Vipul and I are eager to hear thoughts on what sort of deeper analysis people are interested in; we may incorporate these ideas in future posts.
The data for both sessions and pageviews span from February 26, 2009 to January 3, 2017. LessWrong seems to have launched in February 2009, so this is close to the full duration for which LessWrong has existed.
Total pageviews recorded by Google Analytics for this period is 52.2 million.
Total sessions recorded by Google Analytics for this period is 19.7 million.
Both plots end with an upward swing, coinciding with the effort to revive LessWrong that began in late November 2016. However, as of early January 2017 (the latest period for which we have data) the scale of any recent increase in LessWrong usage is small in the context of the general decline starting in early 2012.
The top 20 posts of all time (by total pageviews), with pageviews and unique pageviews rounded to the nearest thousand, are as follows:
Note that Google Analytics reports are subject to sampling when the number of sessions is large (as it is here) so the input numbers are not exact. More details can be found in a post at LunaMetrics. This doesn’t affect the estimates for the top posts, but those wishing to work with the exported data should be aware of this.
Each post on LessWrong can have numerous URLs. In the case of posts that were renamed, a significant number of pageviews could be recorded at both the old and new URL. To take an example, the following URLs all point to lukeprog’s post “How to Be Happy”:
All that matters for identifying this particular post is that we have the substring “/lw/4su” in the URL. In the above table, I have grouped the URLs by this identifying substring and summed to get the pageview counts.
In addition, each post has two “canonical” URLs that can be obtained by clicking on the post titles: one that begins with either “/r/lesswrong/lw” or “/r/discussion/lw” and one that begins with just “/lw”. I have used the latter in linking to the posts from my table.
The data, source code used to generate the plots, as well as the Markdown source of this post are available in a GitHub Gist.
Clone the Git repository with:
git clone https://gist.github.com/cbdd400180417c689b2befbfbe2158fc.git
Here are a few related PredictionBook predictions:
Thanks to Kaj for providing the data used in this post. Thanks to Vipul for asking around for the data, for the idea of this post, and for sponsoring my work on this post.
Huh, some of the top articles are totally not what I'd have expected. "Don't Get Offended" is non-promoted and currently only has an upvote total of 32. "Advanced Placement exam cutoffs and superficial knowledge over deep knowledge" is also not promoted and has an upvote total of 4.
Would be interesting for someone to run an analysis to see how closely upvotes and page views correlate. Apparently not as much as I'd have guessed.
"Don't Get Offended" seems to rank highly on Google for the term "Don't Get Offended" which has search volume.
For all the talk about the "decline" of LessWrong, total pageviews and sessions to LessWrong have stayed 5-10 times higher than those to the Effective Altruism Forum (the EAF numbers are documented in my post).
I notice a contradiction that i don't yet understand. This post and the wiki page (https://wiki.lesswrong.com/wiki/History_of_Less_Wrong) say that LessWrong started in 2009. However, there are comments here with earlier timestamps (arbitrary example: http://lesswrong.com/lw/qd/science_isnt_strict_enough/k2t). I was under the impression lesswrong.com was an active community at least since 2007. Is the wiki's "2009" a typo?
Also, i am updating my PoV on recent LW history based on the analytics charts. I take it that pageviews have not yet dropped below 2010 levels, even if commenting rates did?
Comments and posts were ported over from Overcoming Bias and so they preceded the Less Wrong website.
Ah, the comments too! Okay, now I understand.
There are some more data (post count, comment count, vote count, etc., but not pageviews) at "History of LessWrong: Some Data Graphics".
Glad to see there's still hope.