LESSWRONG
LW

varungodbole
0020
Message
Dialogue
Subscribe

Posts

Sorted by New

Wikitag Contributions

Comments

Sorted by
Newest
Why is lesswrong blocking wget and curl (scrape)?
varungodbole7mo10

gotcha, thanks!

Reply
Why is lesswrong blocking wget and curl (scrape)?
varungodbole7mo10

This is something I'm curious about as well! A friend recently introduced me to LessWrong, and I've found myself really enjoying the posts here! I'd like to spend more focused time digging into them!

I'd like to create a dump of LessWrong so that I can use a tool like DocETL (https://www.docetl.org/) to better sift through articles that might be interesting to me. It's been quite some time since jimrandomh replied to this post. So I just thought I'd check in before I attempted to crawl the site.

Also, it looks like https://www.lesswrong.com/robots.txt disallows hitting /allPosts?

In this related post, someone mentions another website called greater wrong. But I'm not sure I understand the relationship between that website and this website. I'm a total newbie to this community haha.

What's the most thoughtful way to get a dump of LessWrong? Is that even desirable by the folks that run this site?

Reply
No wikitag contributions to display.
No posts to display.