Home Page vs Search Engines
Search engine optimization is very, very complicated. I am not going to tell you how to SEO the home page in this comment. What I am going to tell you is how to avoid triggering a duplicate content penalty if you decide to make use of my idea for hooking new users on LessWrong.
No page should have more than 5% of it's content in common with other pages on the same site. There are penalties for duplicated content. Assuming that you put at least 20 times as many words on the home page as the longest excerpt, and limit the number of words in the excerpts, such that they do not count as more than 5% of the content on the home page, then you should have pretty good protection against being penalized for duplicate content. If I was going to do this on my own site, I'd comment the code up and down to warn tinkerers about the 5% rule.
Probably the easiest way to do this is show only a limited number of words from excerpts at one time, and rotate them so that every time you load the home page you get a new one (but it never scrolls away or gets replaced - that interrupts one while reading it).
This is important: If you paste 5 billion excerpts into the home page code and rotate them with java script, the search engine will still see all the ones that are hiding in there and include them as content. They can be counted toward your duplicate content limit. The best way to keep them from composing more than 5% of the text is to use CGI to output one at a time. Then the others are hidden. Your web host or software might already have a random quote generator, and it might be as simple as feeding it the excerpts and pasting a small block of code onto the home page.
You can put a little "random excerpt" link in there so that users can enjoy the addicted clicking that gets them new excerpts when they want them. The "random excerpt" link would simply refresh the home page. (Link the page back to itself. Simple.)
Another cool bonus to this is that if you add new excerpts every so often, your home page will always have new content. This can be good for getting more search engine attention as well as user attention.
Proposed rewrites can be found here. Please suggest specific improvements in the comments!
Although long-time Less Wrong users don't pay much attention to the home page, about page, and FAQ, I suspect new users pay lots of attention to them. A few times, elsewhere on the internet, I've seen people describe their impression of Less Wrong that seemed primarily gleaned from these pages--they made generalizations about Less Wrong that didn't seem true to me, but might appear to be true if all one did was read the about page and FAQ.
The about page, in particular, is called out to every new visitor. Try visiting Less Wrong in incognito mode or private browsing (i.e. without your current cookies) to see what I'm referring to.
But the current set of "newcomer pages" isn't very good, in my opinion:
I certainly don't claim to speak for all Less Wrong users. If you have any thoughts, please comment here, send me a private message, or log in to the wiki and edit the candidate pages directly.
I'm especially interested in getting feedback on the FAQ, because I took the liberty of codifying some social norms that were previously implicit: see the section Site Etiquette and Social Norms, especially the bits about Discussion vs Main, politics, and "if you never get voted down, you're not posting enough".
If you think I codified the social norms incorrectly, or you've been thinking they really should be different, please comment! The FAQ seems like a good way to broadcast preferred norms, so I suspect this is an ideal thread to discuss them.
If you've got a suggested change that's nontrivial, I encourage you to create a poll for it here using comments as poll options or HonoreDB's system.
(Previous discussion.)