Wei Dai — LessWrong

If anyone wants to have a voice chat with me about a topic that I'm interested in (see my recent post/comment history to get a sense), please contact me via PM.

My main "claims to fame":

Created the first general purpose open source cryptography programming library (Crypto++, 1995), motivated by AI risk and what's now called "defensive acceleration".
Published one of the first descriptions of a cryptocurrency based on a distributed public ledger (b-money, 1998), predating Bitcoin.
Proposed UDT, combining the ideas of updatelessness, policy selection, and evaluating consequences using logical conditionals.
First to argue for pausing AI development based on the technical difficulty of ensuring AI x-safety (SL4 2004, LW 2011).
Identified current and future philosophical difficulties as core AI x-safety bottlenecks, potentially insurmountable by human researchers, and advocated for research into metaphilosophy and AI philosophical competence as possible solutions.

My Home Page

And I guess you're making the general point that I shouldn't put too much stake into "my sequence hasn't gotten much in terms of concrete pushback," because it could well be that there are people who would have concrete pushback but don't think it's worth commenting since it's not clear if many people other than myself would be interested. That's fair!

Yeah, I should have framed my reply in these terms instead of my personal prioritization. Thanks for doing the interpretive work here.

(But then, probably more people than just me would be interested in a post or sequence on why moral realism is true, for reasons other than deferring, so those object-level arguments should better be put online somewhere!)

There must be a lot of academic papers posted online by philosophers who defend moral realism? For example Knowing What Matters by Richard Y Chappell (who is also in EA). There are also a couple of blog posts by people in EA:

But I haven't read these and don't know if they engage with your specific arguments against moral realism. If they haven't, and you can't find any sources that do, maybe write a post highlighting that, e.g., "Here are some strong arguments against moral realism that hasn't been addressed anywhere online". Or it would be even stronger if you can make the claim that they haven't been addressed anywhere period, including the academic literature.

In order to maintain a realist position about goodness, we can see this negotiation as a process of discovery, rather than a change in goodness itself.

This doesn't seem to be a viable position. Suppose I face options A and B (e.g., in the stock market), and if I choose A I double my bargaining power (e.g., power and/or wealth) versus option B. Then by choosing A over B it seems incontrovertible that I'm changing the outcome of the negotiation process, and therefore changing goodness itself if goodness equals the negotiation outcome.

My perspective is that it's better to reserve "moral realist" to mean there are objective values/morality independent of bargaining/cooperation, otherwise you'd run into issues like the above, where you can change what's good by your actions, which probably contradicts a lot of people's intuitions about what "moral realism" or "objective values" entails.

Do you think you're out of status games altogether? As I've opted out of most conventional status games (e.g., avoided going into academia, made no attempt to climb the corporate ladder, and stopped working a paying job as soon as my passive income allowed me to) and kind of look down on people still in those games, and also think knowledge is more important than wealth, I think my intuitions/psychology around all this is quite close to yours. But I think I'm still playing status games, just more interesting and hopefully more pro-social ones (i.e., with better externalities).

See A Master-Slave Model of Human Preferences for an old post related to this.

I think you're right that I shouldn't have latched onto the first analogy I thought of. Here's a list of 11 (for transparency, analogies 3-10 were generated by Gemini 3.0 Pro, though some may have appeared in previous discussions.):

The CEO & The Corporation
The Judge & The Courtroom
The Dinner Party Host
The University Classroom / Professor
The Conference Breakout Session
Open Source / GitHub Maintainer
The Stand-Up Comedian & The Heckler
The Art Gallery Opening
Graffiti on a Private House
The Town Hall vs Private Meetings
The Hypothetical HOA

I decided to put detailed analysis of these analogies in this collapsed section, as despite extensive changes by me from the original AI-generated text, it doesn't quite read like my style. Also, it might be too much text and my summary/conclusions below may be sufficient to convey the main points.

1. The CEO & The Corporation

Analogy: A Forum Post is a "Project." The Author is the CEO; the Commenter is an Employee. The CEO needs the power to fire employees who disrupt the vision, and the Board (Admins) should defer to the CEO’s judgment.
Disanalogy: In a corporation, the Board cannot see daily operations, creating information asymmetry; on a forum, Admins see the exact same content as the Author. A CEO has a smaller conflict of interest when firing an employee, because they are judged primarily by the company's financial performance rather than the perception of their ideas. If they fire an employee who makes a good criticism, they might subsequently look better to others, but the company's performance will suffer.
Conclusion: The analogy fails because the Author lacks the financial alignment of a CEO and possesses no special private information that the Admins lack.

2. The Judge & The Courtroom

Analogy: When there is a conflict in the physical world, we find disinterested parties to make enforceable judgments, even if the cost is very high. When the cost is too high, we either bear it (wait forever for a trial date) or give up the possibility of justice or enforcement, rather than allow an interested party to make such judgments.
Disanalogy: A courtroom has the power of Coercion (forcing the loser to pay, go to jail, or stop doing something). A Forum Author only has the power of Dissociation (refusing to host the commenter's words). We require neutral judges to deprive people of rights/property; we do not require neutral judges to decide who we associate with.
Conclusion: Dissociation has its own externalities (e.g., hiding of potentially valuable criticism), which we usually regulate via social pressure, or legitimize via social approval, but you don't want this and therefore need another source of legitimacy.

3. The Dinner Party Host

Analogy: A Post is a private social gathering. The Author is the Host. The Host can kick out a guest for any reason, such as to curate the conversation to his taste.
Disanalogy: In the real world, if a Host kicks out a guest that everyone else likes, the other attendees would disapprove and often express such disapproval. There is no mechanism to then suppress such disapproval, like you seek.
Conclusion: You want the power of the Host without the social accountability that naturally regulates a Host's behavior.

4. The University Classroom / Professor

Analogy: The Author is a Subject Matter Expert (Professor). The Commenter is a Student. The Dean (Admin) lets the Professor silence students to prevent wasting class time.
Disanalogy: A classroom has a "scarce microphone" (only one person can speak at a time); a forum has threaded comments (parallel discussions), so the "Student" isn't stopping the "Professor" from teaching. Additionally, LessWrong participants are often peers, not Student/Teacher.
Conclusion: The justification for silencing students (scarcity of time/attention, asymmetry of expertise) does not apply to LW.

5. The Conference Breakout Session

Analogy: The Author is like an Organizer who "rented the room" at a convention. The Organizer has the right to eject anyone to accomplish his goals.
Disanalogy: Just like the Dinner Party, an Organizer would almost never eject someone who is popular with their table. If they did, the table would likely revolt.
Conclusion: This analogy fails to justify the action of overriding the local consensus (upvotes) of the participants in that sub-thread.

6. Open Source / GitHub Maintainer

Analogy: A Post is a Code Repository. A Comment is a Pull Request. The Maintainer has the absolute right to close a Pull Request as "Wontfix" or "Off Topic" to keep the project focused.
Disanalogy: In Open Source, a rejected Pull Request is Closed, not Deleted. The history remains visible, easy to find, and auditable. Also, this situation is similar to the CEO in that the maintainer is primarily judged on how well their project works, with the "battle of ideas" aspect a secondary consideration.
Conclusion: You are asking for more power for an Author than a Maintainer, and a Maintainer has less COI for reasons similar to a CEO.

7. The Stand-Up Comedian & The Heckler

Analogy: The Author is a Comedian. The Commenter is a Heckler. Even if the Heckler is funny (Upvoted), they are stealing the show. The Club (Admins) protects the Comedian because writing a set is high-effort.
Disanalogy: In a physical club, the Heckler interrupts the show. In a text forum, the comment sits below the post. The audience can consume the Author's "set" without interference before reading the comment.
Conclusion: The physical constraints that justify silencing a heckler do not exist in a digital text format.

8. The Art Gallery Opening

Analogy: The Post is a Painting. The Upvoted Comment is a Critic framing the art negatively. The Artist removes the Critic to preserve the intended Context of the work.
Disanalogy: Art is about aesthetics and subjective experience. LessWrong is ostensibly about intellectual progress and truth-seeking.
Conclusion: Prioritizing "Context" over "Criticism" serves goals that are not LW's.

9. Graffiti on a Private House

Analogy: A Post is the Author's House. A Comment is graffiti. The homeowner has the right to scrub the wall (Delete) so neighbors don't see it.
Disanalogy: This is purely about property value and aesthetics.
Conclusion: Again the goals are too different for the analogy to work.

10. The Town Hall vs Private Meetings

Analogy: In the real world we have both town halls (Neutral Moderator) and meetings in private houses (Author Control). We can have both.
Disanalogy: Even in the discussions inside a private house, social norms usually prevent a host from kicking out a guest who is making popular points that everyone else agrees with.
Conclusion: The social legitimacy that you seek doesn't exist here either.

11. The Hypothetical HOA

Analogy: A hypothetical residential community with HOA rules that say, a homeowner not only has the right to kick out any guests during meetings/parties, but no one is allowed to express disapproval for exercising such powers. Anyone who buys a house in the community is required to sign the HOA agreement.
Disanalogy: There are already many people in the LW community who never "signed" such agreements.
Conclusion: You are proposing to ask many ("hundreds") of the existing "homeowners" (some of whom have invested years of FTE work into site participation) to leave, which is implausible in this hypothetical analogy.

Overall Conclusions

None of the analogies are perfect, but we can see some patterns when considering them together.

Neutral, disinterested judgement is a standard social technology for gaining legitimacy. In the case of courts, it is used to legitimize coercion, an otherwise illegitimate activity that would trigger much opposition. In the case of a forum, it can be used to legitimize (or partly legitimize) removing/hiding/deprioritizing popular/upvoted critiques.
Some analogies provide a potential new idea for gaining such legitimacy in some cases: relatively strong and short external feedback loops like financial performance (for the CEO) and real-world functionality (for the open source maintainer) can legitimize greater unilateral discretion. This can potentially work on certain types of posts, but most lack such short-term feedback.
In other cases, suppression of dissent is legitimized for specific reasons clearly not applicable to LW, such as clear asymmetry of expertise between speaker and audience, or physical constraints.
In the remaining cases, the equivalent of author moderation (e.g., kicking out a houseguest) is legitimized only by social approval, but this is exactly what you and Eliezer want to avoid.

Having gone through all of these possible analogies, I think my intuition for judges/courts being the closest analogy to moderation is correct after all: in both cases, disinterested judgement seems to be the best or only way to gain social legitimacy for unpopular decisions.

However, this exercise also made me realize that in most of the real world we do allow people to unilaterally exercise the power of dissociation, as long as it's regulated by social approval or disapproval, and this may be a reasonable prior for LW.

Perhaps the strongest argument (for my most preferred policy of no author moderation, period) at this point is that unlike the real world, we lack clear boundaries to signal when we are entering a "private space", nor is it clear how much power/responsibility the authors are supposed to have, with the site mods also being around. The result is a high cost of background confusion (having to track different people's moderation policies/styles or failing to do so and being surprised) as well as high probability of drama/distraction whenever it is used, because people disagree or are confused about the relevant norms.

On the potential benefits side, the biggest public benefits of moderation can only appear when it's against the social consensus, otherwise karma voting would suffice as a kind of moderation. But in this case clearly social approval can't be a source of legitimacy, and if disinterested judgment and external feedback are also unavailable as sources of legitimacy, then it's hard to see what can work. (Perhaps worth reemphasizing here, I think this intuitive withholding of legitimacy is correct, due to the high chance of abuse when none of these mechanisms are available.) This leaves the private psychological benefit to the author, which is something I can't directly discuss (due to not having a psychology that wants to "hard" moderate others), and can only counter with the kind of psychological cost to author-commenters like myself, as described in the OP.

From what I (and Gemini) can tell, you screenshot said nothing technically untrue. Technically they can fire you as the CEO, but you'd still be the sole member and could fire them and then hire yourself back. :)

The simplest way to rectify the situation to match your intent is to either (1) Resign as member without naming a successor, then the passage you quoted would come into effect and make the directors into members or (2) Name the 3 current directors as Successor Members and then resign as member. You'll probably want to consult a lawyer or advisor for the pros and cons of each option.

Would be grateful for an update once you've done this, or perhaps verified that the situation is actually different (e.g. you already resigned as member but forgot).

It looks like I agreed with you too quickly. Just double-checked with Gemini Pro 3.0, and its answer looks correct to me:

This is a fascinating turn of events. Oliver is quoting from Section 3.01 of the bylaws, but he appears to be missing the critical conditional clause that precedes the text he quoted.

If you look at the bottom of Page 11 leading into Page 12 of the PDF, the sentence structure reveals that the "Directors = Members" rule is a fail-safe mechanism that only triggers if the initial member (Oliver) dies or becomes incapacitated without naming a successor.

Here is the text from the document:

[Page 11, bottom] ...Upon the death, resignation, or incapacity of all successor Members where no successor [Page 12, top] Member is named, (1) the directors of this corporation shall serve as the Members of this corporation...

By omitting the "Upon the death, resignation, or incapacity..." part, he is interpreting the emergency succession plan as the current operating rule.

Ah, I had indeed missed that part. A couple of AIs I asked also missed it, and together with the quoted statement from you, made me pretty sure my interpretation was correct. Sorry, and thanks for the quick correction. I've edited my post, and hope it didn't mislead too many people.

Yeah, a book as successful as Superintelligence could do a lot. Once the LW team implements the ability to download my LW post/comment history, I'll check how far a modern LLM can get with turning it into a book. (@habryka) Unfortunately the thought of writing a whole book by hand does not fill me with motivation, so it would have to be a shortcut like this, or someone else writing it.

Ah thanks, I remember a bit more now. Looking back at the voting announcement posted by Vaniver, it didn't mention how important your role would be on LW 2.0:

In case you’re just tuning in now, some basic details: I’ve been posting on LW for a long time, and about two years ago thought I was the person who cared most about making sure LW stayed alive, so decided to put effort into making sure that happened. But while I have some skills as a writer and a programmer, I’m not a webdev and not great at project management, and so things have been rather slow. My current role is mostly in being something like the ‘senior rationalist’ on the team, and supporting the team with my models of what should happen and why. The actual work is being done by a combination of Oliver Habryka, Raymond Arnold, and Ben Pace, and their contributions are why we finally have a site that’s ready to come out of beta.

And I didn't pay much attention to the LW 2.0 / Lightcone organizational structure in the following years, so it came as kind of a surprise when you said "This is (approximately) my forum."

According to the bylaws I linked, you (as the sole member of Lightcone) have "the exclusive right to remove a director, with or without cause". Since the bylaws also allow Lightcone to have as few as 1 director, my understanding is that at any time, you could choose to invoke the option of removing the other directors and become the sole director. (I'm not familiar with the nonprofit world, and don't know how common or standard this is, but it seems fair to describe this as an organization controlled by one individual.)

LESSWRONG
LW