AnnaSalamon — LessWrong

Help keep AI under human control: Palisade Research 2026 fundraiser

I do think Palisade is operating in the realm of "trying to persuade people of stuff, and that is pretty fraught"

I haven't had that much contact with Palisade, but I interpreted them as more like "trying to interview people, see how they think, and provide them info they'll find useful, and let their curiosities/updates/etc be the judge of what they'll find useful", which is ... not fraught.

Or rather, as somewhere in between this and "trying to persuade people of stuff", but close enough to the former that I'm in favor, which I'm usually not for "persuasion" orgs.

Am I wrong?

Announcing RoastMyPost: LLMs Eval Blog Posts and More

AnnaSalamon4d60

Thanks for building; I'm looking forward to trying it. A main thing I keep wanting from LLM writing assistance (I'm not sure how hard this is; I've tried prompting LLMs myself, and failed to get the quality I wanted, but I didn't try with much patience or skill) is help applying Strunk and White's "The Elements of Style" to my writing. That is, I want help flagging phrases/words/sentence constructions that fail to be short and to the point.

The impossible problem of due process

AnnaSalamon4d40

I mean, I might be being dumb on all these points. But I personally disagree about:

There being a viable "good system for community resolution of conflicts" in larger-than-Dunbar groups (to be fair, the post author does too... except then not at the end?)
Phrasing the cause-of-enforcement as "you decide a person should 'face consequences for their actions'" (IMO, kicking people out of a community should usually be more about "they impose risks/costs we can't live with" and less about "making them face consequences")
A sort of missing mood in the third bullet point ("Holistic judgment: a person should be kicked out if they seem, on the whole, to be bad for the community. They don't need to be found guilty beyond reasonable doubt of a specific egregious crime.") I agree with the denotation of what's written. But there are two memorable-to-me cases where I was part of kicking someone out of the bay area rationalist community (not Brent, nothing most overseas readers would've heard about; quieter affairs); and where their lives and sanity rapidly got a lot worse, to the point where I'd put like 30% that the decision to exile them "ruined their lives". I wouldn't advise past-me against either decision, because they were people we really didn't know how to live with, with major repeated situations. But ... if someone seemed, on the whole, to be a mild force for boredom and awkwardness in the community, say, I certainly woudln't kick them out? (And again, I assume neither would the post's author, mingyuan; but I wish the bullet point e.g. said "if they seem, on the whole, to be someone we can't live with in a healthy fashion", or else differentiated between kicking someone out of a random meetup, vs doing things that'll trigger exile from a place that includes almost all a person's social ties, built up over years).

I think my problem with the last section is only that it is not up to the very high standard that the rest of the post seems to me to hit, in which things are made unusually clear to even a young/inexperienced reader who is happy to believe relayed events but who wants to see the why of things for themself. (And I'm not providing these 'disagreements' because I think the article would be better with my opinions instead of the authors; I don't think I"m especially correct about these matters; I'm providing them as evidence that this part of the article is less visibly-true-to-all-readers, e.g. to me)

The impossible problem of due process

AnnaSalamon4d40Review for 2024 Review

I appreciate this post for spelling out an unsolved problem that IMO is a major reason it's hard to build good community gatherings among large groups of people, and for including enough detail/evidence that I expect many, after reading it, can see how the trouble works in their own inside views. I slightly wish the author had omitted the final section ("What would be the elements of a good system?"), as it seems less evidence-backed than the rest (and I personally agree with its claims less), and its inclusion makes it a bit harder for me to recommend the article to those needing a problem-description.

Partial value takeover without world takeover

AnnaSalamon4d20Review for 2024 Review

I love this post and suspect it's content is true and underappreciated. (Though I admittedly haven't found any new ways to test it / etc since it came out.)

On Not Pulling The Ladder Up Behind You

AnnaSalamon4d40

I like it, but I wish its main point would stick better in my mind somehow. (This was true when I read it last year, and again when I re-skimmed it now.) I, too like the ladder metaphor; I agree that it helps get people thinking about on-ramps, and that that this is valuable; I like the examples and techniques about remembering how you got there, imagining a new early-you who showed up today, etc. But: I still feel there's a "whole" you're gesturing at that's not quite sticking in my head, and I wonder if a slight rewrite could get it to?

Neutrality

AnnaSalamon4d40Review for 2024 Review

I read this once when Sarah wrote it, just over a year ago, and I still think about it ~every two weeks or so. It convinced me that it's possible and desirable to be neutral along some purpose-relevant axes, and that I should keep my eye on where and how this is accomplished, and what it does. (I stayed convinced.) Hoping it makes it in.

Parental Writing Selection Bias

AnnaSalamon6d82Review for 2024 Review

I appreciate the explicit, fairly clear discussion of a likely gap in what I'm reading about parenting and kids. I was aware of a gap near here, but the post added a bit of detail to my model, and I like having it in common knowledge; I also hope it may encourage other such posts. (Plus, it's short and easy to read.)

Deep and obvious points in the gap between your thoughts and your pictures of thought

AnnaSalamon6d20Review for 2024 Review

Nominating this for 2024 review. It seems like an accurate (in many cases, at least) model of a phenomenon I care about (and encounter fairly frequently, in myself and in people I end up trying to help with things) that I didn't previously have an accurate model of.

Eliezer's Unteachable Methods of Sanity

AnnaSalamon11d122

A further wrinkle / another example is that a question like "what should I think about (in particular, what to gather information about / update about)", during the design process, wants these predictions.

Yes; this (or something similar) is why I suspect that "'believing in' atoms" may involve the same cognitive structure as "'believing in' this bakery I am helping to create" or "'believing in' honesty" (and a different cognitive structure, at least for ideal minds, from predictions about outside events). The question of whether to "believe in" atoms can be a question of whether to invest in building out and maintaining/tuning an ontology that includes atoms.

LESSWRONG
is fundraising!
LW

LESSWRONG
is fundraising!
LW

Sequences

Posts

Wikitag Contributions

Comments

Sequences

Posts

Wikitag Contributions

Comments