LESSWRONG
LW

1228
Duncan Sabien (Inactive)
1394747126738
Message
Dialogue
Subscribe

Sequences

Posts

Sorted by New

Wikitag Contributions

Comments

Sorted by
Newest
Civilization & Cooperation
Hubris & Control
Duncan Sabien (Inactive)2d30

Possibly of interest, although it sounds like you maybe already have all of these puzzle pieces.

Reply
The Most Common Bad Argument In These Parts
Duncan Sabien (Inactive)3d4-2

Example: suppose someone says "I can imagine an atomic copy of ourselves which isn't conscious, therefore consciousness is non-physical." and I say "No, I can't imagine that."

Possibly of interest 

Or the followup by Logan Strohl, even more directly on this

Reply1
CFAR update, and New CFAR workshops
Duncan Sabien (Inactive)6d82

Just noting for the audience that the edits which Anna references in her reply to CronoDAS, as if they had substantively changed the meaning of my original comment, were to add:

  • The phrase "directly observed"
  • The parenthetical about having good epistemic hygiene with regards to people's protestations to the contrary
  • The bit about agendas often not being made explicit

It did not originally specify undisclosed conflicts of interest in any way that the new version doesn't.  Both versions contained the same core (true) claim: that multiple of the staff members common to both CFAR!2017 and CFAR!2025 often had various (i.e. not only the AI stuff) agendas which would bump participant best interests to second, third, or even lower on the priority ladder.

I've also added, just now, a clarifying edit to a higher comment: "Some of these staff members are completely blind to some centrally important axes of care."  This seemed important to add, given that Anna is below making claims of having seen, modeled, and addressed the problems (a refrain I have heard from her, directly, in multiple epochs, and taken damage from naively trusting more than once).  More (abstract, philosophical) detail on my views about this sort of dynamic here.

Reply
CFAR update, and New CFAR workshops
Duncan Sabien (Inactive)6d*40

I claim to be as-aware and as-sensitive-to of all of these considerations as you are.  I think I am being as specific as possible, given constraints (many of which I wish were not there; I have a preference for speaking more clearly than I can here).

Reply
A Conservative Vision For AI Alignment
Duncan Sabien (Inactive)8d110

I know of one parent that puts three dollars aside each time they violate the bodily sovereignty of their infant - taking something out of their mouth, or restricting where they can go

It's me, by the way.  Happy to identify myself.

(I have more agreement than disagreement with the authors on many points, here.)

Reply
CFAR update, and New CFAR workshops
Duncan Sabien (Inactive)8d*2011

I'll note that both this and my top-level comment have a lot of agree-disagree votes, and that it would be wise for people looking in from the outside to ponder what it means for e.g. the top-level comment to have 18 people voting and to end up at -2.

(It might be tempting to sum it up as "ah, Duncan claimed that there's something to be wary of here, and the hive-mind ultimately ended up in disagreement" but I think it's more like "Duncan claimed there's something to be wary of here, and close to half of the people agreed (but were drowned out by the somewhat more than half who disagreed)."  Which is precisely what you would expect if there were some system or process that was consistently harmful to certain people, but not all or even most—a lot of people who passed through unscathed would be like "what do you mean? I was well-cared-for!" and might not pause to wonder about whether they were a black raven and what evidence their experience provides about claims of the existence of white ravens.)

Reply
CFAR update, and New CFAR workshops
Duncan Sabien (Inactive)10d*20-12

I came to discover, over time, that the orientation I had toward participants (and subordinates, for that matter), and the care that I felt I owed the people under my supervision—

(and which I believed were standard and universal à la something like the hippocratic oath or the confidentiality standards of lawyers and therapists)

—were not, in fact, universal.  I directly observed that certain staff members did not reliably have the best interests of participants at heart (whatever their protestations to the contrary), but instead had various agendas (which were often not made explicit) which meant that the best interests of the participants might sometimes be second, or third, or even lower than third on the priority list.

i.e. I believe that the past ones, in hindsight, were not only not adequately responsible and careful but were in a crucial way not even trying to be, and I do not have reason to believe that this problem will be any less in an era where people like myself and Kenzi and Julia Galef and Dan Keys are not present.

(There is a rebuttal that might be made that goes something like "ah, well, those staff members have owned up to that very problem and are explicitly striving to do the other thing, now," but a) see the point above about not trusting people who have managed to fuck up X in multiple novel ways, and b) in the world where such a hypothetical rebuttal were in fact to be made, I wouldn't personally put much weight on the self-report of people who are saying that they used to be something like deceptive/manipulative (to the detriment of others) but don't worry, they aren't doing that anymore.)

Why is it necessary? Do you think that you are the last potential wizard of Light in the world? Why must you be the one to try for greatness, when I have advised you that you are riskier than average? Let some other, safer candidate try!

These issues didn't seem to be a problem for 95+% of participants.  But I think many of my own friends and family members would feel differently about choosing to be vulnerable in the following two scenarios:

  • You are under the care of professionals who have your best interests at heart, and who have never abused their position of power to manipulate, deceive, or exploit people under their care
  • You are under the care of professionals who probably (95+% by raw numbers) have your best interests at heart, except in the rare (5-%) subset of cases where they think they can make some interesting use of you, which they may or may not be up front with you about, versus trying to twitch your strings to maneuver you into some position for the fulfillment of their own values, agnostic to yours
Reply2
CFAR update, and New CFAR workshops
Duncan Sabien (Inactive)15d*175

Thinking about thinking, tinkering with your mental and emotional algorithms, shaking up your worldview, adopting new perspectives and new strategies, spending a lot of time zeroing in and ruminating on your problems and goals and values and considering them in contact with other people and with suggestions about how to see them and think of them and change them.  Setting aside your normal ways of doing things.

Becoming more mud, in other words.

This is already inherently vulnerable, but it gets moreso when you're doing it in an isolated retreat context surrounded by other people for multiple days in which there is a clear status differential between the instructors and the participants.

There are ways to do this that are more responsible and careful, and there are ways to do this that are less responsible and careful.  Separately, a person or group can have the intent to do such a thing responsibly and carefully, and this is not the same as being able to do this responsibly and carefully.  Some of these staff members are completely blind to some centrally important axes of care.

(If you've seen a person or group try for X and fail repeatedly in multiple novel ways despite multiple rounds of figuring out what went wrong and fixing it in each specific case, it's wise to be wary of their latest attempt at X. Sometimes people exhibit a curiously robust capacity to keep generating brand-new ways to get X wrong, and my desire to register a warning here is partially downstream of my belief that something like that is true, here.)

Reply
CFAR update, and New CFAR workshops
Duncan Sabien (Inactive)15d*100

I think for reasons of not needlessly rending the social fabric, I don't want to be more specific.  I feel the need to register the warning, and I'm happy with people weighting or discounting it based on how much they trust my assessments generally, including the context that I worked at CFAR for three years and overlapped with much of the present staff.

Reply
CFAR update, and New CFAR workshops
Duncan Sabien (Inactive)15d*20-16

Just noting for the audience that I would not recommend this, and would strongly discourage my friends and family members from going to it.  There's sort of nothing I can do about the fact that this is inherently rude, and clashes with the largely-positive tone of all the other discussion, but it feels rather important to represent this fact, especially as someone with slightly more context and grounded understanding than most.

(I have had zero interaction with CFAR since the end of my three years working there in 2019 and can make no confident predictions about the object-level experience, but I do not trust and indeed am substantially wary of the motives, methods, and competence of (some members of) the team creating the experience.  I would not want anyone I love to put themselves in a vulnerable state under the care of (some of) these particular people, as I have justified reason to believe that duty-of-care will not be reliably discharged.  I'll note that when plans for something like this first reached my ears circa 2022*, I directly told at least one of the people I'm concerned about that I thought they should absolutely not participate in anything like this, and that I expected they would cause harm by doing so (because they had repeatedly caused harm in the past) and should leave the project to others.)

Note also that the CFAR handbook exists and is free to all.

*edited; previously said 2021; I went back and checked the email that I sent; it was in June of 2022 and was not responded to.

Reply42
Load More
30Hubris & Control
5d
7
144Obligated to Respond
1mo
69
305Make More Grayspaces
3mo
65
273Truth or Dare
4mo
59
86Review: Conor Moreton's "Civilization & Cooperation"
1y
8
371Social Dark Matter
2y
129
211Killing Socrates
3y
146
92Exposure to Lizardman is Lethal
3y
97
40Repairing the Effort Asymmetry
3y
11
110A Way To Be Okay
3y
38
Load More
NSFW
3 years ago
A beginner's guide to explaining things
9 years ago
(+218/-83)
A beginner's guide to explaining things
9 years ago
(+87)
A beginner's guide to explaining things
9 years ago
(+4/-5)
A beginner's guide to explaining things
9 years ago
(+8/-4)
Audience-centric Explanations
9 years ago
(+42/-80)
Audience-centric Explanations
9 years ago
(+53/-15)
Audience-centric Explanations
9 years ago
(+58)
Audience-centric Explanations
9 years ago
(+18/-29)
Audience-centric Explanations
9 years ago
Load More