User Profile

star1583
description119
message5663

Recent Posts

Curated Posts
starCurated - Recent, high quality posts selected by the LessWrong moderation team.
rss_feed Create an RSS Feed
Frontpage Posts
Posts meeting our frontpage guidelines: • interesting, insightful, useful • aim to explain, not to persuade • avoid meta discussion • relevant to people whether or not they are involved with the LessWrong community.
(includes curated content and frontpage posts)
rss_feed Create an RSS Feed
All Posts
personIncludes personal and meta blogposts (as well as curated and frontpage).
rss_feed Create an RSS Feed

Thoughts on AI Safety via Debate

13d
6 min read
Show Highlightsubdirectory_arrow_left
12

Turning 30

14d
7 min read
Show Highlightsubdirectory_arrow_left
5

My confusions with Paul's Agenda

1mo
6 min read
Show Highlightsubdirectory_arrow_left
1

LW Migration Announcement

2mo
1 min read
Show Highlightsubdirectory_arrow_left
14

Leaving beta: Voting on moving to LessWrong.com

2mo
1 min read
Show Highlightsubdirectory_arrow_left
65

LW 2.0 Open Beta Live

8mo
1 min read
Show Highlightsubdirectory_arrow_left
37

Pair Debug to Understand, not Fix

1y
1 min read
Show Highlightsubdirectory_arrow_left
3

Don't Shoot the Messenger

1y
Show Highlightsubdirectory_arrow_left
13

[Link] The Quaker and the Parselmouth

1y
Show Highlightsubdirectory_arrow_left
1

Announcement: Intelligence in Literature Prize

1y
Show Highlightsubdirectory_arrow_left
3

Recent Comments

It might help if you pointed at the groups you think the asymmetry is between, as I suspect you and SilentCal are imagining different lines here.

I think you see the asymmetry as being between "people who want to punch others" and "people who don't want to punch others," as only the first group see...(read more)

> We need to defend the need for people to physically interact with the world, and potentially have some of those interactions be unfun, without invoking patterns of behavior that really do lead to terrible things.

I notice some level of confusion here.

Suppose Alice came to me with an argument li...(read more)

> I actually would not have generated the substance of the parent comment (or been able to articulate the follow-up explanations) without the pattern-matching described in the analogy you criticized.

This is not a fully formed take yet, but something about this rubs me the wrong way. It seems to me...(read more)

> Well, I haven't seen even a blog post's worth of effort put into doing something like what I suggested.

I think blog posts are potentially weird measures of effort, here. I also think that this is something that people are interested in doing--I think it's a component of [MIRI's strategic sketch ...(read more)

> The healers may not appreciate being asked to work _so much_ harder, just so that the DPSers can work _a bit_ less hard, and “but this benefits the raid” may not suffice to persuade them.

I note also that healers are much less replaceable than DPS are--or at least, that was the way of things when...(read more)

Specifically, the salary is for being a teaching assistant or a research assistant, rather than being a student, but everything is structured under the assumption that graduate students will have a relevant part-time job that covers tuition and living expenses.

> One reason I don't like your graph is that I have no idea how to suffer both X and Y at the same time, for the same action.

Imagine an audience with non-overlapping preferences. Suppose you have control over the thermometer, and someone likes the temperature above 20 degrees C, and another likes ...(read more)

LessWrong is not the place for this sort of complaint, hence the downvotes (including mine).

Note that while the Slack channel has a similar name, it is an independent entity run by Elo, and doesn't have the same moderation team.

> The honest debater can give a whole bunch of RGB pixel values, which even if it doesn't conclusively establish a lie will make the truth telling strategy have a higher winning probability, which would be enough to make both debaters converge to telling the truth during training.

One thing that I ...(read more)

> My understanding is each debater can actually reveal many pixels to the judge. See this quote from section 3.2:

That sounds different to me--the point there is that, because you only need a single pixel to catch me in a lie, and any such demonstration of my dishonesty will result in your win, you...(read more)