Nicholas Kross

[SEE NEW EDITS] No, You Need to Write Clearer

This post is aimed solely at people in AI alignment/safety. EDIT 3 October 2023: This post did not even mention, let alone account for, how somebody should post half-baked/imperfect/hard-to-describe/fragile alignment ideas. Oops. LessWrong as a whole is generally seen as geared more towards "writing up ideas in a fuller form" than "getting rapid feedback on ideas". Here are some ways one could plausibly get timely feedback from other LessWrongers on new ideas: * Describe your idea on a LessWrong or LessWrong-adjacent Discord server. The adjacent servers (in my experience) are more active. For AI safety/alignment ideas specifically, try describing your idea on one of the Discord servers listed here. * Write a shortform using LessWrong's "New Shortform" button. * If you have a trusted friend who also regularly reads/writes on LessWrong: Send your post as a Google Doc to that friend, and ask them for feedback. If you have multiple such friends, you can send the doc to any of all of them! * If you have 100+ karma points, you can click the "Request Feedback" button at the bottom of the LessWrong post editor. This will send your post to a LessWrong team member, who can provide in-depth feedback within (in my experience) a day or two. * If all else fails (i.e. few or no people feedback your idea): Post your idea as a normal LessWrong post, but add "Half-Baked Idea: " to the beginning of the post title. In addition (or instead), you can simply add the line "Epistemic status: Idea that needs refinement." This way, people know that your idea is new and shouldn't immediately be shot down, and/or that your post is not fully polished. EDIT 2 May 2023: In an ironic unfortunate twist, this article itself has several problems relating to clarity. Oops. The big points I want to make obvious at the top: * Criticizing a piece of writing's clarity, does not actually make the ideas in it false. * While clarity is important both (between AI alignment researchers) and (wh

263Apr 29, 2023

Nicholas Kross

Message

Theoretical AI alignment (and relevant upskilling) in my free time.

2088

524

Rationalist Movie Reviews

Primer (2004) Content warnings here What would actually happen, in real life (circa 2004), if two typical techie engineers invented time travel by accident? Primer is odd for being both slow-burning and fast-paced. It's short and self-contained and low-budget, yet majestic and complicated and, er, "recursive". > I haven't eaten...

Feb 1, 202516

Is principled mass-outreach possible, for AGI X-risk?

Over a year ago, Rohin Shah wrote this, about people trying to slow or stop AGI development through mass public outreach about the dangers of AGI: > But it really doesn't seem great that my case for wide-scale outreach being good is "maybe if we create a mass delusion of...

Jan 21, 20249

How to Get Rationalist Feedback

For the vague ideas and posts that you want to write up, but which don't feel "ready to be A Real LessWrong Post" yet. * Describe your idea on a LessWrong or LessWrong-adjacent Discord server. The adjacent servers (in my experience) are more active. For AI safety/alignment ideas specifically, try...

Oct 5, 202316

Musk, Starlink, and Crimea

When talking about Elon Musk's impact on the world, I mostly look at "how has he influenced extinction risk?". This forces a stark ordering of priorities: If he created a "backup" human civilization on Mars, that would (by consequentialist reasoning) do enough good to probably outweigh even some historically bad...

Sep 23, 2023-13

Incentives affecting alignment-researcher encouragement

My hypothesis: I think the incentives for "cultivating more/better researchers in a preparadigmatic field" lean towards "don't discourage even less-promising researchers, because they could luck out and suddenly be good/useful to alignment in an unexpected way". Analogy: This is like how investors encourage startup founders because they bet on a...

Aug 29, 202328

Build knowledge base first, or backchain?

Specifically for AI alignment (small field, preparadigmatic, plausibly-short-timelines), but a general principle decision-mechanism could apply to other fields. What I mean by this is, what ratio of pre-learning to filling-in-gaps, leads to deeper and quicker insights? (Note that, given the time-blocks I'm thinking of here, I could get sidetracked by...

Jul 17, 202311

Rationality, Pedagogy, and "Vibes": Quick Thoughts

I just read this book review of Egan's The Educated Mind. Here are some thoughts I had, written for all but grammatically directed at the review's anonymous author (like a typical comment!). I'd love to go to that middle school, and that high school. It would set a lower-bound on...

Jul 15, 202314

Load More (7/30)

LESSWRONG
LW

LESSWRONG
LW

Nicholas Kross

Nicholas Kross

Nicholas Kross

[SEE NEW EDITS] No, You Need to Write Clearer

A Quick List of Some Problems in AI Alignment As A Field

Quick Thoughts on A.I. Governance

Why I'm Not (Yet) A Full-Time Technical Alignment Researcher

Nicholas Kross

Rationalist Movie Reviews

Is principled mass-outreach possible, for AGI X-risk?

How to Get Rationalist Feedback

Musk, Starlink, and Crimea

Incentives affecting alignment-researcher encouragement

Build knowledge base first, or backchain?

Rationality, Pedagogy, and "Vibes": Quick Thoughts

[SEE NEW EDITS] No, You Need to Write Clearer

A Quick List of Some Problems in AI Alignment As A Field

Quick Thoughts on A.I. Governance

Why I'm Not (Yet) A Full-Time Technical Alignment Researcher

Rationalist Movie Reviews

Is principled mass-outreach possible, for AGI X-risk?

How to Get Rationalist Feedback

Musk, Starlink, and Crimea

Incentives affecting alignment-researcher encouragement

Build knowledge base first, or backchain?

Rationality, Pedagogy, and "Vibes": Quick Thoughts

Nicholas Kross

Nicholas Kross

Nicholas Kross

[SEE NEW EDITS] No, *You* Need to Write Clearer

A Quick List of Some Problems in AI Alignment As A Field

Quick Thoughts on A.I. Governance

Why I'm Not (Yet) A Full-Time Technical Alignment Researcher

Nicholas Kross

Rationalist Movie Reviews

Is principled mass-outreach possible, for AGI X-risk?

How to Get Rationalist Feedback

Musk, Starlink, and Crimea

Incentives affecting alignment-researcher encouragement

Build knowledge base first, or backchain?

Rationality, Pedagogy, and "Vibes": Quick Thoughts

[SEE NEW EDITS] No, *You* Need to Write Clearer

A Quick List of Some Problems in AI Alignment As A Field

Quick Thoughts on A.I. Governance

Why I'm Not (Yet) A Full-Time Technical Alignment Researcher

Rationalist Movie Reviews

Is principled mass-outreach possible, for AGI X-risk?

How to Get Rationalist Feedback

Musk, Starlink, and Crimea

Incentives affecting alignment-researcher encouragement

Build knowledge base first, or backchain?

Rationality, Pedagogy, and "Vibes": Quick Thoughts

[SEE NEW EDITS] No, You Need to Write Clearer

[SEE NEW EDITS] No, You Need to Write Clearer