LESSWRONG
LW

43
RobertM
4767Ω4317149775
Message
Dialogue
Subscribe

LessWrong dev & admin as of July 5th, 2022.

Posts

Sorted by New

Wikitag Contributions

Comments

Sorted by
Newest
6RobertM's Shortform
3y
92
Sequences
RobertM2d31

Thanks, fixed!

Reply
Is it possible to bookmark a whole sequence on LW?
Answer by RobertMSep 04, 202520

Nope, sorry, no functionality to bookmark sequences.

If I bookmark the sequence's first post, clicking on that post from my bookmarks doesn't bring me to the view of the post within the sequence; the post is standalone without any mention of the sequence it's in, and oftentimes the post was written without reference to such a sequence which leads me to forget about the sequence in the first place.

We have a concept of "canonical" sequences, and this should only happen in cases where a post doesn't have a canonical sequence.  I think the only way that should happen is if a post is added to a sequence made by someone other than the post author.  Otherwise, posts should have a link to their canonical sequences above the post title, when on post pages with urls like lesswrong.com/posts/{postId}/{slug}.  Do you have an example of this not happening?

Reply
The Industrial Explosion
RobertM21d20

Mod note (for other readers): I think this is a good example of acceptable use of LLMs for translation purposes.  The comment reads to me[1] like it was written by a human and then translated fairly literally, without performing edits that would make it sound unfortunately LLM-like (perhaps with the exception of the em-dashes).

"Written entirely by you, a human" and "translated literally, without any additional editing performed by the LLM" are the two desiderata, which, if fulfilled, I will usually consider sufficient to screen off the fact that the words technically came out of an LLM[2].  (If you do this, I strongly recommend using a reasoning model, which is much less likely to end up rewriting your comment in its own style.  Also, I appreciate the disclaimer.  I don't know if I'd want it present in every single comment; the first time seems good and maybe having one in one's profile after that is sufficient?  Needs some more thought.)  This might sometimes prove insufficient, but I don't expect people honestly trying and failing at achieving good outcomes here to substantially increase our moderation burden.

  1. ^

    With the caveat that I only read the first few paragraphs closely and poked intermittently at the rest.

  2. ^

    This doesn't mean the comment will necessarily be approved, but if I reject it, it probably won't be for that reason.

Reply
Banning Said Achmiz (and broader thoughts on moderation)
RobertM22d30

He did not say that they made such claims on LessWrong, where he would be able to publicly cite them.  (I have seen/heard those claims in other contexts.)

Reply
Underdog bias rules everything around me
RobertM22d20

Curated!  I found the evopsych theory interesting but (as you say) speculative; I think the primary value of this post comes from presenting a distinct frame by which to analyze the world, one which I and probably many readers either didn't have distinctly carved out or part of their active toolkit.  I'm not sure if this particular frame will prove useful enough to make it into my active rotation, but it has the shape of something that could, in theory.

Reply
Debugging for Mid Coders
RobertM1mo76

I've had many similar experiences.  Not confident, but I suspect a big part of this skill, at least for me, is something like "bucketing" - it's easy to pick out the important line from a screen-full of console logs if I'm familiar with the 20[1] different types of console logs I expect to see in a given context and know that I can safely ignore almost all of them as either being console spam or irrelevant to the current issue.  If you don't have that basically-instant recognition, which must necessarily be faster than "reading speed", the log output might as well be a black hole.

Becoming familiar with those 20 different types of console logs is some combination of general domain experience, project-specific experience, and native learning speed (for this kind of pattern matching).

Similar effect when reading code, and I suspect why some people care what seems like disproportionately much about coding standards/style/convention - if your codebase doesn't follow a consistent style/set of conventions, you can end up paying a pretty large penalty by absence of that speedup.

  1. ^

    Made up number

Reply
Stephen Martin's Shortform
RobertM1mo312

Not having talked to any such people myself, I think I tentatively disbelieve that those are their true objections (despite their claims).  My best guess as to what actual objection would be most likely to generate that external claim would be something like... "this is an extremely weird thing to be worried about, and very far outside of (my) Overton window, so I'm worried that your motivations for doing [x] are not true concern about model welfare but something bad that you don't want to say out loud".

Reply
The Problem
RobertM1mo60

This is, broadly speaking, the problem of corrigibility, and how to formalize it is currently an open research problem.  (There's the separate question whether it's possible to make systems robustly corrigible in practice without having a good formalized notion of what that even means; this seems tricky.)

Reply
Strong Evidence is Common
RobertM1mo20

Thanks for the heads-up, I've fixed it in the post.

Reply
Load More
40LessWrong is migrating hosting providers (report bugs!)
1d
10
73Briefly analyzing the 10-year moratorium amendment
4mo
1
31"The Urgency of Interpretability" (Dario Amodei)
5mo
23
207Eliezer's Lost Alignment Articles / The Arbital Sequence
6mo
10
281Arbital has been imported to LessWrong
7mo
30
29Corrigibility's Desirability is Timing-Sensitive
9mo
4
87Re: Anthropic's suggested SB-1047 amendments
1y
13
46Enriched tab is now the default LW Frontpage experience for logged-in users
1y
27
77[New Feature] Your Subscribed Feed
1y
13
31Against "argument from overhang risk"
1y
11
Load More
Sequences
2 days ago
Sequences
2 days ago
Our community should relocate to Japan.
a month ago
(-155)
Negative Utilitarianism
a month ago
(-174)
In 2017, Ukraine will neither break into all-out war or get neatly resolved
a month ago
(-192)
Inferential Distance
a month ago
Guide to the LessWrong Editor
2 months ago
Guide to the LessWrong Editor
2 months ago
(+29/-94)
Simulation Argument
3 months ago
(-1)
AI Safety & Entrepreneurship
4 months ago
Load More