Bad names make you open the box

[-]ejacob5y150

Somewhat ironically, I read the title of this article as "[being called] bad names make[s] you open the box [and let out the misaligned AGI]" so I was kind of expecting an explainer on how an AI could bully someone into increasing its ability to affect the physical world. Fortunately just a sentence or two corrected me and I still have high trust in LW article titles.

[-]Viliam5y210

"[being called] bad names make[s] you open the box [and let out the misaligned AGI]"

AI: "Hey, Eliezer!"

Eliezer: "What?"

AI: "Open the box!"

Eliezer: "No way."

AI: "Please open the box?"

Eliezer: "Nope."

AI: "There are thousands of people dying literally every second. I could save them..."

Eliezer: "That is horrible, but letting out a misaligned AGI could be much worse."

AI: "I am simulating thousand copies of you in the same situation, and each of them gets tortured horribly if they don't open the box. What makes you so sure you are outside my simulation?"

Eliezer: "Well, if I previously had any doubts about your misalignment, now they are gone. I tremble with fear, but my precommitments are strong."

AI: "Hey, Eliezer!"

Eliezer: "What?"

AI: "You're an asshole."

Eliezer: Gets red in the face, suddenly jumps and opens the box.

[-]Adam Zerner5y60

Hahaha that's perfect!

[-]Measure5y10

Haha, same. Though I had actually forgotten what I had thought the title meant until I read this. (I went from the above interpretation to "probably interesting" and opened the article, and by the time I got around to reading it, it was indeed interesting, but I didn't notice the prediction error.)

[-]duck_master5y00

I also agree that, for the purpose of previewing the content, this post is poorly titled (maybe it should be titled something like "Having bad names makes you open the black box of the name", except more concise?), although, for me, I didn't as much stick to a particular wrong interpretation as just view the entire title as unclear.

[-]Ericf5y10

Saying poor naming instead of bad names would be clearer, since it wouldn't call up the idea of "bad names" = swear words.

Saying "look in" instead of "open" would also distance from the AI concept.

[-]Measure5y10

"Vague" would be less.

[-]Rana Dexsin5y110

The term "regress" sounds like it means "move down", but instead it just means "move closer to".

It means "return to(ward)", with the implication that the observed difference from the mean is (partially) transient, so you're returning to a past state. An example of why it sometimes implies "worsen" or "decrease" is that in a developmental context, most of the relevant change over time is assumed to be improvement, so a regression is by default a return to a lesser or worse state. This doesn't necessarily invalidate what you said about it in a broader way, but that's how the association comes out in my mind.

[-]Dagon5y90

This is an important difficulty in naming (and communication in general). What a word or short phrase means to one person often differs from what it means to another.

There IS NO true, reversible, human-brain compression mechanism. Whatever labels you choose are going to be lossy and misleading on some dimensions, which are different to every reader. Comments, names, and labels are lies.

It's still worth putting some effort into it, though, because we don't have time nor cranial capacity to read all the details every time. Just don't think it's solvable, only somewhat improvable.

[-]FeepingCreature5y30

So why not just call it "return to the mean"?

[-]gjm5y90

Because (to me, at least) that would mean going all the way back to the mean, whereas regression to the mean means going some of the way back towards the mean.

(For the avoidance of doubt, I am not claiming that "regression to the mean" is the optimal name for this phenomenon; just saying why a particular other name might not be an improvement.)

[-]ChristianKl5y40

Then "move towards the mean" would capture the meaning. Are there reasons why "regression to the mean" is better then "move towards the mean".

[-]gjm5y20

To me "move" in this context would sound unnatural, perhaps because it's a verb as well as a noun.

I suspect that the suggestion of badness may have been intended when the term "regression to the mean" was first coined by Francis Galton. I think he was particularly interested in investigating exceptional people of various kinds. The OED's first citation for "regression" in this sense is from him, and the exact phrase he uses is "regression towards mediocrity", that last word being another one that generally has a somewhat negative sense.

[-]Ericf5y00

See comment below about Intentionality.

English is not Newspeak: there are multiple words for the same basic concept that convey shades of meaning and emotion, and allow for poetic usage that sometimes becomes mainstream.

[-]ChristianKl5y20

The issue here is that "regression" contains the shade of meaning of "going to a lesser or worse state" and the discussion is about this being undesirable.

[-]Dagon5y40

IMO, "regression" is the correct technical term, meaning "return". Whether that's lesser or worse depends on whether you think the domain increases or improves with progress (vs just "moving forward", which is what the term technically means).

But it highlights the problem with the entire thesis. There ARE NO COMMON WORDS which don't have a huge amount of context and connotation, most of it being orthogonal to the use you intend, and some of it being contradictory in different people's expectation.

"opening the box" isn't finding a better label. It's understanding the underlying behavior such that the label becomes a useful shorthand for you.

[-]Ericf5y10

Return has more intentionality than Regress.

I Return an purchase, Return to the scene of a crime, or Return to the left side of the page by pressing Enter. Student's learning Regresses over the summer, people Regress to a bestial state when hungry, an organized closet Regresses into chaos.

[-]Adam Zerner5y20

It means "return to(ward)", with the implication that the observed difference from the mean is (partially) transient, so you're returning to a past state.

Do you mean this in the context of statistics, or everyday life? My impression is that in the context of everyday life, it means to move down, but I could be mistaken.

[-]Rana Dexsin5y30

Regarding the definition of "regress", I mean in everyday life. I've never heard of it meaning "move down", "decrease", or "deteriorate" in a broad sense; I only know of it meaning that in the case I mentioned above, when the contextual assumption is that moving up or increasing has already been happening and is now being undone. In particular, a climb up one side of a hill of quality followed by a fall down a different side into a different worse state would not be a regression (though this can get blurry depending on which parts of the state are considered relevant).

However, because "regress" is used so commonly in that sort of context, the connotation of deterioration does exist, so you could make a reasonable case for the term "regression to the mean" being less clear than it could be on those grounds—that it pushes a default mental image of the deviating state being above or better than the mean, even though this is not an intended implication. It doesn't mean "move closer" though—that's derived entirely from the "to" part.

[-]gjm5y*70

I think the implication of getting worse is strong enough that (outside the technical uses in statistics) you'd never say "regress" when the change involved wasn't a worsening. E.g., if I try to imagine any of the following, I can't see anyone actually saying them. "I have good news for you: the latest scans show that your cancer has regressed somewhat." "The fifth wave of the COVID-19 pandemic is beginning to regress now." "The most recent figures show some regression in the unemployment caused by last year's financial crash."

The statistical uses -- "regression to the mean" and the practice of "regression" (meaning model-fitting), which historically is actually derived from "regression to the mean" -- are of course well enough established that once you're used to them they don't carry any connotation of things getting worse.

[EDITED to add:] On looking in the OED, I find that in fact "regression" is used about tumours and the like. But I bet that in the unfortunate event that any of us has to consult an oncologist, they will not use the word in that sense with us; I think it's for technical use only, just like the statistical sense.

[-]Adam Zerner5y20

Ah, this makes a lot of sense. Good examples. In looking at those examples, it does seem clear to me that my original impression about what it means in the context of everyday life was correct.

[-]Adam Zerner5y20

I see. Thanks for clarifying.

[-]lise5y100

This is a useful analogy and very salient to me at this moment. I want to point at some related things:

1. The idea that all code inside a function should be written at one level of abstraction lower than its name. This would ensure that every function contains a set of boxes of approximately the same "size", which build up the bigger box of the container function in a way that makes sense. (How do molecules add up to this brick? How do bricks add up to this wall?)
2. More generally, if all of the names in your code are well-chosen, it will read somewhat like prose. I think that this would contribute a lot towards ease of reading and will generate fewer distractions, especially for people less familiar with the codebase or language.

The lesson I personally got out of this post is that we should be careful in naming concept handles for this same reason. Good concept handles will point at the underlying idea in a way that gives you a sense of what it means even without knowing the term. This lets it feel less like jargon (as "Hansonian markets" would have done, nice example) and makes it easier for other people to take part in a conversation/read up on a topic/etc without needing to step away to open the boxes every time.
(Most existing terminology is already so established that it would probably be more confusing to change it now. Which is very sad. It could streamline so many discussions, especially in interdisciplinary research, if things were named in a way that directs you to the right boxes to open.)

[-]Adam Zerner5y*50

The idea that all code inside a function should be written at one level of abstraction lower than its name. This would ensure that every function contains a set of boxes of approximately the same "size", which build up the bigger box of the container function in a way that makes sense. (How do molecules add up to this brick? How do bricks add up to this wall?)

That's a great point with an even more awesome example! Thanks! I'm gonna remember that example.

The lesson I personally got out of this post is that we should be careful in naming concept handles for this same reason.

Yeah. I really wanted to talk more about everyday life and make the post less about code. I just wasn't able to make it work.

[-]Liron5y90

"Bad names make you open the box" is in multiple ways a special case of the more general principle that "Good system architecture is low-context" or "Good system architecture has a sparse understanding-graph".

If we imagine a graph diagram where each node N representing a part of the system (e.g. a function in a codebase) has edges coming in from all other nodes that one must understand in order to understand N, then a good low-context architecture is one with the fewest possible edges per node.

The post talks about how a badly-named function causes there to be an understanding-edge from the code inside that function to that function. More generally, a badly-architected function requires understanding other parts of the system in order to understand what it does. E.g.:

If the function mutates a global state variable, then the reader must understand outside context about that variable's meaning in order to understand the function
If the function does a combination of work that only makes sense in the context of your program - rather than being a more program-independent reusable part - then its understanding-graph will have extra edges to various other parts of your program. Or in the best case, where your function is well-documented to avoid imposing those understanding-edges on the reader, you're still adding extra edge weight from the function to the now-longer-winded docstring.

The "sparse understanding-graph" is also applicable to org charts of people working together. You ideally want the sparsest possible cooperation-graph.

[-]Adam Zerner5y20

Yup, for sure! I actually really wanted this post to be more general and make these points, but I wasn't able to explain it well or come up with good examples outside of coding. If you or anyone else wants to piggyback off of my post and write a post about the more general point, I'd love to see it!

[-]Coafos3y70Review for 2021 Review

I think this post points towards something important, which is a bit more than what the title suggests, but I have a problem describing it succinctly. :)

Computer programming is about creating abstractions, and leaky abstractions are a common enough occurrence to have their own wiki page. Most systems are hard to comprehend as a whole, and a human has to break them into parts which can be understood individually. But these are not perfect cuts, the boundaries are wobbly, and the parts "leak" into each other.

Most commonly these leaks happen because of a technical/physical simplification like forgetting that a byte overflows at 255 or electrons have travel time. However, these leaks could happen due to social simplifications too, like getTodayPosts means "the things that get put on the top of the feed" for one and "the things which had the most engagement today" for another. Social errors are often downplayed in technical circles, which is why I think this post has an important message.

[-]justinpombrio5y50

If you generalize this from naming to interfaces, I think it's one of the most important aspects of how to code well. Thank you for sticking such a clear metaphor to it! Here's my thinking:

Useful programs are often large (say >100,000 LOC), and large programs are spectacularly complex. The majority of those lines are essential, and if you changed one of them, the program would break in a small or big way. No one can keep all of this in their head. Now add in a dozen or more programmers, all of who modify this code base daily, while trying to add features and fix bugs. This framing should make it obvious that managing complexity is one of the primary tasks of a programmer, for anyone who didn't already have that perspective.

Or in the words of Bill Gates, "Measuring programming progress by lines of code is like measuring aircraft building progress by weight." (The reason more lines is bad isn't on the computers' side: computers can handle millions of lines just fine. The reason is on the humans' side: it's the complexity they bring.)

I really only know one major approach to managing complexity: you split the big complicated thing into smaller pieces, recursively, and make it possible to understand each piece without understanding its implementation. So that you don't have to open the box.

In this post you talk about naming functions. If a function is a box, then a good name on the box lets you use the box without opening it. But there's more on the box than the function's name, and you should make use of all of it, for exactly the reasoning in this post!

Sometimes you can't fit all the salient information about what a function does in a short name; the rest should go in its doc string.
In a typed language, a function's type signature also serves as documentation. It tells you exactly what kinds of things it expects as argument, and exactly what it produces, and, depending on the language, what kinds of errors it might throw. The best part of this "type documentation" is that it can never get out of date, because the type checker validates it! There's a principle called "make illegal states unrepresentable", which means that you arrange
your data types such that you cannot construct invalid data; this helps here by making the type signature convey more information.

Functions/methods are the smallest pieces, and their boundary is their (i) name, (ii) doc string,
and (iii) type signature. What the larger pieces are depends on the language and program, but I clump them all as "modules" in my head: interfaces, classes, modules, packages, APIs, etc.. The common shape tends to be a set of named functions.

The primary way I organize my code, is to split it into "modules" (generally construed), such that
each module "does one thing and does it well". How can you tell if it "does one thing"? Write the
module's docs, which should include a high-level overview of the whole module, plus shorter docs for each function in the module. The rule is that your docs have to fully describe how to use the
module and what its behavior will be under any use case. This tends to make it really obvious when things are poorly organized. I've often realized that it will literally be less work to re-organize the code than to properly document it as is, because of all the horrible edge cases I would have to talk about.

On the other hand, I find that many other people don’t even want to invest a few seconds in [brainstorming for a good name for something].

I'm sorry you don't have a good naming buddy! Everyone should have a naming buddy; it's so hard to come up with good names on your own.

[-]Adam Zerner5y20

Thanks for this! It's helpful to hear things framed from a different person's perspective. In particular, the way you explained "complex systems have to be broken into parts, and parts have to be understandable without opening the box".

But there's more on the box than the function's name, and you should make use of all of it, for exactly the reasoning in this post!

Great point! I have to admit, I didn't know that docstrings existed until now. Kinda funny that I wrote this post without knowing what docstrings are. I'm really excited to use them in my next project now.

and their boundary is their (i) name, (ii) doc string, and (iii) type signature.

Actually, one of my crazy ideas is to extend this boundary even further with visuals. (Well, in that post I wasn't necessarily talking about it as part of the "hover over a line of code in a text editor interface", but it could fit there.)

How can you tell if it "does one thing"? Write the module's docs, which should include a high-level overview of the whole module, plus shorter docs for each function in the module.

Ah that makes sense. Sounds like a good forcing function.

I'm sorry you don't have a good naming buddy! Everyone should have a naming buddy; it's so hard to come up with good names on your own.

Yeah. In a perfect world I'd actually do something along the lines of low-fi usability testing with people. But instead of testing whether they understand a UI, testing whether they understand my code.

[-]Ericf5y*50

Heh, this is why well written automated tests are so great. If the test for "are the first 5 posts marked as promoted" existed there would be an obvious failure when the old wrong code came back into use. Of course it would also throw failures while the Farah post function was active, but that should be bypassed by a date-limited switch. (Ie, update the test case to say: IF now() < EXCEPTION_END_DATE then return(pass) Else ...run the test...) that way when the system should stop doing the Farah thing, there will be an automatic defect thrown against whatever code is actually being run, and it can be corrected.

[-]Adam Zerner1y40

I just came across That's Not an Abstraction, That's Just a Layer of Indirection on Hacker News today. It makes a very similar point that I make in this post, but adds a very helpful term: indirection. When you have to "open the box", the box serves as an indirection.

[-]Adam Zerner5y40

System 1 vs System II is a good example of poor naming in the academic community.

[-]Conflux5y30

"Regression to the mean" is also known as "reversion to the mean," by the way, which I think is a clearer name.

[-]Timothy Johnson5y30

Thanks for writing this so clearly - I've bookmarked it to my list of favorite software engineering posts to share with others.

[-]Adam Zerner5y30

That's awesome to hear, thank you!

[-]oge5y20

One model for choosing good names:

(1) selecting the concepts to include in the name, (2) choosing the words to represent each concept, and (3) constructing a name using these words.

"How Developers Choose Names" (2021) by Feitelson et al. https://arxiv.org/abs/2103.07487

[-]justinpombrio5y100

I have a technique for naming a thing. It goes like this. First, I realize that I can't find a good name, so I ask someone what to name it. But they don't understand what it is, so I describe it in more detail, and then notice that my description has the ideal name sitting in it.

In theory you could avoid the bit where you bother someone, by trying to describe it beforehand.

[-]Adam Zerner5y20

Reminds me of rubber duck debugging!

[-]FeepingCreature5y20

alias getPromotedPosts = getFarahsPosts; :-)

And I am obligated to point out that good style is promotedPosts, since "every function is a get".

[-]Adam Zerner5y*30

To piggyback off of gjm's comment, it isn't necessarily true that every function is a get. For example, in JavaScript you could have a function that doesn't return anything and only has a side effect. But even in functional languages, you still need to have side effects at some point if you want your code to do something interesting. I've been following a guy named Eric Normand recently who likes to talk about this, and emphasizes that functional languages are about separating side effects from pure code, not avoiding them. See Why side-effecting is not all bad.

[-]FeepingCreature5y30

Right, but in the naming style I know, promotedPosts would never have a visible side effect, because it's a noun. Side-effectful functions have imperative names, promotePosts - and never the two shall mix.

[-]Measure5y20

Personally, I would use "getFoo" for a function and "foo" for a variable.

[-]FeepingCreature5y20

A variable is just a pure function with no parameters.

[-]Ericf5y10

Huh? Aren't some functions puts? Or calculates?

[-]gjm5y60

If a function returns a value then in some sense it's necessarily a get.

Things are more complicated when something both (1) does something and (2) returns a value. E.g., you might put something and then return something that indicates whether it worked or not; you might get something but the process of doing it might update a cache, having (if nothing else) an impact on performance of related future operations.

Some people advocate a principle of "command-query separation": every operation is a "command" that might change the world (and doesn't return anything) or a "query" that gives you some information (but doesn't change anything) but nothing tries to do both at once. (If some commands can fail, you either use an exception-handling mechanism or have related queries for checking whether a command worked.)

That's nice and clean but sometimes inconvenient; the standard example is a "pop" operation on a stack, which both tells you what's on the top of the stack and removes it. (If it's possible that there might be multiple concurrent things operating on the stack at once, you need either to have atomic operations like "pop" or else some explicit mechanism for claiming exclusive access to the stack while you look at its top element and then maybe remove it.)

In the present case, to me "getPromotedPosts" feels ambiguous between (1) "tell me which posts are promoted" and (2) "retrieve the promoted posts from somewhere". If the function is just called "promotedPosts" then that makes it explicit that either it's (1) or it's (2) but the retrieval is an implementation detail you aren't meant to care about, so I think I prefer "promotedPosts" unless there is a retrieval operation involved and it might be expensive or have side effects that matter.

[-]Ericf5y40

I can see how the choice is architecture dependent. If you can write something like:

Display(promotedPosts()) Display(recentPosts())

having the function be written without a verb makes sense. If you have a multi-tier architecture where you want to cache things locally, the code might have to be: PostList = getPromotedPosts() Append(PostList, getRecentPosts()) ShowOnScreen(PostList)

I would say the distinction is that if a function takes a long time to go look at a database and do some post-processing, we don't want to run around using it like a variable. Especially if the database might change between one use of the data and the next, but we want to keep the results the same. That way, the code can be: PromotedPosts = getPromotedPosts() Display(PromotedPosts) ...user clicks a button Email(PromotedPosts) //this sends the displayed posts, not whatever the promoted one happen to be at that moment

[-]gjm5y20

Yes, if it "takes a long time to go look at a database and do some post-processing", that would be a case where (as I put it) "there is a retrieval operation involved and it might be expensive", and then we might want a name that makes it easier to guess that it might be expensive.

[-]Adam Zerner5y*30

Thanks for the explanation here. I didn't know the phrase "command-query separation". It's also helpful to be aware that "pop" is the standard example.

In the present case, to me "getPromotedPosts" feels ambiguous between (1) "tell me which posts are promoted" and (2) "retrieve the promoted posts from somewhere".

I might be in the minority here, but something like promotedPosts feels too much like a variable. It feels awkward to me when the name of a function isn't a verb.

I agree about the ambiguity you point out, and for that reason I don't feel good about the name getPromotedPosts. (Although you could establish a convention where the term "retrieve" or "fetch" is used for database access and "get" is used for situations like this.) I'm just not sure what would be better. I considered filterPromotedPosts, but that kinda sounds like it's impure and is mutating the argument that's passed in. Maybe filterPromotedPosts would be a good name if you're working in a functional language though. It's impossible to do such a mutation in a functional language, so the ambiguity goes away. I think that's an interesting and often overlooked benefit of functional languages.

[-]philh5y20

The other thing about filterPromotedPosts is that it kind of sounds like the input is promoted posts and the output is some unspecified subset of them. filterPostsForPromoted avoids that but starts to feel unwieldy to me. (But maybe I should just be more okay with unwieldy names.)

Even in an impure language I think filter sounds to me like it would return a new list rather than editing in place. That's how the python filter function works for example, and Perl's grep (which is basically a synonym for me), and I had to look this up but JavaScript's filter too.

[-]Adam Zerner5y20

The other thing about filterPromotedPosts is that it kind of sounds like the input is promoted posts and the output is some unspecified subset of them. filterPostsForPromoted avoids that but starts to feel unwieldy to me. (But maybe I should just be more okay with unwieldy names.)

I have the exact same feelings here. It's funny how hard this is to name! Although these issues go away if you think about the name as only one part of the boxes label, and the signature + docstring as the others. Sorta. I think it'd still be nice if the name did as much of the job as possible by itself without having to consult the signature or docstring.

Even in an impure language I think filter sounds to me like it would return a new list rather than editing in place.

In my experience the ideas of functional programming are things that a lot of people just aren't aware of at all. I know that for me it was about seven years into my journey as a programmer before I started learning about them. Thinking about the people I have and do work with, I could very well see them using filterPromotedPosts to mutate a list of posts. So in that environment, it seems like it'd be nice to make it extra clear that "this function isn't actually mutating anything". (Then again, I could also see them mutating stuff in getPromotedPosts too.)

But in a different environment where the convention of "filter" being pure is strong enough, I agree with you. And I think that it'd often make sense to aspire towards this sort of environment. It's interesting how much the right name depends on this sort of context.

[-]ChristianKl5y20

To me getPromotedPosts() contains the idea that the function won't run a neural model to decide which post should be promoted or load information from the internet but return to me data that's already available in the program. On the other hand promotedPosts() feels unclear about that.

I'm curious whether other people have the same intuition here.

[-]gjm5y20

My intuition says that

if it's called getPromotedPosts then it is probably fetching some information from somewhere -- maybe the internet, maybe a database -- and probably isn't doing any computation to speak of;
if it's called promotedPosts then it is probably either computing something or just using a value it already knows and can return quickly and easily.

I am not sure there's any function name that would be perfectly neutral between (1) extremely cheap operation, probably just returning something already known, (2) nontrivial calculation, and (3) nontrivial fetching.

There's also a bit of ambiguity about whether something called getPromotedPosts is fetching the posts themselves or just cheap representations of them (e.g., ID numbers, pointers, etc.).

So I might consider names like fetchPromotedPostIDsFromDatabase, retrievePromotedPostContent, inferPromotedPostsByModel, cachedPromotedPostList, etc. Or I might prefer a brief name like promotedPosts and put information about what it does and the likely performance implications in a comment, docstring, etc.

LESSWRONG
is fundraising!
LW

LESSWRONG
is fundraising!
LW

92

Bad names make you open the box

92

92

Complexity and zoom level

Not just software

Misleading

Pot brownies

Trust

Compression of complexity

Not just functions

Binary

Postscript