Say Not "Complexity"

[-]Felix218y110 Response to previous version

Quote: "We think in words, "

No we don't. Apparently you do, though. No reason to believe otherwise. :)

Please keep up these postings! They are very enjoyable.

Going back to "explaining" something by naming it (from a couple of your earlier posts):

e.g. Q: Why does this block fall to the floor when I let go of it? ... A: Gravity!

I always thought that such explanations were common side-effects of thinking in words. Sort of like optical illusions are side-effects of how the visual system works. Perhaps not. One does not need to use words to think symbolically. There are, after all, other ways to do lossy compression than with symbols.

Anyway, I'll still assert that it's easier to fall for such an "explanation" if you think in words. ... An easy assertion, given how hard it is to count the times one does it!

[-]Rick_Smith18y20 Response to previous version

Aren't we understating the role of labels in brevity here?

Where the labelled thing is understood well enough by the labeller and listener or of trivial importance to the problem domain, don't labels contribute to cognitive economy?

I'd have said when you need to get things done, fear of wasting time is desirable rather than deadly.

[-]Valter18y90 Response to previous version

Actually, the "emergence" and "complexity" pseudo-causal explanation are much worse than Felix's "gravity" example: the answer "Gravity!" does explain the fact that the block falls to the floor by noting that it is a specific instance of a general phenomenon for which we have very precise information on how it works (attraction force is constant x m1 x m2 /d^2). We may not know why gravity exists, but that is a different (higher level?) problem.

In the case of "emergence" and "complexity", we just don't know.

P.S. I do think that "emergence" is a useful concept to describe situations where modelling is more conveniently done at a (more) aggregate level, but that's yet another story.

[-]Hopefully_Anonymous18y20 Response to previous version

I don't think these parable posts convey information efficiently to the overcomingbias audience, but I like your point at the end. Specifically, I agree it's better to use placeholders that make lack of knowledge/understanding clear, rather than placeholders that seem to cover up such lack of knowledge/understanding.

[-]Nato_Welch18y00 Response to previous version

"Then a miracle occurs..."

I wonder if memetics would serve as a good candidate for the category of things that satisfy without explaining or predicting anything, along with phlogiston, emergence, and complexity. The analogy to biology seems interesting and fun, but is it more useful than as just a way to re-formulate our perspective?

[-]timtyler15y00 Response to previous version

I don't know where you get the idea that memetics doesn't explain or predict things from.

We know a lot about what factors influence cultural virulence. Marketing and advertising folk make use of that knowledge on a daily basis. They know which jingles are catchy, which catchphrases are likely to be repeated, which images are more likely to be shared - and so on. We know which ideas play well with which other ones well enough to know that we should not target our condom commercials at the catholic demographic.

Check out Dan Zarella for some of the recent material: http://danzarrella.com/

He views his work as being memetics: http://danzarrella.com/what-is-a-meme.html

[-]Gray_Area18y250 Response to previous version

In computer science there is a saying 'You don't understand something until you can program it.' This may be because programming is not forgiving to the kind of errors Eliezer is talking about. Interestingly, programmers often use the term 'magic' (or 'automagically') in precisely the same way Eliezer and his colleague did.

[-]Bernard_Guerrero18y70 Response to previous version

Step 1: Steal Underpants Step 2: ????? Step 3: Profits!!!!

[-]Eliezer Yudkowsky18y150 Response to previous version

Programming is not forgiving to the kind of errors Eliezer is talking about.

But it's a lot better to be unforgiving of yourself than to wait for reality to hit you over the head with it. It's better to notice in 10 seconds that you don't understand something, than to realize this only after 20 people spend 5 years and $10 million of venture capital and the "emergent behavior" you pinned your hope on fails to materialize. It's all too easy to program "chaos", "complexity", or "emergence", so long as you tell yourself that you need to program more of it before you reach Step 3 and Profit.

[-]Tom_McCabe18y10 Response to previous version

"That was when I thought to myself, "Maybe this one is teachable.""

How many people have asked you about becoming an AGI designer? It sounds like you have a good deal of experience with rejection, even after weeding out the obvious crackpots.

[-]Barkley_Rosser18y00 Response to previous version

Well, this is partly a matter of what discipline one is dealing with. So, sure, for AI or computer science more generally, Kolmogorov or Chaitin or Rissanen measures are more useful and reasonably well defined. For other disciplines, other definitions may be more suitable. Thus for economics, I have (following Richard Day) defined complexity in a dynamic way based on erratic dynamics appearing endogenously out of the system (with "erratic" defined more specifically). I laid this out in a paper in 1999 in the Journal of Economic Perspectives, and have a more recent paper up on my website ("Computational and Dynamic Perspectives on Economic Complexity") comparing the two approaches, at http://cob.jmu.edu/rosserjb.

[-]Benjamin_Zivan18y60 Response to previous version

As a current student, I can confirm your suspicions about a seemingly complete paper being preferred over one that addresses all information about the topic. "I don't know" still is not an acceptable answer in many circles and I regard it as an unfortunate phenomenon.

[-]NevilleSandiego13y20 Response to previous version

In my second year uni course, I have an outline for writing lab reports that says 'include in your discussion anything you feel is out of place, or that you don't understand in this experiment. You will not be marked down for such admissions'. And I thought 'NO-ONE is going to take you up on that.'. I hate having to bullshit science papers - I tend to compromise, with a hashed together explanation that I express doubt in, and take the marks hit. Bullshitting is great fun in English courses, but in science it feels like shooting myself in the foot.

[-]Bill_Harris18y00 Response to previous version

Gray Area wrote, "You don't understand something until you can program it." As somewhat of an aside, Randy MacDonnell has written about APL and J (http://facilitatedsystems.com/weblog/2007/07/if-you-can-say-it-its-done.html), "If you can say it, it's done."

On a different note, I took a pair of summer high school mathematics courses sponsored by the NSF years ago at the University of Miami. One professor, a Dr. Hermann, I seem to recall, said he often imagined himself wearing "worry beads" and fingering them as he spoke. If he fingered the beads nearer his neck, he was speaking more precisely; if he fingered those nearer his waist, he was speaking less precisely. In reality, he wore no beads, but he did, on occasion, finger imaginary beads as he was explaining certain concepts.

Perhaps the same thing can be adapted here to indicate the level of magic in claims we make. If we finger imaginary beads near our necks, we claim we know what's going on; if we finger those nearer our waists, we admit there's magic here.

[-]Nathan218y00 Response to previous version

Forgive me for latching onto the example, but how would an AI discover how to solve a Rubik's cube? Does anyone have a good answer?

[-]DanielLC16y00 Response to previous version

I had the same problem.

I think it would need some genetic algorithm in order to figure out about how "close" it is to the solution, then make a tree structure where it figures out what happens after every combination of however many moves, and it does the one that looks closest to the solution.

It would update the algorithm based on how close it is to the closest solution. For example, if it's five moves away from something that looks about 37 moves away from finishing, then it's about 42 moves away now.

The problem with this is that when you start it, it will have no idea how close anything is to the solution except for the solution, and there's no way it's getting to that by chance.

Essentially, you'd have to cheat and start by giving it almost solved Rubik's cubes, and slowly giving it more randomized ones. It won't learn on its own, but you can teach it pretty easily.

[-]CronoDAS16y20 Response to previous version

A less cheating-ish solution is to use some reasonable-seeming heuristic to guess how close you are to a solution. For example, you could just count the number of squares "in the right place" after a move sequence.

[-]xfc16y00 Response to previous version

(First post, bear with me.. find the site very interesting :)

I do agree!

But actually I would model the problem with what is known in some circles as a closed-loop controller, and specifically with a POMDP. Then apply RealTime Dynamic Prog. by embedding an heuristic without having to visit all the states in order to compute the rough but optimal h*.

Another way could be done by means of a graphical model, and more specifically a DAG would be quite nicely suited to the problem. Apply a simulated annealing approach (Ising model!) and when you reach "thermal equilibrium" by having minimized some energy functional you get the solution. Obviously this approach would involve learning the parameters of the model, instead of modelling the problem as in my first proposed approach.

Quite geeky, excuse me!

[-]CG_Morton15y00 Response to previous version

Exactly the difficulty of solving a Rubik's cube is that it doesn't respond to heuristics. A cube can be 5 moves from solved and yet look altogether a mess, whereas a cube with all but one corner correct is still some 20 moves away from complete (by the methods I looked up at least). In general, -humans- solve a Rubik's cube by memorizing sequences of moves with certain results, and then string these sub-solutions together. An AI, though, probably has the computational power to brute force a solution much faster than it could manipulate the cube.

The more interesting question (I think) is how it figures out a model for the cube in the first place. What makes the cube a good problem is that it's designed to match human pattern intuitions (in that we prefer the colors to match, and we quickly notice the seams that we can rotate through), but an AI has no such intuitions.

[-]DanielLC15y00 Response to previous version

Exactly the difficulty of solving a Rubik's cube is that it doesn't respond to heuristics. A cube can be 5 moves from solved and yet look altogether a mess, whereas a cube with all but one corner correct is still some 20 moves away from complete (by the methods I looked up at least).

I don't know the methods you used, but the only ones I know of have certain "steps" where you can easily tell what step it's on. For example, by one method, anything that's five moves away will have all but two sides complete.

[-]danlowlite15y100 Response to previous version

Wouldn't the AI have to discover that it is something to be solved, first? Give a kid such a puzzle and she's likelier to put it in her mouth then even try.

Unless I'm being obtuse.

[-]NickiH15y40 Response to previous version

You're right, and I think that this is a mistake a lot of people make when thinking about AI - they assume that the fact that they're intelligent means they also know a lot. Like the child, their specific knowledge (such as the fact that there is something to solve), is something they have to learn, or be taught, over time.

[-]bigjeff515y-10 Response to previous version

Curiosity could be built-in, I don't see the problem with that.

It seems to be built-in for humans - we don't learn to be curious, though we can learn not to be.

[-]danlowlite15y30 Response to previous version

It could be built in. I agree. But the child is curious about it's texture and taste than how the pieces fit together. I had to show my child a puzzle and solve it in front of her to get her to understand it.

Then she took off with it. YMMV.

Good point, though.

[-]bigjeff515y00 Response to previous version

But the child is curious about it's texture and taste than how the pieces fit together.

But as you see, there was an initial curiosity there. They may not be able to make certain leaps that lead them to things they would be curious about, but once you help them make the leap they are then curious on their own.

Also, there are plenty of things some people just aren't curious about, or interested in. You can only bring someone so far, after which they are either curious or not.

It would be very interesting to do the same thing with an AI, just give it a basic curiosity about certain things, and watch how it develops.

[-]wizzwizz46y20

just give it a basic curiosity about certain things

What's "curiosity"? I don't think we can just say "just" yet, when we can't even explain this concept to a hypothetical human-minus-curiosity. (Wanting to learn more? What does it mean to actively learn about something?)

[-]CCC13y-10 Response to previous version

Consider how this could be tested. One would write a program that generates a virtual rubik's cube, and passes this on to the AI to be solved (this avoids the complexity of first having to learn how to control robotic hands). It can't just randomly assign colours to sides, lest it end up with an unsolveable cube. Hence, the preparatory program starts with a solved cube, and then applies a random sequence of moves to it.

This will almost certainly be done on the same computer as the AI is running on. A good AI, therefore, should be able to learn to inspect its own working memory, and observe other running threads on the system - it will simply observe the moves used to shuffle the cube, and can then easily reverse them if asked.

It is possible, of course, for test conditions to be altered to avoid this solution. That would, I think, be a mistake - the AI will be able to learn a lot from inspecting its own running processes (combined with the research that led to its development), and this behaviour should (in a known Friendly AI) be encouraged.

[-]stack9y40 Response to previous version

the problem with this is the state space is so large that it cannot explore every transition, so it can't follow transitions backwards in a straight forward manner as you've proposed. It needs some kind of intuition to minimize the search space, to generalize it.

Unfortunately I'm not sure what that would look like. :(

[-]CCC9y00 Response to previous version

(Wow, this was from a while back)

I wasn't suggesting that the AI might try to calculate the reverse sequence of moves. I was suggesting that, if the cube-shuffling program is running on the same computer, then the AI might learn to cheat by, in effect, looking over the shoulder of the cube-shuffler and simply writing down all the moves in a list; then it can 'solve' the cube by simply running the list backwards.

[-]stack9y10 Response to previous version

Oh I see: for that specific instance of the task.

I'd like to see someone make this AI, I want to know how it could be done.

[-]CCC9y00 Response to previous version

Observe the contents of RAM as it's changing?

I'm not 100% sure of the mechanism of said observations, but I'm assuming a real AI would be able to do things on a computer that we can't - much as we can easily recognise an object in an image.

[-]wizzwizz46y40

You're assuming the AI has terminal access. Just because our brains are implemented as neurons doesn't mean we can manipulate matter on a cellular scale.

[-]Nate18y170 Response to previous version

"We think in words, "

Correction: We think by magic!

[-]thomblake15y10 Response to previous version

Update: Rubik's cube solved http://news.cnet.com/8301-17852_3-20013666-71.html

[-]TheatreAddict14y40 Response to previous version

I think I just thought of an insanely over-simplified analogy.

Say I'm not invited to my best friend's sleepover and I don't understand why. I call her, and the answer she gives me is: "It's complicated."

The situation might indeed be complicated, but the word complicated is just a fake explanation... :D Amiright, guys?

[-]DanielLC14y10 Response to previous version

That sounds to me more like a reason not to explain. If it's complicated, it will take a while.

[-]TheStevenator14y30 Response to previous version

I've been working my way through the sequences in order for the last few weeks and trying to read all of the links. I love this blog and tell people about whenever I can.

Reading these entries has helped me realize some of the ways in which I tend to think incorrectly, and I hope I am taking it slow enough to reflect enough and make myself think better. :)

I suppose I should comment about at least one thing relevant to this article in particular. Posted at 4:22 am?! When do you sleep, Eliezer?

[-]Thering14y20 Response to previous version

I think Many of the commanders have misread her question; it's not, how would you make a program to solve a rubiks cube (which is brute forceable as there are a finite number of states for a rubiks cube), but how do you program an ai so it can work out how to do so. The ai has to study the cube and determine the possible states of the cube (and how to write them down mathematically) and the operators available to change the state of the cube. This means it has to know what a state and an operator are (if not by name). It then has to work out how to combine the operators to change the state to a predetermined state, which we happen to call a solved cube. The ai has to be able to do this with you never having coded into the ai anything specific to do with a rubiks cube, or any methods to solve the rubiks cube. The next step would be to code an ai where it ha to work out for itself that it must combine operators to change the state of the cube. I don't think I could manage to code the simpler version; for it to have been coded well, it must be able to solve any problem, not just the rubiks cube.

[-]buybuydandavis14y40 Response to previous version

we would say "magic" - as in, "X magically does Y"

That's a nice bit of semantic hygiene. I hope to remember it.

[-]Dmytry14y10 Response to previous version

The solution (I posted it elsewhere also):

To solve Rubik's cube, you can just do hill climbing, with breadth-first-ish search for the higher hill point (i.e. you find higher point even if it is several moves away). This discovers the sequences. Cache the sequences.

It's a very general problem solving method, hill climbing with N move look-ahead. One does try maximizing various metrics, that are maximal in the final state, and finds one that works for you without getting you stuck in local maximum for too long. You also try various orders of iterating the moves (e.g. one could opt for repetitive sequences).

This works for chess as well, and for pretty much all puzzles. This is how I solve puzzles when I get a puzzle for first time, except of course I have terabytes worth of tricks that I can try, and 10^15 - ish operations per second; parallel, of course, but parallel works. Pre-generating sequences is not necessary. You arrive at them when hill climbing with breadth-first search, and cache them. You also tell them to other people whom you want to make into rubik-cube-solvers. The important thing that can't be stressed enough - try to figure out a good metric to climb. Some sides of hill are smoother than others.

One could hill climb some sort of complexity metric - evolution did that to arrive at humans, even though the bacteria is a better solution to 'reproduction'. You only need a comparator for climbing. Comparators are easy. You can make agents fight (or you can make agents cooperate). You don't need mapping to real number. You can do evolutionary hill climbing with n-move look ahead. edit: note that you do NOT need good ordering for hill climbing either. If sometimes a>b and b>c and c>a it is okay if you remember where you already been and avoid looping. That may still get you to the top of the hill.

[-]orthonormal14y20 Response to previous version

One could hill climb some sort of complexity metric - evolution did that to arrive at humans, even though the bacteria is a better solution to 'reproduction'.

I can't understand what you mean. Surely you don't mean that natural selection rewarded something besides inclusive genetic fitness.

[-]Dmytry14y00 Response to previous version

It of course didn't reward anything other than fitness. And the universe is not made of anything other than quarks etc (or smaller yet things). Hello fake-reductionist nihilism.

It, however, so happened that rewarding it resulted in growing complexity of behaviours of most complex organisms. You can hill climb by pouring liquid into a valley, if all you care for is some liquid on the top of the hill; liquid behaves in a very complicated way, minimizing a very complicated metric, such that it ends up on the tops of the hills by surface tension even though most of it is in the valleys, and a single molecule would be seeking valleys. The evolution doesn't just lead to mankind. The evolution, for the most part, leads to better bacteria. Mankind is a side effect from niche-filling. Remove all bacteria and single celled organisms, and they will re-evolve from a human (the canine infectious cancer was once a dog).

[-]orthonormal14y00 Response to previous version

I think it would be less misleading to say that many of our complex characteristics were instrumental goals for the evolutionary process as it hill-climbed the inclusive genetic fitness metric.

[-]Dmytry14y00 Response to previous version

It's hard to put it in a non misleading way. If you simulate evolution as is you are wasting almost all of your time on bacteria. Evolution didn't as much hill climb as just flooded the entire valley. edit: or rather, it predominantly wasn't going towards human. If you want to optimize, you look at how it got to human, and think how you avoid doing the rest of it.

[-]TheOtherDave14y20 Response to previous version

To clarify: are you actually suggesting that simulating just that subset of the evolutionary process that evolved humans and not the subset that evolved bacteria is a worthwhile strategy to explore towards achieving some goal? (If so, what goal?) Or do you mean this just as an illustration of a more general point?

[-]Dmytry14y00 Response to previous version

As illustration, with a remark on practical approach. Seriously, the thing about the evolution, it doesn't "reward fitness" either.

The agents compete, some are eliminated, some are added after modification; it's a lousy hill climbing, with really lousy comparator (and no actual metric like 'fitness' - just a comparator which aren't even climbing properly - where A may beat B, B beat C, and C beat A), but it makes for a variety, where the most complex behaving agent behaves in more and more complex ways all the way until it starts inventing puzzles and solving them. When one has a goal in mind, one can tweak the comparator to get to it more efficiently. The goal can be as vague as "complex behaviour" if you know what sort of "complex" you want or have an example. Problem solving doesn't require defining stuff very precisely first.

[-]TheOtherDave14y00 Response to previous version

A few things:

Agreed that given a process for achieving a goal that involves a comparator with that goal as a target, one can often start with a very fuzzy comparator (for example, "complex behavior") and keep refining it as one goes. That's especially true in cases where the costs of getting it not-quite-right the first time are low relative to the benefits of subsequently getting it righter... e.g., this strategy works a lot better for finding a good place to have dinner than it does for landing a plane. (Though given a bad enough initial comparator for the former, it can also be pretty catastrophic.)
I infer that you have a referent for 'fitness' other than whatever it is that gets selected for by evolution. I have no idea what that referent is.
I think it's misleading to refer to evolution having a comparator at all. At best it's true only metaphorically. As you say, all evolution acts on is the result of various competitions.
You seem to be implying that evolution necessarily results in extremely complex puzzle-inventing systems. If I've understood that correctly, I disagree.

[-]NevilleSandiego13y10 Response to previous version

'Mercury's gravitational pull has long since been destroyed by solar flares, which is why is has no atmosphere'. Something I read today- seems appropriate. Apparently they'd been watching a documentary, and I think they put the components of the explanation together incorrectly in their head.

[-]Houshalter13y60 Response to previous version

I think when your friend was talking about "complexity" he didn't mean the word literally. He may have meant that you would have to create a complicated solution, as opposed to finding a nice and elegant solution. The difference is you try to hammer out every detail and special case, one at a time, and adding "complexity" as you go, as opposed to just thinking about a single solution which would handle every case.

This is what I think most people mean when they talk about "complexity" as a solution to their problem. They don't literally mean that adding more complexity will solve the problem. It is just a different approach to problem solving. And sometimes that approach is easier and gets things done, even if it is more messy. Sometimes it is not.

Different approaches to solving problems is an interesting subject in itself. I've seen it create huge divisions in both artificial intelligence and politics. I tend to prefer nice elegant solutions. But when the problem seems complicated, it's tempting to run to the complexity side of things. There is no guarantee you will ever find an elegant solution, but if you just handle special case after special case, you can make progress over time for sure.

[-]Timo11y00 Response to previous version

I suspect that counts as a useful thinking-tool. Whenever I notice incomplete steps in my reasoning, I'll say "by magic!" and then I can worry less about fooling myself.

A thinking-tool needs a name in order to properly install it into memory. "Magic-markers" could work.

[-]slicedtoad10y50 Response to previous version

"Technical explanation of technical explanation" link is broken.

Here's a working one: http://www.yudkowsky.net/rational/technical/

[-]eigen6y110

This one goes down as one of the truly great essays on the sequences for me. Recognizing the gaps in my map is what has lead me to understand many things even though when I was not consciously noticing those kind of gaps. Now I'll do it consciously and I'm happy about that.

What's more, the sequences seem to be repetitive at surface-level, but they are not; they hammer-in the concepts. It was this specific essay that truly conveys to me the importance of not doing the "skip-overs", this was the one essay which leads me to think that I also might be teachable.

[-]tlhonmey5y10

In my work I additionally find it useful to break "magic" itself down into categories:

magic: It's part of the tool's intended problem domain, but you can't use it this way without a thorough understanding of precisely how it functions.

black magic: This is not part of the tool's intended problem domain, but your thorough understanding of both the problem and the tool's internals lets you use it for this. (Playing music on a floppy drive for example)

voodoo: It's not what the tool was made for, you don't know why or how it works, you have no idea what range of inputs will produce acceptable outputs. You just know that having the clock open in the top right corner of your screen keeps your word processor from crashing...

The fact that there is no "real" magic reminds people that there is a rational explanation, and the categories convey information about how deep the pond is likely to be to anyone considering diving for answers.

LESSWRONG
LW

LESSWRONG
LW

128

Say Not "Complexity"

128

128