Algorithms as Case Studies in Rationality

[-]Wei Dai15y70

Sometimes, though, it gives an interesting insight into what's going on-- often cases where classical logic tells us that an inference is just fine, but informal pragmatics tell us that there is something silly about it.

Can you please give an example of this?

[-]abramdemski15y90

It's an interesting experience to learn formal logic and then take a higher-level math class (any proof-intensive topic). During the process of finding a proof, we ask all sorts of questions of the form "does that imply that?". However, since we're typically proving something which we already know is a theorem, we could logically answer: "Yes: any two true statements imply one another, and both of those statements are true." This is a silly and unhelpful reply, of course. One way of seeing why is to point out that although we may already be willing to believe the theorem, we are trying to construct an argument which could increase the certainty of that belief; hence, the direction of propagation is towards the theorem, so any belief we may have in the theorem cannot be used as evidence in the argument.

What do you think, am I overstepping my bounds here? I feel like the probabilistic case gives us something more. In classical logic, we either believe the statement already or not; we don't need to worry about counting evidence twice because evidence is either totally convincing or not convincing at all.

[-]prase15y30

"does that imply that?"

Because in casual speech the question doesn't actually mean "does that imply that?", but rather "do we have a derivation of that from that, using our set of inference rules?" Not the same, but people seldom realise the distinction.

[-]Johnicholas15y50

This is the "paradox of the material conditional", which is one of the primary motivations of relevance logic - to provide a sentential connective that corresponds to how we actually use "implies", as opposed to the material (truth-functional) implication.

http://plato.stanford.edu/entries/logic-relevance/

[-]abramdemski15y00

Good point! Perhaps you won't be surprised, though, if I say that my own preferred account of the conditional is the probabilistic conditional.

[-]abramdemski15y00

That is also a fair interpretation, especially for those students who just want to get the homework done with and don't really care about increasing their sureness in the theorem being re-proved.

If we additionally care about the argument and agree with all the inference rules, then I think there is a little more explaining to do.

[-]prase15y00

Not only for the students, I think. Confusion between implication and inference was enough widespread to motivate Lewis Carroll to write an essay, and nothing much changed has since then. I didn't properly understand the distinction even after finishing university.

[-]abramdemski15y00

Another example (perhaps a bit frivolous): when browsing Less Wrong comments and deciding which to upvote, it might be tempting to take the existing upvotes into account. However (to an extent-- it's just an analogy), this is like using the fact that my probability for some statement A is high as an argument to increase the probability of A. The direction of propagation is towards the estimated goodness of the post, so using that estimate in the argument is bad form.

[-]KrisC15y00

where classical logic tells us that an inference is just fine, but informal pragmatics tell us that there is something silly about it.

Give me a long enough lever and a place to stand...

[-]endoself15y20

Archimedes did not know that gravity was caused by the Earth's mass. His only mistake was overconfidence about the cause of gravity, which can be seen from Bayesian reasoning, not just informal pragmatics.

[-]abramdemski15y50

Algorithms I find useful that I didn't put in the article:

--Find decent solutions to packing problems by packing the largest items first, then going down in order of size

--Minimum-description-length ideas (no surprise to rationalists! Just occam's razor)

--Binary search (just for finding a page in a book, but still, learning the algorithm actually improved my speed at that task :p)

--Exploration vs exploitation trade-off in reinforcement learning (I can't say I'm systematic about it, but learning the concept made me aware that it is sometimes rational to take actions which seem suboptimal from what I know, just to see what happens)

[-]Drahflow15y30

My classical example for algorithms applicable to real life: Merge sort for sorting stacks of paper.

[-]lukstafi15y00

Only that "Exploration vs exploitation trade-off" is not an algorithm. Reinforcement learning (RL) is pretty much "non-algorithmic" (as Pei Wang would say). ETA: there are specific algorithms in RL (and in -- related -- planning and game playing), but the "trade-off" is a concept; it sure needs to be expressed algorithmically but is it fair to give credit to "algorithmicality" in this case?

[-]abramdemski15y00

Right; when I say "I'm not systematic about it" I mean that I don't purposefully follow a specific algorithm. I would probably benefit from being a bit more systematic, but for the moment, I'm merely trying to "train my intuition".

I would hope that all these algorithms would be applied "non-algorithmically" in Pei Wang's sense-- that is, the ideas from the algorithm should interact dynamically with the rest of my thought process.

[-]wedrifid15y00

Reinforcement learning is pretty much "non-algorithmic"

I'm rather certain I could implement reinforcement learning as an algorithm. In fact, I'm rather certain I have done so already. If I can point to an algorithm and say "look, that's a damn reinforcement learning algorithm" then I'm not sure how meaningful it can be to call it "non-algorithmic".

[-]lukstafi15y00

I concede, RL is a prototype example of algorithmic learning problem. The exploration vs exploitation trade-off is something that needs to be addressed by RL algorithms. It is fair then to say that we gain insight into the "trade-off" by recognizing how the algorithms "solve" it.

[-]wedrifid15y00

It is also fair to say there is an abstract concept of 'trade off' that is not itself algorithmic.

[-]cousin_it15y50

Nice to see you here again!

The Wolfram link for K-B completion was surprisingly unhelpful, Wikipedia worked much better for me because it has a detailed example.

Symbolic integration is not actually a complete algorithm because it sometimes needs to check if an expression is equivalent to zero, which is not yet known to be decidable, so in practice people use heuristics for that part. Though of course I agree that teaching math students to find antiderivatives by tricks and guesses is dumb.

[-]abramdemski15y20

I actually cited the Wolfram article because I preferred it, but I went ahead and added a link to the wikipedia article for those whose taste is closer to yours! Thanks.

The Risch algorithm for symbolic integration is what first gave me a hunger to learn "the actually good ways of doing things" in this respect, and a sense that I might have to search for them beyond the classroom. However, I never did learn to use the Risch algorithm itself! I don't really know whether it turns out to be good for human use.

[-]bogdanb15y20

My impression is that many if not most algorithms for computers are not quite directly usable by humans.

For example, back-tracking is a simple algorithm that works very well for some problems, but there are just too many steps for anything but the smallest problems for a human to follow, even with pen and paper. A human will need fudge parts of it (skip steps, make decisions based on guesses instead of systematically) to be able to finish it quickly.

But knowing about real back-tracking is still useful: one has a better intuition which steps should be fudged and which shouldn’t, a better estimate of how hard the problem is (which helps, e.g., for deciding whether you’re likely to find a solution if you spend a bit more time, or if it’s better to go to a computer), or how to pick solutions systematically when deciding it’s worth to do it “by hand”. This applies to many algorithms.

[-]abramdemski15y00

Agreed! My intention is definitely more toward the second approach then the first.

[-]Dr_Manhattan8y20

Recommending http://algorithmstoliveby.com/ for the same reasons

[-]Sniffnoy15y20

How do we deal with cycles larger than 2?

[-]abramdemski15y10

Referring to belief propagation? The actual procedure does something really simple: ignore the problem. Experimentally, this approach has shown itself to be very good in a lot of cases. Very little is known about what determines how good an approximation this is, but if I recall correctly, it's been proven that a single loop will always converge to the correct values; it's also been proven that if all the local probability distributions are Gaussian, then the estimated means will also converge correctly, but the variances might not.

Many things can be done to improve the situation, too, but I'm not "up" on that at the moment.

[-]SilasBarta15y20

From what I recall of reading Pearl (Probabilistic Reasoning in Intelligent Systems), there are a few other ways to do it.

One is to collapse the cycle down to one multi-input, multi-output node, so you're conditioning on all inputs. This makes the node more complex, but forces you to consider the beliefs together, and prevent the information cascade problem.

Another is to make each message carry information about where it originated so that you can spot where evidence is being double-counted. (This is analogous to the "cite your sources" requirement in scholarship so that a faulty piece of evidence can be traced through multiple papers back to its source.)

Generally, to find ways around the problem, think back to the "soldier counting" problem and see what solutions would work there. The soldier counting problem is where you have some network of soldiers -- ideally, without cycles -- and want to know how you can count the size of the network by only looking at messages from soldiers connected to you. With an acyclic network, anyone can send the message "count" to all connections, and all soliders do the same until they're connect to no one, at which point they return "1" plus the sum of any messages coming back.

[-]JGWeissman15y00

EDIT: On rereading your question, it seems I associated it with the wrong section of the article, matching to a question I asked and answered for myself when reading it.

The last two steps in the simplification get combined into one step, and then the next pass combines that step with the previous step, and so on until all the steps are combined and the last pass does not combine any steps so you know that you are done.

[-]bogdanb15y10

Thanks for the post. I wasn’t quite aware of those particular algorithms, nor of the usefulness of thinking in real life in terms of algorithms.

I did have a vague feeling that “if only more people would study programming, I wouldn’t want to hit that many of them that often”, but this post raised my awareness of why it is so.

I can’t think of applying a particular algorithm in life(*), but I do notice myself thinking in programming terms. For example, more than once I noticed thinking of things like government and management, and the difficulty thereof, in terms of trees (the logical structure) and cooperative peer-to-peer algorithms and bottlenecks.

(*: The one exception that comes to mind is binary search, which I do use on occasion; for example, I never could remember how to do roots and logarithms “properly”, and when I need one I do a quick binary search with the reverse operation.)

I’m really curious to hear of other algorithms you (all) found useful for human reasoning.

[-]abramdemski15y10

I suspect your intuition about computer programming is also based on the way it forces a certain amount of clear thinking to be done.

[-]Johnicholas15y10

DPLL and JTMS (justification-based truth maintenance system) might be good candidates for "teach this algorithm to humans to enhance human rationality".

[-]abramdemski15y20

I'll second and constraint-solving algorithms; specifically, the variable-ordering heuristics seem helpful to me. Choose the variables which most constraint the problem first! Note, the constraint propagation step is another instance of the sum-product algorithm. :)

[-]darius15y20

I've wondered lately while reading The Laws of Thought if BDDs might help human reasoning too, the kind that gets formalized as boolean logic, of course.

This article reminded me of your post elsewhere about lazy partial evaluation / explanation-based learning and how both humans and machines use it.

[-]Johnicholas15y90

You do manipulate BDDs as a programmer when you deal with if- and cond-heavy code. For example, you reorder tests to make the whole cleaner. The code that you look at while refactoring is a BDD, and if you're refactoring, a sequence of snapshots of your code would be an equivalence proof.

This is the lazy partial evaluation post, cut and pasted from my livejournal:

Campbell's Heroic Cycle (very roughly) is when the hero experiences a call to adventure, and endures trials and tribulations, and then returns home, wiser for the experience, or otherwise changed for the better.

Trace-based just-in-time compilation is a technique for simultaneously interpreting and compiling a program. An interpreter interprets the program, and traces (records) its actions as it does so. When it returns to a previous state (e.g. when the program counter intersects the trace), then the interpreter has just interpreted a loop. On the presumption that loops usually occur more than once, the interpreter spends some time compiling the traced loop, and links the compiled chunk into the interpreted code (this is self-modifying code) then it continues interpreting the (modified, accelerated) program.

Explanation-based learning is an AI technique where an AI agent learns by executing a general strategy, and then when that strategy is done, succeed or fail, compressing or summarizing the execution of that strategy into a new fact or item in the agent's database.

In general, if you want to make progress, it seems (once you phrase it that way) just good sense that, any time you find yourself "back in the same spot", you should invest some effort into poring over your logs, trying to learning something - lest you be trapped in a do loop. However, nobody taught me that heuristic (or if they tried, I didn't notice) in college.

What does "back in the same spot" mean? Well, returning from a recursive call, or backjumping to the top of an iterative loop, are both examples. It doesn't mean you haven't made any progress, it's more that you can relate where you are now, to where you were in your memory.

[-]abramdemski15y00

Thanks for the analogy between those two algorithms! I think more could be done in the way of specifying when and how it is useful to go back and reflect, but deciding how to apply these algorithms to everyday thinking is really something that requires empiricism. These are habits to be perfected (or discarded) over longer periods of time.

[-]wedrifid15y00

Knuth-Bendix Completion

I think you just described how to do most undergraduate and high school maths exams!

[-]abramdemski15y00

I'm glad I conveyed my enthusiasm. :) I think that one reason I was overconfident when I got to higher math was because I had mastered this sort of simplification-based reasoning, and assumed it would always work!

[-][anonymous]15y00

I'd hope that this doesn't hold for most undergraduate maths courses, but it's definitely false at the top end.

[-]abramdemski15y00

What do you mean by "most"? It seems to work pretty well all the way up through calc 1, imho.

Correction: I was wrong; there are some basic cases which knuth-bendix can't handle. It looks like it wouldn't be sufficient up to calc 1 after all.

[-][anonymous]15y00

I was thinking of mathematics degree courses as a whole, rather than specific lecture courses, and in particular of the British system. The mechanics of calculus is taught at A-level in the UK, and here I'd definitely agree that following a standard recipe is most of what's required. But the key feature of a good university maths course is that it develops certain ways of thinking that enable you to tackle problems unlike any you've seen before, and this is the experience that I hope would be shared by all students of mathematics. This is a genuine hope though, not an expectation.

I liked your original post by the way.

[+]arnaw15y-120

LESSWRONG
LW

LESSWRONG
LW

38

Algorithms as Case Studies in Rationality

38

38

Knuth-Bendix Completion

Summary-Product

Conclusion