The Unreasonable Feasibility Of Playing Chess Under The Influence

About a 40% reduction in rating.

Does Elo have a meaningful zero? I thought it was an interval scale.

Ohhh, that's a very good point 🤔I guess that makes the comparison a bit less direct. I'll think about whether it can be fixed or if I'll rewrite that part. Thank you for pointing it out!

[-]tailcalled4y10

In psychology, these sorts of scales are often standardized relative to the standard deviation instead of the mean. Or you could pick some other range as a metric; for instance if I'm looking it up right (and I very well may not be), the human range of ability appears to span around 1000 Elo points. So the drop is about 2x that between beginner and master humans.

[-]tailcalled4y10

Wait derp, that's the range for chess, but go appears to more have a range of around 3000 Elo points.

[-]Ulisse Mini3y60

I'm a pretty bad chess player (~1500 ELO) and I can play bullet games while sleep deprived without much loss in skill. I think the system 1 pattern matching is relatively unimpaired while the system 2 calculating is very impaired. In bullet calculating doesn't help much though.

A GM playing their instant move with no calculation can crush a master-level player, and I can crush a novice playing my instant move. It's all about system 1 :)

[-]Erik Jenner4y60

Performance deteriorating implies that the prior p is not yet a fixed point of p*=D(A(p*)).

At least in the case of AlphaZero, isn't the performance deterioration from A(p*) to p*? I.e. A(p*) is full AlphaZero, while p* is the "Raw Network" in the figure. We could have converged to the fixed point of the training process (i.e. p*=D(A(p*))) and still have performance deterioration if we use the unamplified model compared to the amplified one. I don't see a fundamental reason why p* = A(p*) should hold after convergence (and I would have been surprised if it held for e.g. chess or Go and reasonably sized models for p*).

[-]Jan4y30

That... makes a lot of sense. Yep, that's probably the answer! Thank you :)

[-]Zack_M_Davis4y40

If I was Scott Alexander or Zvi I'd comb through those papers and wring out insight.

Huh? What's stopping you? How are Scott or Zvi relevant here, at all?

[-]Jan4y20

Yeah, that thought was insufficiently explained, thanks for pointing that out! For me, Scott and Zvi are examples of people who are really good at "putting together pieces of evidence while tolerating huge amounts of uncertainty". I think I don't have that talent and I know I don't have the experience (or patience) to pull that off.

But there is an interesting meta-point here: Epistemic work comes at a cost and knowing which rabbit holes not to go down is an important skill.

When Scott and Zvi are doing one of their "Much more than you'd wanted to know" or "Covid Updates", then they are operating at some kind of efficient frontier where a lot of the pieces need to be considered to provide a valuable perspective. Everybody and their dog has a perspective on lockdown effectiveness, so adding to that will require a lot of great epistemic work. But the work is worth it, because the question is important and the authors care deeply about the answer.

Drunk chess playing, in contrast, is pretty underexplored (in agreement with its striking unimportance in the grand scheme of things). So making some headway is relatively easy (low hanging fruit!) and the marginal increase in value that an extremely deep dive into the neuroscience literature provides is just less worth it.

[-]Maxwell Peterson4y30

I think you're saying that alcohol in the body mostly damages players' ability to read out variations, but not how good their knee-jerk initial impression of "here's the best move" is? I like that theory! I never thought of it before, but having played a good number of Go games while drunk, it feels right.

[-]Jan4y20

Yes, that's a good description! And a cool datapoint, I've never played (or even watched) Go, but the principle should of course translate.

[-]omegastick4y10

If I'm not mistaken (and I'm not a biologist so I might be), alcohol mainly impacts the brain's system 2, leaving system 1 relatively intact. That lines up well with this post.

[-]Charlie Steiner4y20

Hm.

The actual evaluation of positions in chess is really intensive in pattern-recognition, and seems like a great job for complicated cortical learned-pattern-recognizers. But following down game trees to amplify your own evaluation of positions is also intensive in pattern recognition, and is the sort of learned mode of behavior that also needs to be "understood" by the cortex. So how are you imagining communication with the cerebellum? Is it just keeping the cortex "on track" in executing the learned behavior over time, or is it doing something more complicated like handling short-term memory or something, or is it doing something even more complicated like using a fairly sophisticated understanding of math to tell the cortex what patterns to look for next?

[-]Pattern4y20

Is AlphaZero better than other chess playing programs though?

after four hours of training, it beat the current world champion chess-playing program, Stockfish.

I remember there was some controversy around this at the time.

[-]Jan4y30

Yeah, I remember that controversy! I think this section on Wikipedia gives a bit of insight.

The graph in this Tweet shows the elo of Stockfish as a function of time, with the introduction of the neural net nnue for Stockfish 12 highlighted.

[-]SimonM4y10

I think the controversy is mostly irrelevant at this point. Leela performed comparably to Stockfish in the latest TCEC season and is based on Alpha Zero. It has most of the "romantic" properties mentioned in the post.

[-]MikkW4y20

Not just in the latest TCEC season, they've been neck-and-neck for quite a bit now

^{^}

Much less win against a 2400 opponent.

^{^}

A term in chess that is not rigorously defined, but roughly equates to "an extremely powerful move that is usually not obvious but that almost automatically wins (or draws, if you are losing) the game. Some would call this move a “brilliant” one". source

^{^}

Who would have thought!

^{^}

At which point I'd have decent drawing chances.

^{^}

The original paper actually lists four, but they are confusing.

^{^}

Actually, it's the only tool I have. Send help. Or more tools.

^{^}

Despite being so relatively recent, it is very hard to find a source that is reasonably complete and correct. Almost everyone ignores Konrad Zuse, or those who don't ignore him ignore everything else. I spent way too much time diving into this a few years ago, but it seems super relevant to me.

^{^}

A draw is simply a disgrace and should be avoided whenever possible.

^{^}

Or "Iterated capability amplification". Terminology is not quite set in stone yet.

^{^}

To get this, we might instantiate the prior randomly and then make it perform slightly better than random by creating two copies of the system and adopting the prior through traditional reinforcement learning and self-play. This is where the grounding of the prior comes from.

^{^}

Am I being unfair to the people who came up with this using neural networks for chess in 1999? Possibly.

^{^}

In case somebody who worked on the project ever reads this: Please don't be mad. I can't imagine the amount of blood, sweat, and tears that went into this.

^{^}

Stockfish, the goliath of computer chess, has by now caught up and overtaken the record set by AlphaZero.

^{^}

There are, however, weak hand-wavy arguments for why we might be lucky sometimes. In particular, the more general an algorithm, the more robust it tends to be to perturbations. Robustness is something that evolution "cares" a lot about. Generality is something that DeepMind cares a lot about (see also MuZero). Perhaps the number of totally general algorithm that can do all the things humans do is not that large? Sounds plausible, right?

LESSWRONG
LW

LESSWRONG
LW

29

The Unreasonable Feasibility Of Playing Chess Under The Influence

29

29

A proud history of drinking and playing chess

How do they do it?

How to solve chess

Concluding thoughts