Wow, thanks for the very comprehensive response. (Also fun to see someone has compiled a modern chess engine on early-mid-90s hardware and shared their results.)
Thanks for writing this post. I have a handful of quick questions: (a) What was the reference MIPS (or the corresponding CPU) you used for the c. 2019-2020 data point? (b) What was the constant amount of RAM you used to run Stockfish? (c) Do I correctly understand that the Stockfish-to-MIPS comparison is based on the equation [edit: not sure how to best format this LaTeX...]:
So, your post piqued my interest to investigate the Intel 80486 a bit more with the question in mind: how comparable are old vs. new CPUs according not just to MIPS but also to other metrics?
It would seem that improvements in MIPS have outpaced improvements in memory-related metrics, e.g. memory latency in units of cycles has gotten worse. What I don't know is how sensitive Stockfish is to variations in memory performance. However I would conclude that I'd update the estimated SF8 ELO on older hardware upwards when additionally accounting for the effect of memory-related metrics in addition to MIPS. In other words, it seems more likely to me that the SF8 ELO curve is underestimated when you include both MIPS and memory-related effects.
(There is another direction one could follow: get Stockfish to compile on an 80486 or other old hardware that one has lying around, and report back with results.)
nostalgebraist's blog is a must-read regarding GPT-x, including GPT-3. Perhaps, start here ("the transformer... 'explained'?"), which helps to contextualize GPT-x within the history of machine learning.
(Though, I should note that nostalgebraist holds a contrarian "bearish" position on GPT-3 in particular; for the "bullish" case instead, read Gwern.)
There's a reference in the footnotes of Schelling p. 116 to a paper by Goffman, "On Face-Work" (Psychiatry 18:224, 1955). The same article was republished in Goffman's book Interaction Ritual (1967).
You might be interested to learn about some recently announced work on training agents with reinforcement learning to play "no-press" Diplomacy:
Thanks, that's a clarifying distinction.
Agree that specifics are important here. Some specifically interesting examples to me where non-profit and for-profit models overlap:
Note that in the above two examples, I've been using the terms "for-profit" and "non-profit" in a primarily regulatory sense, i.e. for-profit = corporation, LP, etc. vs. non-profit = 501(c)(3). In those examples, the terms also seem to map onto their "intentional" sense, but it's unclear what form a general rule might take to disentangle "for-profit" vs "non-profit" in their regulatory vs "intentional" senses.
One other important feedback "loop," or rather a feedback terminal, is an M&A event. The for-profit organization's owners receive a single injection of $ from a new parent organization, and then the for-profit organization (a) continues operating as a separate subsidiary of the parent, or (b) ceases its separate existence, getting liquidated into the parent. (Various outcomes in between can also occur.)
I'm curious whether there is an analog of this sort of M&A "loop" with non-profit organizations. If there is no such analog, then we have two broken feedback loops in non-profits: an indifferent product/sales loop and a nonexistent M&A "loop." How relatively important are the two broken loops in explaining the strengths and weaknesses of for-profit vs non-profit models?
One might even find it doubly fun to give a random internet forum participant $100,000.
Another manifestation of cultural-artifact co-accumulation is binary bootstrapping, e.g. as used for building compilers. In this case, the correspondence between culture and artifact is rather direct: a culturally impactful idea to introduce an addition or change to a programming language must eventually make its way into the compiler source, which itself needs to be compiled into a new binary artifact via existing binary artifacts of older versions of the compiler or of other programs. As the programming language accumulates new ideas, you require newer binary artifacts as well (even if you can store all the binary artifacts). And, as new programmers learn newer programming languages with newer ideas, the older languages gradually fall into disuse. Working with uncommonly known old binary artifacts then becomes a niche field (e.g. maintenance of legacy systems) or a fun hobby (i.e. retro-computing).