"AlphaZero took 8x less compute to get to AlphaGoZero level performance 1 year later."
This looks like a typo -- the second algorithm should be AlphaGo?
I read you as saying Stag Hunt is a multi/multi game. If I'm right, why?
I was excited to be reminded by this post of Louis Sachar's classic Wayside School series: in one section of the first puzzle book installment, Sideways Math from Wayside School, you read about students at various places on the social behavior curve for participating in a game of basketball, and are asked to determine who will play under the changing circumstances of that day's recess.
Something triggered in me by this response -- and maybe similar to part of what you were saying in the later part: sometimes preferences aren't affected much by the social context, within a given space of social contexts. People may just want to use chopsticks because they are fun, rather than caring about what other people think about them.
Also, societal preferences for a given thing might actually decrease when more and more people are interested in them. For example, demand for a thing might cause the price to rise. With orchestras: if lots of people are already playing violin, that increases the relative incentive for others to learn viola.
Once you add this condition, are current state-of-the-art Starcraft-learning ANNs still getting more training data than humans?
Are there public links to natural language and / or computer code descriptions of the funding pipeline (with donors, recommenders, and donees) that Jaan described in the conversation? I don't think I got the full structure from his description.
I can feel a pressure to try to guess the other person's worldview and conform to it. Recently I have been I think better at just trying to debate things out with others. Possibly I may get uncomfortable if consensus isn't reached. I'm getting maybe a little bit more comfortable with this possibility though. Something interesting that can come up is a strong indignant feeling: "how the hell could anyone NOT believe X!!", which can cause me to change those exclamation points into a question mark and start wondering, which could potentially take a long time (currently I am confused about God beliefs/unbeliefs, after realizing that I sort of identify as an atheist but have a hard time identifying clear reasons that I should).
Another thing that I have noticed is the possibility to give silent responses rather than essentially lying. This can be very uncomfortable and sad, but may have benefits as well. I think it can feel pretty awful if I end up having to give a lot of silent responses over a period where I ALSO am not able to give myself much space to think (e.g. in a situation where I am constantly around people for a substantial period of time and not able to find a way to give myself "sufficient seclusion").
I did identify with this. Nothing concrete to share right now.
Maybe his translator, but not him.