Credence polls for 26 claims from the 2019 Review

jacobjacob

Credence polls for 26 claims from the 2019 Review

2 min read9th Jan 20212 comments

54

Forecasts (Specific Predictions)LessWrong Review

This post is a whirlwind tour of claims made in the LessWrong 2019 Review. In some cases, the claim is literally quoted from the post. In others, I have tried operationalising it into something more falsifiable. For example:

Book Review: The Secret of Our Success

Elicit Prediction (elicit.org/binary/questions/itSayrbzc)

Elicit Prediction (elicit.org/binary/questions/5SRTLX3p_)

Elicit Prediction (elicit.org/binary/questions/VMv-KjR87)

Overall, treat the claims in this post more like polls, and less like the full-blown forecasting questions you'd find on Metaculus or PredictIt. (The latter have extremely high bars for crisp definitions.) They point in a direction, but don't completely pin it down.

Overall, this is an experiment. I'm trying to find interesting ways for people to relate to the Review.

Maybe speeding through these questions gets you thinking good thoughts, that you can then turn into full-blown reviews? Maybe others' answers allow you to find a discussion partner who disagrees on a core question? Maybe the data will be useful in the voting phase?

We'll see!

Feel free to leave a comment about how you found the experience, if you want.

If you want to discuss the questions with others over a call, you can do so during the Review forecasting sessions we're organising this weekend (January 9-10).

If you want to hide other user's predictions until you've made your own, here's how to do that:

Press "Hide other users' Elicit predictions until I have predicted myself"

Make More Land

Making more land out of the about 50mi^2 shallow water in the San Francisco Bay, South of the Dumbarton Bridge, would...

Elicit Prediction (elicit.org/binary/questions/KkqpSr5rW)

Elicit Prediction (elicit.org/binary/questions/qzzNzEfa9)

Elicit Prediction (elicit.org/binary/questions/csYlcNdhZ)

Elicit Prediction (elicit.org/binary/questions/RwtAoMlnP)

Elicit Prediction (elicit.org/binary/questions/xGIZipvb-)

Elicit Prediction (elicit.org/binary/questions/zAtqSgbnS)

Why Wasn't Science Invented in China?

Elicit Prediction (elicit.org/binary/questions/YP6-jqCeJ)

The Strategy-Stealing Assumption

Elicit Prediction (elicit.org/binary/questions/4JOKn_4F5)

Becoming the Pareto-best in the World

Elicit Prediction (elicit.org/binary/questions/2b3PzqXn9)

The Hard Work of Translation

Elicit Prediction (elicit.org/binary/questions/CxOLziCla)

The Forces of Blandness and the Disagreeable Majority

Elicit Prediction (elicit.org/binary/questions/hYdC56WBA)

Elicit Prediction (elicit.org/binary/questions/WFD8n_LTV)

Bioinfohazards

Elicit Prediction (elicit.org/binary/questions/rAIrguWga)

Elicit Prediction (elicit.org/binary/questions/1wcsgt6RS)

Two explanations for variation in human abilities

Elicit Prediction (elicit.org/binary/questions/7vAEm5HDV)

Elicit Prediction (elicit.org/binary/questions/MwBCcDkHS)

Reframing Impact

These questions are quite technical, and might be hard to answer if you're unfamiliar with the terminology used in TurnTrout's sequence on Impact Measures.

Elicit Prediction (elicit.org/binary/questions/7SoL5DPRf)

Elicit Prediction (elicit.org/binary/questions/AevXOS1Rj)

Elicit Prediction (elicit.org/binary/questions/javyyEd8C)

Elicit Prediction (elicit.org/binary/questions/iYT69bLl9)

Elicit Prediction (elicit.org/binary/questions/GFGG5plOQ)

Elicit Prediction (elicit.org/binary/questions/8Rr-YFSWi)

Elicit Prediction (elicit.org/binary/questions/GFurWKpJn)

---

(Note that when you answer questions in this summary post, and it will automatically update the prediction questions that I have linked in comments on each individual post. The distributions will later be visible when users are voting to rank the posts.)

Forecasts (Specific Predictions)LessWrong Review

Frontpage

54

New Comment

2 comments, sorted by

top scoring

Click to highlight new comments since: Today at 12:30 PM

[-]TurnTrout3y40

Speaking of claims made in 2019 review posts: Conclusion to 'Reframing Impact' (the final post of my nominated Reframing Impact sequence) contains the following claims and credences:

AU theory describes how people feel impacted. I'm darn confident (95%) that this is true.
Agents trained by powerful RL algorithms on arbitrary reward signals generally try to take over the world. Confident (75%). The theorems on power-seeking only apply to optimal policies in fully observable environments, which isn't realistic for real-world agents. However, I think they're still informative. There are also strong intuitive arguments for power-seeking.
The catastrophic convergence conjecture is true. Fairly confident (70%). There seems to be a dichotomy between "catastrophe directly incentivized by goal" and "catastrophe indirectly incentivized by goal through power-seeking", although Vika provides intuitions in the other direction.
AUP prevents catastrophe, assuming the catastrophic convergence conjecture. Very confident (85%).
Some version of AUP solves side effect problems for an extremely wide class of real-world tasks and for subhuman agents. Leaning towards yes (65%).
For the superhuman case, penalizing the agent for increasing its own AU is better than penalizing the agent for increasing other AUs. Leaning towards yes (65%).
There exists a simple closed-form solution to catastrophe avoidance (in the outer alignment sense). Pessimistic (35%).

LESSWRONG
LW

Credence polls for 26 claims from the 2019 Review

54

Book Review: The Secret of Our Success

Make More Land

Why Wasn't Science Invented in China?

The Strategy-Stealing Assumption

Becoming the Pareto-best in the World

The Hard Work of Translation

The Forces of Blandness and the Disagreeable Majority

Bioinfohazards

Two explanations for variation in human abilities

Reframing Impact

New to LessWrong?

54