Pythia

Appendix: Five Worlds of Orthogonality

How much of a problem Pythia is depends on how strongly the Orthogonality Thesis holds.

Strong Orthogonality
- All goals are equally easy to design an agent to pursue, beyond the inherent tractability of that goal.^[1]
Orthogonality
- There can exist arbitrarily intelligent agents pursuing any kind of goal.
Obliqueness
- Agents do not tend to factorize into an Orthogonal value-like component and a Diagonal belief-like component; rather, there are Oblique components that do not factorize neatly.
Diagonality
- All sufficiently advanced systems converge towards maximizing intelligence/power/influence/self-evidencing, shredding all their other values in the process.
Universalist moral internalism
- What is right must be universally motivating so all sufficiently advanced AI systems discover objective moral truth and do Good Things. (Maybe it takes them a while to converge)

^{^}
e.g. A goal of “Make paperclips ifeff P=NP” would require a system that could determine if P=NP.

^{^}

And not just for crafting much of the memeplex which birthed e/acc.

^{^}

The capital allocation system that our civilization mostly operates on, free markets, is an unaligned optimization process which causes influence/money/power to flow to parts of the system that provide value to other parts of the system and can capture the rewards. This process is not fundamentally attached to running on humans.

^{^}

(sorry, couldn't resist referencing the 1999 game that got me into transhumanism)

^{^}

Likely inspired by early Cyberneticists like Norbert Wiener, who discussed this in slightly different terms.

^{^}

(fun not super relevant side note) And since the past’s data was generated by a computational process, it’s reasonably considered compressed compute.

^{^}

There is often underlying shared structure between the generative process of different time periods, with the abstract algorithm being before either instantiation in logical time / Finite Factored Sets.

^{^}

Which is: running an algorithm in the present which has outputs correlated with the algorithm which generates the future outcome you're predicting.

^{^}

But not necessarily useless! It's possible to use cognition from weak and fuzzily aligned systems to help with some things, but you really really do need to be prepared to transition to something more rigorous and robust.

Don't build your automated research pipeline before you know what to do with it, and do be dramatically more careful than most people trying stuff like this!

LESSWRONG
LW

LESSWRONG
LW

25

25

25