You can quantize any distribution. For the random distribution, you need to use 0.000... 01% quantilization to get anything useful at all. (In non-trivial environments, the vast majority of random actions are useless junk. If you have a humanoid coffee making robot, almost all random motor inputs will result in twitching on the floor.)

However, you can also quantalize over a model trained to imitate humans. Suppose you give some people a joystick and ask them to remote control the robot to make coffee. They manage this about half the time. You train the robot to be a 25% quantalizer. The top 25% of human actions will do better than humans.

This is technically equivalent to an imitator of humans, and an ASI that can only output 2 bits (which are used as a random seed for the human)

Though these descriptions imply different internal workings, which may indicate different probabilities of the AI using rowhammer.

Reply

[-]Charlie Steiner4y40

Yeah, I think this is the key point. Quantilizers are only safe or smart relative to some base distribution.

Reply

[-]TurnTrout4y30

You mention:

If we assume the strategies have a measure biased towards short strategies or use a prefix-free encoding or a subset of the strategies are implicitly something like that

Can you discuss more your assumptions on the quantilizer's base distribution? I take it that it's uniform?

Reply

[-]itaibn04y10

Really all I need is that a strategy that takes n bits to specify will be performed by 1 in of all random strategies. Maybe a random strategy consists of a bunch of random motions that cancel each other out, and in 1 in $\sim 2^{n}$ of strategies in between these random motions are directed actions that add up to performing this n-bit strategy. Maybe 1 in $\sim 2^{n}$ strategies start off by typing this strategy to another computer and end with shutting yourself off, so that in the remaining bits of the strategy will be ignored. A prefix-free encoding is basically like the latter situation except ignoring the bits after a certain point is built into the encoding rather than being an outcome of the agent's interaction with the environment.

Reply

Moderation Log

LESSWRONG
LW

LESSWRONG
LW

11

Quantilizer ≡ Optimizer with a Bounded Amount of Output

11

Ω 5

11

Ω 5