While I don't necessarily disagree with you on the object level, your thoughts about the source of normative assumptions itself seems to assume moral anti-realism. This may be a reasonable assumption but it's one you don't explore here and your post could equally well be explained in terms of a transcendental realist stance.

Reply

[-]Acorn6y10

Sorry for the format here, and I still try to figure out how to use markdown in the comment.

I find difficulty understanding inferences about parameters $ \alpha,\beta,\gamma $ in the "Example:regret" part.

Take the fully rational planner p for example.

Since the human will say h following s, the different between reward functions for h and -h is non-negative, which implies that: $ (\beta R(h)+\gamma R(h|s)) - (\beta R(\sim h)+\gamma R(\sim h|s)) \geq 0 $

Then it is concluded that $ \beta R(h-\sim h)+\gamma R(h-\sim h|s)\geq0$

Similarly, from the human will say $ \sim h$ following i, we have $ \beta R(h-\sim h)+\delta R(h-\sim h|i)\leq0$

It seems that more information about the reward function is need in order to arrive at the final model with $ (p,R(\alpha,\beta,\gamma,\delta)|\gamma\geq-\beta\geq\delta) $

Reply

[-]Stuart_Armstrong6y30

I saw the $R$ 's as normalised to 1 or zero, and the coefficients as giving them weights. So instead of $β R (h - \sim h) + γ (h - \sim h | s) \geq 0$ , I'd write $β + γ \geq 0$ (given the behaviour and assumptions).

But this is an old post, and is mainly superseded by new ones, so I wouldn't spend too much time on it.

Reply

[-]avturchin7y10

The case about regret could be made stronger if one will actually look in existing psychological literature, which probably explored relation between regret and values.

Also, it is possible to imagine "hyperregret disorder", where a person will regret about any of his-her choice, and in that case regret is non-informative about the preferences.

Reply

Moderation Log

LESSWRONG
LW

LESSWRONG
LW

12

Normative assumptions: regret

12

12

Adding normative assumptions

What are they?

Example: regret

Planning and multiple attempts

Is the regret normative assumption correct?

Meta versus feeling