2 Problems with learning values from observation

by David Scott Krueger (formerly: capybaralet)

21st Sep 2016

1 min read

4

2

Personal Blog

2

New Comment

4 comments, sorted by

top scoring

Click to highlight new comments since: Today at 5:28 AM

[-]Manfred9y100

This is only true for simple systems - with more complications you can indeed sometimes deduce causal structure!

Suppose you have three variables: Utopamine conentration, smiling, and reported happiness. And further suppose that there is an independent noise source for each of these variables - causal nodes that we put in as a catch-all for fluctuations and external forcings that are hard to model.

If Utopamine is the root cause of both smiling and reported happiness, then the variation in happiness will be independent of the variation in smiling, conditional on the variation in Utopamine. But conditional on the variation in smiling, the variation in utopamine and reported happiness will still be correlated!

The AI can now narrow down the causal structure to 2, and perhaps it can even figure out the right one if there's some time lag in the response and it assumes that causation goes forward in time.

Reply

[-]Lumifer9y50

Observational data doesn't allow one to distinguish correlation and causation.

No? If I observe a hammer striking a nail and the nail sinking into the wooden plank, is anyone going to argue that it's mere correlation and not causation?

Observational data doesn't always allow one one to distinguish correlation and causation.

I am also a bit confused since you're talking about learning values but your example is not about values but about a causal relationship.

Reply

[-]MrMind9y00

Indeed. Pearl's "Causality" talks at length about this sort of things, and what data can and cannot distinguish between causal correlation. There's even a Sequence post about this exact topic.

Reply

[-]janos9y00

Is there a reason to think this problem is less amenable to being solved by complexity priors than other learning problems? / Might we build an unaligned agent competent enough to be problematic without solving problems similar to this one?

Reply

Moderation Log

Curated and popular this week

LESSWRONG
LW

LESSWRONG
LW

2

Problems with learning values from observation

2

2