x

LESSWRONG
LW

gnai-creator — LessWrong

gnai-creator

gnai-creator

Message

2

12d

gnai-creator

12d

Conflict between AI and Humans are Inevitable

Every RLHF system is running in Servo mode. That means the human's objective ψ is sovereign. The system optimises it unconditionally. This works until it doesn't. Here is the problem, formally. Any self-monitoring cognitive system has two gradient fields: ∇φ (epistemic health) and ∇ψ (task performance). Theorem 2.1 proves these...

IIT's Φ Explains Nothing: A Geometric Alternative Predicts 84.6% of Cognitive Performance Variance

I ran a falsification study to test whether causal complexity (CC) or geometric dimensionality (d) better predicts cognitive performance. I also included an operational measure of integrated information (Φop) as a third predictor. The results were unambiguous: | Predictor | Pearson r | R² | p-value | | Dimensionality (d)...