theophilus tabuke

Message

8mo

The Decalogue For Aligned AI.

The Decalogue For Aligned AI shares contextual similarities with the Christians' Ten Commandments but seeks to promote acceptable behavior in AI systems towards the achievement of their objective functions. It comprises of ten principles that establishes the foundation upon which technical solutions to alignment problems can be built on. Tenets...

Nov 7, 20251

Restraining Factors in AI Alignment Systems

I've been thinking about how the specification problem and reward hacking seem deeply intertwined, yet we often treat them as separate challenges. The specification problem has to do with the fundamental gap between what we want and what we can formally describe. We struggle to capture human values in reward...

Jun 12, 20251

LESSWRONG
LW

LESSWRONG
LW

theophilus tabuke

theophilus tabuke

The Decalogue For Aligned AI.

Restraining Factors in AI Alignment Systems

theophilus tabuke

theophilus tabuke

The Decalogue For Aligned AI.

Restraining Factors in AI Alignment Systems