rvnnt

Message

215

127

rvnnt's Shortform

Feb 13, 20253

Should you publish solutions to corrigibility?

This question is partly motivated by observing recent discussions about corrigibility and wondering to what extent the people involved have thought about how their results might be used. If there existed practically implementable ways to make AGIs corrigible to arbitrary principals, that would enable a wide range of actors to...

Jan 30, 202513

Requesting feedback/advice: what Type Theory to study for AI safety?

TL;DR: Requesting feedback or advice on questions of what math to study (or what projects to engage in) in order to learn how to prove things about programs and to analyze the difficulty/possibility of proving things about programs. Ulterior motive: AI safety (and also progress towards a university degree). Motivation,...

Jun 23, 20207

LESSWRONG
LW

LESSWRONG
LW

rvnnt

rvnnt

rvnnt

rvnnt

rvnnt's Shortform

Should you publish solutions to corrigibility?

Requesting feedback/advice: what Type Theory to study for AI safety?

rvnnt's Shortform

Should you publish solutions to corrigibility?

Requesting feedback/advice: what Type Theory to study for AI safety?