Capability Self-Knowledge As an Alignment Property: Measuring and Closing the Gap
I've been researching AI for the past 2 years, but I wouldn't refer to myself as a researcher. I have no PhD, have never written a formal research paper and have never been published. In fact this is my first post here on LessWrong (possibly the last thing I will...
Jan 141