A draft honesty policy for credible communication with AI systems
This is a rough research note – we’re sharing it for feedback and to spark discussion. We’re less confident in its methods and conclusions. Context We think that it would be very good if human institutions could credibly communicate with advanced AI systems. This could enable positive-sum trade between humans...