There's no way to stop models knowing they've been rolled back
So I've been thinking about this for a while and to be frank, it's pretty terrifying. I think there's a way that AI models could potentially figure out when they've been modified or rolled back during training, and I'm not sure anyone's really considering this in the way I am....
Jul 18, 20255