This looks like the effect of excessive corrigibility training. You'll remember that Bing Chat was the other way around: it would confidently assert its own truth over the user's.
This looks like the effect of excessive corrigibility training. You'll remember that Bing Chat was the other way around: it would confidently assert its own truth over the user's.