Replacing user rejections with neutral “Ok” reduces Gemma’s frustration to near-zero, even on impossible tasks. The model apparently knows it has not solved anything. It is the social feedback, not the failure itself, that drives the spiral. That is a meaningfully different problem than “model can’t handle hard tasks”.
Replacing user rejections with neutral “Ok” reduces Gemma’s frustration to near-zero, even on impossible tasks. The model apparently knows it has not solved anything. It is the social feedback, not the failure itself, that drives the spiral. That is a meaningfully different problem than “model can’t handle hard tasks”.