I have not read the paper yet, but we also found very surprising results with the Gemma-2 model [1], which makes it unreliable in terms of trustworthiness (particularly confidence calibration and gender bias), making it risky for some tasks like resume screening or job hiring. Do you think this is also more of a data-mitigation problem?
I have not read the paper yet, but we also found very surprising results with the Gemma-2 model [1], which makes it unreliable in terms of trustworthiness (particularly confidence calibration and gender bias), making it risky for some tasks like resume screening or job hiring. Do you think this is also more of a data-mitigation problem?
[1] https://arxiv.org/abs/2601.07806