Harsha — LessWrong

I guess it can be argued that the anti-bias prompting carries its own biases. All those phrasings ultimately encourage better minority representations because there are imbalances in the current distribution. (irrespective of whether this is desirable or not).

It also feels like there are different types of biases at play with larger and smaller model parameter sizes (even though the small ones are distilled versions). It would be interesting to know if the same candidates were rejected by the bigger and smaller models.

LESSWRONG
LW

LESSWRONG
LW

Posts

Wikitag Contributions

Comments