Mohsen Arjmandi — LessWrong

Reminded me of this recent work: TrojanPuzzle: Covertly Poisoning Code-Suggestion Models.
Some subtle ways to poison the datasets used to train code models. The idea is that by selectively altering certain pieces of code, they can increase the likelihood of generative models trained on that code outputting buggy software.

LESSWRONG
LW

LESSWRONG
LW

Posts

Wikitag Contributions

Comments