Week 3: Adversarial Robustness
Authors: Ely Hahami, Lavik Jain, Emira Ibrahimović This work was done as week 3’s experiment for Boaz Barak’s “CS 2881r: AI Safety and Alignment” at Harvard. The lecture where this work was presented can be viewed on YouTube here, and its corresponding blogpost can be found here. Code for experiment...
Nov 21, 20251