Arguments for Robustness in AI Alignment
Most popular robustness papers address short-term problems with robustness failures in neural networks, e.g., in the context of autonomous driving. I believe that robustness is also critical for the long-term success of AI alignment and ending the discussion with a statement along the lines of "Meh, this doesn't seem solvable;...
Jan 19, 20242