Defending Against Model Weight Exfiltration Through Inference Verification
by Roy Rinberg, Adam Karvonen, dreuter, and Keri Warr
Authors: Roy Rinberg, Adam Karvonen, Alex Hoover, Daniel Reuter, Keri Warr Arxiv paper link One Minute Summary Anthropic has adopted upload limits to prevent model weight exfiltration. The idea is simple: model weights are very large, text outputs are small, so if we cap the output bandwidth, we can make...
Dec 15, 2025120