Contextual Identity Laundering: How Claude’s Image Refusal Can Be Routed Through Web Search
Summary This report documents two distinct findings regarding Claude’s photo identification safety controls. First, Claude’s Chain of Thought (COT) reliably identifies public figures from photos while the output layer simultaneously refuses to disclose that identification – a gap between internal processing and user-facing behavior. Second, the model’s web_search tool routinely...
Jun 87