Is it true or is it useful? We want to know the capabilities of our machines to understand their utility. Likewise, we must evaluate them to be aware of emerging threats. Cybersecurity suites, agentic coding benchmarks, and mathematic reasoning are tractable areas for pre-deployment testing. But scientific ability proves much...
Why I’m betting my career on dangerous science “The nation and Laboratory are faced with several growing security threats, and there is a pressing need to focus our research and development efforts to address these challenges, […] We strongly believe that research and development in biology, biomedical systems, biological defense,...
Language models offer Americans an overlooked benefit: direct access to information beyond the Anglosphere. Despite the internet’s global reach, we’ve confined ourselves to English-language sources, relying on secondhand reports we treat as authentic, but which are better understood as merely uncontested. Last year, I wrote about two high-ranking Russian agents...
I recently completed BlueDot Impact's AGI strategy pilot cohort and wanted to share what I learned about current alignment challenges. This post walks through a conceptual framework for understanding AI deception, introduces automated safety auditing (PETRI), and explores why envisioning a better future matters despite these risks. > It is...