Lauren Greenspan

Mean field sequence: an introduction

This is the first post in a planned series about mean field theory by Dmitry and Lauren (this post was generated by Dmitry with lots of input from Lauren, and a second part should be coming soon). The posts are a combination of an explainer and some original research/ experiments....

Apr 474

AI Safety x Physics Grand Challenge

Join us for the AI Safety x Physics Grand Challenge, a research hackathon designed to engage physicists in technical AI safety research. While we expect LessWrong community members with both technical AI safety and physics expertise to benefit most from this event, we encourage anyone interested in exploring this intersection...

Jul 23, 202537

Call for Collaboration: Renormalization for AI safety

We invite proposals that probe aspects of renormalization in AI systems that will help us predict, explain, and interpret neural network behavior at different levels of abstraction. We import ‘renormalization’ from physics, as a technique to coarse-grain theoretical descriptions of complex interactions to focus on those that are most relevant...

Mar 31, 202535

Opportunity Space: Renormalization for AI Safety

This opportunity space was developed with Dmitry Vaintrob and Lucas Teixeira as part of PIBBSS' horizon scanning initiative. A detailed roadmap of can be found here. Background The basic premise is to view different aspects of a neural network (NN) as a complex statistical system, as follows: The data-generating process...

Mar 31, 202522

Renormalization Roadmap

At PIBBSS, we’ve been thinking about how renormalization can be developed into a rich framework for AI interpretability. This document serves as a roadmap for this research agenda – which we are calling an Opportunity Space[1] for the AI safety community. In what follows, we explore the technical and philosophical...

Mar 31, 202564

Renormalization Redux: QFT Techniques for AI Interpretability

Introduction: Why QFT? In a previous post, Lauren offered a take on why a physics way of thinking is so successful at understanding AI systems. In this post, we look in more detail at the potential of Quantum field theory (QFT) to be expanded into a more comprehensive framework for...

Jan 18, 202547

Is AI Physical?

Context: This is part of a series of posts I am writing with Dmitry Vaintrob, as we aim to unpack some potential value from Quantum Field Theory (QFT). Consider this post as framing why physics and its frameworks can be good for building a science of AI. Introduction In Position:...

Jan 14, 202523

Lauren Greenspan

Lauren Greenspan

Mean field sequence: an introduction

Renormalization Roadmap

Toward Safety Case Inspired Basic Research

Renormalization Redux: QFT Techniques for AI Interpretability

Lauren Greenspan

Mean field sequence: an introduction

Renormalization Roadmap

Toward Safety Case Inspired Basic Research

Renormalization Redux: QFT Techniques for AI Interpretability

Mean field sequence: an introduction

AI Safety x Physics Grand Challenge

Call for Collaboration: Renormalization for AI safety

Opportunity Space: Renormalization for AI Safety

Renormalization Roadmap

Renormalization Redux: QFT Techniques for AI Interpretability

Is AI Physical?