ayush bharadwaj

Message

Mathematician / Engineer / Researcher - AI Alignment through interpretability

Hessian analysis with JAX: a platform-agnostic, high-performance approach

Aug 5, 2025•9

ayush bharadwaj

Mathematician / Engineer / Researcher - AI Alignment through interpretability

ayush bharadwaj — LessWrong

ayush bharadwaj

Message

Mathematician / Engineer / Researcher - AI Alignment through interpretability

Hessian analysis with JAX: a platform-agnostic, high-performance approach

Aug 5, 2025•9

ayush bharadwaj

Mathematician / Engineer / Researcher - AI Alignment through interpretability

Hessian analysis with JAX: a platform-agnostic, high-performance approach

ayush bharadwaj

6mo

In mechanistic interpretability research, we often want to analyze the Hessian of the loss function (for example, by computing its eigenspectrum). Ideally, we would want our Hessian analysis code to work seamlessly for all models irrespective of architecture or training platform (e.g. PyTorch, tensorflow, flax, etc.). However, this is hard to achieve because of platform-specific interfaces for accessing information about the model (e.g. datasets, parameters, and functions). As a result, we end up tightly coupling our analysis code with the training platform, forcing unnecessary re-writes when switching platforms.

The goal of this post is to present a simple, JAX-based framework to address the above difficulty. This framework will help us:

make the core numerical

... (read 2979 more words →)

LESSWRONG
LW

LESSWRONG
LW

ayush bharadwaj

ayush bharadwaj

Hessian analysis with JAX: a platform-agnostic, high-performance approach

ayush bharadwaj

ayush bharadwaj

Hessian analysis with JAX: a platform-agnostic, high-performance approach