Joel Burget

Paul Christiano named as US AI Safety Institute Head of AI Safety

> U.S. Secretary of Commerce Gina Raimondo announced today additional members of the executive leadership team of the U.S. AI Safety Institute (AISI), which is housed at the National Institute of Standards and Technology (NIST). Raimondo named Paul Christiano as Head of AI Safety, Adam Russell as Chief Vision Officer, Mara Campbell as Acting Chief Operating Officer and Chief of Staff, Rob Reich as Senior Advisor, and Mark Latonero as Head of International Engagement. They will join AISI Director Elizabeth Kelly and Chief Technology Officer Elham Tabassi, who were announced in February. The AISI was established within NIST at the direction of President Biden, including to support the responsibilities assigned to the Department of Commerce under the President’s landmark Executive Order. > Paul Christiano, Head of AI Safety, will design and conduct tests of frontier AI models, focusing on model evaluations for capabilities of national security concern. Christiano will also contribute guidance on conducting these evaluations, as well as on the implementation of risk mitigations to enhance frontier model safety and security. Christiano founded the Alignment Research Center, a non-profit research organization that seeks to align future machine learning systems with human interests by furthering theoretical research. He also launched a leading initiative to conduct third-party evaluations of frontier models, now housed at Model Evaluation and Threat Research (METR). He previously ran the language model alignment team at OpenAI, where he pioneered work on reinforcement learning from human feedback (RLHF), a foundational technical AI safety technique. He holds a PhD in computer science from the University of California, Berkeley, and a B.S. in mathematics from the Massachusetts Institute of Technology.

257Apr 16, 2024

Highlights from Lex Fridman’s interview of Yann LeCun

48Mar 13, 2024

OpenAI appoints Retired U.S. Army General Paul M. Nakasone to Board of Directors

35Jun 13, 2024

GPT2, Five Years On

34Jun 5, 2024

Joel Burget

Message

I write software for Survival and Flourishing Corp. Previously MATS, Google, Khan Academy.

818

Notes on Dark Sun (The Making of the Hydrogen Bomb)

In the past couple of years it’s been popular to read Richard Rhodes’ The Making of the Atomic Bomb, especially after Situational Awareness’s prediction / promotion of a new Manhattan Project in AI. However, I think you’ll find more which applies to the current moment in Dark Sun. Consider: physicists...

Sep 2, 202522

Economic Post-ASI Transition

Who's done high quality work / can tell a convincing story about managing the economic transition to a world where machines can do every job better than humans? Some common tropes and why I don't think they're good enough: * "We've always managed in the past. Take the industrial revolution...

Jan 1, 202519

OpenAI appoints Retired U.S. Army General Paul M. Nakasone to Board of Directors

> Today, Retired U.S. Army General Paul M. Nakasone has joined our Board of Directors. A leading expert in cybersecurity, Nakasone’s appointment reflects OpenAI’s commitment to safety and security, and underscores the growing significance of cybersecurity as the impact of AI technology continues to grow. > > As a first...

Jun 13, 202435

GPT2, Five Years On

Jack Clark's retrospective on GPT2 is full of interesting policy thoughts, I recommend reading the whole thing. One excerpt: > I've come to believe that in policy "a little goes a long way" - it's far better to have a couple of ideas you think are robustly good in all...

Jun 5, 202434

Quick Thoughts on Scaling Monosemanticity

1. How Many Features are Active at Once? Previously I’ve seen the rule of thumb “20-100 for most models”. Anthropic says: > For all three SAEs, the average number of features active (i.e. with nonzero activations) on a given token was fewer than 300 2. Splitting SAEs Having multiple different-sized...

May 23, 202428

How is GPT-4o Related to GPT-4?

GPT-4o both has a new tokenizer and was trained directly on audio (whereas my understanding is that GPT-4 was trained only on text and images). Is there precedent for upgrading a model to a new tokenizer? It seems like it's probably better to think of it as an entirely new...

May 15, 202410

How to Model the Future of Open-Source LLMs?

I previously expected open-source LLMs to lag far behind the frontier because they're very expensive to train and naively it doesn't make business sense to spend on the order of $10M to (soon?) $1B to train a model only to give it away for free. But this has been repeatedly...

Apr 19, 202425

Load More (7/14)

LESSWRONG
LW

LESSWRONG
LW

Joel Burget

Joel Burget

Joel Burget

Paul Christiano named as US AI Safety Institute Head of AI Safety

Highlights from Lex Fridman’s interview of Yann LeCun

OpenAI appoints Retired U.S. Army General Paul M. Nakasone to Board of Directors

GPT2, Five Years On

Joel Burget

Notes on Dark Sun (The Making of the Hydrogen Bomb)

Economic Post-ASI Transition

OpenAI appoints Retired U.S. Army General Paul M. Nakasone to Board of Directors

GPT2, Five Years On

Quick Thoughts on Scaling Monosemanticity

How is GPT-4o Related to GPT-4?

How to Model the Future of Open-Source LLMs?

Paul Christiano named as US AI Safety Institute Head of AI Safety

Highlights from Lex Fridman’s interview of Yann LeCun

OpenAI appoints Retired U.S. Army General Paul M. Nakasone to Board of Directors

GPT2, Five Years On

Notes on Dark Sun (The Making of the Hydrogen Bomb)

Economic Post-ASI Transition

OpenAI appoints Retired U.S. Army General Paul M. Nakasone to Board of Directors

GPT2, Five Years On

Quick Thoughts on Scaling Monosemanticity

How is GPT-4o Related to GPT-4?

How to Model the Future of Open-Source LLMs?