TheManxLoiner

Trying to grok Anthropic's Global Workspace paper and the J-space

Introduction The aim of this post is to share a quick attempt at grokking the conceptual ideas that lie behind the notion of J-space and how it is calculated in the paper Verbalizable Representations Form a Global Workspace in Language Models. Specifically, I am trying to understand Section 2.1 of...

Jul 712

Advice for budding research managers/coaches after 6 months at MATS

Here is my advice for people interested in research management (RM). It’s an info dump, but you should at least skim all the materials if you are seriously considering this for your career. What is RM? * RM means different things in different places. Ensure you check what the work...

May 2813

AI as a Social Technology, by Henry Farell

Last week, I attended a talk ‘AI as a social technology’ by Henry Farell (HF) at the Blavatnik School of Government in Oxford. In this post, I list various thoughts or recollections I have. If I had more time, I would create a more coherent flowing narrative, but I’d rather...

May 2715

What I like about MATS and Research Management

Crossposted on my personal blog. This is post number 16 in my second attempt at doing Inkkaven in a day, i.e. to write 30 blogposts in a single day. MATS is an organization that pairs up-and-coming AI Safety researchers (who I call participants) with the world’s best (this is not...

Apr 58

A Proposal for a Better ARENA: Shifting from Teaching to Research Sprints

TLDR I propose restructuring the current ARENA program, which primarily focuses on contained exercises, into a more scalable and research-engineering-focused model consisting of four one-week research sprints preceded by a dedicated "Week Zero" of fundamental research engineering training. The primary reasons are: * The bottleneck for creating good AI safety...

Jan 1028

Quotes on OpenAI's timelines to automated research, safety research, and safety collaborations before recursive self improvement

I watched OpenAI's latest livestream from Oct 28th 2025 (after the news that OpenAI has transitioned into public benefit corporation). I found four parts of particular interest to the AI safety community. Internal timelines: AI research intern by Sep 2026 and AI researcher by Mar 2028 07:00 minutes in. >...

Oct 29, 202517

A distillation of Ajeya Cotra and Arvind Narayanan on the speed of AI progress

Introduction To help improve my own world models around AI, I am trying to understand and distill different worldviews. One worldview I am trying to understand is ‘AI as a normal technology’, by Arvind Narayanan and Sayash Kapoor. As a stepping stone to distilling that 15,000 word beast, I am...

Jul 22, 20259

TheManxLoiner

TheManxLoiner

AI as a powerful meme, via CGP Grey

A Proposal for a Better ARENA: Shifting from Teaching to Research Sprints

Distillation of 'Do language models plan for future tokens'

My experience at ML4Good AI Safety Bootcamp

TheManxLoiner

AI as a powerful meme, via CGP Grey

A Proposal for a Better ARENA: Shifting from Teaching to Research Sprints

Distillation of 'Do language models plan for future tokens'

My experience at ML4Good AI Safety Bootcamp

Trying to grok Anthropic's Global Workspace paper and the J-space

Advice for budding research managers/coaches after 6 months at MATS

AI as a Social Technology, by Henry Farell

What I like about MATS and Research Management

A Proposal for a Better ARENA: Shifting from Teaching to Research Sprints

Quotes on OpenAI's timelines to automated research, safety research, and safety collaborations before recursive self improvement

A distillation of Ajeya Cotra and Arvind Narayanan on the speed of AI progress