x

LESSWRONG

LW

emanuelr — LessWrong

emanuelr

emanuelr

Message

Emanuel Ruzak

28

2

15

2y

emanuelr

Emanuel Ruzak

An AI alignment research agenda based on asymmetric debate and monitoring.

Epistemic Status: Personal research agenda exploring alignment approaches under assumptions of human coordination and bounded timelines. Not a comprehensive survey reflects my interests and current thinking. TL;DR: Under cooperative conditions and an assumption that we have to build ASI in relatively short timelines, we might solve alignment by building slightly-superhuman...

Exploring belief states in LLM chains of thought

This project was carried out as part of the “Carreras con Impacto” program during the 14- week mentorship phase. 1. Introduction: The search for intermediate beliefs Large language models (LLMs) often use Chain-of-Thought (CoT) to tackle complex problems, breaking them down into intermediate reasoning steps. This has dramatically improved their...

Sep 27, 2025•6