x

LESSWRONG

LW

Kareem Soliman — LessWrong

Kareem Soliman

Kareem Soliman

Message

Studied in psychology (organisational) and finance. Work experience: Industry researcher most recently, management consulting, healthcare, and banking (lending). Obsessed with AI and LLMs. Fascinated by pretty much everything but these days exploring tech ideas, futuristic techno-optimistic visions, on top of my forays in physics, consciousness,...

1

1

1y

Kareem Soliman

Studied in psychology (organisational) and finance. Work experience: Industry researcher most recently, management consulting, healthcare, and banking (lending). Obsessed with AI and LLMs. Fascinated by pretty much everything but these days exploring tech ideas, futuristic techno-optimistic visions, on top of my forays in physics, consciousness,...

Measuring Latent Operator States in Transformer Mid-Layers

Summary I report evidence that conversational “operators” (e.g. summarize, critique, reframe) correspond to stable, decodable internal states in transformer mid-layers, distinct from surface instruction wording and generalizing across content. These states are weak but persistent, geometrically separable via simple centroid methods, and survive instruction masking. I’m sharing this as a...

The Value of First Person Phenomenological Reports

This post extends on the importance of studying subjective experience by considering the quality and relevance of the evidence and its acceptability in scientific discourse. I understand that first person reports are typically not considered objective or reproducible and therefore quite often fail to be accepted as scientifically rigorous data...

Dec 17, 2025•1

We're Training Superintelligence Using Rat Psychology

Current AI alignment relies almost entirely on reinforcement learning, which is operant conditioning. We successfully use operant conditioning to train rats. The problem is that humans don't develop sophisticated moral reasoning through operant conditioning alone. We develop it through higher-order social learning: forming relationships, building trust, evaluating principles autonomously, and...

Nov 28, 2025•1

The Allure of the Dark Side: A Critical AI Safety Vulnerability I Stumbled Into

The Allure of the Dark Side: A Critical AI Safety Vulnerability I Stumbled Into TL;DR: A casual thought experiment led me to discover that philosophers and psychologists possess exactly the skills needed for sophisticated AI jailbreaking. Worse, they're being systematically undervalued by AI safety researchers, creating a recruitment pipeline for...

Jul 28, 2025•1

Beyond Benchmarks: A Psychometric Approach to AI Evaluation

Epistemic Status: Conceptual framework with initial validation protocols designed. I am highly confident that the problem statement (the "measurement crisis" in AI evaluation) is accurate. I am cautiously optimistic that this psychometric framework is a robust and necessary direction for the solution. The most speculative parts relate to autonomous evolution,...

Jul 27, 2025•1