This website requires javascript to properly function. Consider activating javascript to get access to all site functionality.
LESSWRONG
Tags
LW
Login
Simulator Theory
•
Applied to
The case for more ambitious language model evals
by
Jozdien
2mo
ago
•
Applied to
OpenAI Credit Account (2510$)
by
Emirhan BULUT
2mo
ago
•
Applied to
Goodbye, Shoggoth: The Stage, its Animatronics, & the Puppeteer – a New Metaphor
by
RogerDearnaley
3mo
ago
•
Applied to
Motivating Alignment of LLM-Powered Agents: Easy for AGI, Hard for ASI?
by
RogerDearnaley
3mo
ago
•
Applied to
On the future of language models
by
RogerDearnaley
3mo
ago
•
Applied to
How to Control an LLM's Behavior (why my P(DOOM) went down)
by
RogerDearnaley
4mo
ago
•
Applied to
Introduction and current research agenda
by
quila
4mo
ago
•
Applied to
Is Interpretability All We Need?
by
RogerDearnaley
4mo
ago
•
Applied to
Impressions from base-GPT-4?
by
quila
5mo
ago
•
Applied to
Revealing Intentionality In Language Models Through AdaVAE Guided Sampling
by
RiversHaveWings
5mo
ago
•
Applied to
The utility of humans within a Super Artificial Intelligence realm.
by
Marc Monroy
6mo
ago
•
Applied to
FAQ: What the heck is goal agnosticism?
by
porby
6mo
ago
•
Applied to
A Mathematical Model for Simulators
by
lukemarks
6mo
ago
•
Applied to
The Löbian Obstacle, And Why You Should Care
by
lukemarks
7mo
ago
•
Applied to
Memetic Judo #3: The Intelligence of Stochastic Parrots v.2
by
Max TK
7mo
ago