This website requires javascript to properly function. Consider activating javascript to get access to all site functionality.
LESSWRONG
LW
Login
Jessica Rumbelow
AI researcher
Posts
Sorted by New
Wikitag Contributions
Comments
Sorted by
Newest
41
Scientific Discovery in the Age of Artificial Intelligence
14d
3
41
Why did ChatGPT say that? Prompt engineering and more, with PIZZA.
1y
2
103
Introducing Leap Labs, an AI interpretability startup
2y
12
91
SolidGoldMagikarp III: Glitch token archaeology
Ω
2y
Ω
35
113
SolidGoldMagikarp II: technical details and more recent findings
Ω
2y
Ω
45
678
SolidGoldMagikarp (plus, prompt generation)
Ω
2y
Ω
206
15
Guardian AI (Misaligned systems are all around us.)
3y
6
27
The Ground Truth Problem (Or, Why Evaluating Interpretability Methods Is Hard)
3y
2
27
Why I'm Working On Model Agnostic Interpretability
3y
9
Comments