This website requires javascript to properly function. Consider activating javascript to get access to all site functionality.
LESSWRONG
is fundraising!
Tags
LW
$
Login
Redwood Research
Edit
History
Subscribe
Discussion
(0)
Help improve this page
Edit
History
Subscribe
Discussion
(0)
Help improve this page
Redwood Research
Random Tag
Contributors
Posts tagged
Redwood Research
Most Relevant
8
205
Causal Scrubbing: a method for rigorously testing interpretability hypotheses [Redwood Research]
Ω
LawrenceC
,
Adrià Garriga-alonso
,
Nicholas Goldowsky-Dill
,
ryan_greenblatt
,
jenny
,
Ansh Radhakrishnan
,
Buck
,
Nate Thomas
2y
Ω
35
5
135
Apply to the Redwood Research Mechanistic Interpretability Experiment (REMIX), a research program in Berkeley
Ω
maxnadeau
,
Xander Davies
,
Buck
,
Nate Thomas
2y
Ω
14
4
143
Takeaways from our robust injury classifier project [Redwood Research]
Ω
dmz
2y
Ω
12
4
86
Benchmarks for Detecting Measurement Tampering [Redwood Research]
Ω
ryan_greenblatt
,
Fabien Roger
1y
Ω
19
3
145
Redwood Research’s current project
Ω
Buck
3y
Ω
29
3
48
Redwood's Technique-Focused Epistemic Strategy
Ω
adamShimi
3y
Ω
1
3
16
AXRP Episode 17 - Training for Very High Reliability with Daniel Ziegler
Ω
DanielFilan
2y
Ω
0
2
142
High-stakes alignment via adversarial training [Redwood Research report]
Ω
dmz
,
LawrenceC
,
Nate Thomas
3y
Ω
29
2
114
Why I'm excited about Redwood Research's current project
Ω
paulfchristiano
3y
Ω
6
2
64
Some common confusion about induction heads
Alexandre Variengien
2y
4
2
13
[Linkpost] Critiques of Redwood Research
Akash
2y
2
1
101
Some Lessons Learned from Studying Indirect Object Identification in GPT-2 small
Ω
KevinRoWang
,
Alexandre Variengien
,
Arthur Conmy
,
Buck
,
jsteinhardt
2y
Ω
9
1
87
Practical Pitfalls of Causal Scrubbing
Ω
Jérémy Scheurer
,
Phil3
,
tony
,
jacquesthibs
,
David Lindner
2y
Ω
17
1
56
We're Redwood Research, we do applied alignment research, AMA
Ω
Nate Thomas
3y
Ω
2
1
50
Help out Redwood Research’s interpretability team by finding heuristics implemented by GPT-2 small
Haoxing Du
,
Buck
2y
11