This website requires javascript to properly function. Consider activating javascript to get access to all site functionality.
LESSWRONG
Tags
LW
Login
Academic Papers
•
Applied to
The theory of Proximal Policy Optimisation implementations
by
salman.mohammadi
8d
ago
•
Applied to
Rawls's Veil of Ignorance Doesn't Make Any Sense
by
Arjun Panickssery
2mo
ago
•
Applied to
Skepticism About DeepMind's "Grandmaster-Level" Chess Without Search
by
Arjun Panickssery
2mo
ago
•
Applied to
How to Control an LLM's Behavior (why my P(DOOM) went down)
by
RogerDearnaley
3mo
ago
•
Applied to
Striking Implications for Learning Theory, Interpretability — and Safety?
by
RogerDearnaley
3mo
ago
•
Applied to
VLM-RM: Specifying Rewards with Natural Language
by
ChengCheng
6mo
ago
•
Applied to
Paper digestion: "May We Have Your Attention Please? Human-Rights NGOs and the Problem of Global Communication"
by
Klara Helene Nielsen
9mo
ago
•
Applied to
Papers, Please #1: Various Papers on Employment, Wages and Productivity
by
Kaj_Sotala
11mo
ago
•
Applied to
A technical note on bilinear layers for interpretability
by
RobertM
1y
ago
•
Applied to
Attributes of successful professors
by
electroswing
1y
ago
•
Applied to
An Overview of Sparks of Artificial General Intelligence: Early experiments with GPT-4
by
Annapurna
1y
ago
•
Applied to
How to Read Papers Efficiently: Fast-then-Slow Three pass method
by
the gears to ascension
1y
ago
•
Applied to
Citability of Lesswrong and the Alignment Forum
by
Leon Lang
1y
ago
•
Applied to
My Reservations about Discovering Latent Knowledge (Burns, Ye, et al)
by
Robert_AIZI
1y
ago
•
Applied to
Article Review: Discovering Latent Knowledge (Burns, Ye, et al)
by
Robert_AIZI
1y
ago
•
Applied to
Poster Session on AI Safety
by
Neil Crawford
1y
ago
•
Applied to
Characterizing Intrinsic Compositionality in Transformers with Tree Projections
by
Ulisse Mini
1y
ago