LESSWRONGTags
LW

Reinforcement Learning

•

Applied to (Appetitive, Consummatory) ≈ (RL, reflex) by Steven Byrnes 9h ago

•

Applied to Language for Goal Misgeneralization: Some Formalisms from my MSc Thesis by Giulio 1d ago

•

Applied to Finding the estimate of the value of a state in RL agents by Clément Dumas 2mo ago

•

Applied to Speedrun ruiner research idea by lukehmiles 2mo ago

•

Applied to The theory of Proximal Policy Optimisation implementations by salman.mohammadi 2mo ago

•

Applied to Measuring Learned Optimization in Small Transformer Models by J Bostock 2mo ago

•

Applied to [Aspiration-based designs] 2. Formal framework, basic algorithm by Jobst Heitzig 3mo ago

•

Applied to [Aspiration-based designs] 1. Informal introduction by Jobst Heitzig 3mo ago

•

Applied to Skepticism About DeepMind's "Grandmaster-Level" Chess Without Search by Arjun Panickssery 4mo ago

•

Applied to Krueger Lab AI Safety Internship 2024 by Joey Bream 5mo ago

•

Applied to Interpreting the Learning of Deceit by RogerDearnaley 6mo ago

•

Applied to Refinement of Active Inference agency ontology by Roman Leventov 6mo ago

•

Applied to Utility ≠ Reward by Oliver Sourbut 6mo ago

•

Applied to Planning in LLMs: Insights from AlphaGo by jco 7mo ago

•

Applied to Reinforcement Learning using Layered Morphology (RLLM) by MiguelDev 7mo ago

•

Applied to AISC project: SatisfIA – AI that satisfies without overdoing it by Jobst Heitzig 7mo ago