x

LESSWRONG

LW

MichaelEinhorn — LessWrong

MichaelEinhorn

MichaelEinhorn

Message

51

1

4y

MichaelEinhorn

51

4y

Mode collapse in RL may be fueled by the update equation

by TurnTrout and MichaelEinhorn

TL;DR: We present an advantage variant which, in certain settings, does not train an optimal policy, but instead uses a fixed reward to update a policy a fixed amount from initialization. Non-tabular empirical results seem mixed: The policy doesn't mode-collapse, but has unclear convergence properties. Summary: Many policy gradient methods...

Jun 19, 2023•53

Race Along Rashomon Ridge

by Stephen Fowler, Peter S. Park, and MichaelEinhorn

Produced As Part Of The SERI ML Alignment Theory Scholars Program 2022 Research Sprint Under John Wentworth Two Deep Neural Networks with wildly different parameters can produce equally good results. Not only can a tweak to parameters leave performance unchanged, but in many cases, two neural networks with completely different...

Jul 7, 2022•52