x
Meta learning to gradient hack — LessWrong