Towards White Box Deep Learning — LessWrong