Constructing Neural Network Parameters with Downstream Trainability — LessWrong