Architecture-aware optimisation: train ImageNet and more without hyperparameters — LessWrong