No convincing evidence for gradient descent in activation space — LessWrong