ProLU: A Nonlinearity for Sparse Autoencoders — LessWrong