x
A Little Depth Goes a Long Way: the Expressive Power of Log-Depth Transformers — LessWrong