The surprising parameter efficiency of vision models — LessWrong