Searching for Modularity in Large Language Models — LessWrong