Exploring SAE features in LLMs with definition trees and token lists — LessWrong