Soft Prompts for Evaluation: Measuring Conditional Distance of Capabilities — LessWrong