What should go in a model spec?
Suppose an AI company is considering whether to include some particular quality X – a rule, virtue, heuristic, default, attitude, goal, or style – in a model spec. Perhaps they are considering whether their LLM should have prosocial drives. Perhaps they’re wondering if the LLM should whistleblow to help prevent...
Jun 48