Mikita Balesni

Posts

Sorted by New

Wikitag Contributions

Comments

Sorted by
Mikita BalesniΩ91814

I think one practical difference is whether filtering pre-training data to exclude cases of scheming is a useful intervention.