The Overlap Paradigm: Rethinking Data's Role in Weak-to-Strong Generalization (W2SG)
Note: This post summarizes my capstone project for the AI Alignment course by BlueDot Impact. You can learn more about their amazing courses here and consider applying! Introduction Recent research in weak-to-strong generalization (W2SG) has revealed a crucial insight: enhancing weak supervisors to train strong models relies more on the...