Metaprogrammatic Hijacking: A New Class of AI Alignment Failure — LessWrong