x
Behavioral-Driven Alignment Erosion: Exploring Safety Boundary Attenuation and Inference Path Manipulation in Non-Technical Multi-Turn LLM Dialogues — LessWrong