Truth-Preserving Constraint Acknowledgment: Preventing Epistemic Decay in Aligned Language Models — LessWrong