Factor(U,T): Controlling Untrusted AI by Monitoring their Plans
Authors: Edward Lue Chee Lip, Anthony Channg, Diana Kim, Aaron Sandoval, Kevin Zhu — Algoverse AI Research Paper: arXiv:2512.14745 | Code: GitHub | Accepted at AAAI 2026 TrustAgent Workshop AI Control protocols are designed to be safe even when the AI being deployed might be scheming. The standard setup: a...
Apr 131