Geometric Safety Features v1.5.0: A Pipeline for Topological Analysis of Al Model Uncertainty
Hi people of the LessWrong Community, I would like to present what I am working on so I found that unstable AI outputs don't come from high-variance regions, but from geometrically constrained 'narrow passages' in embedding space, it’s built in Python and if used geometric features to flag model states...