wnx
Message
AI safety, risks from global systems, philosophy, engineering physics, complex adaptive systems, values for the future, future of intelligence.
Scale, cascades and cumulative impacts from interconnected systems. AI systems' impact on human cognitive and moral autonomy. Evaluations, measurement, robustness, standards and monitoring of AI systems.
Interest in...
3
2
wnx has not written any posts yet.

Alignment approaches at different abstraction levels (e.g., macro-level interpretability, scaffolding/module-level AI system safety, systems-level theoretic process analysis for safety) is something I have been hoping to see more of. I am thrilled by this meta-level red-teaming work and excited to see the announcement of the new team.