This document is an institutional design treatise about theories of change and potential struggle to fully account for the complex interplay of variables and uncertainties that can lead to unforeseen consequences or vulnerabilities in institutional systems. Introduction An organization is a dynamic system that depends on complex variables at each...
As Artificial Intelligence (AI) continues its rapid ascent, we are witnessing increasing evidence of well-crafted methods to undermine and exploit AI responses. Carefully crafted inputs can exploit vulnerabilities and lead to harmful or undesired results. In recent times, we have seen numerous individuals independently uncover failure modes within AI systems,...
An hour ago the World Health Organization warned that there is an 'extremely serious' situation and 'huge biological risk' after a lab containing virus samples falls into control of Sudan fighters. #safety #ai #biosecurity
We might need to rethink the Hard Reset , aka the AI Pause. Last month Viktoria joined me for a talk at Cohere for AI, It was a perfect timing as she told us about AGI Safety. A few days ago, Viktoria, with Elon among others she asked all AI...
Marvin Minsky's theory of Negative Expertise: Knowledge is typically considered in positive terms ,but can also be viewed in negative terms, a negative way to seem competent is to never make mistakes. The idea is that experts can be seen as those who know what not to do Most of...
I was just thinking about how difficult and expensive it is to get a review in journals. How could a blind review of manuscripts, independent of journals look like, would this be relevant for accelerating research progress. A website where you could incrementally improve your idea based on review., this...
Suppose we have an agent A trying to optimise for a reward R in an environment S. How can we tell that the presence of the agent does not affect the environment and the measurement(observation) is not only subject to the agent but the environment? This is related with the...