Just a research idea, not a whole damn agenda
Sparse Autoencoders (SAEs) are an unsupervised technique for decomposing the activations of a neural network into a sum of interpretable components (often referred to as features). Sparse Autoencoders may be useful interpretability and related alignment agendas. .. (read more)
Pause AI is an AI advocacy organizing with the aim of pausing AGI research due to concerns about future risks from AI. From their website:.. (read more)
A cost-benefit analysis is a method for quantitatively assigning values to different actions... (read more)
Axioms (together with definitions) forms the basis of mathematical theorems. Every mathematical theorem is only proven inside its axiom system... (read more)
User | Post Title | Tag | Pow | When | Vote |
Rational convergence towards certain norms, habits, social contracts, even though agents might have no way of communicating or affecting each other, nor even any direct evidence that the other exists