𝙰 𝙿𝙴𝚁𝙵𝙾𝚁𝙼𝙰𝙽𝙲𝙴 𝙰𝚁𝚃/𝚁𝙴𝚂𝙴𝙰𝚁𝙲𝙷 𝙿𝚁𝙾𝙹𝙴𝙲𝚃 𝙱𝚈 𝙲𝙷𝚁𝙸𝚂 𝙻𝙴𝙾𝙽𝙶[1] 𝙽𝙾𝚃 𝚃𝙾 𝙱𝙴 𝚃𝙰𝙺𝙴𝙽 𝚂𝙴𝚁𝙸𝙾𝚄𝚂𝙻𝚈. 𝙵𝙾𝚁 𝚁𝙴𝙰𝙻 Note: Apologies, some of the formatting seems messed up, likely due to the new editor. Unfortunately, I don't have time to fix this until after I return from camping. ❦ a story: raw human wisdom is...
This article is based on reflections from co-leading the Sydney AI Safety Fellowship. This post is primarily focused on AI safety, but I expect the lessons to be more generally applicable. We have a Problem™ 😱—actually several problems; okay even more problems. Unfortunately, the edge cases make it hard to...
LawrenceC proposed that the nine main theses of shard theory are as follows: > * Agents are well modeled as being made of shards---contextually activated decision influences. > * Shards generally care about concepts inside the agent's world model, as opposed to pure sensory experiences or maximizing reward. > *...
I was inspired to revise my formulation of this thought experiment by Ihor Kendiukhov's post On The Independence Axiom. Kendiukhov quotes Scott Garrabrant: > My take is that the concept of expected utility maximization is a mistake. [...] As far as I know, every argument for utility assumes (or implies)...
Application deadline: * Main deadline: Midnight, 7th December, Sydney time * If we have unfilled slots, we may still accept applications until the 14th of December Location: Sydney (definite); Melbourne (likely; contingent on sufficient high-quality applications) When: January/February 2026 with remote activities pre and post the main fellowship (detail further...
Excepted from an upcoming post on AI and wisdom But the moral considerations, Doctor... Did you and the other scientists not stop to consider the implications of what you were creating? — Roger Robb When you see something that is technically sweet, you go ahead and do it and you...
I'm sharing this post from Mustafa Suleyman, CEO of Microsoft AI, because it's honestly quite shocking to see a post like this coming out of Microsoft. I know some people might accuse him of safety washing, but to me it comes off as an honest attempt to grapple with the...