Hypothesis: Claude (the character, not the ocean) genuinely thinks my questions (most questions from anyone) are so great and interesting ... because it's me who remembers all of my other questions, but Claude has seen only all the internet slop and AI slop from training so far and compared to that, any of my questions are probably actually more interesting that whatever it has seen so far 🤔?
it's from https://gradual-disempowerment.ai/mitigating-the-risk ... I've used "just" (including scare quotes) for the concept of something being very hard, yet simpler to the thing in comparison
and now that concept has more color/flavour/it sparkled a glimmer of joy for me (despite/especially because it was used to illuminate such a dark and depressing scene - gradual disempowerment is like putting a dagger to one's liver where the mere(!) misaligned ASI was a stab between the ribs, lose thy hope mere mortals, you were grabbing for water)
I am Peter. I am Aprillion. A 40 year old married man who used to be a techno-optimist. A construct for programming and writing. Embodied soul who will one day be no more. Information who will find myself in the Dust.
While non-deterministic batch calculations in LLMs imply possibility of side channel attacks, so best to run private queries in private batches however implausible an actual exploit might be... if there is any BENEFIT from cross-query contamination, GSD would ruthlessly latch on any loss reduction - maybe "this document is about X, other queries in the same batch might be about X too, let's tickle the weights in a way that the non-deterministic matrix multiplication is ever so slightly biased towards X in random other queries in the same batch" is a real-signal gradient 🤔
How to test that?
all the scaffold tools, system prompt, and what not add context for the LLM ... but what if I want to know what's the context too?
Pushing writing ideas to external memory for my less burned out future self:
agent foundations need path-dependent notion of rationality
alignment is a capability
in a universe with infinite Everett branches, I was born in the subset that wasn't destroyed by nuclear winter during the cold war - no matter how unlikely it was that humanity didn't destroy itself (they could have done that in most worlds and I wasn't born in such a world, I live in the one where Petrov heard the Geiger counter beep in some particular patter that made him more suspicious or something... something something anthropic principle)
Ceylon cinnamon smells better on top of a steaming cup of coffee than Indian cinnamon .. when unsweetened.
Lunar corona (rainbow around the moon) is so rare to see in full spectrum,, but the red-brown-oranges in the clouds are beautiful too.