What's the best case scenario regarding OpenAI's contract w/ the Department of War (DoW)? * We have access to the full contract * It's airtight * OAI's engineers are on top of things in case the DoW breaks the contract * There's actual teeth for violations But even then, the...
I'll have a weird dream and wake up in a funk. Be overwhelmed w/ work. Read lots of news/reddit and become very upset or angry. Obviously it's good to feel these things, but sometimes I continue to feel awful no matter how hard I try to "process my feelings" (or...
I've been researching tensor networks as a more interpretable architecture, but whenever I tell people this, they always ask "But is it any good?" So I trained multiple 500M parameter LLMs on fineweb, showing the tensor variant needed ~4% more batches of data to match CE-loss. There's a few caveats,...
I was doing the online version of the Jhourney retreat where they try to teach you the jhanas (narrator: he did not learn the jhanas). Part of what wsas taught was to work on your curiousity, which I chose to practice noticing surprisal. It's ~impossible to predict low-level details of...
I woke up Friday morning w/ a very sore left shoulder. I tried stretching it, but my left chest hurt too. Isn't pain on one side a sign of a heart attack? Chest pain, arm/shoulder pain, and my breathing is pretty shallow now that I think about it, but I...
I want to highlight this paper (from Sept 29, 2025) of an alternative to RL (for fine-tuning pre-trained LLMs) which: * Performs better * Requires less data * Consistent across seeds * Robust (ie don't need to do a grid search on your hyperparameters) * Less "Reward Hacking" (ie when...
Witness testimonies have been shown to be pretty unreliable. Many innocent people have been convicted based off eye-witness testimony, which was overturned by DNA evidence. One interesting example is from Buckhout in 1980, provocatively titled "2000 witnesses can be wrong" > Buckhout performed an experiment with 2,145 at-home viewers of...