What is a circuit? [in interpretability]
Probabilistic Logic <=> Oracles?
Yudhister Kumar's Shortform
Paphos
Published June 2, 2025. "The first thing you need to know about Cyprus is that everything is Hotel. School is Hotel. Restaurant is Hotel. Home is Hotel." [1] Leptos Estates owns everything on the island, as well as NUP, so naturally NUP is in a hotel. Because Leptos Estates is...
Rome
Published March 20, 2024. A post-apocalyptic fever dream. The oldest civilized metropolis. Where sons are pathetic in the eyes of their father, and both are pathetic in the eyes of their grandfathers—all while wearing blackened sunglasses and leather jackets. Grown, not made. Rome is, perhaps, the first place I recognized...
Geneva
Published on September 13, 2023. Geneva is evil. It's overpriced, loud, and dirty. Paying ten francs for a medicore street taco is no way to live life. God forbid you visit the city center during the day, and stay as far away from Geneva station as you can. I thought...
Toledo
Published September 12, 2023. > One recounts that Washington Irving, who was traveling in Spain at the time, suggested the name to his brother, a local resident; this explanation ignores the fact that Irving returned to the United States in 1832. Others award the honor to Two Stickney, son of...
What is a circuit? [in interpretability]
I'm aware of the understanding that "a circuit is a subgraph of a neural network that implements a specific computation." In practice (to my understanding) the way you identify "circuits" is by identifying components of the neural network that have high correlation with certain tasks, and doing some ablations to...
Chemical Turing Machines
Epistemic status: brief writeup of some interesting work I found in[1][2][3][4], among other places. Probably subtly wrong in places & if you have additions / comments to make, please do. Finite state automata (FSA) can be modeled with reactions of the form A+B→C+D. FSAs operate over regular languages, so our...