Gurkenglas

I operate by Crocker's rules.

I try to not make people regret telling me things. So in particular:
- I expect to be safe to ask if your post would give AI labs dangerous ideas.
- If you worry I'll produce such posts, I'll try to keep your worry from making them more likely even if I disagree. Not thinking there will be easier if you don't spell it out in the initial contact.

Posts

Sorted by New

5Gurkenglas's Shortform

22A Brief Theology of D&D

58Would you like me to debug your math?

14Domain Theory and the Prisoner's Dilemma: FairBot

7Changing the AI race payoff matrix

67Using GPT-N to Solve Interpretability of Neural Networks: A Research Agenda

43Mapping Out Alignment

18What are some good public contribution opportunities? (100$ bounty)

5Gurkenglas's Shortform

35Implications of GPT-2

16What shape has mindspace?

Wiki Contributions

Comments

Constructive Cauchy sequences vs. Dedekind cuts

Gurkenglas1mo20

If you want to transfer definitions into another context (constructive, in this case), you should treat such concrete, intuitive properties as theorems, not axioms, because the abstract formulation will generalize further. (remark: "close" is about distances, not order.)

If constructivism adds a degree of freedom in the definition of convergence, I'd try to use it to rescue the theorem that the ~~Dedekind~~order and ~~Cauchy~~distance structures on ℚ agree about the completion. Potential rewards include survival of the theory built on top and evidence about the ideal definition of convergence. (I bet it's not epsilon/N, because why would a natural property of maps from ℕ to ℚ introduce the variable of type ℚ before the variable of type ℕ?)

AI Safety Chatbot

Gurkenglas4mo20

rename your "logs" directory to "sources"

An attempt at a "good enough" solution for human two-party negotiations

Gurkenglas7mo5-2

The fair value input should be "what you expect to pay/get for this if this negotiation falls through", right? To serve as a BATNA.

Social status part 1/2: negotiations over object-level preferences

Gurkenglas1mo20

that thing about affine transformations

If the purpose of a utility function is to provide evidence about the behavior of the group, we can preprocess the data structure into that form: Suppose Alice may update the distribution over group decisions by ε. Then the direction she pushes in is her utility function, and the constraints "add up to 100%" and "size ε" cancel out the "affine transformation" degrees of freedom. Now such directions can be added up.

"Deep Learning" Is Function Approximation

Gurkenglas1mo20

Let's investigate whether functions must necessarily contain an agent in order to do sufficiently useful cognitive work. Pick some function of which an oracle would let you save the world.

Constructive Cauchy sequences vs. Dedekind cuts

Gurkenglas1mo20

Hmmmm. What if I said "an enumeration of the first-order theory of (union(Q,{our number}),<)"? Then any number can claim to be equal to one of the constants.

What is the best argument that LLMs are shoggoths?

Gurkenglas1mo20

If Earth had intelligent species with different minds, an LLM could end up identical to a member of at most one of them.

The Worst Form Of Government (Except For Everything Else We've Tried)

Gurkenglas1mo1-1

Is the idea that "they seceded because we broke their veto" is more of a casus belli than "we can't break their veto"?

Constructive Cauchy sequences vs. Dedekind cuts

Gurkenglas1mo20

Sure! Fortunately, while you can use this to prove any rational real innocent of being irrational, you can't use this to prove any irrational real guilty of being irrational, since every first-order formula can only check against finitely many constants.

Constructive Cauchy sequences vs. Dedekind cuts

Gurkenglas1mo20

Chaitin's constant, right. I should have taken my own advice and said "an enumeration of all properties of our number that can be written in the first-order logic (Q,<)".