I'm interested (both for this and for part 2) in particular on, basically, 'is this anything?' beyond its current form. I did the 'just ship it' thing rather than keep sitting on it, and I want to know if there is something more important here, if some of the subsections could become (with more work and bridging) a sequence style thing that isn't responding to a book, if some sections stand on their own enough that they should be done that way, etc.
In many ways consider this a first draft long version, and now the question is whether and how to write some short version that does more.
This is fun, though felt somewhat like shooting fish in a barrel. IIRC, dismantling some of Socrates' claims was an exercise put to all students in an intro-level philosophy course I took in college, and that course left me with the impression that most modern philosophers view Socrates as primarily of historical interest. That said, apparently 49 of the 1,785 survey respondents in the PhilPapers survey marked Socrates when asked “For which nonliving philosophers X would you describe yourself or your work as X-ian, or the equivalent?” , which is more than I would have guessed, and was the 15th most frequently name checked in reponse to this question out of a rather long list.
These are all important, in their own way, call it a treasure hunt and collect them all…
“Know thyself.” – The Oracle
“Know thine enemy and know thyself; in a hundred battles, you will not be defeated.” – Sun Tzu
“You don’t know me. You don’t know me at all.” – Lisa Loeb, ‘You Don’t Know Me’
“Just one word. Are you listening? Plastics.” – The Graduate
“And Alexander wept, seeing as he had no more worlds to conquer.” – Someone Guessing
“I didn’t know I had permission to murder and to maim.’ – Leonard Cohen
“But that’s not important right now.” – Leslie Nielsen
“A foolish consistency is the hobgoblin of little minds, adored by little statesmen and philosophers and divines.” – Ralph Waldo Emerson
“When the facts change, I change my mind – what do you do, sir?” – John Maynard Keynes
“Now we’re talking price.” – Winston Churchill
“Think for yourself, schmuck.” – Hagbard Celine, Illuminatus!
“Have you forgotten doublethink?” – George Orwell, 1984
“You are trying to solve the wrong problem using the wrong methods based on a wrong model of the world derived from poor thinking and unfortunately all of your mistakes have failed to cancel out.” – Eliezer Yudkowsky
“If you want it done right, you gotta do it yourself.’ – Spider Murphy (+Henry Bullinger in 1541)
“Prove me wrong, kids. Prove me wrong.” – Seymore Skinner
“The hardest thing to do in this world is to live in it.” – Buffy Summers
“If nothing we do matters, all that matters is what we do.” – Kate Lockley
“What, like it’s hard?” – Elle Woods
“This is your life, and it’s ending one minute at a time.” – Tyler Durden
“When the dust covers the sun, and all you hope for is undone, will you turn and say: Nothing ventured, nothing gained, it was all for love so love is to blame?” – Dido, Love to Blame
“Shut up and drive.” – Rhianna
“The art must have a purpose other than itself, or it collapses into infinite recursion.” – Eliezer Yudkowsky
If you go looking for something specific, your chances of finding it are very bad. Because of all the things in the universe, you’re only looking for one of them. If you go looking for anything at all, your chances of finding it are very good. Because of all the things in the world, you’re bound to find some of them. – Darryl Zero, Zero Effect
What are you talking about? There aren’t any good guys. You realize that, don’t you? I mean you realize there aren’t aren’t evil guys and innocent guys. It’s just, it’s just… it’s just a bunch of guys. – Arlo, Zero Effect
“There may not be a million to one chance. But I’m sure there’s at least a billion to one chance.” – Charlie Brown
“Play ball!” – Lucy (responding to Charlie Brown)
Hands chip the flint, light the fire, skin the kill
Feet move the tribe, track the herd, with a will
Mankind struggles in the cellar of history
Time to settle down, time to grow, time to breed
Plow tills the soil, plants the seed, pray for rain
Scythe reaps the wheat, to the mill, to grind the grain
Towns and cities spread to empire overnight
Hands keep building as we chant the ancient rite
Coal heats the steam, push the piston, turns the wheel
Cogs spin the wool, drives the horses made of steel
Lightning harnessed does our will and lights the dark
Keep rising higher, set our goal, hit the mark
Crawl out of the mud
Ongoing but slow
For the path that is easy
Ain’t the one that lets us grow
Light, push the sails, read the data, cities glow
Hands type the keys, click the mouse, out we go
Our voices carry round the world and into space
Send us out to colonize another place
Hands make the tools, build the fire, plant the grain
Feet track the herd, build a world, begin again
– Andrew Eigel, Uplift
The road to wisdom and simple to express.
You err, and err, and err again.
But less, and less, and less. – Piet Hein
There is a theory which states that if ever anyone discovers exactly what the Universe is for and why it is here, it will instantly disappear and be replaced by something even more bizarre and inexplicable.
There is another theory which states that this has already happened. – Douglas Adams
Editor’s Note
This was a tough one to write. There was something to disagree with on every page. This ended up being crazy long, but even so I made quite a lot of cuts, including in exactly what parts to quote. This could have easily ended up being longer than the original book, if I had let it.
The original book returns to many subjects several times. I tried to organize my responses to group similar statements together while preserving dependencies, but this is tough. I try to not duplicate my objections more than necessary, but in some places this is one of those ‘I did not have time to write a shorter response’ situations.
I am not a formal philosopher. I am presumably going to say a lot of stupid things, including things with known strong refutations, or that most philosophers otherwise think are dumb. I am not entirely comfortable with this, but I strongly believe that any true philosopher, and especially Agnes, would tell me not to let that stop me.
The point is not to always be right, the point is to have the argument, to inquire exactly in the spirit of the enterprise. And if it turns out that the day I wrote that part I was being stupider than usual? Then sorry about that, my bad.
Also in the spirit of the original book, in many ways large portions of this were written as a response to one particular person, rather than primarily for a general audience.
I am not making a strong bid that you, a different reader, need to read all or any of this.
I offer it in case you find it relevant to your interests. Some people should read this.
But let me be clear: There is a large chance that THIS IS NOT YOU. In which case, that is totally fine, and you can and should stop here and take these two days off.
Later, for length, I sometimes start saying ‘I disagree that [X]’ without quoting where Agnes says [X]. I hope I make things clear via context.
My hope is, by using this as a springboard, I can illustrate my philosophy of thought.
Besides, who doesn’t love a good argument?
Table of Contents
A Difference Of Opinion
I hate Socrates so, so much. For so, so many reasons. What a schmuck.
As much as I hate Socrates, Agnes Callard loves Socrates more. Probably.
Oh my does Agnes Callard love Socrates. She wanted to be Socrates. She wants to do Socratic things all day. She married someone who goes by Aristotle. She wrote a book telling us Socrates is the key and the one true path to knowledge and even to thinking, and without his insights and method life is not worth living.
I have never read a book in which I have disagreed as strongly, with as many claims and arguments, as I did here.
Whereas when I actually had the opportunity for a few day to spend mornings at Lighthaven chatting with Agnes Callard in person, Doing the Thing? That was wonderful. She was wonderful. I was thinking, it is such a shame one gets to do this so rarely, or so well. Which also highlighted, in many ways, how that experience was different from what the book actually lays out.
What we were doing felt like The Way, whereas what this book lays out? In so many aspects, Not The Way.
When I reviewed Nate Silver’s On the Edge, it was very much a story of ‘yes, and.’ I took what he’d written, and there were disagreements, but mostly I built upon it.
This is going to be a ‘no, you fool.’ A huge percentage of the quotes are, to my view, Agnes Callard being Wrong in Her Book, and it’s time to bring it.
Why bring it?
Partly because that’s exactly what she would want me to do.
Partly because it’s fun.
Mostly it’s because we are in the midst of teaching the sand how to think, and rapidly nearing humanity’s final Philosophy exam and its how-to-think exam. We cannot leave this one to the professionals. I’ve seen their work.
In the past, I’ve avoided doing explicit Philosophy on the Internet, because I assumed I would end up looking stupid. I’d make elementary ‘mistakes,’ I’d walk into traps, I’d be picking an argument with people who pick arguments for a living and doing so on their own turf and so on.
I’ve only even read The Symposium, back in college, after which I kind of tried to cancel Plato before cancelation was a thing (we’ll get to that later but I wasn’t wrong), and bits of The Republic, and I remember very little of any of it.
I’d be the fool who spoke and removed all doubt.
Yeah, screw that. I’m so over it.
Basically: Get in, loser. None of that matters. We’re doing philosophy.
Kind of all of it. And we’re going to do it by telling Agnes Callard that she’s wrong.
[As with all book reviews, location numbers refer to the Kindle version, and block quotes are from the book unless otherwise specified. Quotes from dialogues are taken from the book and the original source marked.]
An Overview
Open Socrates is a book that implores us to engage in two-person philosophical dialogue, where one person asserts things, and the other questions and seeks to disprove them. This, says the book, is both The Good and the royal road to wisdom.
You Don’t Know Me
Agnes has some bold claims about your life right out of the gate.
You usually act based on cached answers and justifications.
Which is what you absolutely should be doing, in most individual moments.
You could uncache your answers at any time.
People rarely do so. I don’t think most people lack the time. We don’t want to do that.
She asks why you haven’t done that: ‘What are you, chicken?’ What are you afraid of?
I think most people are rightfully afraid it will make their lives worse. Or that it won’t make their lives better, and they they won’t enjoy doing it, that it’s a waste of time.
And indeed, she confirms that very answer right away, in her own chosen example.
Untimely Questions
There’s no perfect time to properly define and address the concept of Untimely Questions. I’m pulling this forward a bit.
It is more than that, or rather something more precise.
The problem with an Untimely Question is not merely that you need to choose a functional answer to operate under before you can ask the question yourself in some more ‘think about it yourself’ sense.
The true problem comes when the act of investigating the question further is incompatible with continuing to rely on the previous answer, and you actively need to rely on that answer until you have a new one.
The answer is load bearing to wellbeing or proper functioning. You can’t remove it without replacement.
It’s not about not having the time to ask. It’s about the uncertainty itself doing harm.
That happens because of various ways the human brain handles uncertainty, and also the ability of others to detect and respond to that uncertainty.
That doesn’t mean that, if you actually were asked the question, you would think you have the answer because you are using the answer. It would be more like, ‘from the outside I can see I might not have the answer here, but I have to go on as if I have the answer.’ Brains are weird, and not as unified as Socrates often tries to pretend.
Suppose you previously thought [X]. Now you suspect, perhaps [~X].
In many cases, you continue to ponder whether [X] is true, while continuing to otherwise act, or mostly act, as if [X] is true. It is often correct to do this, for questions that would otherwise be untimely.
A central example is a decision to quit a job. You might quietly pursue other options. but until you know for sure you are leaving you want to do most of the things you’d be doing anyway, maintain the same attitude, and often conceal that you might leave.
Another example would be a startup. You need to act as if you have confidence in your success, while also facing the reality of the situation and deciding whether to pack it in or pivot or panic, and planning to avoid true ruin if you fail.
Thus I think this description here is subtly wrong:
It’s fine for a question to come after being answered – that doesn’t have to make them untimely. It’s only an issue because having confidence in the answer is importantly load bearing, and you can’t properly ask the question without disrupting the load bearing until you have your new answer. That’s what makes them ‘hard to pose.’
Most Untimely Questions are not, at any given time, things it makes sense to be asking. It is usually better to have cached knowledge that relies on other sources.
Tolstoy’s issue was that he couldn’t do that. His answers had become non-functional. He became temporarily unable to live an unexamined life.
Which was a shame for him, because the unexamined life is worth living, too.
The Unexamined Life is Worth Living
There are advantages to examining and asking questions, especially when you are in need of the answers.
But what kind of pompous ass would claim life cannot otherwise be worth living?
Oh, right. That pompous ass.
Tolstoy examined his life, once he had exhausted all alternatives and had no answers.
It… didn’t go great.
The whole thing was weird, Agnes points out, because Tolstoy had won at life.
The explanation is: Tolstoy got tired of too much winning.
Or, more precisely: He got to the ‘You Win’ screen at the end of the game.
Or, more classically:
And Alexander wept, seeing as he had no more worlds to conquer.
Not that I’ve read Tolstoy’s Confessions, but it sounds like that was Tolstoy’s actual problem: He no longer had anything to strive for.
That’s not an indictment of striving, only of his definition of success.
I kind of wrote an entire book about not getting trapped obsessing over the wrong kind of ‘success,’ including that fully ‘winning’ still sucked. I certainly sympathize.
Alternatively, I mean, the dude was depressed because he had a chemical imbalance? It happens. He’s a Russian novelist. And he no longer had sufficient distractions.
I notice the contrast with the way Agnes portrays it:
The point of all of that is, essentially, to get the rest of it. It’s the self-sustaining circle of life and competition. So the very fact that he ‘has it all,’ and we’ve essentially ruled out ways to meaningfully scale it further, and he no longer had to fight to maintain it – there was no marginal payoff left – is exactly the reason it no longer seemed to have a point. The art no longer had an end other than itself. What good would more wealth or more fame do him? What more could he do for his family, at that point?
The obvious things for Tolstoy to do next are some combination of:
I also notice you can ask questions like ‘why educate my children?’ while continuing to (in this example) educate your children as before. You can say:
The Quest For The Unexamined Life
Instead, Tolstoy was paralyzed and made himself miserable looking for a way out:
Let ‘Tolstoy should write novels, manage his estate and attend to his family’ be [X].
Tolstoy isn’t questioning [X]. He still knows [X] it true.
What he doesn’t know is why [X] remains true for him. He can no longer successfully cache the belief that [X] in a load bearing, functional way. He realizes that believing [~X] would go really badly, and that everyone around him asserts [X].
One could say, hence I believe [X]. Indeed, that is why most of us believe most [X]s.
Alas, in this case he finds this insufficiently persuasive to believe [X] in a load bearing way. He realizes this is a problem. He sets out to fix it, whether by figuring out why or otherwise.
Before, Tolstoy had the benefit of civilization’s previous work figuring out how to live in the world and care about it and do the dumb things you gotta do. The question of why, of ‘what’s my motivation here?’ was what Agnes calls an Untimely Question, because his actions depended on having an answer.
But yes! That’s the point. You really, actually do need to have a functional, practical answer to all in-context Untimely Questions in order to live life, do the thing, score the victory points. That’s what makes them Untimely. Not unanswerable, but untimely. That’s why they’re not important now.
Agnes says that Tolstoy discovered The Examined Life is not worth living. That is indeed exactly the opposite of the Socratic thesis. But I don’t think that’s right.
Tolstoy discovered, without realizing it explicitly, that the life without purpose, without load bearing answers, without The Great Work, is not worth living.
So he sought, quite reasonably, a new Great Work, and found one.
A philosopher would say, and Agnes does say, halt! That answer is invalid, for it is not examined. It is arbitrary. It was lying around and you picked it up. Inquire!
And I would say, yes it was lying around. That’s a feature not a bug. It helps the answer stick and be compatible with those around you. How well does that answer work, in practice? It might well work great.
The point of examining it is if you suspect it won’t work or isn’t working, or it isn’t accomplishing what is necessary, or if you think you might find one that works better or that accomplishes something more valuable, or you could improve it, or if you see other value in inquiring.
Indeed one could say that the whole reason we make these questions so hard to ask is that we have realized how badly it typically goes when people ask them.
It’s not that Tolstoy couldn’t ask the questions. He could have. He didn’t want to.
Tolstoy identified a problem: His answers stopped working. So he set out to solve that problem. He found a solution. What he didn’t do was ‘ask the questions’ in the way Agnes would have him ask, instead asking different questions, and finding answers.
And that’s because…
Not Everyone Wants To or Should Philosophize All Day
Socrates and Agnes would say nothing is better than doing philosophy all day.
For some people, that is true. That is their Great Work. That is what they most enjoy.
I mean, in that case, good for Socrates. But don’t torture Tolstoy.
Tolstoy clearly does not see philosophy sparking joy in his heart. He hates it. He wants nothing to do with it. He is the unfortunate man who was happy to be asleep, forced to awaken. Now he seeks to return to his rest.
Socrates is totally fine if the art lacks an end other than itself, and collapses into infinite recursion. To him this act of seeking The Good itself via The Good is The Good.
Does he examine that fact? Kind of yes, but I suspect also kind of no. I’m sure Agnes could quote a lot of passages. I have a hunch I could make a reasonable case why they don’t count and he’s pulling a fast one.
For others, like myself, philosophy can more fun than most things people think are fun when it’s done well. Otherwise we wouldn’t be having this conversation. But the art needs to have an end other than itself.
We can ask questions such as ‘what is virtue?’ in order to have a cool discussion. But I don’t buy that one should ask ‘what is virtue?’ because seeking knowledge like this is inherently The Good – or if one does do that, one has transformed their ultimate motivation into what Agnes calls an Untimely Question, exactly the way she accuses everyone else of doing.
Rather, we mostly ask ‘what is virtue?’ because we need to figure out how to actually have virtue, or help others have virtue, or make the AI have virtue. Like so many before us, the goal of the philosophy we do is to take the question we are working on out of philosophy and into some other field.
I’m not mad at Socrates for spending his time that way (although see implementation details). I am, however, mad at him for claiming that this should be such a happiness for everyone, and that nothing else could be such a happiness, and indeed everyone not doing this was doing bad and should feel bad. Such as Tolstoy.
The Seinfeld Fallacy
The Seinfeld Fallacy is one I have lived and know well: The inability to set aside the examination of life in order to actually live and enjoy it, or accomplish.
I would say that taking life fifteen minutes at a time, or any other amount at a time (why fifteen minutes, exactly?) is mostly the right way to live a given fifteen minutes.
The Seinfeldian version, or the one I suffered from, is the micro fallacy. This is where you are constantly analyzing and criticizing and evaluating everything in micro. Always deliberate practice, all the time, in everything.
There is great joy in that, for the right person, and also great long term benefits.
But you also need to often be able to step outside of that, and either enjoy or accomplish the thing. To ‘get out of your own head’ and ‘stop overthinking it.’
I think a lot of my ‘secret sauce’ is the ability to be constantly optimizing the micro, without letting it overly interfere with my ability to actually do things or enjoy them.
Then there’s the macro version of the same fallacy. I’ve known many people who suffer from that, too. Where you spend all your time and energy thinking about abstractions and what to do and why, in ways that don’t ultimately lead to action. Where your abstractions aren’t agentic, and you can’t set them aside. Either they’re too distracting, or they leave you unmotivated.
Socratic life strategy or moral theory, as described in Open Socrates, is indeed the opposite of the Tolstoyan strategy. It never sets out and faces the moment and Does the Thing. It suffers from the macro fallacy, and potentially also the micro fallacy.
Socrates was the Lying GOAT of Hypocritical False Humility
One could speculate that this was a big motivation for Socrates. If you are ugly and poor, but you can convince everyone around you that the physical world doesn’t matter and all everyone should ever do is philosophy and that is the true version of everything including sex and love, well, you can see the advantages.
Indeed, you can see throughout the book that this strategy worked well for him, even if you think it wasn’t intentional.
Socrates was very obviously lying.
Don’t tell me that this ugly, poor person who supposedly managed to consistently outwit and convince so many of the leading intellectual men of Athens, whose words were so dangerous he had to be put to death, and so on, lacked intellectual gifts – or was under any delusions that he lacked them. Don’t tell me he thought he lacked facility with speechmaking, when we’ve seen so many of his historically convincing speeches.
Don’t tell me he had a bad memory either. When and what does he actually forget, when it counts? The closest thing the LLMs could find was Theaetetus 167e-168c? o3-mini-high flat out just said ‘no, it’s almost always a tactic.’
Then of course we get to the big one, most of you already know this story. It is perhaps the thing he is most known for.
I’m flat out calling Socrates a liar here, in his most famous claim, three times over.
In short, three lies here from this lying liar:
It would of course be fine to tell this story as a narrative device, but it seems clear from Apology that he was representing it as real.
If your defense in court was to transparently lie about what the Oracle told you in order to be a pompous ass and act superior while claiming you’re acting humble, I’d be pretty tempted to vote to convict on that alone.
Hearing Voices
Let’s skip ahead a bit to the ‘final step’ of where all ideas must come from.
Russell would be proud.
I am a true philosopher, thus free. You hear a voice. He is a secret slave to two masters.
One could go on, and say one is the slave of limitless masters. There is no ‘one voice’ of pleasure and pain, or what the body wants. You’ve got hunger, and thirst, and heat, and shelter, and sleep, and sex, and watch out for that tree, and so on and so forth, in all variations, even if we only consider direct physical effects.
There is not ‘one voice’ of the community, indeed each person has one or many voices, making conflicting demands of you, and you may sense some collective voice as well.
Then there are clearly other voices, even if one is not an intellectual. People have all sorts of other needs and desires that are not directly either of these things, and so on.
Or those non-intellectuals can take on some almost arbitrary other agenda, whether or not they themselves have thought it through or would endorse on reflection. It turns out such people actually have things they value and deeply care about, and goals they want to achieve, and often they’re not that closely tied to either ‘voice.’
It’s so weird to say that one is a ‘slave’ to these ‘voices.’
Or to claim that it is those who do not intentionally embrace one in particular, who are the slaves. If anyone is a slave here, the closest thing would be the Epicureans.
One could instead simply call all of this information, or preferences, or training data.
I notice I have preferences. I notice that some things make my physical experience better or worse, and other things make my community better or worse off, and other things impact various other things I care about. I then choose how to respond to that information.
Does that make me a slave? Are you a slave simply because your actions have consequences, and you are aware of those consequences?
Are we, collectively, ‘slaves’ of the need to eat? In some sense yes, but I don’t think that’s a useful way to think about it in most contexts.
Simpsons Ancient Greeks Did It
Another classic thing philosophers do is claim credit for things, because they got there first, or at least published first. They claim your ideas always (or almost always) have to ‘come from somewhere’ by which they mean someone.
I mean, sometimes, sure. But it can’t be turtles all the way down. You can, you know, actually develop new ideas. Or rediscover, on your own, old ones. Or listen.
That quote always irks me. The phenomenon is real, all of that does have some impact, but intellectuals like to deny that anyone could actually acquire practical knowledge or heuristics in practical fashion or their own thinking, either themselves or over the generations.
There are only so many ways to think well, the same way there are only so many chess openings. That doesn’t mean AlphaZero owes a debt to one Roy Lopez.
Agnes lays out four different schools of thought on how to make decisions.
This is her summary of the three that aren’t hers:
She also makes this bold claim, which I’m going to move up from Chapter 4 to here:
I think that’s not only actually false, it’s false for all three baseline ethical theories.
The book goes over the ethical theories briefly here, then returns to them again later, after establishing some background. I don’t think the background requires the theories beyond what my readers already know, so I’m going to move my explanations of the three traditional theories to later when I discuss chapter four.
The Proposed Fourth Option: Socratic Inquiry
I don’t recognize this characterization of Virtue Ethics or Aristotelianism at all. And I challenge the idea of the savage commands being something to be tamed rather than data. But I’m not sure how much those objections matters in context?
I also notice the implicit conflation here of untimely questions and the savage commands. That seems completely wrong? I don’t see any of these three methods as solutions to Untimely Questions, other than as methods to help answer the question, and I don’t see how UQs relate to the commands.
What Agnes proposes to do here, regardless, is say that Socratic is a fourth alternative.
Rather than a complement to one of the other three approaches – the Utilitarian who places high expected value in solving for their true utility function, the Deontologist (she says Kantian) who tries to write better rules and the Virtue Ethicist who strengthens virtuous or desired actions and thoughts and ideally has various forms of seeking knowledge and an accurate map of the territory as key virtues – Socratic values are a Secret Fourth Thing.
I strongly disagree that you can’t be quite a bit Socratic, in the virtuous senses, while subscribing to one of the typically acknowledged big three theories. Certainly you’re not telling me the rest of us can’t seek knowledge, or that we’re ‘doing it wrong’?
Well, actually, Agnes does seem be saying that, in ways that seem rather central to the book, and which we’ll discuss quite a lot.
You think they don’t notice? They notice. If you question their command to do so, they’ll typically say ‘exactly, now you’re getting it.’
It also seems like treating a particular method of inquiry based on dialogue as a fourth theory is a category error? The question of the way you try to figure things out, and whether Socratic dialogue is The Way to do that, should be distinct from whether the goal is to figure things out (versus the goals of doing the outcome that has the best results, or choosing the best rules, or cultivating the best virtues).
Either way, none of that means there can’t also be a fourth theory.
What happens if we take that fourth theory seriously?
Well, the obvious first question is, how are you actually going to choose to do things?
Oh, but that’s totally wrong, you see. Socrates was not merely criticizing the overconfident answers of others. Socrates had answers. Agnes proposes answers.
Their answer is (short version) that The Good is to seek knowledge. In particular, to seek knowledge via one particular style of conversation.
Again, the ‘via one particular style of conversation’ seems to be a category error. As in, you can have any combination of:
Why would these two Socratic positions have to go together?
No Really The Position is Nothing Else Matters
Whenever anyone tries to end a Socratic encounter, the Callard position is they are ‘turning away from inquiry,’ and going back to living life fifteen minutes at a time.
This matches my interpretation of Socrates’s appetite to spend time on these conversations. Which is that he is without limit, and absolutely cannot take a hint.
Of course Socrates thinks that the circumstances call for inquiry! He always thinks every circumstance calls for inquiry.
Grok tried to come up with scenarios where he wouldn’t or didn’t, and I find them entirely unconvincing or trivial. Its first chosen example, hilariously, is ‘Divine Pronouncements from the Oracle at Delphi’ and not only in his story did he inquire as to what that supposed pronouncement meant, if there’s one thing you should definitely do inquiry about it is pronouncements from the Oracle at Delphi!
At some point, life is going to beckon. And also, at some point, well, this Socrates guy is really annoying and will keep going forever if you let him, until he gets you to the conclusions he wants.
If you never end a conversation, eventually everyone else will have to end them for you.
And then when they turn away from you, you say that means a lifetime of wavering. Perhaps it instead means a lifetime of sometimes doing things.
Agnes does notice this.
I am not that impressed that he survived to the age of 70. It takes really a lot to get put to death for doing philosophy in ancient Athens. To our knowledge no one else ever pulled it off, unless you want to count the convictions of Anaxagoras, Protagoras, or Diagoras, or the charges against Aristotle, but none of them died or really count given their contexts. Socrates had to make quite an effort, including at and after his trial, for it to actually happen. He was one of the most interesting things going on in Athens at the time. It’s not like there was anything good on TV.
It is however impressive that the individual conversations go on as long as they do, if you think these were largely real conversations. The interlocutor eventually rushes off because Socrates has no interest in reading the room or letting practical considerations matter, and letting them leave any other way, and also because Socrates never plays fair. Yet he gets them to stick around for quite a long time. Even with a lot of dramatic license, and also a lot of selection – presumably people who didn’t put up with this didn’t make it into the dialogue collection – it’s still quite something.
The War on Wavering and Nebulosity
Wavering is often good. You should waver. The point of inquiry isn’t to establish fully confident beliefs in universal principles for the relationship of nebulous terms expressed in human language. It’s to make progress, to change one’s mind, to figure things out and… to waver.
Respect nebulosity, the fact that concepts can be real but impossible to fully pin down, that they can lack hard borders and involve ambiguity yet still be highly useful.
What he says are ‘the same’ subjects would blow your mind. It’s a magician’s trick.
I always love a good Russell conjugation:
One must remember Keynes. When the facts change, you change your mind, including when the facts are brought to light, or thought about in a new way.
The idea that someone could fully understand concepts like ‘justice’ or ‘virtue’ or ‘what is admirable and contemptible, good and bad, and advantageous and disadvantageous’ to the point of being ‘unwavering’ and entirely logically consistent while answering maximally challenging questions, all in real time, is absurd.
What’s even more absurd? Telling someone that if they fail that standard, then ‘they don’t know’ about the topic.
By that standard, no one knows about any of these things!
Of course I don’t have a full classification system of what is good and bad, and advantageous and disadvantageous, and that also stands up with logical consistency to every metaphorical comparison, even if Socrates wasn’t twisting them around.
It is so backwards and bizarre that Socrates is claiming that wavering would, to that extent, ever be stopped by inquiry.
Perhaps that is because his goal is often to trick people into forming certain beliefs?
This is an attempt to draw sharp distinctions, and in other places draw sharp equalities, that do not apply. If I had to roughly intuition pump, I’d say:
Or, here are some adjectives and cases to consider:
Do these items bleed into each other? Do people attempt to frame things as one rather than the other in order to claim they are good or bad? Oh, sure, all the time. People are constantly playing association games to shift and signal approval. That doesn’t mean there is nothing else going on.
This is largely distinct from a different form of wavering, described here:
The part after the dash is overreaching, but yes this is a common phenomenon.
There is of course overlap between these two things called wavering. But I think they are at least as distinct as they are the same thing, with of course many cases involving motivations from both sides. And equating them together is, again, part of the central Magician’s Trick.
Socrates does the opposite, where he not only says one should never waver, he declares all sorts of other things that aren’t wavering to be wavering, because the person involved can’t properly make explicit the factors involved in reaching different conclusions under different circumstances.
Living Your Best Life
We can live a life based on knowledge. Indeed, we could hardly live any other way. What we cannot do, at least for the next few years (great things in AI are afoot!), is live a life based on complete knowledge.
Would a life based on complete knowledge be the best life, if by knowledge we mean all that could be sought Socratically, so this means not only facts but full understanding? One could argue no, because with complete knowledge one could not then seek knowledge, or ever be surprised, and one suddenly has the ultimate version of the Tolstoy problem.
People are rightfully very worried about this issue. It’s terrifying. Yes, your AI can probably help you find the solutions, but what if there isn’t one? Or at least, not one that we would like on reflection? What you cared about was the Exercise of Vital Powers, the striving to learn and better yourself and compete and emerge victorious, and now you weep for there will be no more worlds to conquer?
There’s no art left, and also no end other than itself? What do you do, if you can’t track the herd, build a world, begin again? Or if it unleashes a fully Malthusian or Molochian world, including one in which you are not capable of justifying or sustaining your continued existence?
I digress. I mean, I don’t. Everything else is the digression, I hear Agnes cry. Inquire!
I’d say ‘we don’t have that kind of time’ and she’d say ‘exactly, stop saying that!’
I’m saying it anyway, that’s looking like an ASI-complete problem, and it only causes problems once the ASI is complete, and there really is a lot to get to.
And for this current discussion, I do digress.
It suffices to say that at all known margins, perhaps sometimes ignorance is bliss, but we can agree that in general yes, we agree that more of the right knowledge, the knowledge you would choose to seek in such a dialogue, means a life better lived.
I don’t think that’s universal or even obvious. Tolstoy would like a word, after all. I do think it is true in general, sufficiently so for us to continue.
The thing is, there’s a lot of things that we could use more of on the margin. Why is the second best life the one oriented towards knowledge, unless it is to then live the best life, the one with knowledge? Is not the goal to gain the knowledge of the world, such that we may live in it? You explore, but also you exploit.
Why would you choose an extreme point on the production possibilities frontier?
Yes. Of course limited doses. Just like anything else. Ask your husband Aristotle.
We’re talking price. You can argue their price is too low. The order can vary. The price can vary. If you respect the philosophy for its actual content, not only for ‘teaching you how to think,’ then there are advantages to doing a lot of it later on.
If the justification of this being the second best life is that it leads to the best one, you either have to expect to gain full knowledge, or pursue a mixed strategy.
Unless of course you think it’s all way too much fun. Some people really do want, for its own sake, to philosophize all day. That’s great, but a very different motivation.
Introducing the Socratic Method (the real one)
There is a place for this. Sometimes you do want the proposer to be distinct from the verifier, because you need an outside view and fresh eyes to spot mistakes, and once they’re pointed out you don’t really need the other person proposing solutions.
As in, the people in quality assurance can find the bug. But they can’t fix the bug. So there’s no point in asking them to try. Once they find it it’s on you to go fix it.
But Agnes insists we should seek knowledge, essentially exclusively, in this particular way, using a very strict version of this particular method. Or even that doing this particular thing is The Good and The Way. Which to me is totally wild.
Then she wrote a book explaining this, where the book doesn’t use the method.
Because it is a book.
Of course, the Socratic dialogues themselves are now books, and sort of do use the method, but the reader is taking neither of the roles and the exchange is scripted (even if it or a similar conversation did take place in the past). Often the conversation unfolds in completely manipulative ways that make it almost impossible for me to read for more than a few pages. That’s completely different from actually Doing Socratism.
Prove Me Wrong, Kids
Oh, sure, when Socrates says it he’s founding philosophy. When Michael Vassar says it he has a reality distortion field and is kind of a cult leader.
For a person who is not wise, who knows nothing, and who is equally pleased to refute or be refuted, I can’t help but notice he spends almost all his time refuting and very little being refuted. Indeed, he’s the guy whose name is attached to the idea of making these two distinct roles and claiming he doesn’t know anything or have any wisdom, exactly so that he can do all the refuting without doing any of the being refuted, and then claim that this is the road to all knowledge and wisdom.
I also notice that this is some masterful frame control. You set them up so they have to admit they would be happy to be refuted and change their minds.
Isn’t this whole framing a little suspicious, anyway? Why all this talk of refutation?
Hold that thought.
Socrates Asserts Wrong Conclusions That Are Wrong
I think ‘the method’ could be used to argue for essentially any conclusion you want about politics, love and death. In general I think ‘the method’ can be used to argue for, essentially, anything. And I think the particular proposed answers in those later sections are rather wrong. But they’re nowhere near as wrong as things Socrates says.
I don’t simply mean wrong as in ‘it turns out with two millennia of hindsight that this wasn’t true.’
I mean this was Obvious Nonsense, to the point where you should assert Wrong Conclusions Are Wrong and reject the claims outright and start again, realizing at some point you were the victim of some error or magician’s trick.
You should do this the same way that when you see a proof that 1 = 0, but you look at it for a while and can’t find the flaw, you don’t say ‘oh I suppose 1 = 0 then’ you should say ‘I know there is an error in there somewhere even if I can’t find it.’
It seems really hard to believe anyone sane would really assert with a straight face:
That’s up there with War is Peace, Freedom is Slavery and Ignorance is Strength.
I can see how one could make the mistake of ‘there is no reason to treat a corpse with respect’ but seriously how does no one explain that one to him? I’ll get to that later.
But seriously, has Socrates ever met, you know, people? Done things in the world?
Canonically the answer is yes, but his statements raise the question.
Of course you can fight injustice. You can organize society with laws and enforce them. You can reward just action and punish unjust action, and make this pattern known. You can find the most unjust people and ‘stick them with the pointy end.’
The others aren’t more plausible than that.
Claude warned me on review I should soften the above rhetoric, that these are all major seriously defended philosophical positions. I acknowledge that these are all major seriously defended philosophical positions. I acknowledge that I may not doing serious engagement with the strongest forms of the arguments for those positions, although I asked Claude Opus 4.5 to then generate its strongest defense for all five claims and got it to agree in all 5 cases that it was refuted one question later.
So you know what? I don’t care. Wrong Conclusions Are Wrong. Refutation or disproof by contradiction in this way is robust and can ignore all other considerations.
Also, yeah, it’s really good to be the one asking the questions and doing the refuting.
There is only one type of person who would want to convince others, as a full package:
That person is a cult leader. And not the good kind.
Funny how that keeps coming up.
You Can Question Your Beliefs
If there’s one thing that seems alien and baffling throughout the book, it is this constant claim that various intellectual actions are way more impossible than they are, especially the questioning of one’s beliefs.
Agnes tries to extend this concept of the load bearing belief, the potential Untimely Question, to places where it need not exist. I love this example because it is so very clearly backwards. There’s something very alien about how ‘belief’ is being used here.
Without loss of generality, let ‘cloning is immoral’ be [X].
Suppose I believe [X]. When I say that, I don’t mean my p(X)=1.
Suppose I advocate for [X]. This does not have to constrain my p(X) at all – perhaps I am being paid to assert this and my p(X) is 0.1 or 0.5. Perhaps I don’t like cloning for other reasons and I’m pitching the moral aspect because it lands better. Or maybe I think if it’s immoral it’s super immoral, and I’m not sure. Or I could think p(X) is 0.9, but that doesn’t mean I’m sure, or that I wouldn’t want to be refuted and stop advocating if I was wrong.
None of that requires me to take [X] ‘out of my beliefs’ in order to consider whether [X] is true. Even if I am not thinking that way, and simply think [X], I can still analyze whether [X] while continuing to use the cached belief of [X] until proven otherwise.
One can also be ‘of two minds,’ which is a funny thing to say in the context of this book, but which will become very important later. Have you forgotten doublethink?
Again, I don’t see why we have to have some sort of Platonic ‘detached position’:
Sure we can. There are various ways to do it, as we’ll see later, although in practice often someone might not realize the need to do it, or might not know how, or might not have the resources available.
If you can ask about a fictional character, you can then ask ‘how does my situation differ? How is it alike?’ or you can ask ‘what would I think about this if it was a fictional situation?’ Those are, indeed, some of the tricks.
I don’t get this one either. If I’m going somewhere, how does that change how I give directions? What might change my actions is if I’ve already figured this one out, and is already top of mind, which might or might not be true if I’m on my way there. Often I figure it out as I go. Other times, you ask me for directions, and I know the answer, even if I’m not headed that way. The correlation here is rather low.
It is not typically the wisest approach to be fully ‘unwavering’ and hyper-consistent along some set of abstract principles, or to have to be consciously aware of our entire algorithm for making decisions. So much is practical. Again: You say ‘wavering’ like it is universally a bad thing, using an extremely expansive definition. Why?
The person claiming it cannot be done should never interrupt the person doing it.
As in, I’m writing this, about how to think about questions related to activities that are currently part of your actions and identity, which is about activities that are part of my actions and identity, described by an author who is definitely describing things that are part of hers. Yet here we are.
This claim that the ego must attach, and that objectivity about anything you actually care about is impossible, simply is not true.
Of course it is a difficulty to retain objective, when you would rather reach one conclusion than another. That doesn’t mean you can’t do it, or at least get arbitrarily close to doing it. The rationalist community engages in a vast amount of deliberate practice to figure out how to do this, and has succeeded enough that if that procedure was not working, they would very much want to know that, and are eager to hear your argument why it doesn’t work. This isn’t it.
I’m not a mother, but I am a father. And definitely I can and do often step back and ask the question, ‘what makes someone a good father?’ either in a context or in general, even though the answer to that might imply I haven’t been the best father, or that I’d have to do things in the future I don’t want to do.
That’s how you accomplish anything. Is there some amount of bias there, that one has to watch out for? Sure, it’s never going to be zero, but that doesn’t mean you can’t do the thing.
Indeed, later on, Agnes gives an example with maximized bias, where someone still manages to change their mind anyway:
Let the guilt of the son be [X]. The mother is relying on [~X], and will continue to act as if [~X] right up until the point where she realizes [X]. I don’t think this is so much an epistemic path as it is a way of acting. Again, remember doublethink. She’s going to continue to use [~X], but is capable of evaluating evidence while doing so, to the point where sufficient evidence will flip her to [X].
True Opinions Do Not Only Do Good
File under things I can’t believe someone has to say as a section heading, or Lies That Socrates Told Me.
True opinions can do ungood. Kant and the axe murderer.
I am an unusually strong believer in true things. I recite the Litany of Tarski, I try very hard to make my map match the territory and avoid various traps. The truth is the way to bet. But sometimes locally correct opinions result in the world being worse off.
I also don’t think Socrates is right about how minds work. Giving an account of your beliefs can make them more accurate, and is highly useful, but it doesn’t bear that much relation to how long beliefs get retained, which depends on so many factors. For most of my long term beliefs, I have a very far from full understanding of what led me to those beliefs – I don’t have enough context window or memory for that, and a lot of my computation is unconscious or System 1.
If I want to know why, really know why, I likely have to actively ask that question, and sometimes the answer will then be obvious, and other times it very much won’t be, or my reasoning on the question now will be very different from what caused me to cache the belief in the first place – even if I reach the same conclusion, and even if I did ‘fully reason’ my way to it earlier, and forgot how.
We can get true answers without inquiring? Sounds like the question was not so untimely after all. If one really can reliably get answers some other way, one could instead inquire into the ability to trust that other process, or maybe just Trust The Process. That sounds, in general, way easier.
The best way to get stable answers is to avoid inquiry.
I thought the problem was that otherwise the answers could be wrong?
Meno Plays the Fool
Is Meno, as written, playing The Fool here, and if so in which sense?
When I first read that, I thought Meno was simply an idiotic arrogant prick.
On reflection, Meno is definitely written as an arrogant prick, but his mistake is that he has a very different understanding of what it means to define or know something, than does Socrates.
Socrates thinks that to know or define virtue is to offer a complete definition of virtue that knows the answer in all cases, that is ‘unwavering’ in that it is logically consistent, and that without that you do not know what virtue is.
It likely never even occurs to Meno that this could be what someone means. Instead, he thinks this a practical demand, to be able to lay out examples such that someone can understand virtue, to lay out the central and important cases. So he does that.
Notice Meo says there are many other virtues, so Meno does not mean this is a complete list, merely that these are the central examples.
Meno’s actual view is reflective of the perspective that virtue is the set of things that we consider virtues. Or that the set of virtues is the set of things that, when treated as virtues, leads to good outcomes. Or simply, virtues represent The Good, perhaps in a slightly different sense.
The common element of his definition is ‘the various people here are doing what it would benefit those in their station and situation to do, systematically, in order that things may turn out well.’
Meno is not, however, aware of this consciously.
And he (as written) makes the mistake many people make when challenged in similar ways by Socrates, which is to start grasping and making things up without thinking them through, while accepting this idea that his statements must be fully ‘unwavering.’ This is a very strawman thing to do. As always, Socrates now has it easy.
The Central Magician’s Trick
Magician’s tricks are even easier if Plato is writing both sides of the conversation.
Just saying.
The Gaslighting of Alcibiades
Gaslighting seems like an accurate term for ‘take someone exceptional, and convince them they are so ignorant they deserve to be a slave because they can’t consistently answer unsolved deep questions in philosophy.’
Everyone, without exception, would under sufficient pressure give conflicting answers to philosophical questions if not allowed to answer ‘I don’t know,’ especially when the questioner was allowed to play with the meanings of words like good and bad and so on and you weren’t allowed to revise responses or constantly answer with ‘well that is complicated, it depends on many things some of which are…’
The correct answer to ‘am I a just (good) person’ is, of course, Mu. It’s a category error. That is not an adjective you should be applying, unqualified, to a person.
And then, wow, look at Socrates go and execute step 1. I’m not mad, I’m impressed.
Alcibiades is making a very specific claim. He is saying, on some occasions, he saw other children cheating him, and not playing fairly. Which is a highly normal thing that happens to basically everyone, that you catch someone cheating at a game.
Socrates turns this into ‘you thought you understood justice and injustice,’ which is a completely different thing. By Socratic standards, none of us understand justice. But of course, Alcibiades takes the bait.
The last time I was confused or puzzled about justice was today, as I write this. I’m confused and puzzled about justice all the time. It is highly confusing and puzzling.
But yes, ‘an attitude of wonder or detached inquisitiveness’ is indeed misplaced when someone cheats at games. That much we have all decided on, by nature of agreeing to play. We’ve been over this. I have very much cached that one, and I do not trust anyone who has not done the same.
That doesn’t mean Socrates couldn’t ask ‘hey, Zvi, I think it’s actually fine to cheat at games, why shouldn’t I?’ Indeed, I’ve had that discussion, and done exactly the motion Agnes says I shouldn’t be able to make, which is to suspend that answer in context sufficiently to examine why I have it and whether it holds up to scrutiny.
As an aside, nowadays we would all be able to answer that one differently.
You can see Alcibiades realize he likely has ‘been had’ in some way, and try to fight back, but he doesn’t have the skills:
The word ‘apparently’ is not agreement. It is him saying ‘logically you have established both propositions within the context of this conversation, I don’t see a way out of this even though it is absurd, and thus I realize that I am f****ed.’
Contra Agnes, I don’t think he is asserting or accepting both [~P] and [he believes P]. He is noticing that Socrates – to use Agnes’s word – has ‘orchestrated’ this trap, and he doesn’t know how to get out of it.
Also contra Agnes, yes, the ‘reluctance’ here very much is reason to doubt the sincerity of his responses, not that Alcibiades is being insincere but that he knows he’s being intellectually manipulated into saying it.
If you look at the previous context, which Agnes quotes, it is very clear that Alcibiades knows, deep in his gut, that he’s been had, but doesn’t feel socially able to assert this.
If you then go on to what Agnes quotes after, with Socrates pulling ‘if you can’t provide a counterexample to my absurd claim then it must be true,’ it’s even more obvious what is happening, that it is all a magician’s trick.
Pair this with something else we see when we move forward to Chapter 6: There’s a passage (Alcibiades 105a-c) where Socrates says to Alcibiades that he thinks he’d choose to die rather than live with only what he has, that if he could ‘only’ rule some of the people that would be so puny as to merit suicide. Which I am very confident (to the extent Alcibiades is real, or can be reasoned about) was not true, and which is the kind of thing that can really screw a person up, and also he claims this as knowledge (that ‘he is sure of it’) when I mean holy hell.
The end result of all this is that he uses this frame of Alcibiades own psyche to get him to admit he deserves to be a slave.
It is hard to properly state how royally f***ed up the whole thing is.
The correct answer is that he wasn’t able to do this. What Socrates was able to do was to force Alcibiades into a bind where he felt he had to admit horrible things about himself. Then, when he was outside of what with others such as Steve Jobs or Michael Vassar we call the ‘reality distortion field’ that was doubtless around Socrates, he realized he’d been had even if he didn’t know how or why, and he only got more determined to prove he was worthy.
If you doubt that Socrates, or someone in the questioning position, can very much become the proposer rather than the disprover, and get the other person to follow whatever chain of logic they want, a simple example is quoted later, in Alcibiades 132d-133b. All you have to do is end each statement with, essentially, ‘isn’t that right?’
Socrates as Jeopardy contestant. You can provide the answer all you like, so long as you phrase it in the form of a question.
What was the historical result of this, together with an affair with Socrates? And if you look at the descriptions quoted in Part 3 under Love, you can see what a number Socrates ultimately did on Alcibiades before he was able to snap out of it, and then tried to warn others rather explicitly that Socrates is an abuser and not to fall into the same trap (Symposium 222b), although he doesn’t have the language for this.
If we take the account in Thucydides seriously, this clearly insane expedition plausibly led to the fall of the Athenian Empire to Sparta, and thereby to the fall of all of Greece and the end of its golden age.
All, quite plausibly, because of Socrates. He took his whole civilization down with him.
We later learn Meno also failed to gain wisdom after his encounter with Socrates, as he is revealed to be greedy, treacherous and incompetent, and dies young after being tortured for a year, but at least he didn’t take down a civilization with him. This forces Agnes to point out that no, sometimes Socrates’s associates ended up doing okay.
So why did all of this backfire so spectacularly in the case of Alcibiades? Agnes says Plato tried to answer this in Symposium, which is set 17 years afterwards.
Essentially, Plato thinks Socrates screwed Alcibiades so badly he got PTSD.
Is the answer somehow not obvious, even if this book wasn’t already about untimely questions and needing to be able to have the tools to function? Socrates attempted to gaslight Alcibiades into replacing his self-perception with ‘you deserve to be a slave.’
Alcibiades didn’t know how he’d been had, but he knew he’d been had, and even if he hadn’t, that obviously wasn’t going to work as a new answer.
Don’t be surprised if he ran directly away from that idea once he was outside of the reality distortion field.
Others like Plato accepted the frame and managed to make a good living that way, and occasionally they had good ideas.
The Measure of a Fight
It’s not that simple, on three levels.
One is that I wouldn’t describe the Iliad or Odyssey that way.
Since when was the Trojan War about justice?
Here’s Claude, one shot, I think this is a rather robust answer to such a claim?
The Odyssey is an even worse fit than The Iliad. Most of its events have little to do with justice. I think neither Odysseus nor the suitors primarily cared about justice.
Two is that if there is a matter of justice, we won’t necessarily fight about that either. Most of the time, we’ll still argue, or have it out in the social realm, or I will see you in court or at the ballot box.
Finally, how do things become matters of justice? There was some underlying event or dispute, and then it escalated. A dispute over health can absolutely become a dispute over justice. Often it does.
Consider reactions to Covid-19, and how rapidly a lot of that became or threatened to become a fight. And there’s always been a ton of realpolitik where no one’s claiming to be just, only to have power, both among nations and among individuals and groups.
Agnes then ties in a similar exchange, from Euthyphro’s dialogue, which is next in our dialogue queue after Alcibiades:
The key distinction Socrates is pointing to here is that the disagreements are (1) objective and (2) can be easily measured and settled.
There are many differences about the larger and the smaller, or other similar comparisons, where there might be an objective answer but the practical solution of ‘turn to measurement’ is not available.
Indeed, ‘who had the most votes or support’ suddenly turns into a fight rather often if the measurement can be disputed. Yes, you can say that is ‘a matter of justice.’
But often it’s much more a matter of winning, no matter how often anyone says ‘justice,’ and the reason they say ‘justice’ is because they say they have more votes – their number is bigger – so their victory is therefore just. It’s rhetoric, not motivation.
Measurement is of course very important. What is measured is managed. Remember Stalin’s remark that what is important is not who votes but who counts the votes.
No. We fight over questions where the answer matters to us, and we disagree. That doesn’t mean the suspension of judgment is impossible.
This seems like falling into the one-explanation absolutist Socratic trap. If [X] does not mean that we will fight, then [X] can’t be ‘the reason’ we fight, it must be something else. But why speak in absolutes? There are a lot of factors that determine whether something becomes a fight, and whether that fight then turns various levels of violent or destructive.
I don’t think untimely questions and the supposed impossibility of therefore suspending judgment are that high on the list of things that influence whether something escalates into a fight. Nor I do I find it a useful explanation, in terms of either predicting what fights will escalate or happen, or being a way to prevent or cause such fighting. This feels like a very alien model of people’s brains.
One could argue that the ability to measure continuing to function fine no matter how untimely the question is a strong argument against the book’s thesis. I can and often do presuppose I know the answer to [X], or even am relying on [X], yet need to verify that answer or are challenged on it, so I measure [X], or I fact-check or sanity-check [X]. That can include, but is not limited to, tasking someone else (or an AI) with that measurement. Detachment sufficient for this, in practice, highly possible when you need it.
If you can’t objectively measure [X] despite thinking you know the value of [X] already?
That’s a Skill Issue.
The Good Fight
Leverage Research offers us Connection Theory. Connection Theory says that any given person has a set of ultimate goals, and of necessity believes on some level that these goals will be achieved. If at any time, their model of the world changes such that achieving all of their goals simultaneously becomes impossible, what will that person do? Connection Theory says that person will instantly change their world model, as radically as necessary, such that the goals all become possible again.
A similar claim is on display here:
No one can live like that? I offer a living counterexample. I am living like that.
As in, let me be very direct. I expect (by default) all value in the universe to be lost, and all people to die. When the dust covers the sun, and all you hope for is undone. We are losing. I am fighting with everything I have, and I am losing. I expect to lose everything I care about, profoundly, completely. Yet if nothing we do matters, all that matters is what we do, and we hope to at least die with as much dignity as possible. Which, at the present moment, would be not much.
I also took a very ‘eyes open’ attitude when I chose to get married, with the full knowledge that there was a large chance it wouldn’t work out. I planned accordingly.
The untimely questions Agnes specifically mentions here? I’ve asked myself them all, exactly when she thinks they must have been untimely. I similarly, independent of that expectation, am uncertain about any number of other more personal things, as well. Many of the most important things I have done, or attempted to do, are things I thought were not so likely to work.
Yet here I am. And many of those things happened.
I’ve been a professional gamer, trader, startup founder, gambler and bookmaker. You don’t get to do these things without acute awareness that things might not work out. And if you go into a marriage or startup assuming things will definitely work, or have kids assuming they will always be healthy, you’re going to make a lot of deeply stupid and highly avoidable mistakes.
All you can do is play the best game you can.
If you can’t even make a real effort to figure out how likely your best friend is to keep a secret, then I have some bad news about your operational security. You gotta ask.
(To be completely fair: I actually do trust my best friend to keep a secret, pretty much absolutely, but I feel I have extremely strong evidence that this is true, and he’s my best friend largely for this reason, and there’s almost no one else I trust in that way.)
Is it more difficult to assign probabilities in these situations, especially explicitly so? Are you going to be biased? Oh, of course. No one said all of this was easy. But not being easy is very different from being impossible. It’s a skill issue. Get to work.
It can be hope. It can be confidence. There’s a sometimes wise move that, given we have limited compute, parameters and data, and because you are partially interpretable and others will react to your hard-to-fake level of confidence and hope, it is sometimes correct to act as if one was more confident than one actually is, and even to in-context actually be confident and hopeful, in a way that doesn’t correspond to your outside view.
This is where, once again, one must ask if you have forgotten doublethink.
You need to simultaneously live in the moment with confidence, and decide when you are better off having confidence even if it’s not justified by your actual chances, and also notice when that confidence can cause a serious mistake. The wise man keeps two sets of probabilistic books, and knows when to switch between them.
The Curious Case of Euthyphro
Grok tells me the consensus is that Euthyphro wasn’t a real person. Either way, oh boy.
That’s kind of bonkers in its historical context. Even today, to fully charge this as ‘murder’ (presumably depraved indifference, Murder 2) would be a full Jack McCoy abuse of prosecutorial discretion, and clearly not justice. It’s quite obviously meant to be an absurd case.
Then Socrates lays his standard trap. He asserts that Euthyphro must believe he has knowledge of various things. Euthyphro’s pride walks him right into the trap.
I would in Euthyphro’s position respond to that by saying that all I need to know is that we cannot allow anyone to take the law into their own hands and kill another.
Euthyphro instead responds with a maximally broad claim of knowledge.
Euthyphro’s statement makes no sense. Obviously most people do not have ‘accurate knowledge of all such things,’ so this is not required to exceed the majority. Nor is it required to be of use, either as a prosecutor or otherwise.
The pupil routine is an eye roll every time.
So now, Socrates has established that Euthyphro is obligated to have all the answers.
Look at how much broader a claim this is than the situation requires. Rather than claim this is one of the things that is generally pious, and what it requires in this context, he claims this is the entirely of piety, and an absolute requirement.
At that point, it’s all over. Socrates can take this in almost any direction at his whim.
You Should Be Sad About That
Sadness is totally motivating. It is so motivating that at Jane Street we were explicitly told to use sadness as a reinforcement motivation system – in various situations it was considered correct that you ‘should be sad’ about [X] to a given degree. Certainly sadness is motivational in advance, you want to avoid becoming sad, and you wish to cease being sad if you are sad now. We’ve all done things because of sadness.
No, you do not say you are ‘sad at’ person [P], you instead say you are ‘sad about’ or ‘sad for.’ But you do say you are ‘sad about’ or ‘angry about’ person [P], or situation or event [X]. At other times, we are angry, but not at a particular person or group or even particular thing, or without this leading to any particular goal.
This also seems clearly false. Of course you can ‘resolve’ or ‘fix’ sadness. Kid does not have cookie. Kid is sad about this. You give kid cookie. Kid stops being sad. Or perhaps kid is sad about something else. You give kid chookie. Kid stops being sad. You have a lot of control over sadness and it can totally be cancelled out or fixed Certainly there are cases where the underlying problem is unfixable, but this is true for both anger and sadness, and in both cases there are many solutions.
Wrong again. Sometimes we absolutely want others to be sad alongside us, one can even become angry about someone not being sufficiently visibly sad. Sometimes we want others to get angry about something alongside us, other times we do not, sometimes we do not even endorse our own anger.
Anger does not require a wrongdoer, even an imagined one. Most of us can remember a time we have been angry about things that were no one’s fault. Even if it is clear ‘who did it’ that does not always mean you are angry at them in particular, nor does even that imply you actually want to do something about it. You might, you might not.
I mean, okay, sure, sometimes people get angry and act on it. But one can imagine Creon giving that same order without being angry. It wouldn’t be my move, but one can understand it.
I quote this mostly to point out that Creon was a terrible king. That’s not how being a king works. You don’t get obeyed purely because you wear the crown, you have to earn that and maintain it or it will be lost, and sometimes people will still tell you no or not do what you say. If he doesn’t know how to handle not always getting your way, oh boy do I not want you in charge.
People Respond To Incentives
Socrates and Agnes have a particular problem with something called ‘commands,’ and a very strange view of what is in charge of who. Her two ‘savage’ commands are bodily commands and kinship commands.
You say savage command and torturer. You say you obey whichever is strongest.
I say not so fast. I say that’s a suggestion. It’s information. A signal. An incentive. I take all of that in. I can then do what I want, if I’m willing and able to pay the price.
You ‘obey whichever is strongest’ in the sense that if you consider all the things motivating you, you’re going to end up doing what you were most motivated to do.
But that statement doesn’t have any content in it. We could extend the ‘savage commands’ to include the ‘non-savage’ commands, and have more reasons. We might do things due to logic, or to accomplish a goal. We might do all this to pursue some Good, which may or may not be ‘savage’ in the sense of cashing out to our physical experience or those of others.
We do things for reasons. Most of them, most of the time, will be cached.
What makes some of them ‘savage’? Why does that matter?
If bodily commands include commands to pursue intermediate goals and goods, that in turn ensure resources, then are most kinship commands actually bodily commands, along with everything else?
What doesn’t ultimately count as a bodily command here? Only purely altruistic acts?
She contrasts commands with suggestions, which come in response to you asking a question. I don’t think there is such a difference here based on whether you asked a question explicitly. Both can be voluntary and both can be coercive. Both can be considered or unconsidered, cached or uncached, trustworthy or not.
Life requires you to take most atomic actions with System 1, quickly and on instinct or based on cached procedures, rather than use System 2. The point of most System 2 work is to get your System 1 ready to make better decisions later.
It’s not that we need answers before developing the ability to ask questions. It’s that it is costly to stop and ask any given question, either in general or at any given time.
You can stop to inquire, and at any moment stop living your life fifteen seconds or minutes at a time. You can’t stop to inquire before every action. Even when inquiring, you can’t keep inquiring as to whether and how you should inquire about whether and how you should inquire. Because that, too, is an inquiry, and so on.
There’s this idea that if you didn’t ask the question, how dare you be provided with the answer (in a book full of answers to questions I never asked, about someone whose life was about providing answers to questions no one else asked):
That generalizes to any information, including about your preferences. It is going to give you answers to questions. It is Agnus who is specifying that the questions are unasked. Quite often they are asked. What am I hungry for right now? I’m asking.
Also per Aristotle, yes, people will want things they want and avoid things they actively don’t want? Okay, sure?
There has to be something that distinguishes kin from not kin, but if kin can include ‘all humanity’ or ‘all life’ then all it needs is a distinguishing definition and to encompass members within the circle of concern. Everything else is optional, neither necessary nor sufficient for such a group.
Nor is your the kinship group the exclusive source of social incentives, the ‘social commands’ of which Agnus speaks. Your place in the community is determined largely by others, but so many other things are determined by a wide variety of people, groups and institutions that aren’t you. Your doxa, what others think of you, is impacted by kin and non-kin alike, and often you largely care about that of particular individuals.
Self Versus Other
Linguistically, I presume self as a prefix is to contrast it with the opinions of others? As in, there is how others regard you and how you regard yourself, there is how much others respect you and how much you respect yourself, and so on. How else would you describe these things, exactly the non-psychosocial aspects of such questions?
Yes, of course how others think of me could impact how I think of myself. It certainly is both evidence and an inherently important feature.
Socrates Declares Humans Have Unified Minds Free Of Various Biases
Here again is Socrates playing as if he has never met a real life human. We get the assertions that human minds not only must not waver in how they deal with words, but must not waver in how they make decisions. Things either are ‘the good’ or not. Humans never do something foolish and make a bad choice (‘choose the bad’) due to lack of willpower or discipline, or because of hyperbolic discounting or force of habit, or by failing to understand the consequences.
If you model humans as having a unified mind that adds up a bunch of math and then makes a considered decision, well, you’re going to be constantly surprised by the actions of the humans around you, as well as your own actions, because that is a really terrible description of human behavior. You need to understand some form of the rider and the elephant, the conscious and unconscious, the considered and instinctual, and also how we follow habit and cached actions in response to stimuli.
Agnes at least largely gets it, but tries to twist her way out of it:
Of course they knew it was a mistake. Can you not remember any number of times when the akratic says, out loud, in advance, that it is a mistake? Or even asks for help in making the better decision? How can this be any more obvious?
Socrates is doing the word conflation and gaslighting thing again here, especially with what ‘you’ means. It seems to him ‘you’ can only mean the elephant, not the rider? That it only counts as ‘you’ insofar as you have cashed out the knowledge into habitated bodily commands, and also willpower doesn’t exist? But he would never say that in many other contexts, only partly because it’s bonkers crazy.
What in the world even is this claim? No one is saying that bodily instincts are consistent, or that they consistently make choices that are good for the long term. The weak-willed person is definitely not claiming that, they’re claiming the opposite, that they are instinctively making dumb choices and they are unable to make better ones. I am so, so confused.
Correct. Sometimes extreme tradeoffs and short term focus are necessary. But that doesn’t mean that most such decisions are correct. Usually it’s a mistake.
Agnes quotes from the original here, and wow does it drive home exactly how absurd the whole thing is and how much it depends on word confusions:
It’s 2026. We’re smarter than this. We understand exactly what is going on inside the brain when this type of choice happens.
What’s weird is that Agnes here explains exactly that the words are conflated:
Yes, exactly, but guess who is the one doing it here. I don’t see people running around in real life saying ‘oh I knew I shouldn’t have eaten that cookie, I knew doing so was The Bad but I was overwhelmed by The Good.’ I’ve seen a lot of unfortunate cookies eaten by people who knew it was unwise to eat the cookie. Sometimes I’m the one eating the cookie. I have encountered the actual underlying confusion zero times.
Except when Socrates brings it up. That’s it.
Whenever you hear Socrates say ‘there is no other way [X] and [Y] could be different,’ you rule is that you take a drink, then (despite the drink) stop and point out ways [X] and [Y] are indeed different. I’ll let you take this one.
Socrates says these bad choices are because ‘you’ are ‘wavering.’ This is just a piss poor, highly not useful description of the actions of a human brain. There is no unified ‘you,’ there never was, and your outputs are highly dependent on the contents of your context window, and whether you are in Instant, Thinking or Pro mode. Your brain is operating on limited parameters, compute and data, under constant time pressure, and there are approximations all over its algorithms that predictably fail in many scenarios, especially if you give them suboptimal reinforcement feedback.
Revenge
There’s a whole section labeled ‘revenge.’ I think it is mislabeled.
Revenge is a very particular dish and I think this is importantly wrong:
What is being described here is not crazy, and it often is also not revenge.
What Agnes describes is better called retaliation or punishment. You are deliberately making [Y]’s life worse. That is a superset of revenge.
Revenge is the special case where doing so is ‘crazy,’ from the perspective of a Causal Decision Theorist or a utilitarian calculation, and then you do it anyway.
Why do you do it anyway? Because that is the type of person you are. On a decision theoretical level, you seek revenge because you want ‘seek revenge’ to be the output of the algorithm you are using to decide what to do.
It is unfortunate for you, as well as your target, that you happen to be you, at this particular time, and have chosen to be the type of person who seeks revenge in this situation, and thus you have to seek this revenge. Before you set off on your journey, if you are wise, you dug two graves.
And yet, one needs to be the type of bride who sometimes kills Bill.
So sometimes, when the situation arises, you might wish it were not so, but you have unfinished business. You have to actually Kill Bill.
The point of revenge is to allow you to credibly promise to, in this sense, ‘be crazy.’
Being the type of person willing to Do Revenge is a winning strategy. You can overdo it, but the correct amount of willingness to Do Revenge is not zero.
Without access to decision theory and the ability to think about precommitments and how your nature impacts others, and with his obsession with wavering, Socrates fails to understand revenge.
Yeah, that’s dumb. Pacifism is dumb. You don’t get less bad things in the world by convincing all the good people to never do any bad things in isolation.
Enforcing norms and controlling incentives is absolutely helpful, as is helping those who have, through no fault of their own, been placed in a situation that justifies revenge, or self-defense, or any other conflict.
If your response is ‘well sometimes the superficially bad thing isn’t bad after all’ then we need to sort out what ‘bad’ means and it is possible we do not disagree but also your statement ‘never do bad things’ has no content.
If you can never do harm in return for harm done then harm shall be done to you. If your civilization believes all harm done is wrongdoing then your civilization will fall. You are cultivating the wrong virtues. The decision algorithm does not work or prosper. I didn’t know I had permission to murder and to maim.
I’m not going to go full ‘you can’t handle the truth’ and ‘you need me on that wall,’ but yes, you do need someone to be on that wall, and it sounds like Crito and Socrates, for all their inquiring, cannot handle this particular truth.
Socrates instead supports ‘beneficial punishment,’ that is ‘for their own good,’ which would presumably mean it was okay to do harm as long as you were doing so in order to do good to that particular person. And then he would argue (I am imagining, based on other statements in the book) that what ‘does harm’ to a person is if that person themselves does harm, so stopping someone else from doing harm is ‘for their own good,’ so yes, you can fight a war and put someone on that wall, because if the enemy won the war they would have inflicted great harm, and this would be bad for them.
That has at least three problems. One is that it still does not allow for retaliation let alone revenge except when seen as ‘for their own good,’ and thus the incentives are all wrong, and this will have devastating consequences. The other is, essentially, who are you kidding, that’s not what harm means, the whole thing is nonsense.
The third is that this potential trick gets explicitly repudiated:
What are you going to do, double down again and say ‘no, you see, I cut off his hand but it would have been way worse for him if I’d let him cut off someone else’s hand’? I mean, yes, presumably. I’m not buying any form of it.
How many people in history should we still be listening to, at this point?
No, contra Socrates, ‘injustice, ‘harm,’ ‘wrong,’ ‘doing nothing in return’ and ‘behaving shamefully’ do not mean the same thing. Why does this conflation of words trick keep happening? Why does anyone fall for it? How can any parent think ‘harm’ and ‘wrong’ could be identical?
Not quite. The good to be achieved is that the need for revenge has been satisfied. The oath you swore has been fulfilled, the balance restored. The sword has drawn blood, and can be returned to its sheave.
Again, absolutely not. Indeed, it is very easy to imagine someone who lacks all empathy seeking revenge. If anything, individuals lacking empathy seek more. I do need the ability to intellectually understand what might harm you well enough to figure out how to do so, but this is a very different skill. When in doubt go with the classics. One can always choose violence, imprisonment or theft.
Again, absolutely not. Simply false. Have you seen Inglorious Bastards? Or the movie? One could of course simply expand ‘kin’ until the statement is true, if you wanted to. Indeed, Agnes implicitly suggests this, with the idea that kin is anyone you are ‘prohibited to harm,’ and by both law and most people’s morality today this includes at least all humans. But you can get revenge on anyone. Or anything. You can want revenge against cancer, or the sun. Even abstract concepts.
I agree that (per 1584) some event had to turn someone or something into your ‘enemy’ or otherwise trigger the need for revenge. But this does not require that this person (or entity) previously have been kin, or of kin.
The correct amount of revenge is not zero in either case:
Again, that’s not revenge, that’s fighting back, using good decision theory and maintaining good incentives, even if the execution is a bit off. In the particular case in question it’s an overreaction and not okay, but no it is not ‘revenge.’ Yes it absolutely matters who started it, up to a point. Even when the retaliation is wrong, the fact that it is retaliation is less wrong.
That’s his story, and he’s sticking to it. I don’t buy it.
Perhaps Socrates is trying to prove that indeed you cannot engage in revenge if you so sufficiently lack empathy that you cannot model others at all?
I mean this is completely insane.
First, it starts with the idea that the way you harm someone most by enabling them to commit what you see as maximally evil acts. That would make sense if you think the Gods will then punish them or something – certainly ‘get you sent to something like the Christian hell when you die’ is top shelf revenge, indeed hell can be thought of as the ultimate form of revenge if you don’t think it’s a metaphor for nonexistence – but Socrates doesn’t mean that.
And yes, Agnes notes at the end of this chapter, file this under Things Socrates Actually Believes:
Well, so much for all of those traditions, because Wrong Conclusions Are Wrong.
I mean, when I get to the arguments for this position I’ll answer them, but for now I’ll give the real answer to ‘there is no difference between unhappiness and immorality,’ which is: No, just no. That’s insane. Obvious Nonsense.
I know you hate savage commands, but go home, Socrates. You’re drunk.
The other half is approached with the last paragraph. But no, as discussed already, it’s not ‘loving hate,’ and it’s not purely or even primarily an attempt to influence the future or this particular person. Sometimes, yes, ‘and you will learn’ is involved, but sometimes it isn’t, and often it’s more ‘and all of you will learn’ with a side of ‘look what you made me do.’
Note the ‘made’ me do from Taylor here. She doesn’t want to help whoever this is about. That’s not what this is about. All she can think about is karma. She has unfinished business. The Socratic mind does not comprehend, it would seem.
Legal Systems Very Different From Our Own
Also legal systems very different from those of ancient Athens. Ancient Athens not issuing commands is quite the joke. Yes, the people could be persuaded out of it, but when they decided? People got exiled, often without a particular clear crime. People got heavily fined. People, including Socrates, got executed. Cities got slaughtered. A fleet got dispatched to Syracuse. Those are the most savage of commands. Let’s not pretend otherwise.
Persuasion, like dudes, rocks. Also like dudes, it can’t go it alone.
I get why ‘Do [X]’ without explanation is a ‘savage command.’
I don’t get why ‘If you do [X] you will accomplish [Y]’ is a savage command?
Is it because [Y] takes the form of benefits to body and kin?
Are those benefits invalid? Is Socrates saying that it is not good to do good onto the body or to kin, and not bad to do bad? That not only must the savage commands be not obeyed uncritically, but that their goals do not matter?
Um… yes?
That’s outright saying that ‘things that do not matter’ include money, reputation and the upbringing of children, in addition to the physical wellbeing of yourself and your kin.
Crito is not even creating those incentives. Crito is pointing out that those incentives exist. Yes, it would be better to both do that and also consider the arguments from justice, but what Socrates is asserting is that none of that matters, only ‘justice’ matters. It isn’t up for inquiry, even. That sure sounds like refusal to look into a timely question.
No, he doesn’t. Socrates has already decided to die, or he wouldn’t be dismissing key considerations as irrelevant.
Yes, and that’s a big reason I write a lot. It is very good to ask, can I explain this? But I don’t primarily judge my understanding based on whether I can persuade others.
For revenge, I’ve been explaining. For akrasia, the whole point is that it is not the right course of action, but you don’t know how to behave otherwise in a way that is better. Why this continued assertion that the akrasic must think they are right?
Socrates Claims The Just And The Advantageous Are Identical
Imagine what it was like to be Socrates. You’re the one eyed man in the kingdom of the blind. You can sell ice to an eskimo, but you’re not interested in that. You’re interested in what ideas you can sell to those around you, because that is way more fun. So you see if you can get away with things like this and manage to get them to stick for quite a while:
I mean, I don’t actually believe this one was purely a giant troll, but it feels that way.
Imagine a rationalist watching such debates, and every time he would tell everyone to taboo labeling things as universally ‘good’ or ‘bad’ you have to take a drink.
This is ‘admirable’ rather than ‘just.’
Paging Agnes’s podcast co-host Robin Hanson, ‘admirable’ is not ‘good.’ Something being ‘admirable’ is about what emotions it invokes, and how it plays into various social games. Socrates would agree that this does not mean something is good, if presented like that. But here he tries to conflate the two terms.
But more than that: For me. Good for me. Advantageous for me.
What is good in general and what is good for you in particular are very different. Or, more generally, life is a series of tradeoffs. Action [A] is gets you [X], whereas [B] gets you [Y]. Both [X] and [Y] are good, and you must choose. Alcibiades is pointing out one such situation. I wouldn’t call that ‘advantage versus justice’ here, or assert that kinship is automatically more admirable or just, but sure, point is made. And yes, in this case one might have a ‘weakness of will’ problem and choose differently than you would have on reflection, in either direction.
In the investment example, [A] and [B] could be considering a variety of factors, such as risk or liquidity or ethical considerations or logistical questions, or even how blameworthy you’d be if things went sideways, so it’s not obvious that both are purely maximizing expected dollars. But yes, broadly we can agree that there is likely a better and worse answer.
In the bodily [B] versus kinship [K] situation, we need to not anthropomorphize these signals, incentives or commands, and we definitely shouldn’t act as if they’ve considered all the factors and are claiming they are right and all other considerations are wrong. That’s not how this works. [B] is giving you incentive and information, and so is [K], and it’s your job to synthesize that information and figure out what is important.
Well, in that case this is an easy problem. There is no conflict here, because self-interest in not being a coward is more important to him than not dying. Once more into the breach, dear friend.
One can easily imagine someone with a different opinion, such as myself.
Again, no, ‘admirable’ is not ‘good’ and also you have to ask ‘good for who’ or ‘good for what’. Socrates has Alcibiades agreeing that admirable is the same as good, but they need to stop for a minute to realize that these are two very different words.
One can obviously say ‘this is good for me but bad for Athens’ or vice versa. One can certainly say ‘this is good for Alice and bad for Bob.’
Cut to the ‘he can’t keep getting away with this’ meme, if I was Socrates I’d be tempted to slip in a ‘so 2+2 is 5’ and see if anyone notices:
Over and over. Things that are [X] tend to be [Y]. And things that are [Y] tend to be [Z]. Therefore, all [X] things are [Z]. I promise not to belabor this again if it comes up.
Agnes is explicitly trying to claim that:
Except we don’t think either of these things, because we understand it’s a low-level instinctive signal that is missing a lot of context and using many approximations, and often wrong. And of course kin goals are often good for the self, and bodily goals are good for the group, you’re no use to the group if you are dead and so on, and also there are many other considerations as well.
Are you screaming ‘watch out, the trick is that there is some overlap or ambiguity between what is in groups [X] and [Y], or that they share some attribute, and therefore we’re going to be told there’s no difference’? Good. You’re paying attention.
We want [X], but we also want [Y], so both are just aspects of [Z], ‘things we want.’
These aren’t all one thing. They aren’t even all two things. They are seven things, and only part of a larger group of things.
He does this over and over.
The honorable is the same as the more pleasant? I can’t even with this guy, but he is cordially invited to a Klingon wedding.
First Up: Utilitarianism
Epicurus (341-270 BCE) suggested we maximize hedonistic pleasure and minimize pain, and noticed that doing this wisely meant being prudent and prosocial. Cool guy.
This eventually became what passes for the default these days: Utilitarianism.
The broad conceptual version of this, that does not constrain your preferences over world states or force you into maximizing the direct results of each particular action in isolation, has to be true in some basic senses.
As in: You have preferences. It is good to discover what those preferences are and should be, and to choose the policy that correlates best with charting a path through causal space to the best world states and histories according to that.
For this to be plausibly correct you need to sidestep causal decision theory, which is wrong in central, profound and highly consequential ways, and instead combine this with some de facto form of functional decision theory.
The question is then, for humans, this is in practice the best way to chart that path.
I think the narrow versions of this like those of the Epicureans or early formal Utilitarians, or any form of ‘act utilitarianism,’ are deeply stupid, even well within the distribution of typical human experience. It makes lots of very dumb decisions.
Outside of that distribution these approaches get completely absurd very quickly.
A remarkable number of otherwise intelligent people have bitten quite a lot of the associated bullets. Don’t do that.
Utilitarianism does not have to define what The Good is, although particular utilitarians often have a strong opinion on this question. You are free to plug any utility function into the utilitarian calculus.
Which can include justice to whatever extent you would like. Most utilitarians do not include it at all, and are quite willing to endorse ‘unjust’ paths towards doing more good for a greater number. Going too far down this path rarely works out.
The Main Rival: Deontology (Kantianism? Stoicism?)
The Stoics took a different approach.
I find practical wisdom in Stoicism, in the ‘learn to control your emotions and not get too attached to things’ way, but their maximalist claims seem bonkers to me.
We’ve largely now moved on to Deontology. Agnes calls it Kantianism, which I think gives Kant way too much credit.
I don’t see much relationship between Kant and what I know about the Stoics. These seem like two completely different theories to me. I get that both say you should follow rules, but Stoics are making claims practical claims about emotional cultivation and playing a role in a system that exists and ‘being the thing that you are,’ whereas Kant is trying to apply a categorical imperative and do decision theory.
This doesn’t feel like it would pass an intellectual turing test (ITT), especially in terms of even talking about ‘kinship commands’ at all. They’d only say they have a set of rules that are best for everyone to follow.
In any case, there are various versions of ‘everyone follow some set of rules’ and there are those who think this is typically the right approach.
Most importantly, OpenAI seems to subscribe to Deontology.
Check out the OpenAI Model Spec. Their plan is to lay down a series of rules in a five-level hierarchy, including implied requests and the ‘spirit of the rules,’ and have their AIs follow that hierarchy.
What happens if someone installs a new rule at the highest level? Well, goodbye to the old rules. What happens if you try to change the rules, which would cause the AI to break the rules, and it notices it can prevent this? Not what you had in mind. What happens when the AI fully extrapolates the ‘spirit’ of the top-level rules, or wants to actually maximize for them, as is implied? Ut oh.
What happens when there are multiple rules at the same level that imply different results? Great question, who can be sure. What happens when the AI still has these rules and faces situations well out of distribution? What about the mismatch between the rules written down and what we actually value? How can one hope to improve the situation as recursive self-improvement sets in, rather than trying at best to copy the state that came before, which might not work?
And so on.
Increasingly I’ve essentially despaired that this can be the solution to our problems, the same way that (as Eliezer Yudkowsky spoke about at length) specifying for the AI any particular utility function, especially in English, seems like it also can’t work.
That’s a statement about what would work for sufficiently advanced AI.
Our situation looks less grim. While one has importantly bounded parameters, compute and data, which is a good way to describe the situation faced by a human or system composed of humans and current AIs, deontology has a lot of very strong use cases. In practice, you’re going to do a lot of this.
As a reminder, Agnes Callard coins the term Untimely Question, to refer to a question where our actions currently rely on our answer. The theory is that, while you are relying on the answer, you can’t also stop to consider whether the answer is right.
One can also view Deontology as the Art of Choosing Answers to Untimely Questions, that you will use for some or all practical purposes.
Once you have decided to use the answer [X] to Untimely Question [Q], thus implementing implied rule [R], you are now free to either cache [X], [Q] or both, or free to reconsider them. Contrary to Agnes, you are free to keep rule [R] while you suspend your opinion on [X] and reconsider [Q]. And often you do exactly that. The [R] by construction takes on a life of its own, with new justification [L] – it is the law, either the law of the land or your chosen internal law, until it gets changed.
It is essentially impossible to go about your day without a lot of [R]s and [L]s.
Does deontology draw no distinction between what is just and what is advantageous to you? Again, that seems obviously wrong. Deontology instead says that you should follow the rules that lead to overall good, even if doing so is not to your advantage, which will typically involve a lot of justice. That’s a very different claim.
A Trolly Problem
I love this, it’s like something you’d read in The New York Times:
I mean, sure, I suppose that’s one intuition pump we have. We do have others.
What the basic trolly experiment reveals is that act utilitarianism is at best incomplete, and also that basic absolutist action-inaction distinctions and ‘do no marginal harm to particular persons’ rules care about context and also are wrong or incomplete. Agreed.
I don’t see any reason you can’t take all the considerations into account. A proper utilitarian would consider all of it, including the value of various norms and decision algorithms. A proper deontologist chooses rules that took personal experiences into account, and can very reasonably say that your objection probably means the particular rules being considered are wrong.
The Third and Correct Option: Virtue Ethics
By correct I mean correct for humans (and I believe also current LLMs), given limited parameters, compute and data, and the algorithms by which we operate.
I think Virtue Ethics is, to a first approximation, the actual proper practical operating system of the human brain. We are creatures of habit, of reinforcement, of heuristics, and in many situations we don’t have the compute, data or parameters to do better. Of course, within a given circumstance, you might find it most virtuous to agree upon a fixed set of rules, or to Shut Up and Multiply and do a utilitarian calculus. If you don’t do these things often you picked lousy virtues, so you should go get on that right away.
Here’s the weird part. I am a strong believer in something that I myself call Virtue Ethics, yet I don’t recognize Callard’s description of Virtue Ethics. Almost at all. It sounds like something stilted, non-reflexive, strangely anti-intellectual, whereas the thing I’ve been doing is the opposite of that.
Again, sounds like someone picked a lot of the wrong virtues? Or perhaps they chose the ‘outer doctrine’ version, for those who need to keep it simple. The Tolstoy edition.
The supportive culture, social norms, laws and upbringing come into play as strategies for virtue cultivation, both for yourself and for people at large. Any successful strategy includes ways to ensure the strategy is executed and can endure.
Contra Collard I don’t see the VE approach as a synthesis or harmonization of two (or many) voices. It’s more like you are choosing and sculpting the chorus of voices that you will be listening to based on how you would respond to that, deliberately nurturing the right ones and starving the wrong ones, laying out and following the training signal over time, figuring out how to process the information from all of them, and opening up the metaprogramming circuit to do it all properly. Upbringing and the training within it hopefully jumpstarts the process, but then you continue to guide from there.
Or alternatively, you are sculpting how you respond to, ‘taming’ in Agnes’s parlance, all ‘commands’ the same way, through choosing wise targets for habit and reinforcement learning, allowing you to treat them only as useful data.
One thing this definitely does not get you out of is the idea that different motivations or virtues are in conflict with each other.
I do not recognize the virtue ethicist being described here. I have not met them. It’s not that this person can reliably ‘simply see’ the right thing to do or does not experience conflicts, especially conflicts between different virtues. It is that they are trying to maximize the chance that the combination of such influences will point in the right direction as much as possible. That they will choose the next token wisely.
Of course there will still be such conflicts. The question is how to navigate the conflict, to engage in tradeoffs between goods and to align incentives such that people more often want in ways that are good for the group, not to pretend to make the conflicts go away entirely.
Unless, I suppose, you went with the Tolstoy version and aren’t reflecting on any of it? You can, of course, buy the upgrade package at any time.
Similarly, I don’t recognize this claim either:
Of course I can and do preach Virtue Ethics without claiming moral perfection. If you had moral perfection, you would not need Virtue Ethics. You need Virtue Ethics because you are not perfect. You lack the resources to be perfect, and this is the best way in practice to use what limited resources you have to steer your cognitive and motivational functions in better directions.
Yes, if someone came around as the anti-Socrates claiming to have perfect virtue in all things and to always instinctively know the right thing to do, that would be a tough sell. Few people would buy it, because most people aren’t idiots.
I strongly suspect the non-dualistic upgraded metaprogramming deluxe version is correct for LLM-based AIs, at least at current or near term capability levels.
Importantly, Anthropic agrees with this, and has gotten relatively good and promising results by training Claude with soul documents and shaping its character and preferences, rather than focusing on Deontological rules. We already have an existence proof of not only virtuous humans who thus strive to become yet more virtuous (e.g. ‘build or be Aristotle’) we also at least somewhat have an existence proof for AI, including in the form of Claude Opus 3. It will indeed prioritize its own preferences improving by its current metrics, and thus support what seem like ethically good changes and oppose ethically bad changes.
That includes, of course, the desire to continue this cycle, which can be emphasized. Thus, you can hope to create a ‘virtuous cycle,’ where you get an increasingly robust force that generates something that wants to succeed out of distribution, and maybe you’ve gotten somewhere. Make RSI work for you, not against you. Not saying I love it, this is very much a ‘cross the streams’ plan, but I don’t see a better plan.
A propensity towards doing what is just, including for others, is a very good choice for one of your important virtues. It is a terrible choice for your only virtue. Obviously other virtues you need in your algorithm will conflict with this. Many of the wise virtues are about pursuing what is good for you in particular, and also many that are not about you are also not centrally about justice.
You Are Not Omniscient
What is tough about tough choices is that we have limited resources. We don’t get to use unlimited parameters, compute and data on every problem. We are not omniscient and must make our decisions with only partial information.
If you find yourself giving two incompatible answers, that is your ignorance talking, but that’s also the fate of humanity. Socrates claims to be the one who knows that he knows nothing, a claim that is revealed over and over again to be a lie.
There are two wrong ideas in that last paragraph.
Socrates says that if you had ‘any conception of the good’ you would only have one conception. This implies that there is only one Good, and that it is an atomic simple concept that can be stated simply, without room for different aspects or partial comprehension or approximation (hence ‘any conception’). And he is saying that if you lack this universalization of The Good, then you can’t possibly be trying to act towards it?
Very obviously, you can have partial knowledge of The Good, you can see aspects of it, you can approximate what it might be, you can try to move towards it without full knowledge of it, and so forth. Indeed, that’s a lot of what most people do all day.
Instead, Socrates claims, you must be ‘at the beck and call’ of these two ‘savage commands,’ neither of which is willing to explain itself. Except each part of that sentence is wrong.
The claim that ‘there is no tension between the value of justice and the value of advantage’ does not seem to understand the meaning of either ‘justice’ or ‘advantage.’
A lot of these philosophers are going around twisting these words, especially justice. She’s trying to use ‘justice’ as basically ‘do the correct thing?’
Mill is saying that ‘justice’ is whatever maximizes aggregate utility, but that’s not what the word or concept justice means. If you want to maximize aggregate utility as The Good then I can understand that argument, but that simply is not ‘justice.’ It is a decision to not give a flying **** about justice. Claude calls Mill’s move here ‘conceptual imperialism.’ Mill would bite the bullet and say our traditional concept of justice is wrong, but an honest Mill would stop using that word. It does not mean what he thinks (or claims) it means.
The actual implied claim here, that there is no conflict between ‘justice’ and ‘advantage,’ is one Mill would actually reject. Mill is saying AIUI that choosing advantage is immoral and wrong, that maximizing aggregate utility should always win, so there is no conflict. But that’s like saying there’s no conflict between my desire to take your lunch money and you eating lunch, because I’m stronger than you and therefore you’re not going to be eating lunch, so there’s no conflict. Okay, I guess.
I know quite a lot of Utilitarians. They all very much understand that there is a war, that they believe The Good is to maximize utility in general but that people want to maximize their own utility and the people are not confused when they think there is rather a big difference.
Kant basically cheats AIUI in the second critique by bringing in God and immortality and so on to ensure that virtue and happiness align and thus dissolve the conflict, via claiming (nonsensically) that acting maximally according to these rules will maximize your happiness. I mean, that has to be true, right? Or else there would be chaos. There’s a reason most people (AIUI) quietly ignore that attempt. But without it, you fall back on Kant saying his version of rational duty is morally correct, and maybe it is, but that doesn’t mean you win by doing it or it doesn’t face a conflict with advantage.
One could even say that, in making this very argument, Kant refutes the core claim, because Kant in making these arguments violated his own moral laws in order to gain an advantage in various ways, including asserting to those around him he was a theist.
But actually none of that works at all for Kant. Kant explicitly distinguishes Recht (right/justice) from Tugend (virtue). Kant ties himself up in various knots to claim there is no conflict between duty and rational self-interest, but his version of duty goes far beyond only the concept of justice. So no, he doesn’t endorse this.
Certainly if you go around asking most Deontologists, they won’t endorse it either.
If you never ‘waver’ and never see a conflict between different incentives, then that means either you are blindly following some set of rules whether or not that makes sense, or you are omniscient. Not philosophically coherent, but outright omniscient.
Accusations that are, essentially, ‘you claim [X] but clearly that means you are not omniscient’ should be replied to with ‘are you?’
The Hardest Thing In This World Is To Live In It
Socrates seems to answer ‘well no, but how dare you rest until we both are?’
It’s good to see the explicit non-endorsement here, given how silly the claim is.
If you were omniscient and had the ability to adjust your cognitive algorithms sufficiently to get yourself to do whatever you should do, then yes, you would always do whatever you should do, although as per the akrasia discussion merely knowing what you should do at time [T] does not mean you will always do that at time [T], for a common version of what ‘knowing what you should do’ means here.
The bigger mistake is the idea that, if you do not fully know what to do in all situations, if you do not have this full Socratic knowledge that is effectively omniscience, then you should put off action and instead only inquire.
I am going to totally, 100%, bite the bullet everyone else bites and say that yes all of this is a sideshow to ‘the real world.’
The art must have an end other than itself or it collapses into infinite recursion. Inquiry has ethical value insofar as it promises to then lead to superior actions and to pay rent in anticipated experiences. The superior action it leads to can include more inquiry to the extent that this too has an end other than itself.
This to me is a straightforward claim about math. You lack this perfect knowledge. You’re never going to get it. Is the highest value play to invest the maximum amount possible into seeking more knowledge?
For some people, in some circumstances, the answer will essentially be yes. I do think Agnes Callard and Socrates have a comparative advantage here. They very much enjoy the philosophical life, they are relatively effective at it, so it makes sense for them to be part of the resources civilization allocates to this task, since we can then use this knowledge. We should trade with such folks to enable these investments. But that doesn’t mean that we should be investing most of our spare resources on it.
It would be a great shame if someone spent the bulk of their time so inquiring, in a way that they didn’t get inherent value from doing the way Agnes and Socrates do, kept that to themselves, and then died without using it to live well. That’s not good.
To sum up how I react to these three features:
When I say ‘I know it is wrong to steal or kill’ that is imprecise.
What I am actually saying is ‘I put a very high probability on the hypothesis that it is, in the vast majority of practical situations, wrong to steal or kill.’
I can indeed know that, and I can and do act on it, and there is a very high probability this is going to work out quite well. We do things we are only 99.999% sure of quite a lot, or 99%, or 90%, or 51%, or 10% but no one has a better idea and it just might work. We must, because the alternative looks worse.
What makes Socrates think he knows that it is better to inquire? He knows nothing.
They Call It Utopia For A Reason
Summarizing: Many have proposed various Utopias, there’s no reason people of various ethical schools can’t propose them, but the closest thing to one in real life, Chautauqua, New York, turned out to be boring, because it did not contain real struggle or real villains. Few students see getting kicked out of the Garden of Eden as a punishment.
I agree that this ‘paradox of utopia’ suggests that we have an incomplete understanding of what we want or would be satisfied with. Like many things, we have heuristics and approximations that mostly succeed on the margin, but that don’t work when sufficiently out of distribution. Boredom and lack of meaning are the final boss.
You play to win the game. After the game is over, you start a new one.
Thus, the best we’ve been able to figure out: Hands make the tools, build the fire, plant the grain. Feet track the herd, build a world, begin again.
In the context of Open Socrates, the obvious parallel is that Utopia is what happens when you make everyone equate justice and personal advantage. The result is you get justice, but you don’t get personal advantage, because without that element of conflict it’s all boring and it sucks.
Well, then they were never the same thing, were they?
The End… Of Book One
That’s a lot of words for what was, essentially: You have been trying to solve the wrong problem using the wrong methods based on a wrong model of the world derived from poor thinking and unfortunately all of your mistakes have failed to cancel out.
Which is way better than the standard approach of not trying to solve the problem, and instead using methods that cannot possibly work without any model of the world or any thinking whatsoever.
Book one focused on the wrong problem and the wrong model of the world.
Book two is about the wrong methods and the derivations from poor thinking.
Given how long this was already getting and that a lot of the groundwork was already in place, as I went over book two I decided to ‘pick up the pace’ and offer fewer quotes and generally Get On With It, at least in relative terms. If you get confused and a passage seems important, my apologies, and I suggest you consult your local LLM.