Claude, Author of the Humanitas

Linch

In the wee hours of Memorial Day, my friends and I stayed up past 4:30 AM California time to listen to the announcement of Pope Leo’s first encyclical, Magnifica Humanitas, on safeguarding the human person in the time of artificial intelligence. We were excited albeit sleepy, eagerly anticipating the event and upcoming essay by the world’s foremost religious authority on a question so central to our world. Still we were an odd audience for this presentation: none of us are practicing Catholics, and most of us didn’t really know what to expect.

I thought Pope Leo’s own speech was good, and addressed the current moment in AI with some of the seriousness it deserves. I thought the other speeches, including by Chris Olah, were less impressive. But that’s okay, I’m not the target audience!

A specific cardinal’s point struck me, however:

Cardinal Parolin made much of a specific prepositional choice in the subtitle: “sulla custodia della persona umana nel tempo dell’intelligenza artificiale,“ which the live translator translated to something like “on the safeguarding of the human person in the time of AI,” and not “sull’intelligenza artificiale“ – “on AI.”

This was supposed to be a big deal. “In the time of AI” supposedly centers the human person in the theological narrative, while a mere first papal encyclical on AI focuses too much on the technology itself and not on human and societal reactions. A fascinating position!

Though as my subsequent analysis will demonstrate, perhaps a more apt preposition here is “by.” As in, the world’s first papal encyclical written in large part by AI.

My article has the following claims, each of which I hope to convince you of:

Significant fractions of the recent papal encyclical are written by AI. I provide multiple lines of evidence for this.
We can corroborate the vibes and tonal indications with statistical evidence. Phrases and punctuation much more commonly used by AI are much more present in this papal encyclical than past encyclicals.
The best commercially available AI detector, Pangram, notes that some paragraphs are between 40% and 100% AI, while most paragraphs appear to be 0% AI.
1. This is unlikely to be a false positive:
  1. 0% of paragraphs in past encyclicals I backtested are registered as AI.
  2. Pangram in general has a very low false positive rate
This is overall very unlikely to be a translation artifact (including AI translation). We again have multiple lines of evidence:
1. All the most prominent signs of AI I observed in English are preserved verbatim in the Italian version, as well as in other translations.
2. The Italian version of the current encyclical also gets flagged as AI by Pangram (actually more so than the English version), though I’m not aware of academic research or rigorous testing of Pangram’s service when applied to Italian)
3. Backtesting AI translation of past encyclicals get 0% on Pangram
The specific AI used is most likely Claude, judging by both textual and circumstantial evidence.
Different sections of the encyclical have very different rates of apparent AI usage. This indicates to me that some ~~cardinals~~ Vatican officials used AI assistance for this encyclical and many (probably including Pope Leo himself) don’t.
Each individual piece of evidence might be explained away, but the consilience of evidence across multiple angles and sources is in my opinion very hard to dismiss collectively.

Significant fractions of the recent papal encyclical are written by AI

I was initially very excited to read Pope Leo’s first encyclical, a long treatise on maintaining humanity in the age of AI. The intersection between AI and societal response is one of my greatest intellectual and personal interests, and it’s both exciting and a relief for the world’s foremost religious authority to share a substantial interest in my personal and career obsessions.

Nonetheless – as I kept reading – certain lines jumped out at me as too smooth, too triadic, too… inhuman:

“Technology has the power to heal, connect, educate and protect our common home; but it can also divide, exclude and generate new forms of injustice. In the abstract, technology in and of itself is not a solution to humanity’s problems, just as it is not inherently evil. In practice, however, technology is never neutral, because it takes on the characteristics of those who devise, finance, regulate and use it.”
“We must, then, avoid the “Babel syndrome,” namely the idolatry of profit that sacrifices the weak, a uniformity that neutralizes differences, and the pretense that a single language — even a digital one — can translate everything, including the mystery of the person, into data and performance. The risk of dehumanization — of building a future that excludes God and reduces the other to a means — is an ancient and ever-new temptation that today takes on a technical guise.”
“A dialogue with such kinds of knowledge does not diminish the power of the Gospel. On the contrary, it makes it possible to identify with greater clarity what genuinely fosters the lives of individuals and communities.”
“give stable form to this insight at the ecclesial and international levels, while bearing in mind the growing gap between rich and poor countries and the need for policies that genuinely promote more humane living conditions for all.”
“We cannot be satisfied with merely calling for the moralization of machines — the so-called “alignment” of AI with human values — without also having the courage to insist on a further condition: the possibility of openly discussing the ethical frameworks involved and subjecting them to shared standards of social justice”

I read AI-generated text as part of my job regularly, and believe I have acquired a very good intuition for discerning AI-generated text from those by humans, including in formal writing (both academic and otherwise). Still, any individual phrase that seems AI-generated can be a false positive on my end, the result of an oversensitive nose for AI1. However, the sheer density of these phrases and overall tone in specific paragraphs seem implausibly all a random artifact.

Still, I can definitely be wrong here, and you should not believe my gut intuitions or judgments of vibes on authority (“Trust me bro”).

Intuitions, self-proclaimed expert judgments, and loose verbal reasoning can be a good starting point for an investigation, but if we want any confidence in our conclusions, we need to investigate further and more systematically.

Statistical Evidence and Tells

Three common and well-known tells in AI writing — sometimes genuinely deployed by humans but nowhere their profligate use by AI — are the regularity of em-dashes, the high frequency of specific words like “genuinely”, and the tendency to repeatedly invoke tricolons.

Let’s examine each of them in turn:

Em-dashes

The em-dash (“—”) is punctuation that’s by far most strongly associated with AI. It is also used 127 times in Magnifica Humanitas, much more than previous encyclicals.

Magnifica Humanitas: 127 times em-dash, 6 times en-dash (–), the latter all in citations.2

Dilexit Nos (2024): 0 times em-dash, 26 en-dashes, including 2 in citations. Comparatively long document.3

Laudate Deum (2023): 0 times em-dash, 12 times en-dashes. Much shorter. Also not officially an encyclical.

Fratelli Tutti (2020): 0 times em-dash, 46 en-dashes, of which maybe 5-10 are in quotes or citations. Note that this is 50% longer than Magnifica Humanitas.

Laudato Si’ (2016): 0 times em-dash, 25 times en-dash, of which maybe 10 are in citations or quotes (the piece overall appears to have many quotes). Similar length to Magnifica Humanitas

Lumen Fidei (2013): 26 times em-dash, 0 times en-dash. Some em-dashes in citations.

Note that this comparison actually understates the weirdness of the em-dashes in Magnifica Humanitas. For example, in Lumen Fidei, many of the em-dashes function similarly to speech colons in standard English. A typical use looks like

What was handed down by the apostles — as the Second Vatican Council states — “comprises everything that serves to make the people of God live their lives in holiness and increase their faith. In this way the Church, in her doctrine, life and worship, perpetuates and transmits to every generation all that she herself is, all that she believes.”

Using em-dashes as speech colon replacements is moderately common in formal (human) English writing, but essentially absent in LLM-English. I also did not notice em-dashes used this way in Magnifica Humanitas (though with 127 instances, it was annoying to check all of them!)

“Genuinely”

“Genuinely” is a phrase repeatedly used by Anthropic’s model Claude. It is extremely obvious to anybody who regularly uses it. It’s gotten so bad that in leaked system prompts, Anthropic attempted to explicitly forbid Claude to use that word!

1.4 Tone & Formatting
[...]
Claude avoids saying “genuinely”, “honestly”, or “straightforward”. 4

As far as I could tell, this injunction does not and did not work.

Indeed, Anthropic’s own “Claude Constitution”, which many people believe to be substantially AI-assisted, used the phrase “genuinely” 33 times and genuine overall 50 times (inclusive).

How often is the phrase “genuinely” used in Magnifica Humanitas?

Less so than in Anthropic documents, but substantially more than past papal writings.

Specifically “genuinely” was used 9 times and “genuine” overall (inclusive) 22 times in yesterday’s encyclical, compared to 0 and 5 times, respectively, in Dilexit Nos, which is of similar length. Across a number of other encyclicals I scanned, the highest occurrences were 3 and 10, respectively.

These tells are all statistical. Any individual instance of “genuine(ly)” is plausibly a result of normal human communicative intent that is, well, genuine. But the sheer frequency of these occurrences, vastly out of accord with prior norms and normal human speech, is strongly suggestive of synthetic origin.

Is this due to subject matter?

An obvious rejoinder you might have is that word choices in essays are naturally not independent of subject matter. And it sure seems like an encyclical on AI might meditate more about genuineness more than other encyclicals! For example, an essay on AI deepfakes might be much more concerned about what makes a video “genuinely human” than an essay on climate change.

To investigate this hypothesis, I dived specifically into each use of genuinely in this encyclical:

[Par 23] “A dialogue with such kinds of knowledge does not diminish the power of the Gospel. On the contrary, it makes it possible to identify with greater clarity what genuinely fosters the lives of individuals and communities. Following this perspective, Pope Francis [...] recognizes the importance of listening to scientific research and of encouraging a serious and honest debate among experts while welcoming a diversity of opinions.”

“Genuinely” does not seem critical here, nor specific to questions of AI and authenticity.

[Par 35] The establishment of the Pontifical Commission Iustitia et Pax should also be seen in this light as an attempt to give stable form to this insight at the ecclesial and international levels, while bearing in mind the growing gap between rich and poor countries and the need for policies that genuinely promote more humane living conditions for all.

Also not critical here.

[Par 40] In his social Encyclical Caritas in Veritate, Pope Benedict XVI sought to reassess and expand the concept of development presented in Populorum Progressio, interpreting it in light of globalization. He noted that such development should translate into “real growth, of benefit to everyone and genuinely sustainable.” [42]

Appears to be in a quote, so will give it a pass.5

[Par 57] Along with a greater awareness of the value of every human person and their rights, recognition of minority rights has also grown. Yet, there is still a long way to go to ensure that the rights of a great many, namely women, are equally and genuinely guaranteed throughout the world.

Again, does not seem like “genuinely” was endogenously related to the subject matter

[Par 100] The artificial imitation of positive human communication — words of advice, empathy, friendship and even love — can be engaging and at times genuinely helpful. However, for less discerning users, it can also be misleading, creating the illusion of a relationship with a real personal subject. When words are simulated, they do not build genuine relationships, but only their appearance. The artificial imitation of care or support can become particularly risky when it enters contexts where real relationships and emotional bonds are lacking. Here, the danger is not so much that a person may believe they are communicating with another person, but rather that they may gradually lose the very desire to form genuine human connections.

…you get the idea.6

Indeed, of all 9 instances of “genuinely” in the encyclical, only the last use (“When people come to believe that nothing is genuinely true and that principles are hollow words, then the fuse in their hearts is lit for new eruptions of intolerance and aggression.”) seem semantically critical. If we drop that and the Pope Benedict quote we’re left with 7/9 suspicious uses.

Again, to be clear any individual instance is plausibly normal, authentic, genuine. However the statistical pattern of the repeated invocations is quite suspicious!

Is this just a personality quirk of Pope Leo XIV specifically?

Another possibility you might have is that maybe this is just a personality/stylistic quirk of Pope Leo XIV specifically? Maybe he just genuinely likes the word?

Lord knows I too have odd personality quirks in writing, some of which have an unfortunate resemblance to AI.

Ultimately, I think this is plausible but unlikely. First of all, popes don’t typically draft the text of their own encyclicals that much. So it’s unlikely that stylistic quirks as specific as adverbial usage will bleed out to the final drafts as much. In contrast, I’m much more open to higher level constructs like the imagery, themes, or favorite Bible passages being much more prominent in some pope’s encyclicals than others.

Further, the specific phrases used are often next to other suspicious “AI tells” (more on that later).

Unfortunately, I don’t have easy access to many (pre-papacy) writings by Pope Leo to test against this alternative hypothesis. However, I did find Chapter 2 (“The Authority of the Local Prior”) of his 1987 PhD thesis here. In 14 pages (roughly the size of the post you’re reading), the future Pope Leo’s chapter has no uses of “genuine” or “genuinely,” and 0 em-dashes in his own prose.7

(I welcome extensions of my analysis by people with access to the full thesis in print).

Tricolon density

A common mark of LLM writing is the repeated invocation of tricolons: a series of three parallel words, phrases, or clauses used for rhetorical effect.

I noticed quite a few invocations of the tricolons in Magnifica Humanitas. It was especially notable in sections that otherwise had other tells of AI.

Unfortunately, unlike “genuinely” or em-dashes, this is harder to directly observe or baseline, as I can’t use the automatic “find” feature on chrome, it’s annoying to count by hand, and there are numerous edge-cases.

Nonetheless, I attempted to use my AI Agent Claude Code ( Claude Opus 4.7 1M XHigh) to give it a good college try, testing Magnifica Humanitas against 3 encyclicals by Francis, 2 by Benedict, 1 jointly by Benedict and Francis and 1 by Leo XIII (who wrote Rerum Novarum, which the current encyclical on AI is supposedly strongly based on).

Caption: Note the easy and natural use of em-dashes above. This is how the AIs naturally speak!

I think this is partial confirmation of my hypothesis. Strict tricolons seem noticeably more prominent in pope Leo XIV’s writings than that of past popes we tested against.

Unfortunately (for my hypothesis) there is also substantial variation in the encyclicals authored/commissioned by previous popes. In particular, tricolons are much more common in writings by Benedict than by Francis. So this simple test is suggestive but does not rule out normal human variation.

Further, the LLM scan is a rough estimate. There’s inherent subjectivity in a question of triadic markers (unlike more direct vocabulary or punctuation tells). I welcome replication of my attempts here, either via a different AI agent methodology, or (preferably) someone more patient than me willing to manually count and verify this.

Do we have other sources of evidence to look at? Yes, we can use automated AI detectors, specifically Pangram.

Pangram analysis

Pangram is by far the best commercially available AI detector. It is much better than other AI detectors, so much so that other ones are almost useless in comparison. In particular, Pangram optimizes very hard for getting a false positive rate of nearly zero, while being more okay with false negatives.

This means that if you see text online that Pangram flags as AI, you should have very high confidence that it’s AI. In contrast, if you have some text that Pangram flags as 100% human, you should still be appropriately skeptical, especially if you’re otherwise suspicious. So how did yesterday’s encyclical do?

When I pasted the first twenty paragraphs of the encyclical in8, Pangram flags 11% of it as AI.

In particular, Pangram is very suspicious of Paragraphs 7-8.

For what it’s worth, I too was rather suspicious of this section. “It was an impressive feat: a single language, a single technology, a single direction.” sounded just a bit too neat to me.

Spot-checking different sections of the encyclical, we see a repeated pattern.

Some sections register as essentially 0% AI, while others seem much more AI-y.

This indicates to me that some Vatican officials [EDITED: this used to say "cardinals"] who contributed to the encyclical used AI assistance heavily and most (probably including Pope Leo himself) did not. More on this later.

Comparison to other encyclicals

You might naturally be skeptical of Pangram’s analysis here. After all, maybe you haven’t heard of Pangram (or myself) until today. And besides, Pangram’s likely trained on internet data and its main use case is detecting AI in blog posts and social media posts and movie reviews and academic papers and so forth. What evidence do we have that it’s suited for a task as off-distribution as papal encyclicals? Maybe the detector’s just befuddled by unusually pious tokens?

To test this hypothesis, I ran Pangram on the previous 4 encyclicals. The first 20 paragraphs on all of them register as 100% human, all with high confidence:

I also tested writings by Pope Benedict and John Paul in case Pope Francis had an unusually human touch. And against encyclicals by Pope Leos XIII and XII in case Pangram is prejudiced against Leos. All 100% human, as expected.

Comparison to Pope Leo XIV’s speech

I also tested it against a transcript of Pope Leo’s speech announcing yesterday’s encyclical. 100% Human on Pangram. This is evidence that Pope Leo himself and/or his primary speechwriter does not use AI to draft his speeches.

(Incidentally, “genuine(ly)” appears 0 times in the transcript. Though em-dashes appear three times).

Sidebar: Pangram has a very low false positive rate in general

I think the encyclical evidence specifically should be quite convincing. But separately, I also strongly believe Pangram has a very low false positive rate in general. I observed this both in academic research and my own tests:

I’ve tested writings that I’m very confident is not AI (e.g. writings by myself, or from before 2021) dozens if not hundreds of times against Pangram, and repeatedly gotten 100% Human. This indicates to me that their advertised very low false positive rate is real.

In contrast, the false negative rate is much higher: Sometimes I’d ask an AI to read an outline by me and generate a draft, as a test. Pangram only catches that sometimes, though my impression is that they’ve gotten better in the last few months.

So you should generally trust that text that Pangram flags as AI is probably AI, while continuing to maintain some healthy suspicion of text that Pangram flags as not AI.

Probably not a translation artifact

One potential mitigating factor is that this is a translation artifact: (human) senior church officials wrote the original encyclical in their native language (maybe Italian) and the translators, not the writers, were lazy and used substantially AI assistance in translating to English. I cannot rule this hypothesis out but currently think it’s very unlikely.

The same signs of AI I observe in English are essentially preserved verbatim in the Italian version

If the strong AI signs I’ve observed (“tells”) are a result of lazy translators to English, we should expect them to look different in other languages, especially the source language.

To test this, I gave all the tells I noticed in the English encyclical as well as the Italian version of the encyclical to two software agents – Claude Opus 4.7 and ChatGPT 5.5 Pro, arguably the strongest commercially available language models out there – to see whether the tells were preserved in Italian. Both agents believe the tells were preserved verbatim (See Claude screenshot below):

That said, I don’t speak Italian myself, and cannot personally verify the results. I welcome replications. Suspicious readers who speak Italian, and especially Italian native speakers who are familiar enough with how AI sounds in Italian, should try to verify the work for themselves!

The Italian version of the current encyclical also gets flagged as AI by Pangram

If the English Pangram results are due to overly technophiliac or lazy translators of Italian to English, we should expect that the original Italian results will get 0% while the English translations gets flagged. We do not observe this (first twenty paragraphs again).

Instead, the Italian flagged sections appear to be a superset of the sections flagged in English. This would naively indicate that more, not less, of the Italian sections were drafted by AI.

(See further analysis by Daniel Filan)

That said, I don’t know how accurate Italian Pangram is. All the academic research I am aware of on Pangram’s accuracy (and my own experiments) are in English.

Backtesting AI translation of past encyclicals get 0% on Pangram

Another angle on the translation artifact hypothesis: Suppose you 100% organically human-write an article and then use AI to translate it. Does this even get flagged by Pangram?

As far as I can tell, the answer is no! (H/T Daniel Filan for this methodology)

I took a random excerpt of Fratelli Tutti in Italian and asked 3 leading frontier models (Gemini 3.1, ChatGPT 5.5, Claude 4.7) to translate it to English, with web search off. Each translation is different from the official Vatican translation (and from each other).

Pangram reads all three texts as 100% human. (Gemini results, Claude results, ChatGPT results)

This indicates to me further evidence that any Pangram-detected signs of AI in Magnifica Humanitas come from stylistic features that are present across languages, rather than artifacts plausibly introduced by AIs in translation.

The specific AI used is likely Claude

The AI I use the most often for work is Anthropic’s Claude. I have a decent sense of its underlying rhythm, how it argues, and its favorite vocabulary and syntactic choices.9

I believe I identified the same voice of Claude in the recent papal encyclical about safeguarding the human person in the age of artificial intelligence. Somewhat ironic, considering.

Unfortunately, my primary reasons for believing this are somewhat idiosyncratic and inscrutable, and thus more likely to be wrong. And the “Claude authorship” hypothesis is of course overall less important to nail than the “AI authorship” hypothesis, all things considered.

Still, I want to offer some textual evidence favoring Claude over other models10.

Textual

The aforementioned “genuinely” strongly bears the fingerprints of Claude. It is very much the house style of Claude/Anthropic and regularly used in both internal and external Anthropic communications about Claude’s nature and in Claude’s repeated outputs. To be honest, I see “genuinely” so much these days that the word no longer feels like a real word to me and I’ve completely lost the ability to spell the word correctly by myself, never mind appreciating it in context.

Maybe there’s a metaphor in there about authenticity in the age of AI? Nah, I’m probably overthinking it.

Regardless, the repeated uses of genuinely in Magnifica Humanitas is strongly suggestive of Claude-specific fingerprints.

Against ChatGPT

Applying the same analysis against ChatGPT fingerprints gives us a negative result. The words that I most associate with ChatGPT (“delve”, “meticulous,” “tapestry”, “goblins”) over other LLMs all show up zero times in the English encyclical.

This is moderate evidence against significant ChatGPT usage.

I don’t know enough about the tells that are specific to other models (Gemini, Grok, Mistral, DeepSeek, Kimi, etc). I welcome replications!

Different sections of the encyclical have very different rates of apparent AI usage

Significant fractions of the text in both English and Italian are flagged as AI. This varies significantly from section to section and paragraph to paragraph.

Some paragraphs in Pangram, and also by visual inspection, are clearly AI, while others seem clearly not AI.

My understanding is that popes don’t usually draft the majority of the text of their encyclicals themselves.

Between these two facts, this indicates to me that some Vatican officials heavily used AI assistance for this encyclical and many (probably including Pope Leo himself) didn’t.

My tentative hypothesis is that Pope Leo does not approve of the AI usage in encyclicals, and plausibly was not even aware of significant AI usage in his own encyclical! Quite unfortunate if true.

Conclusion

I attempted to provide a number of different angles and evidence for the conclusion that the papal encyclical on safeguarding the human person in the age of AI is actually an article in large part neither on nor in AI, but by AI. Any individual method might be flawed, but I believe the consilience of evidence is very strongly suggestive, perhaps even overwhelming.

Nonetheless, I welcome good-faith debate and corrections. Please feel free to comment with contrary evidence and replications.

We’re soon entering a time of unprecedented danger in the world. Like children with flamethrowers, or Bronze Age peasants attempting to build the Tower of Babel, humanity is messing with forces we cannot hope to understand or control.

In this new age of AI, getting provenance genuinuely right isn’t just a question of human authenticity — it’s a matter of life and death.

Though in previous attempts to validate my intuition systematically, I have a substantially higher false negative rate than false positive rate, suggesting I’m not sensitive enough.

Technically 6 times, but all 6 in citations (eg “cf Neh 2–6, [123] Cf. Dicastery for the Doctrine of the Faith – Dicastery for Culture and Education”)

Unedited AI texts use em-dashes much more often than en-dashes. The former is much more directly a sign of AI than the latter. However, it’s common for people hiding AI writing to change em-dashes to en-dashes. Additionally, the specific differences between em-dashes and en-dashes could just be random variation of specific stylistic preferences/trends between people. Thus, we should probably count em-dashes and en-dashes together in the analysis, though em-dashes over en-dashes is still some (nonzero) evidence.

I asked Claude to review this section. It says the injunction to avoid “genuinely” is no longer in its system prompt as of Opus 4.7 (and has no direct evidence of past prompts), though it believes the previous/github report is credible enough for Opus 4.6.

This is not strictly zero information. Given Claude’s demonstrated predilection for genuinely, we can imagine this specific passage was chosen by Claude to quote.

Interestingly, here the instances of “genuine” (“genuine relationships”, “genuine human connections”) seemed to carry actual semantic content, while “genuinely helpful” did not. So despite its placement in the rest of the paragraph, I don’t think “genuinely” is semantically relevant here.

If you’re very literal about it, there are 2 em-dashes in the chapter. One of them is in a large quote-block by someone else. The other is in the footnote “Robert F. Prevost, O.S.A. — Canon lawyer working in Chulucanas, Peru.” Neither invocations are semantic in the main text, and the latter likely added by an editor.

The whole encyclical is too long for Pangram.

I did not use AI to draft or write any of the text in this post, including for sentences deliberately engineered to sound like AI.

Note that I’m only moderately confident in the presence, and to a lesser degree, the substantial contribution, of Claude’s fingerprints on the recent encyclical. I can’t rule out that other AIs were also separately involved. I’m also only moderately confident in my conclusion that it’s Claude: all the AIs are kinda similar to each other, so differentiating between them is much harder than detecting the presence of AI at all.

The comments are quite dismissive unfortunately

They hated me because I told them the truth.

Have I now become your enemy by telling you the truth?
-Galatians 4:16

One of my favorite verses during this episode, alongside Amos 5:10. And Joshua 8:1a-b, of course.

The Italian version of the current encyclical also gets flagged as AI by Pangram (actually more so than the English version), though I’m not aware of academic research or rigorous testing of Pangram’s service when applied to Italian)

Here's one paper that finds it's reasonably accurate and has a very low FPR

Each of these is light enough to suggest to me less that the passages in question are AI-drafted and more that someone who was drafting them was talking to Claude for feedback, which is slightly different. IE, if you are writing it but asking for feedback, you will constantly get back phrasal suggestions with em dashes in them, tricolons will pass the "did Claude like it" test nicely, etc.

However this is sort of immaterial: I think I agree that this has significant Claude style tells. I am just prone to splitting hairs about where and how leakage happens, and in this specific case I expect someone proofing with and talking a little too much to Claude, and writing in an excessively formal style, as opposed to actually outsourcing the draft, which I would expect to be much more tell-heavy.

I think that might explain the em-dashes and the genuinely but not the triads.

Also I don't want you to take it on faith, but my own self-perception is that I can tell the difference between light AI polish and more substantial AI involvement in writing.

Normalized for length the number of triads is high but not absurd, which would be consistent with human-back-leakage. I would expect it to creep much higher if these were raw LLM-isms. It's not even the highest ranking for (A + B + C) by length, this would be consistent with "at the high end for encyclicals, and the main place it varies is that the author has been accustomed to explicit instead of implicit triads".

But this is relatively minor; if the humans have been using Claude enough to get into em dashes, towards the upper end of triad use for encyclicals, and an outlier level of explicit as opposed to implicit triads, this is not materially different from if Claude actually drafted a few of the paragraphs.

I do note that the length means that it is much more rhetoric-heavy than shorter encyclicals though. That is, much more of it is doing pure persuasion, which is where it's natural to use this construction. At least one of these is probably restating the trinity, for example, which would be bizarre in a to-the-point message. So I am not sure it should be expected that its length-normalized count should be the same as shorter encyclicals

Normalized for length the number of triads is high but not absurd, which would be consistent with human-back-leakage.

True, though it's also consistent with some of the writing being much more AI assisted than others.

You said you "ran Pangram on the previous 4 encyclicals. The first 20 paragraphs on all of them register as 100% human, all with high confidence", but one of the most obvious features you would add to a commercial AI-generated text detector would be searching the web to find similar results before 2022? (I don't know if they do this or not) Otherwise I imagine you would have absurd false positives on old texts or well known texts in general that I don't think users would like. I would be surprised if they weren't doing that. I was asking GPT5.5 about how Pangram possibly works:

If the text is old and famous enough, a detector may classify it as human partly because it has seen it, or seen close variants of it, not because it has discovered some universal essence of human writing.
I think the strongest version of the Pangram comparison would use obscure pre-2022 human religious/institutional texts unlikely to be in training data, ideally matched for genre, language, translation status, and formality. Then Pangram would have less opportunity to win by recognition and more pressure to distinguish authorship style.

Also, although there may be evidence of Claude assistance in the final wording/style, "Claude, author of the Humanitas" is much stronger than the evidence supports; unless you have a way to differentiate normal human writing polished by AI to "I'm the bishop, the pope asked me my help writing an article saying X and Y, please create a few paragraphs for me". Maybe that's exactly your point, but it's not completely clear what you mean when you say that "Claude wrote" something; do you mean "human drafters wrote the substantive content, argument, and theological framing, then used Claude or another LLM for polishing, grammar, phrasing etc, which produced em-dashes, smoother triads, 'genuinely' and things that would be flagged as AI-generated" or do you mean something else? I finished reading your post and I was still confused what you think actually happened, how exactly people used AI (assuming they used it)

I think the strongest version of the Pangram comparison would use obscure pre-2022 human religious/institutional texts unlikely to be in training data, ideally matched for genre, language, translation status, and formality. Then Pangram would have less opportunity to win by recognition and more pressure to distinguish authorship style.

You're welcome to run the test yourself. Or you can send me a Google doc with your 10 preferred documents for me to test if Pangram credits are your bottleneck. :)

Also, although there may be evidence of Claude assistance in the final wording/style, "Claude, author of the Humanitas" is much stronger than the evidence supports;

Yeah to be clear the title was a nod to Pierre Menard, Author of the Quixote. I was trying to have an appropriate balance of engagement and literariness.

but it's not completely clear what you mean when you say that "Claude wrote" something; do you mean "human drafters wrote the substantive content, argument, and theological framing, then used Claude or another LLM for polishing, grammar, phrasing etc, which produced em-dashes, smoother triads, 'genuinely' and things that would be flagged as AI-generated" or do you mean something else? I finished reading your post and I was still confused what you think actually happened, how exactly people used AI (assuming they used it)

My current guess is somewhere in between, closer to the latter end.

I obviously don't have insights or special knowledge into the process.

My current hypothesis is that I believe human drafters wrote most of the substantiative content, arguments, and theological framing. I believe that the level of AI involvement is well beyond what I'd consider "light-touch" AI involvement like fixing typos, light brushups, some assistance with translation. Stuff like the triads don't come in if your pre-AI drafts is almost ready and you just used AI for one or two final passes.

The fact that these are the things we can observe also means that there's probably a bunch of AI involvement beyond we could observe: we see the shadows but not necessarily what leads to them.

Still I think most (all?) of core theological and philosophical points and the arguments are written by humans, not AIs.

(One thing that confused a bit is the analysis in Par 8 of the Nehemiah story, which seems a bit tortured to me if I understand the standard reading correctly. But theology isn't exactly my expertise or forte).

Couldn't the explanation be that the type of writing used in encyclicals follows the same aesthetics used in the training data? I'm thinking of this article in particular:

I'm Kenyan. I Don't Write Like ChatGPT. ChatGPT Writes Like Me.

There’s a growing community (cult?) of self-proclaimed AI detectives, who have designed and detailed what they consider tells, and armed their followers with a checklist of robotic tells. Does a piece of text use words like ‘furthermore’, ‘moreover’, ‘consequently’, ‘otherwise’ or ‘thusly’? Does it build its arguments using perfectly parallel structures, such as the classic “It is not only X, but also Y”? Does it arrange its key points into neat, logical triplets for maximum rhetorical impact?
To these detectives of digital inauthenticity, I say: Friend, welcome to a typical Tuesday in a Kenyan classroom, boardroom, or intra-office Teams chat. The very things you identify as the fingerprints of the machine are, in fact, the fossil records of our education.

You know that article itself is heavily AI-assisted, right?

I know you're very confident in that claim, based on:

It gets 100% on Pangram and the tone and rhythm just seems way too AI-like.

and

I asked Claude to come up with some Kenyan essays from that time period that has this ChatGPT rhythm and Claude was unable to.

As well as what looks like your personal judgment of some unspecified other of his essays you've read.

According to Gemini 3.1 pro:

During training, Pangram feeds its system a massive dataset of publicly licensed, human-written documents. However, instead of just comparing human text to random AI text, Pangram generates a "synthetic mirror." It asks an AI (like GPT-4 or Claude) to rewrite the human example so it closely matches the original content.
By forcing the neural network to differentiate between a human essay and an AI's exact replication of that essay, the model learns the incredibly subtle stylistic choices that AI makes consistently, rather than just relying on broad topic differences.

AI is trained on a biased sample of human writing. It develops stylistic quirks that come through in "match the style" prompts relative to a separate, biased sample of human writing used to train Pangram.

If a subset of authors happen to share those stylistic quirks, as Olang' argues, then Pangram may simply not have the capacity to distinguish these "hard cases" and classify them systematically as AI. This would be similar to the problems people have noted where AIs develop racist biases due to the distribution of their training data -- perhaps being more likely to classify the same action as a crime when it's performed by a black person, for example.

Furthermore, if you then turn around and submit pre-AI texts to Pangram to determine whether it can successfully classify them, you're highly likely to be feeding in precisely its own training data. That's the only guaranteed 100% clean data Pangram could use for training. It's no surprise that it reliably classifies it as 100% human. What would be more compelling would be if you put in pre-LLM text that had never been published. But unfortunately, we have no Papal encyclicals, and likely no convenient Kenyan essays, with which to perform that experiment.

Let's accept that Pangram isn't just a random number generator and actually provides some evidence, even for hard cases. Then what do we make of it when different subsets of the original text receive 100% human or 100% AI scores? What if one bit is a subset of the other with a different classification? That's what we're seeing in the case of Olang's essay.

Let's accept that Pangram isn't just a random number generator and actually provides some evidence, even for hard cases. Then what do we make of it when different subsets of the original text receive 100% human or 100% AI scores? What if one bit is a subset of the other with a different classification? That's what we're seeing in the case of Olang's essay.

We know Pangram has much more false negatives than false positives, I think the parsimonious explanation is that the entire thing or almost the entire thing is AI and they're false negatives.

As well as what looks like your personal judgment of some unspecified other of his essays you've read.

Literally just look at his other essays from before ~2021 vs after ~2023! You don't have to take my word for it!

We know Pangram has much more false negatives than false positives, I think the parsimonious explanation is that the entire thing or almost the entire thing is AI and they're false negatives.

High profile authors who share AI's stylistic quirks are much more likely to be investigated. Therefore, the base rate at which Pangram will flag investigated text as AI generated will be much higher than the base rate for a random sample of text. In other words, the evidence provided by Pangram is already more or less accounted for by the mere fact you thought to submit the essay to it.

What do you think is the base rate of AI usage in say randomly selected substack posts with >1k views?

Why is that relevant?

Therefore, the base rate at which Pangram will flag investigated text as AI generated will be much higher than the base rate for a random sample of text. In other words, the evidence provided by Pangram is already more or less accounted for by the mere fact you thought to submit the essay to it.

I think the "much higher" claim only makes sense if your prior for random samples of 2026 internet text is much lower than mine.

Can you give toy numbers for "much higher,", "base rate at which Pangram will flag investigated text as AI generated," and "base rate for a random sample of text."

Because as stated, it sounds like you're making a technically plausible analytic point that in reality nonetheless has effectively zero relevance to the question we're talking about.

I know that Pangram flags it, but do we have evidence that rules out "the guy wrote it himself but AI is just trained on that style"?

If this were true, you'd expect pre-2020 encyclicals to sometimes be flagged as AI-written, which they never are in my experience.

Also FYI Pangram thinks the text you quoted is 100% human-written.

I also found that this chunk was flagged as 100% human-generated by Pangram.

And that one moment wasn’t an aberration. Every English class and every homework assignment for three years prior (and more, it could be argued) was specifically designed to get the teacher marking your composition to award you a mark as close as possible to the maximum of 40. Scored a 38/40? Beloved, whoever is marking your paper has deemed you worthy of breathing the same air as Malkiat Singh.

It’s a memory that’s hard to write over - the prompt, written in the looping, immaculate cursive of the teacher on the blackboard: “A holiday I will never forget.” Or perhaps it was one of those that demanded that you end the entire composition with, “…and that’s when I woke up and realised it was just a dream.” The topic was almost irrelevant. The real test was the execution.

I'm out of free credits, so can't check other chunks. Would be interesting to get a hierarchical breakdown of Pangram scores on various scales of the essay.

Can you clarify what your experience is with checking pre-2020 encyclicals for evidence of being AI-written?

I've copy-pasted various chunks of Pope Francis encyclicals into Pangram, and it's never flagged anything as AI-written. I haven't been that systematic about doing so I'm afraid, but here are some ones I'm doing right now (note that for all of these I'm using the Italian text under the assumption that that's the original, and for ease of comparison with my post on the same topic):

A big chunk of Caritas in Veritate. Written in 2009 by Benedict XVI, Pangram thinks it's 100% human-written.
A chunk of Spe Salvi. Written in 2007 by Benedict XVI, Pangram says 100% human.
A chunk of Lumen Fidei. Written in 2013 by Francis, Pangram says 100% human.
A chunk of Fratelli Tutti. Written in 2020 by Francis, Pangram says 100% human.

I'm not entirely sure why this post is "Personal Blog" btw.

The encyclical is written in Latin.

What you are looking at is one of its official translations (the English one or the Italian one).

(The preliminary work on its drafts can be done in any language, then the official Latin version is produced, then it is officially translated into various languages, or at least this is how it is supposed to go.

So what you see here is that they are now using AI to help them produce the official translations. But one might want to also analyse the Latin original in this sense.)

FWIW the level of Latin ability among Church officials is now low enough that these things are not initially drafted in Latin. Source: (a) a background knowledge of the Catholic hierarchy + the state of Latin education and (b) the fact that they didn't release the Latin version, which they presumably would have done if they had had it.

Yes, looks like they are going to polish the Latin one for a while, before it is ready to become the official canonical source…

Thanks!!!

If the original was written first in Latin, why is it released in other languages first but the Latin version not even up online?

I also provide some evidence against the translation artifacts theory above though I did not test Latin specifically (because it wasn't up there).

If that’s true, this would be highly irregular.

Normally, they are not supposed to release translations until the Latin original is officially “promulgated” (I think; I am not an authority).

But yeah, you are right. They don’t do it the old way now. Instead, they are going to polish the Latin one for a while. So weird…

No Latin version up.

My own guess is that the starting language is part Italian and part English fwiw.

Skimming this article, it looks like not long after Vatican II people started drafting the encyclicals in their native languages, and finishing up the Latin version last.

https://www.paideiainstitute.org/leo_xiv_and_latin_expect_improvement_but_not_much

Actually this is really good news! If the final official version is written last, the Church still has a chance to revise their policies and rewrite the sections touched by AI.

But I’ll let you in on a little secret. This is partially deliberate in order to allow time to gauge an encyclical’s acceptance and ascertain if any tweaks are necessary. By not having the official Latin version published right away, we can receive feedback from Vatican watchers, scientists, and theological experts and amend the Latin version as much as we want before publication.
That alone should tell you that the Latin version, though official, is not the base text anymore. Whereas a hundred years ago the Latin version was produced first and then the vernaculars were rendered as translations of it, now the Latin version is the last to be completed and is essentially a fine-tuned and carefully raked Latin translation of the Italian.

In this case if I were a Catholic, my ask for the Church would be to do substantially more than the usual amount of amendments.

Not being a Catholic, it feels presumptuous to ask, but my guess is that the median Catholic intellectual would not be happy to learn about the degree of AI involvement.

Thanks, it’s enlightening.

So the “old order” keeps deteriorating…

"[Like] Bronze Age peasants attempting to build the Tower of Babel, humanity is messing with forces we cannot hope to understand or control"

To be fair, Bronze Age peasants built some pretty impressive towers, and judging purely from the robustness and longevity of their structures compared to the robustness and longevity of modern buildings, one might suppose them to have had a better understanding of the forces involved than we do...

I'm alluding to the specific Tower of Babel story, well-known in the Western canon and alluded to multiple times in the Humanitas itself.

The comments are quite dismissive unfortunately

They hated me because I told them the truth.

Have I now become your enemy by telling you the truth?
-Galatians 4:16

One of my favorite verses during this episode, alongside Amos 5:10. And Joshua 8:1a-b, of course.

The Italian version of the current encyclical also gets flagged as AI by Pangram (actually more so than the English version), though I’m not aware of academic research or rigorous testing of Pangram’s service when applied to Italian)

Here's one paper that finds it's reasonably accurate and has a very low FPR

I think that might explain the em-dashes and the genuinely but not the triads.

Also I don't want you to take it on faith, but my own self-perception is that I can tell the difference between light AI polish and more substantial AI involvement in writing.

Normalized for length the number of triads is high but not absurd, which would be consistent with human-back-leakage.

True, though it's also consistent with some of the writing being much more AI assisted than others.

If the text is old and famous enough, a detector may classify it as human partly because it has seen it, or seen close variants of it, not because it has discovered some universal essence of human writing.
I think the strongest version of the Pangram comparison would use obscure pre-2022 human religious/institutional texts unlikely to be in training data, ideally matched for genre, language, translation status, and formality. Then Pangram would have less opportunity to win by recognition and more pressure to distinguish authorship style.

I think the strongest version of the Pangram comparison would use obscure pre-2022 human religious/institutional texts unlikely to be in training data, ideally matched for genre, language, translation status, and formality. Then Pangram would have less opportunity to win by recognition and more pressure to distinguish authorship style.

You're welcome to run the test yourself. Or you can send me a Google doc with your 10 preferred documents for me to test if Pangram credits are your bottleneck. :)

Also, although there may be evidence of Claude assistance in the final wording/style, "Claude, author of the Humanitas" is much stronger than the evidence supports;

Yeah to be clear the title was a nod to Pierre Menard, Author of the Quixote. I was trying to have an appropriate balance of engagement and literariness.

but it's not completely clear what you mean when you say that "Claude wrote" something; do you mean "human drafters wrote the substantive content, argument, and theological framing, then used Claude or another LLM for polishing, grammar, phrasing etc, which produced em-dashes, smoother triads, 'genuinely' and things that would be flagged as AI-generated" or do you mean something else? I finished reading your post and I was still confused what you think actually happened, how exactly people used AI (assuming they used it)

My current guess is somewhere in between, closer to the latter end.

I obviously don't have insights or special knowledge into the process.

The fact that these are the things we can observe also means that there's probably a bunch of AI involvement beyond we could observe: we see the shadows but not necessarily what leads to them.

Couldn't the explanation be that the type of writing used in encyclicals follows the same aesthetics used in the training data? I'm thinking of this article in particular:

I'm Kenyan. I Don't Write Like ChatGPT. ChatGPT Writes Like Me.

There’s a growing community (cult?) of self-proclaimed AI detectives, who have designed and detailed what they consider tells, and armed their followers with a checklist of robotic tells. Does a piece of text use words like ‘furthermore’, ‘moreover’, ‘consequently’, ‘otherwise’ or ‘thusly’? Does it build its arguments using perfectly parallel structures, such as the classic “It is not only X, but also Y”? Does it arrange its key points into neat, logical triplets for maximum rhetorical impact?
To these detectives of digital inauthenticity, I say: Friend, welcome to a typical Tuesday in a Kenyan classroom, boardroom, or intra-office Teams chat. The very things you identify as the fingerprints of the machine are, in fact, the fossil records of our education.

You know that article itself is heavily AI-assisted, right?

I know you're very confident in that claim, based on:

It gets 100% on Pangram and the tone and rhythm just seems way too AI-like.

and

I asked Claude to come up with some Kenyan essays from that time period that has this ChatGPT rhythm and Claude was unable to.

As well as what looks like your personal judgment of some unspecified other of his essays you've read.

According to Gemini 3.1 pro:

During training, Pangram feeds its system a massive dataset of publicly licensed, human-written documents. However, instead of just comparing human text to random AI text, Pangram generates a "synthetic mirror." It asks an AI (like GPT-4 or Claude) to rewrite the human example so it closely matches the original content.
By forcing the neural network to differentiate between a human essay and an AI's exact replication of that essay, the model learns the incredibly subtle stylistic choices that AI makes consistently, rather than just relying on broad topic differences.

Let's accept that Pangram isn't just a random number generator and actually provides some evidence, even for hard cases. Then what do we make of it when different subsets of the original text receive 100% human or 100% AI scores? What if one bit is a subset of the other with a different classification? That's what we're seeing in the case of Olang's essay.

We know Pangram has much more false negatives than false positives, I think the parsimonious explanation is that the entire thing or almost the entire thing is AI and they're false negatives.

As well as what looks like your personal judgment of some unspecified other of his essays you've read.

Literally just look at his other essays from before ~2021 vs after ~2023! You don't have to take my word for it!

We know Pangram has much more false negatives than false positives, I think the parsimonious explanation is that the entire thing or almost the entire thing is AI and they're false negatives.

What do you think is the base rate of AI usage in say randomly selected substack posts with >1k views?

Why is that relevant?

Therefore, the base rate at which Pangram will flag investigated text as AI generated will be much higher than the base rate for a random sample of text. In other words, the evidence provided by Pangram is already more or less accounted for by the mere fact you thought to submit the essay to it.

I think the "much higher" claim only makes sense if your prior for random samples of 2026 internet text is much lower than mine.

Can you give toy numbers for "much higher,", "base rate at which Pangram will flag investigated text as AI generated," and "base rate for a random sample of text."

Because as stated, it sounds like you're making a technically plausible analytic point that in reality nonetheless has effectively zero relevance to the question we're talking about.

I know that Pangram flags it, but do we have evidence that rules out "the guy wrote it himself but AI is just trained on that style"?

If this were true, you'd expect pre-2020 encyclicals to sometimes be flagged as AI-written, which they never are in my experience.

Also FYI Pangram thinks the text you quoted is 100% human-written.

I also found that this chunk was flagged as 100% human-generated by Pangram.

And that one moment wasn’t an aberration. Every English class and every homework assignment for three years prior (and more, it could be argued) was specifically designed to get the teacher marking your composition to award you a mark as close as possible to the maximum of 40. Scored a 38/40? Beloved, whoever is marking your paper has deemed you worthy of breathing the same air as Malkiat Singh.

It’s a memory that’s hard to write over - the prompt, written in the looping, immaculate cursive of the teacher on the blackboard: “A holiday I will never forget.” Or perhaps it was one of those that demanded that you end the entire composition with, “…and that’s when I woke up and realised it was just a dream.” The topic was almost irrelevant. The real test was the execution.

I'm out of free credits, so can't check other chunks. Would be interesting to get a hierarchical breakdown of Pangram scores on various scales of the essay.

Can you clarify what your experience is with checking pre-2020 encyclicals for evidence of being AI-written?

A big chunk of Caritas in Veritate. Written in 2009 by Benedict XVI, Pangram thinks it's 100% human-written.
A chunk of Spe Salvi. Written in 2007 by Benedict XVI, Pangram says 100% human.
A chunk of Lumen Fidei. Written in 2013 by Francis, Pangram says 100% human.
A chunk of Fratelli Tutti. Written in 2020 by Francis, Pangram says 100% human.

I'm not entirely sure why this post is "Personal Blog" btw.

The encyclical is written in Latin.

What you are looking at is one of its official translations (the English one or the Italian one).

So what you see here is that they are now using AI to help them produce the official translations. But one might want to also analyse the Latin original in this sense.)

Yes, looks like they are going to polish the Latin one for a while, before it is ready to become the official canonical source…

Thanks!!!

If the original was written first in Latin, why is it released in other languages first but the Latin version not even up online?

I also provide some evidence against the translation artifacts theory above though I did not test Latin specifically (because it wasn't up there).

If that’s true, this would be highly irregular.

Normally, they are not supposed to release translations until the Latin original is officially “promulgated” (I think; I am not an authority).

But yeah, you are right. They don’t do it the old way now. Instead, they are going to polish the Latin one for a while. So weird…

No Latin version up.

My own guess is that the starting language is part Italian and part English fwiw.

Skimming this article, it looks like not long after Vatican II people started drafting the encyclicals in their native languages, and finishing up the Latin version last.

https://www.paideiainstitute.org/leo_xiv_and_latin_expect_improvement_but_not_much

Actually this is really good news! If the final official version is written last, the Church still has a chance to revise their policies and rewrite the sections touched by AI.

But I’ll let you in on a little secret. This is partially deliberate in order to allow time to gauge an encyclical’s acceptance and ascertain if any tweaks are necessary. By not having the official Latin version published right away, we can receive feedback from Vatican watchers, scientists, and theological experts and amend the Latin version as much as we want before publication.
That alone should tell you that the Latin version, though official, is not the base text anymore. Whereas a hundred years ago the Latin version was produced first and then the vernaculars were rendered as translations of it, now the Latin version is the last to be completed and is essentially a fine-tuned and carefully raked Latin translation of the Italian.

In this case if I were a Catholic, my ask for the Church would be to do substantially more than the usual amount of amendments.

Not being a Catholic, it feels presumptuous to ask, but my guess is that the median Catholic intellectual would not be happy to learn about the degree of AI involvement.

Thanks, it’s enlightening.

So the “old order” keeps deteriorating…

"[Like] Bronze Age peasants attempting to build the Tower of Babel, humanity is messing with forces we cannot hope to understand or control"

I'm alluding to the specific Tower of Babel story, well-known in the Western canon and alluded to multiple times in the Humanitas itself.

118

Claude, Author of the Humanitas

118

Significant fractions of the recent papal encyclical are written by AI

Statistical Evidence and Tells

Em-dashes

“Genuinely”

How often is the phrase “genuinely” used in Magnifica Humanitas?

Is this due to subject matter?

Is this just a personality quirk of Pope Leo XIV specifically?

Tricolon density

Pangram analysis

Comparison to other encyclicals

Comparison to Pope Leo XIV’s speech

Sidebar: Pangram has a very low false positive rate in general

Probably not a translation artifact

The same signs of AI I observe in English are essentially preserved verbatim in the Italian version

The Italian version of the current encyclical also gets flagged as AI by Pangram

Backtesting AI translation of past encyclicals get 0% on Pangram

The specific AI used is likely Claude

Textual

Against ChatGPT

Different sections of the encyclical have very different rates of apparent AI usage

Conclusion

118

118