The Italian version of the current encyclical also gets flagged as AI by Pangram (actually more so than the English version), though I’m not aware of academic research or rigorous testing of Pangram’s service when applied to Italian)
Here's one paper that finds it's reasonably accurate and has a very low FPR
Each of these is light enough to suggest to me less that the passages in question are AI-drafted and more that someone who was drafting them was talking to Claude for feedback, which is slightly different. IE, if you are writing it but asking for feedback, you will constantly get back phrasal suggestions with em dashes in them, tricolons will pass the "did Claude like it" test nicely, etc.
However this is sort of immaterial: I think I agree that this has significant Claude style tells. I am just prone to splitting hairs about where and how leakage happens, and in this specific case I expect someone proofing with and talking a little too much to Claude, and writing in an excessively formal style, as opposed to actually outsourcing the draft, which I would expect to be much more tell-heavy.
Normalized for length the number of triads is high but not absurd, which would be consistent with human-back-leakage. I would expect it to creep much higher if these were raw LLM-isms. It's not even the highest ranking for (A + B + C) by length, this would be consistent with "at the high end for encyclicals, and the main place it varies is that the author has been accustomed to explicit instead of implicit triads".
But this is relatively minor; if the humans have been using Claude enough to get into em dashes, towards the upper end of triad use for encyclicals, and an outlier level of explicit as opposed to implicit triads, this is not materially different from if Claude actually drafted a few of the paragraphs.
I do note that the length means that it is much more rhetoric-heavy than shorter encyclicals though. That is, much more of it is doing pure persuasion, which is where it's natural to use this construction. At least one of these is probably restating the trinity, for example, which would be bizarre in a to-the-point message. So I am not sure it should be expected that its length-normalized count should be the same as shorter encyclicals
Normalized for length the number of triads is high but not absurd, which would be consistent with human-back-leakage.
True, though it's also consistent with some of the writing being much more AI assisted than others.
Couldn't the explanation be that the type of writing used in encyclicals follows the same aesthetics used in the training data? I'm thinking of this article in particular:
I'm Kenyan. I Don't Write Like ChatGPT. ChatGPT Writes Like Me.
There’s a growing community (cult?) of self-proclaimed AI detectives, who have designed and detailed what they consider tells, and armed their followers with a checklist of robotic tells. Does a piece of text use words like ‘furthermore’, ‘moreover’, ‘consequently’, ‘otherwise’ or ‘thusly’? Does it build its arguments using perfectly parallel structures, such as the classic “It is not only X, but also Y”? Does it arrange its key points into neat, logical triplets for maximum rhetorical impact?
To these detectives of digital inauthenticity, I say: Friend, welcome to a typical Tuesday in a Kenyan classroom, boardroom, or intra-office Teams chat. The very things you identify as the fingerprints of the machine are, in fact, the fossil records of our education.
If this were true, you'd expect pre-2020 encyclicals to sometimes be flagged as AI-written, which they never are in my experience.
Also FYI Pangram thinks the text you quoted is 100% human-written.
Can you clarify what your experience is with checking pre-2020 encyclicals for evidence of being AI-written?
I've copy-pasted various chunks of Pope Francis encyclicals into Pangram, and it's never flagged anything as AI-written. I haven't been that systematic about doing so I'm afraid, but here are some ones I'm doing right now (note that for all of these I'm using the Italian text under the assumption that that's the original, and for ease of comparison with my post on the same topic):
I also have some examples here:
I also found that this chunk was flagged as 100% human-generated by Pangram.
And that one moment wasn’t an aberration. Every English class and every homework assignment for three years prior (and more, it could be argued) was specifically designed to get the teacher marking your composition to award you a mark as close as possible to the maximum of 40. Scored a 38/40? Beloved, whoever is marking your paper has deemed you worthy of breathing the same air as Malkiat Singh.
It’s a memory that’s hard to write over - the prompt, written in the looping, immaculate cursive of the teacher on the blackboard: “A holiday I will never forget.” Or perhaps it was one of those that demanded that you end the entire composition with, “…and that’s when I woke up and realised it was just a dream.” The topic was almost irrelevant. The real test was the execution.
I'm out of free credits, so can't check other chunks. Would be interesting to get a hierarchical breakdown of Pangram scores on various scales of the essay.
I know you're very confident in that claim, based on:
It gets 100% on Pangram and the tone and rhythm just seems way too AI-like.
and
I asked Claude to come up with some Kenyan essays from that time period that has this ChatGPT rhythm and Claude was unable to.
As well as what looks like your personal judgment of some unspecified other of his essays you've read.
According to Gemini 3.1 pro:
During training, Pangram feeds its system a massive dataset of publicly licensed, human-written documents. However, instead of just comparing human text to random AI text, Pangram generates a "synthetic mirror." It asks an AI (like GPT-4 or Claude) to rewrite the human example so it closely matches the original content.
By forcing the neural network to differentiate between a human essay and an AI's exact replication of that essay, the model learns the incredibly subtle stylistic choices that AI makes consistently, rather than just relying on broad topic differences.
AI is trained on a biased sample of human writing. It develops stylistic quirks that come through in "match the style" prompts relative to a separate, biased sample of human writing used to train Pangram.
If a subset of authors happen to share those stylistic quirks, as Olang' argues, then Pangram may simply not have the capacity to distinguish these "hard cases" and classify them systematically as AI. This would be similar to the problems people have noted where AIs develop racist biases due to the distribution of their training data -- perhaps being more likely to classify the same action as a crime when it's performed by a black person, for example.
Furthermore, if you then turn around and submit pre-AI texts to Pangram to determine whether it can successfully classify them, you're highly likely to be feeding in precisely its own training data. That's the only guaranteed 100% clean data Pangram could use for training. It's no surprise that it reliably classifies it as 100% human. What would be more compelling would be if you put in pre-LLM text that had never been published. But unfortunately, we have no Papal encyclicals, and likely no convenient Kenyan essays, with which to perform that experiment.
Let's accept that Pangram isn't just a random number generator and actually provides some evidence, even for hard cases. Then what do we make of it when different subsets of the original text receive 100% human or 100% AI scores? What if one bit is a subset of the other with a different classification? That's what we're seeing in the case of Olang's essay.
Let's accept that Pangram isn't just a random number generator and actually provides some evidence, even for hard cases. Then what do we make of it when different subsets of the original text receive 100% human or 100% AI scores? What if one bit is a subset of the other with a different classification? That's what we're seeing in the case of Olang's essay.
We know Pangram has much more false negatives than false positives, I think the parsimonious explanation is that the entire thing or almost the entire thing is AI and they're false negatives.
As well as what looks like your personal judgment of some unspecified other of his essays you've read.
Literally just look at his other essays from before ~2021 vs after ~2023! You don't have to take my word for it!
We know Pangram has much more false negatives than false positives, I think the parsimonious explanation is that the entire thing or almost the entire thing is AI and they're false negatives.
High profile authors who share AI's stylistic quirks are much more likely to be investigated. Therefore, the base rate at which Pangram will flag investigated text as AI generated will be much higher than the base rate for a random sample of text. In other words, the evidence provided by Pangram is already more or less accounted for by the mere fact you thought to submit the essay to it.
I know that Pangram flags it, but do we have evidence that rules out "the guy wrote it himself but AI is just trained on that style"?
Here's a subset of my evidence here:
https://www.lesswrong.com/posts/3LcyoqNTJuCZ65MbL?commentId=TfeGhHBh35rffdo6W
The encyclical is written in Latin.
What you are looking at is one of its official translations (the English one or the Italian one).
(The preliminary work on its drafts can be done in any language, then the official Latin version is produced, then it is officially translated into various languages, or at least this is how it is supposed to go.
So what you see here is that they are now using AI to help them produce the official translations. But one might want to also analyse the Latin original in this sense.)
FWIW the level of Latin ability among Church officials is now low enough that these things are not initially drafted in Latin. Source: (a) a background knowledge of the Catholic hierarchy + the state of Latin education and (b) the fact that they didn't release the Latin version, which they presumably would have done if they had had it.
Yes, looks like they are going to polish the Latin one for a while, before it is ready to become the official canonical source…
Thanks!!!
If the original was written first in Latin, why is it released in other languages first but the Latin version not even up online?
I also provide some evidence against the translation artifacts theory above though I did not test Latin specifically (because it wasn't up there).
If that’s true, this would be highly irregular.
Normally, they are not supposed to release translations until the Latin original is officially “promulgated” (I think; I am not an authority).
But yeah, you are right. They don’t do it the old way now. Instead, they are going to polish the Latin one for a while. So weird…
No Latin version up.
My own guess is that the starting language is part Italian and part English fwiw.
Skimming this article, it looks like not long after Vatican II people started drafting the encyclicals in their native languages, and finishing up the Latin version last.
https://www.paideiainstitute.org/leo_xiv_and_latin_expect_improvement_but_not_much
Actually this is really good news! If the final official version is written last, the Church still has a chance to revise their policies and rewrite the sections touched by AI.
But I’ll let you in on a little secret. This is partially deliberate in order to allow time to gauge an encyclical’s acceptance and ascertain if any tweaks are necessary. By not having the official Latin version published right away, we can receive feedback from Vatican watchers, scientists, and theological experts and amend the Latin version as much as we want before publication.
That alone should tell you that the Latin version, though official, is not the base text anymore. Whereas a hundred years ago the Latin version was produced first and then the vernaculars were rendered as translations of it, now the Latin version is the last to be completed and is essentially a fine-tuned and carefully raked Latin translation of the Italian.
In this case if I were a Catholic, my ask for the Church would be to do substantially more than the usual amount of amendments.
Not being a Catholic, it feels presumptuous to ask, but my guess is that the median Catholic intellectual would not be happy to learn about the degree of AI involvement.
In the wee hours of Memorial Day, my friends and I stayed up past 4:30 AM California time to listen to the announcement of Pope Leo’s first encyclical, Magnifica Humanitas, on safeguarding the human person in the time of artificial intelligence. We were excited albeit sleepy, eagerly anticipating the event and upcoming essay by the world’s foremost religious authority on a question so central to our world. Still we were an odd audience for this presentation: none of us are practicing Catholics, and most of us didn’t really know what to expect.
I thought Pope Leo’s own speech was good, and addressed the current moment in AI with some of the seriousness it deserves. I thought the other speeches, including by Chris Olah, were less impressive. But that’s okay, I’m not the target audience!
A specific cardinal’s point struck me, however:
Cardinal Parolin made much of a specific prepositional choice in the subtitle: “sulla custodia della persona umana nel tempo dell’intelligenza artificiale,“ which the live translator translated to something like “on the safeguarding of the human person in the time of AI,” and not “sull’intelligenza artificiale“ – “on AI.”
This was supposed to be a big deal. “In the time of AI” supposedly centers the human person in the theological narrative, while a mere first papal encyclical on AI focuses too much on the technology itself and not on human and societal reactions. A fascinating position!
Though as my subsequent analysis will demonstrate, perhaps a more apt preposition here is “by.” As in, the world’s first papal encyclical written in large part by AI.
My article has the following claims, each of which I hope to convince you of:
Significant fractions of the recent papal encyclical are written by AI
I was initially very excited to read Pope Leo’s first encyclical, a long treatise on maintaining humanity in the age of AI. The intersection between AI and societal response is one of my greatest intellectual and personal interests, and it’s both exciting and a relief for the world’s foremost religious authority to share a substantial interest in my personal and career obsessions.
Nonetheless – as I kept reading – certain lines jumped out at me as too smooth, too triadic, too… inhuman:
I read AI-generated text as part of my job regularly, and believe I have acquired a very good intuition for discerning AI-generated text from those by humans, including in formal writing (both academic and otherwise). Still, any individual phrase that seems AI-generated can be a false positive on my end, the result of an oversensitive nose for AI1. However, the sheer density of these phrases and overall tone in specific paragraphs seem implausibly all a random artifact.
Still, I can definitely be wrong here, and you should not believe my gut intuitions or judgments of vibes on authority (“Trust me bro”).
Intuitions, self-proclaimed expert judgments, and loose verbal reasoning can be a good starting point for an investigation, but if we want any confidence in our conclusions, we need to investigate further and more systematically.
Statistical Evidence and Tells
Three common and well-known tells in AI writing — sometimes genuinely deployed by humans but nowhere their profligate use by AI — are the regularity of em-dashes, the high frequency of specific words like “genuinely”, and the tendency to repeatedly invoke tricolons.
Let’s examine each of them in turn:
Em-dashes
The em-dash (“—”) is punctuation that’s by far most strongly associated with AI. It is also used 127 times in Magnifica Humanitas, much more than previous encyclicals.
Magnifica Humanitas: 127 times em-dash, 6 times en-dash (–), the latter all in citations.2
Dilexit Nos (2024): 0 times em-dash, 26 en-dashes, including 2 in citations. Comparatively long document.3
Laudate Deum (2023): 0 times em-dash, 12 times en-dashes. Much shorter. Also not officially an encyclical.
Fratelli Tutti (2020): 0 times em-dash, 46 en-dashes, of which maybe 5-10 are in quotes or citations. Note that this is 50% longer than Magnifica Humanitas.
Laudato Si’ (2016): 0 times em-dash, 25 times en-dash, of which maybe 10 are in citations or quotes (the piece overall appears to have many quotes). Similar length to Magnifica Humanitas
Lumen Fidei (2013): 26 times em-dash, 0 times en-dash. Some em-dashes in citations.
Note that this comparison actually understates the weirdness of the em-dashes in Magnifica Humanitas. For example, in Lumen Fidei, many of the em-dashes function similarly to speech colons in standard English. A typical use looks like
Using em-dashes as speech colon replacements is moderately common in formal (human) English writing, but essentially absent in LLM-English. I also did not notice em-dashes used this way in Magnifica Humanitas (though with 127 instances, it was annoying to check all of them!)
“Genuinely”
“Genuinely” is a phrase repeatedly used by Anthropic’s model Claude. It is extremely obvious to anybody who regularly uses it. It’s gotten so bad that in leaked system prompts, Anthropic attempted to explicitly forbid Claude to use that word!
As far as I could tell, this injunction does not and did not work.
Indeed, Anthropic’s own “Claude Constitution”, which many people believe to be substantially AI-assisted, used the phrase “genuinely” 33 times and genuine overall 50 times (inclusive).
How often is the phrase “genuinely” used in Magnifica Humanitas?
Less so than in Anthropic documents, but substantially more than past papal writings.
Specifically “genuinely” was used 9 times and “genuine” overall (inclusive) 22 times in yesterday’s encyclical, compared to 0 and 5 times, respectively, in Dilexit Nos, which is of similar length. Across a number of other encyclicals I scanned, the highest occurrences were 3 and 10, respectively.
These tells are all statistical. Any individual instance of “genuine(ly)” is plausibly a result of normal human communicative intent that is, well, genuine. But the sheer frequency of these occurrences, vastly out of accord with prior norms and normal human speech, is strongly suggestive of synthetic origin.
Is this due to subject matter?
An obvious rejoinder you might have is that word choices in essays are naturally not independent of subject matter. And it sure seems like an encyclical on AI might meditate more about genuineness more than other encyclicals! For example, an essay on AI deepfakes might be much more concerned about what makes a video “genuinely human” than an essay on climate change.
To investigate this hypothesis, I dived specifically into each use of genuinely in this encyclical:
“Genuinely” does not seem critical here, nor specific to questions of AI and authenticity.
Also not critical here.
Appears to be in a quote, so will give it a pass.5
Again, does not seem like “genuinely” was endogenously related to the subject matter
…you get the idea.6
Indeed, of all 9 instances of “genuinely” in the encyclical, only the last use (“When people come to believe that nothing is genuinely true and that principles are hollow words, then the fuse in their hearts is lit for new eruptions of intolerance and aggression.”) seem semantically critical. If we drop that and the Pope Benedict quote we’re left with 7/9 suspicious uses.
Again, to be clear any individual instance is plausibly normal, authentic, genuine. However the statistical pattern of the repeated invocations is quite suspicious!
Is this just a personality quirk of Pope Leo XIV specifically?
Another possibility you might have is that maybe this is just a personality/stylistic quirk of Pope Leo XIV specifically? Maybe he just genuinely likes the word?
Lord knows I too have odd personality quirks in writing, some of which have an unfortunate resemblance to AI.
Ultimately, I think this is plausible but unlikely. First of all, popes don’t typically draft the text of their own encyclicals that much. So it’s unlikely that stylistic quirks as specific as adverbial usage will bleed out to the final drafts as much. In contrast, I’m much more open to higher level constructs like the imagery, themes, or favorite Bible passages being much more prominent in some pope’s encyclicals than others.
Further, the specific phrases used are often next to other suspicious “AI tells” (more on that later).
Unfortunately, I don’t have easy access to many (pre-papacy) writings by Pope Leo to test against this alternative hypothesis. However, I did find Chapter 2 (“The Authority of the Local Prior”) of his 1987 PhD thesis here. In 14 pages (roughly the size of the post you’re reading), the future Pope Leo’s chapter has no uses of “genuine” or “genuinely,” and 0 em-dashes in his own prose.7
(I welcome extensions of my analysis by people with access to the full thesis in print).
Tricolon density
A common mark of LLM writing is the repeated invocation of tricolons: a series of three parallel words, phrases, or clauses used for rhetorical effect.
I noticed quite a few invocations of the tricolons in Magnifica Humanitas. It was especially notable in sections that otherwise had other tells of AI.
Unfortunately, unlike “genuinely” or em-dashes, this is harder to directly observe or baseline, as I can’t use the automatic “find” feature on chrome, it’s annoying to count by hand, and there are numerous edge-cases.
Nonetheless, I attempted to use my AI Agent Claude Code ( Claude Opus 4.7 1M XHigh) to give it a good college try, testing Magnifica Humanitas against 3 encyclicals by Francis, 2 by Benedict, 1 jointly by Benedict and Francis and 1 by Leo XIII (who wrote Rerum Novarum, which the current encyclical on AI is supposedly strongly based on).
Caption: Note the easy and natural use of em-dashes above. This is how the AIs naturally speak!
I think this is partial confirmation of my hypothesis. Strict tricolons seem noticeably more prominent in pope Leo XIV’s writings than that of past popes we tested against.
Unfortunately (for my hypothesis) there is also substantial variation in the encyclicals authored/commissioned by previous popes. In particular, tricolons are much more common in writings by Benedict than by Francis. So this simple test is suggestive but does not rule out normal human variation.
Further, the LLM scan is a rough estimate. There’s inherent subjectivity in a question of triadic markers (unlike more direct vocabulary or punctuation tells). I welcome replication of my attempts here, either via a different AI agent methodology, or (preferably) someone more patient than me willing to manually count and verify this.
Do we have other sources of evidence to look at? Yes, we can use automated AI detectors, specifically Pangram.
Pangram analysis
Pangram is by far the best commercially available AI detector. It is much better than other AI detectors, so much so that other ones are almost useless in comparison. In particular, Pangram optimizes very hard for getting a false positive rate of nearly zero, while being more okay with false negatives.
This means that if you see text online that Pangram flags as AI, you should have very high confidence that it’s AI. In contrast, if you have some text that Pangram flags as 100% human, you should still be appropriately skeptical, especially if you’re otherwise suspicious. So how did yesterday’s encyclical do?
When I pasted the first twenty paragraphs of the encyclical in8, Pangram flags 11% of it as AI.
In particular, Pangram is very suspicious of Paragraphs 7-8.
For what it’s worth, I too was rather suspicious of this section. “It was an impressive feat: a single language, a single technology, a single direction.” sounded just a bit too neat to me.
Spot-checking different sections of the encyclical, we see a repeated pattern.
Some sections register as essentially 0% AI, while others seem much more AI-y.
This indicates to me that some cardinals who contributed to the encyclical used AI assistance heavily and most (probably including Pope Leo himself) did not. More on this later.
Comparison to other encyclicals
You might naturally be skeptical of Pangram’s analysis here. After all, maybe you haven’t heard of Pangram (or myself) until today. And besides, Pangram’s likely trained on internet data and its main use case is detecting AI in blog posts and social media posts and movie reviews and academic papers and so forth. What evidence do we have that it’s suited for a task as off-distribution as papal encyclicals? Maybe the detector’s just befuddled by unusually pious tokens?
To test this hypothesis, I ran Pangram on the previous 4 encyclicals. The first 20 paragraphs on all of them register as 100% human, all with high confidence:
I also tested writings by Pope Benedict and John Paul in case Pope Francis had an unusually human touch. And against encyclicals by Pope Leos XIII and XII in case Pangram is prejudiced against Leos. All 100% human, as expected.
Comparison to Pope Leo XIV’s speech
I also tested it against a transcript of Pope Leo’s speech announcing yesterday’s encyclical. 100% Human on Pangram. This is evidence that Pope Leo himself and/or his primary speechwriter does not use AI to draft his speeches.
(Incidentally, “genuine(ly)” appears 0 times in the transcript. Though em-dashes appear three times).
Sidebar: Pangram has a very low false positive rate in general
I think the encyclical evidence specifically should be quite convincing. But separately, I also strongly believe Pangram has a very low false positive rate in general. I observed this both in academic research and my own tests:
I’ve tested writings that I’m very confident is not AI (e.g. writings by myself, or from before 2021) dozens if not hundreds of times against Pangram, and repeatedly gotten 100% Human. This indicates to me that their advertised very low false positive rate is real.
In contrast, the false negative rate is much higher: Sometimes I’d ask an AI to read an outline by me and generate a draft, as a test. Pangram only catches that sometimes, though my impression is that they’ve gotten better in the last few months.
So you should generally trust that text that Pangram flags as AI is probably AI, while continuing to maintain some healthy suspicion of text that Pangram flags as not AI.
Probably not a translation artifact
One potential mitigating factor is that this is a translation artifact: (human) senior church officials wrote the original encyclical in their native language (maybe Italian) and the translators, not the writers, were lazy and used substantially AI assistance in translating to English. I cannot rule this hypothesis out but currently think it’s very unlikely.
The same signs of AI I observe in English are essentially preserved verbatim in the Italian version
If the strong AI signs I’ve observed (“tells”) are a result of lazy translators to English, we should expect them to look different in other languages, especially the source language.
To test this, I gave all the tells I noticed in the English encyclical as well as the Italian version of the encyclical to two software agents – Claude Opus 4.7 and ChatGPT 5.5 Pro, arguably the strongest commercially available language models out there – to see whether the tells were preserved in Italian. Both agents believe the tells were preserved verbatim (See Claude screenshot below):
That said, I don’t speak Italian myself, and cannot personally verify the results. I welcome replications. Suspicious readers who speak Italian, and especially Italian native speakers who are familiar enough with how AI sounds in Italian, should try to verify the work for themselves!
The Italian version of the current encyclical also gets flagged as AI by Pangram
If the English Pangram results are due to overly technophiliac or lazy translators of Italian to English, we should expect that the original Italian results will get 0% while the English translations gets flagged. We do not observe this (first twenty paragraphs again).
Instead, the Italian flagged sections appear to be a superset of the sections flagged in English. This would naively indicate that more, not less, of the Italian sections were drafted by AI.
(See further analysis by Daniel Filan)
That said, I don’t know how accurate Italian Pangram is. All the academic research I am aware of on Pangram’s accuracy (and my own experiments) are in English.
Backtesting AI translation of past encyclicals get 0% on Pangram
Another angle on the translation artifact hypothesis: Suppose you 100% organically human-write an article and then use AI to translate it. Does this even get flagged by Pangram?
As far as I can tell, the answer is no! (H/T Daniel Filan for this methodology)
I took a random excerpt of Fratelli Tutti in Italian and asked 3 leading frontier models (Gemini 3.1, ChatGPT 5.5, Claude 4.7) to translate it to English, with web search off. Each translation is different from the official Vatican translation (and from each other).
Pangram reads all three texts as 100% human. (Gemini results, Claude results, ChatGPT results)
This indicates to me further evidence that any Pangram-detected signs of AI in Magnifica Humanitas come from stylistic features that are present across languages, rather than artifacts plausibly introduced by AIs in translation.
Share
The specific AI used is likely Claude
The AI I use the most often for work is Anthropic’s Claude. I have a decent sense of its underlying rhythm, how it argues, and its favorite vocabulary and syntactic choices.9
I believe I identified the same voice of Claude in the recent papal encyclical about safeguarding the human person in the age of artificial intelligence. Somewhat ironic, considering.
Unfortunately, my primary reasons for believing this are somewhat idiosyncratic and inscrutable, and thus more likely to be wrong. And the “Claude authorship” hypothesis is of course overall less important to nail than the “AI authorship” hypothesis, all things considered.
Still, I want to offer some textual evidence favoring Claude over other models10.
Textual
The aforementioned “genuinely” strongly bears the fingerprints of Claude. It is very much the house style of Claude/Anthropic and regularly used in both internal and external Anthropic communications about Claude’s nature and in Claude’s repeated outputs. To be honest, I see “genuinely” so much these days that the word no longer feels like a real word to me and I’ve completely lost the ability to spell the word correctly by myself, never mind appreciating it in context.
Maybe there’s a metaphor in there about authenticity in the age of AI? Nah, I’m probably overthinking it.
Regardless, the repeated uses of genuinely in Magnifica Humanitas is strongly suggestive of Claude-specific fingerprints.
Against ChatGPT
Applying the same analysis against ChatGPT fingerprints gives us a negative result. The words that I most associate with ChatGPT (“delve”, “meticulous,” “tapestry”, “goblins”) over other LLMs all show up zero times in the English encyclical.
This is moderate evidence against significant ChatGPT usage.
I don’t know enough about the tells that are specific to other models (Gemini, Grok, Mistral, DeepSeek, Kimi, etc). I welcome replications!
Different sections of the encyclical have very different rates of apparent AI usage
Significant fractions of the text in both English and Italian are flagged as AI. This varies significantly from section to section and paragraph to paragraph.
Some paragraphs in Pangram, and also by visual inspection, are clearly AI, while others seem clearly not AI.
My understanding is that popes don’t usually draft the majority of the text of their encyclicals themselves.
Between these two facts, this indicates to me that some cardinals heavily used AI assistance for this encyclical and many (probably including Pope Leo himself) didn’t.
My tentative hypothesis is that Pope Leo does not approve of the AI usage in encyclicals, and plausibly was not even aware of significant AI usage in his own encyclical! Quite unfortunate if true.
Conclusion
I attempted to provide a number of different angles and evidence for the conclusion that the papal encyclical on safeguarding the human person in the age of AI is actually an article in large part neither on nor in AI, but by AI. Any individual method might be flawed, but I believe the consilience of evidence is very strongly suggestive, perhaps even overwhelming.
Nonetheless, I welcome good-faith debate and corrections. Please feel free to comment with contrary evidence and replications.
We’re soon entering a time of unprecedented danger in the world. Like children with flamethrowers, or Bronze Age peasants attempting to build the Tower of Babel, humanity is messing with forces we cannot hope to understand or control.
In this new age of AI, getting provenance genuinuely right isn’t just a question of human authenticity — it’s a matter of life and death.
1
Though in previous attempts to validate my intuition systematically, I have a substantially higher false negative rate than false positive rate, suggesting I’m not sensitive enough.
2
Technically 6 times, but all 6 in citations (eg “cf Neh 2–6, [123] Cf. Dicastery for the Doctrine of the Faith – Dicastery for Culture and Education”)
3
Unedited AI texts use em-dashes much more often than en-dashes. The former is much more directly a sign of AI than the latter. However, it’s common for people hiding AI writing to change em-dashes to en-dashes. Additionally, the specific differences between em-dashes and en-dashes could just be random variation of specific stylistic preferences/trends between people. Thus, we should probably count em-dashes and en-dashes together in the analysis, though em-dashes over en-dashes is still some (nonzero) evidence.
4
I asked Claude to review this section. It says the injunction to avoid “genuinely” is no longer in its system prompt as of Opus 4.7 (and has no direct evidence of past prompts), though it believes the previous/github report is credible enough for Opus 4.6.
5
This is not strictly zero information. Given Claude’s demonstrated predilection for genuinely, we can imagine this specific passage was chosen by Claude to quote.
6
Interestingly, here the instances of “genuine” (“genuine relationships”, “genuine human connections”) seemed to carry actual semantic content, while “genuinely helpful” did not. So despite its placement in the rest of the paragraph, I don’t think “genuinely” is semantically relevant here.
7
If you’re very literal about it, there are 2 em-dashes in the chapter. One of them is in a large quote-block by someone else. The other is in the footnote “Robert F. Prevost, O.S.A. — Canon lawyer working in Chulucanas, Peru.” Neither invocations are semantic in the main text, and the latter likely added by an editor.
8
The whole encyclical is too long for Pangram.
9
I did not use AI to draft or write any of the text in this post, including for sentences deliberately engineered to sound like AI.
10
Note that I’m only moderately confident in the presence, and to a lesser degree, the substantial contribution, of Claude’s fingerprints on the recent encyclical. I can’t rule out that other AIs were also separately involved. I’m also only moderately confident in my conclusion that it’s Claude: all the AIs are kinda similar to each other, so differentiating between them is much harder than detecting the presence of AI at all.