We should not expect the Vatican to want or need to use AI to write documents meant for wide public consumption. Yes, the liturgical writing style they tend to use can be awkward, but reading and writing such documents is kinda what Catholic priests do. Perhaps it's hard to believe that unassisted humans — in this age where people's attention spans have been fried by phones and social media — can write something 2% as long as Zvi's AI newsletter series, but an organization with the resources of the Catholic Church actually can. Also, the encyclical was presumably written and reviewed by several people working collaboratively; it's not like it was just one guy who can secretly use ChatGPT on his own.
I just don't see why you think we should be so confident about this. Indeed, the encyclical was presumably worked on collaboratively by multiple people. Lots of people use AI to help them write, it wouldn't be so shocking to think that this includes some Vatican officials.
Re: whether it "reads as AI", my understanding is that the whole reason Linch thought to put this into Pangram was that it read as AI. Most people don't put everything they read into AI! You can also read someone on reddit saying the same thing here.
The rest of your post seems like it's just batting away the evidence. Every time people have tested Pangram rigorously, it's been shown to have a very low false negative [EDIT: I actually meant 'positive'] rate. Furthermore, I think the evidence of old papal encyclicals not being flagged is also relevant here. It's possible that that's just because Pangram trained on those, but most people aren't super interested in encyclicals, and Pangram Labs is a small enough company that I doubt they've trained on the whole internet. I think this would be legit if we had very very strong reasons to think there was no way that anyone at the Vatican would ever use AI in their writing, but I just don't think you've offered those reasons.
I fed a couple of my very recent blog posts to Pangram, because I do love em-dashes and bulleted lists. And in an unscientific sample, Pangram scored them as 100% human.
This has updated my prior on Pangram: It does not appear to trivially fail on the kind of writing that I do, given a totally inadequate sample size.
Also as a model size/compressibility issue, it's actually really hard to "cheat" by memorizing the entire internet! Claude Opus couldn't do it and Claude Opus's almost certainly bigger and had more training time than Pangram Labs's classifier.
Also, if Pangram were trained on previous encyclicals, you'd think that it would have learned that encyclical-like style is a tell for being human-generated.
So, the previous posts here led to this article in The Verge. Christopher Hale said in response:
I can confirm that this is 100% false. Not only was it written by hand, but the first drafts were literally written on paper with a pen.
He's a popular Catholic writer focused on the Pope, with some Vatican connections.
An author of one of the previous LessWrong posts then responded by accusing Hale of using AI for his past work. But with 33k likes I guess Hale is winning in terms of popular opinion.
Anyway, for a bunch of people this is now how they heard of LessWrong, so they know it as a group of anti-Christianity pro-AI cultists running a smear campaign, lol.
Note that cardinals are probably not the ones doing the actual writing of these things - they're too busy doing cardinal stuff. My guess is that it's staff at the Dicastery for the Doctrine of the Faith and other departments of the Vatican.
I recently saw two posts here saying the Pope's recent encyclical was largely written by AI. Two posts, on the front page at the same time.
Since I've sometimes crossposted blog posts here, I thought I'd write a post to at least make it clear that I have a different position.
priors
We should not expect the Vatican to want or need to use AI to write documents meant for wide public consumption. Yes, the liturgical writing style they tend to use can be awkward, but reading and writing such documents is kinda what Catholic priests do. Perhaps it's hard to believe that unassisted humans — in this age where people's attention spans have been fried by phones and social media — can write something 2% as long as Zvi's AI newsletter series, but an organization with the resources of the Catholic Church actually can. Also, the encyclical was presumably written and reviewed by several people working collaboratively; it's not like it was just one guy who can secretly use ChatGPT on his own.
Now, the US government has been using AI for important documents recently, like tariffs and health stuff and Epstein censorship, but that's a very different situation: the US government has plenty of workers, but the leadership didn't want to involve too many people because they didn't trust existing government employees to agree with their agenda. (Also, Trump's cabinet members are probably dumber than the average Catholic bishop.)
human judgement
Humans familiar with AI writing (and preferably with non-AI writing of the relevant type) are more reliable than AI-based AI detectors. This should be obvious to people with experience about this, but here's a citation I guess.
So, do we see people reading Magnifica Humanitas and saying "hey that's AI"? Not really. I read it, and I don't think it's AI. I don't think other forums are buying the "it's AI" argument either; here's r/ChatGPT being less credulous than LessWrong, and they seem to be familiar with what AI can do these days.
Yes, it has em-dashes, and some "not X but Y". But you can't analyze writing on such a simplistic level when it's a kind of thing that tends to use those. Is the quote below AI writing because it's using em-dashes and "not X but Y"?
Pangram reliability
"Pangram says previous encyclicals are 0% AI!"
Yeah, they're in the training set. Not only is that an obvious thing for Pangram to do, it's the only plausible way they could get 0% consistently. If we wait long enough, maybe Pangram will update their model's training and this new encyclical will magically become 0% AI too!
"this study said Pangram is reliable!"
Any study on AI detector reliability is using specific models, specific writing styles, and specific number thresholds. An encyclical is a different type of writing than such studies have used, so those results tend to not apply well. More generally, LLMs tend to copy some patterns used in formal writing, and most writing is not that. If you're checking some kind of writing that normally never uses em-dashes, you can just count em-dashes and get good accuracy results by such metrics.
a suggestion
If there are people around who paid for Pangram and just want to use it on something, and also have too much free time, I actually have a suggestion for something to do instead: a website that rates AI-ness of everything on the front page of Hacker News, and lets people filter links by AI-ness. A lot of the posts there are pretty obviously AI these days, and they're types of writing that automated AI detection should work better on.
conclusion
Blindly trusting software like Pangram is, uh, one of the things the encyclical was warning people about. So good job making its point, but also, maybe stop that.