I think many people experiment with creating different digital personas but with low effort, like "You are Elon Musk".

I personally often ask LLM to comment on my drafts as Yudkowsky and other well known LWers. What such answers lack is extreme unique insight which is often for real EY.

The essence of human genius is missed and this is exactly why we still don't have AGI.

Also, for really good EY model we may need more data about his internal thought stream and biographical details which only he can collect. It seems that he is not interested and even if he would, it would be time consuming (but he write quickly). One thousand pages of unedited thought stream may significantly improve the model.

3 comments, sorted by

top scoring

Click to highlight new comments since: Today at 9:41 PM

[-]dirk9h43

IMO that's because it's not relatively easy to create a good replica of a person; LLMs fine-tuned to speak like a particular target retain LLM-standard confabulation, distractibility, inability to learn from experience, etc which will make them similarly ineffective at alignment research. I'd suggest looking into the AI Village for a better sense of how LLMs do at long-horizon tasks. (Also, I want to point out that inference is costly. The AI village, which only has four agents and only runs them for two hours a day, costs $3700 per month; hundreds of always-on agents would likely cost hundreds of times that much. This could be a good tradeoff if they were superhumanly effective alignment researchers, but I think current frontier LLMs are capable only of subhuman performance).

Reply

[-]RomanS9h20

I agree with you on most points.

BTW, I'm running a digital replica of myself. The setup is as follows:

Gemini 2.5 as the model
The script splits the text corpus (8 MB) into small-enough chunks for Gemini to digest (1M tokens), and then (with some scaffolding) returns a unified answer.

The answers are surprisingly good at times, reflecting non-trivial aspects of my mind.

From many experiments with the digital-me, I conclude that a similar setup for Yudkowksy could be useful even with today's models (assuming large-enough budgets).

There will be no genius-level insights in 2025, but he could automate a lot of routine alignment work, like evaluating models.

Given that models may become dramatically smarter in 2026-2027, the digital Yudkowksy may become dramatically more useful too.

I open-sourced the code:

https://github.com/Sideloading-Research/telegram_sideload

Reply

[-]avturchin9h51

Yes, if MIRI spends a year on building as good model of Yudkowsky as possible, it can help in alignment and its measurable and doable thing. They can later ask that model about failure modes of other AIs and it will cry "Misaligned!"

Reply

Moderation Log

LESSWRONG
LW

LESSWRONG
LW

2

[ Question ]

Why there is still one instance of Eliezer Yudkowsky?

2

2

1 Answers sorted by
top scoring

Oct 30, 2025

2

[ Question ]

Why there is still one instance of Eliezer Yudkowsky?

2

2

1 Answers sorted by top scoring

Oct 30, 2025

1 Answers sorted by
top scoring