Say that we have absolutely no idea how to implement any algorithms which aren’t scientifically replicated as of mid-2026. ... We now train the model to predict what text this person will write and speak in a few seconds given their current activations
Is this an algorithm which is scientifically replicated as of mid-2026, translating what will be spoken seconds in the future? That does not match my understanding of the field and I would be interested in a link to such research.
Thought-sharing is easy given appropriate hardware. The main risks are psychosis and dissociative symptoms from identity disruption.
Speech and text are extremely inefficient. For example, math textbooks are routinely more than one page long.
This sucks! I want the entirety of human hard-science results to pass through my mind at least once. Someone learned each of those concepts, but they can't just copy their Understanding to me.[1]
Or perhaps they can?
If we can read and write enough neural state, then communication is a unusually friendly target for cognitive augmentation. Unlike most enhancements, it doesn't need (non-hardware) neuroscience breakthroughs.
Humans are already exceptionally skilled at communication despite terrible bandwidth. By speaking while learning neuralese, we can use spoken language and feature engineering as training wheels to bootstrap telepathy.
(To be clear, I'm talking about hardware and software to pass carefully-translated brain activity between people. It's not spooky.)
Groups of experts could then share deep understanding in minutes-to-days; I'd wager that, with help from a mathematician, I could understand most of modern algebraic topology in a week instead of a year.
This could go a few ways. We'll start with the most pessimistic.
Say that we have absolutely no idea how to implement any algorithms which aren't scientifically replicated as of mid-2026.
Neurotech labs already translate low-dimensional data for speech, movement, and audio-visual stimuli. So we take thousands of these decoders running at much higher resolution across brain surface, and start by training a model on stimuli from a VR headset and haptic suit.
Left: computational graph for feature-engineered bootstrapping of telepathy model write component. The system learns to convert stimuli into neural activations. Right: same, for reading states.
We have a basis. This can decode and re-encode simple stimuli. We now train the model to predict what text this person will write and speak in a few seconds given their current activations; this takes a good bit longer, probably a few months.
Same idea as earlier, but now the model must learn anticipatory signals.
And now, we connect two people using a shared translator model[2]. They've learned explicit "macros" so it's a light application of will to send thoughts to the other person.
It's pretty terrible at first. Very imprecise. We keep the signal gain quite low to reduce weird effects (particularly psychosis, which I'll get to later).
The pair simply talk about interesting things together. As humans do, they begin to build stronger models of each other; neuralese becomes increasingly useful for refining communication.
Just like how people normally understand each others' minds, the group discusses stuff. It's just that, now, their neurons can choose to share data more directly. Inasmuch as the neuralese channel helps people model each other, it's learned as a more efficient language.
After ~4 months of this, the pair now has much better bandwidth than unaided speech. It's more efficient to share learned insights than to learn independently.
And after another few months, they're better thought of as one entity than two. As typical brains split computation between hemispheres, so too the minds delegate thoughts fluently. Big improvements continue for a few years.
Scaling the number of people gives nearly linear returns[3]; we'd need router minds, but beyond that, scaling doesn't have a hard limit.
Alright, what if we know the brain's local learning algorithm and can do whatever extra cortical mass would do?
We could then train the translator in a much more efficient way than CLIP; after pretraining to convert to a blurry common language, we run the translator at much higher learning rate to reduce local error.
As in, we make the translator convert messages into whatever each mind is asking for.
Thus we needn't wait for the two humans to become fluent in neuralese; the translator can adapt much quicker than human minds. Bottlenecks here are mostly psychological.
In the case where we can dramatically improve memory consolidation?
Here as well, we can probably accelerate translator convergence. Unlike most cognition, I strongly suspect that cross-human neuralese benefits (accounting for resources used) from strategically written replay code;
seems like it could be pretty effectively scaffolded with some custom-built tools. And unlike most research / memetically useful tasks, neuralese-conditioned communication has pretty legible feedback mechanisms.
Alright, but beaming stimuli into my mind sounds a lot like hallucinations! I don't have agency over what I'm "thinking".
This is a misnomer; in humanlike intelligences, "control" is the result of lots of local computations with no central deciding entity. But the process which calls itself a me will still be disrupted by this change, and we don't want a crazy superintelligence pointed at human values.
So, at minimum, each person has control over sending and receiving neuralese.
Frequency-coded working memory gives a good inductive bias for message-passing. "Person X is thinking Y" goes on one channel, where "person X" and "Y" are flexibly-bound preexisting circuits.[4]
We'd probably also include a loss term in the translator for raw sensory and motor signals, since these cause the worst subjective loss-of-agency feelings (sensory / movement data is mostly irrelevant to communication anyway).
I'm around 75% confident that these combined approaches would prevent first-order hallucinatory and psychotic effects, and around 80% conditional on non-acute psychosis that we'd avoid second-order (learned, more chronic) psychosis.
To restate:
Humans are pretty damn good at communication for having such trash bandwidth; so the augmentees get better at communicating much faster than we'd expect from performance on other tasks. There's a tight feedback loop of "what's the person actually saying?" which accelerates this much better than it would if they just worked on challenges together without speaking.
Out of the four approaches I've covered, I'm most confident that neuralese/telepathy is tractable with sufficient hardware.
Which brings us to hardware!
This is one reason why bureaucracies aren't even vaguely superintelligent entities, despite often being composed of many individually very smart people.
This architecture (CLIP) is used in multimodal embedding for some tasks like text-conditioned image diffusion and AI-guided molecular search.
By the time linearity is saturated, the group is decidedly a superintelligence.
Also note that, at group sizes where routing becomes a bottleneck, working memory items are probably the most interesting things to broadcast; they've been selected by the augmentee's cognition to be most relevant to whatever's happening.
For example, broadcast storms.