Idea-Gated Transformers: Transformers which use both System 1 and System 2 thinking.

darshan fofadiya

1 Idea-Gated Transformers: Transformers which use both System 1 and System 2 thinking.

by darshan fofadiya

4th Dec 2025

1 min read

0

1

Rejected for the following reason(s):

Insufficient Quality for AI Content.

Read full explanation

It feels unnatural for LLMs/transformers to be intelligent while they can only generate a token at a time. The Idea-Gated transformers is about letting the transformer think in terms of ideas and not words. While it still generates one token at a time, a separate auxiliary head called the thinking head plans the next 20 tokens together (or plans the next idea). As expected it helps the model stay on topic and track.
Please take a read and provide me your feedback. Would love to read your views.

Paper Link

Language Models (LLMs)Machine Learning (ML)AI

1

New Comment

Moderation Log

Curated and popular this week

LESSWRONG
LW

LESSWRONG
LW

1

Idea-Gated Transformers: Transformers which use both System 1 and System 2 thinking.

1

1