LESSWRONG
LW

1467
lukaemon
0010
Message
Dialogue
Subscribe

Posts

Sorted by New

Wikitag Contributions

Comments

Sorted by
Newest
No wikitag contributions to display.
Actually, Othello-GPT Has A Linear Emergent World Representation
lukaemon1yΩ010

In hindsight, I should have trained on layer 6, which is the point where the board state is fully computed and starts to really be used.

You mean layer 4? 

Reply
No posts to display.