Mechanistically interpreting time in GPT-2 small
This work was performed by Rhys Gould, Elizabeth Ho, Will Harpur-Davies, Andy Zhou, and supervised by Arthur Conmy. This has been crossposted from medium, but has been shortened here for brevity. TLDR: we reverse engineer how GPT-2 small models temporal relations; how it completes tasks like “If today is Monday,...
Apr 16, 202368

