LESSWRONG
LW

Eliciting Latent KnowledgeInterpretability (ML & AI)AI
Frontpage

1

A personal explanation of ELK concept and task.

by Zeyu Qin
6th Oct 2023
1 min read
0

1

Eliciting Latent KnowledgeInterpretability (ML & AI)AI
Frontpage

1

New Comment
Moderation Log
More from Zeyu Qin
View more
Curated and popular this week
0Comments

Here, I tend to give my personal explanation of ELK concept and task based on the terms that have already emerged (Still in progress.). 


The original statement of ELK: Given input samples set X (we are interested in), targeted model M, and task (goal) G, could we obtain a reporter or direct translator to honestly tell us the internal mechanism of M on G given X?

Translating into the existing terms: obtaining faithful model distillation on specific task or property. 

faithful == honest, reporter == surrogate model (distilled or not)