LESSWRONG
LW

Tiberius
6200
Message
Dialogue
Subscribe

I am a researcher in AI Interpretability at the Zuse Institute in Berlin.

Mostly interested in building theoretical foundation for Interpretability that works even if the agents have an incentive to hide their interpretations.

Posts

Sorted by New

Wikitag Contributions

Comments

Sorted by
Newest
No Comments Found
No wikitag contributions to display.
3PhD Position: AI Interpretability in Berlin, Germany
2y
0
5Is it allowed to post job postings here? I am looking for a new PhD student to work on AI Interpretability. Can I advertise my position?
Q
2y
Q
4