Hi, I'm Amina, I am new to LessWrong and my career has mostly been in Machine Learning Science and Engineering. I am fascinated by the field of AI interpretability and believe it is very useful for AI safety, which I care about a lot. I recently participated in Neel Nanda's training phase and learnt a lot about current research directions, efficient techniques and recently wrote my very first post about unerstanding alignment faking. Looking forward to engaging in discussions, events and sharing my learnings.
Hi, I'm Amina, I am new to LessWrong and my career has mostly been in Machine Learning Science and Engineering. I am fascinated by the field of AI interpretability and believe it is very useful for AI safety, which I care about a lot. I recently participated in Neel Nanda's training phase and learnt a lot about current research directions, efficient techniques and recently wrote my very first post about unerstanding alignment faking. Looking forward to engaging in discussions, events and sharing my learnings.