x

LESSWRONG

LW

Sahar Abdelnabi — LessWrong

Sahar Abdelnabi

Sahar Abdelnabi

Message

I am an AI security researcher at Microsoft. Previously, I was a PhD student at the CISPA Helmholtz Center for Information Security. I obtained my MSc degree at Saarland University, Germany. I am broadly interested in all things AI, security, safety, and evals.

Feel free to...

17

1

2y

Sahar Abdelnabi

I am an AI security researcher at Microsoft. Previously, I was a PhD student at the CISPA Helmholtz Center for Information Security. I obtained my MSc degree at Saarland University, Germany. I am broadly interested in all things AI, security, safety, and evals.

Feel free to...

Do LLMs Comply Differently During Tests? Is This a Hidden Variable in Safety Evaluation? And Can We Steer That?

This post is based on our paper Linear Control of Test Awareness Reveals Differential Compliance in Reasoning Models by Sahar Abdelnabi and Ahmed Salem from Microsoft. The Hawthorne Effect for AI You may have heard of the Hawthorne effect—the phenomenon where people change their behavior when they know they're being...

Jun 16, 2025•18