x

LESSWRONG

LW

Viktor Moskvoretskii

Viktor Moskvoretskii

Message

PhD @ EPFL DLab with Robert West

I do research on LLM Personas - where to they come from, how to apply them, how to make models safer.

112

2

2mo

Viktor Moskvoretskii

PhD @ EPFL DLab with Robert West

I do research on LLM Personas - where to they come from, how to apply them, how to make models safer.

Viktor Moskvoretskii — LessWrong

Synthetic Persona Pretraining: Alignment from Token Zero

by Julian Minder, Raghav Singhal, Viktor Moskvoretskii, Stefan Krsteski, ashtonanderson, rolandaydin, and Robert West

Julian Minder, Viktor Moskvoretskii, Raghav Singhal, Difan Jiao, Kartik Bali, Yiderigun Borjigin, Shaobo Cui, Stefan Krsteski, Ashton Anderson, Roland Aydin, Robert West (equal contribution) > These are early results, but we wanted to share them with the community now. We will release all artifacts (scaled-up runs, models, code, data, intermediate...