gabrielrecc

Message

261

10y

Automated Sandwiching & Quantifying Human-LLM Cooperation: ScaleOversight hackathon results

We ran a hackathon on scalable oversight with Gabriel Recchia as keynote speaker (watch the talk) and Ruiqi Zhong as co-judge. Here, we share the top projects and results. In summary: * We can automate the “sandwiching” paradigm from Cotra [1] by having a smaller model ask structured questions to...

Feb 23, 20238

Still possible to change username?

I could swear there used to be an option for changing one's username (I've done it before). Has this option been removed? Am I just too daft to find where to click? Or is it auto-disabled after you've done it once?

May 13, 20227

PSA for academics in Ukraine (or anywhere else) who want to come to the United Kingdom

If you are an academic with a PhD, or someone who has achieved some level of recognition in the arts or digital technology, and you would like to come to the United Kingdom, the UK's "Global Talent" visa is worth taking a look at. Although the website makes it sound...

Mar 8, 20223

LESSWRONG
LW

LESSWRONG
LW

gabrielrecc

gabrielrecc

gabrielrecc

gabrielrecc

Automated Sandwiching & Quantifying Human-LLM Cooperation: ScaleOversight hackathon results

Still possible to change username?

PSA for academics in Ukraine (or anywhere else) who want to come to the United Kingdom

Automated Sandwiching & Quantifying Human-LLM Cooperation: ScaleOversight hackathon results

Still possible to change username?

PSA for academics in Ukraine (or anywhere else) who want to come to the United Kingdom