ChengCheng

Posts

Sorted by New

20VLM-RM: Specifying Rewards with Natural Language

Ω

6mo

Ω

2

32Uncovering Latent Human Wellbeing in LLM Embeddings

Ω

7mo

Ω

7

Wiki Contributions

Comments

Speed running everyone through the bad alignment bingo. $5k bounty for a LW conversational agent

ChengCheng1y40

First of all, thank you @ArthurB for offering this bounty and raising the awareness of the need for quality AI alignment educational resources! We are particularly grateful to those who mentioned the Stampy project and also to people who have reached out offering to help in our efforts. Our submission https://chat.stampy.ai/ is a very early prototype focused primarily on summarizing and synthesizing information from our own database of FAQs along with selected documents collected from the alignment research dataset. The conversational feature still requires considerable work. Nevertheless, we would love to get input and feedback to further develop this tool for anyone seeking to better understand or contribute to AI safety. This would not have been possible without the support of our volunteers and collaborators. We welcome all who are interested in using AI to advance alignment.

Reply