x

LESSWRONG

LW

Taylor Sorensen — LessWrong

Taylor Sorensen

Taylor Sorensen

Message

1

3y

Taylor Sorensen

3y

The Intrinsic Interplay of Human Values and Artificial Intelligence: Navigating the Optimization Challenge

Taylor Sorensen3y10

Fascinating post, Joe! We just published a research paper on modeling pluralistic human values, an I thought it might be relevant. Working with philosophers and cognitive scientists, we've tried to make a first attempt at concretely modeling pluralistic human values using language models. It is obviously imperfect, and assumes human values fixed in one point in time, but it is a computational attempt that, to our knowledge, no one has yet attempted.

Please let me know if you have any thoughts on our work and how it may relate to these thoughts, or if y... (read more)