LESSWRONG
LW

262
ZZ Si
0010
Message
Dialogue
Subscribe

Posts

Sorted by New

Wikitag Contributions

Comments

Sorted by
Newest
No posts to display.
No wikitag contributions to display.
On the Importance of Open Sourcing Reward Models
ZZ Si3y10

I think this is a fair point that an open reward function is subject to "SEO" efforts to game it. But, how about a  "training" reward function that is open, and a "test" reward function that is hidden?

I would love to know what are some other OSS efforts on reward function (I do follow Carper's development on RF), and love to contribute.

Reply