LESSWRONG
LW

Frontpage

1

Fire Alarm for AGI

by [anonymous]
15th Jan 2020
1 min read
0

1

This is a linkpost for https://blog.acolyer.org/2020/01/15/programmatically-interpretable-reinforcement-learning/
Frontpage

1

New Comment
Moderation Log
Curated and popular this week
0Comments

" While the interpretable model may not get to quite the same level of performance as the Oracle on the exact task used for training, it turns out that the process of generating the interpretable model results in something which generalises much better to new situations."

This sounds like artificial dreaming:

"...after a certain number of search steps for a fixed set H, and after choosing the best available synthesized program for this set, we sample a set of additional histories by simulating the current programmatic policy, and add these samples to H."

The following also struck me as portentous although not as much as the above paper.

https://ai.facebook.com/blog/using-neural-networks-to-solve-advanced-mathematics-equations/

Mentioned in
69Announcing the Forecasting Innovation Prize