danield

Posts

Sorted by New

Wiki Contributions

Comments

Debate update: Obfuscated arguments problem

danield3y10

Thanks!

Reply

AlphaStar: Impressive for RL progress, not for AGI progress

danield4y70

New paper relevant to this discussion: https://arxiv.org/abs/1911.08265

Constructing agents with planning capabilities has long been one of the main challenges in the pursuit of artificial intelligence. Tree-based planning methods have enjoyed huge success in challenging domains, such as chess and Go, where a perfect simulator is available. However, in real-world problems the dynamics governing the environment are often complex and unknown. In this work we present the MuZero algorithm which, by combining a tree-based search with a learned model, achieves superhuman performance in a range of challenging and visually complex domains, without any knowledge of their underlying dynamics. MuZero learns a model that, when applied iteratively, predicts the quantities most directly relevant to planning: the reward, the action-selection policy, and the value function. When evaluated on 57 different Atari games - the canonical video game environment for testing AI techniques, in which model-based planning approaches have historically struggled - our new algorithm achieved a new state of the art. When evaluated on Go, chess and shogi, without any knowledge of the game rules, MuZero matched the superhuman performance of the AlphaZero algorithm that was supplied with the game rules.

Reply

[AN #69] Stuart Russell's new book on why we need to replace the standard model of AI

danield5y90

Thanks for this summary / commentary, Rohin -- I found it helpful!

Reply

Debate on Instrumental Convergence between LeCun, Russell, Bengio, Zador, and More

danield5y80

Thanks for transcribing this, Ben!

Reply

Machine Learning Projects on IDA

danield5y70

I'm excited about this. If you get any substantive feedback from people who take on these projects or decide not to, I'd be very interested to see a follow-up post.

Reply

An Equilibrium of No Free Energy

danield6y60

I think this article / concept is incredibly useful, and singlehandedly justifies the existence of LW2. Thank you!

I want to go reread you and your research and see how the free energy concept could apply there -- if anyone else does, I'd love to hear thoughts.

Reply