tl; dr: We can better understand common objective functions (reward, prediction, fitness, control) as all being related to a singular, overarching objective. Reward? Prediction? Fitness? In their 2021 paper Reward is enough, DeepMind researchers argue that "intelligence, and its associated abilities, can be understood as subserving the maximization of reward."...
Epistemic status: Speculative The title is a tongue-in-cheek reference to Google AI's latest showcase: Multitask Unified Model, or MUM for short. Further details can be found in their arXiv paper; Rethinking Search: Making Experts out of Dilettantes. Let's say hi to MUM. Multitask Unified Model (MUM) Google presents its new...
Alignment researcher Paul F. Christiano has written several posts on what he refers to as Iterated Distillation and Amplification (IDA). In this post, I will argue that IDA is a general method of adaptation and that it can be found in various different guises in a wide range of contexts....
Biologist Michael Levin was recently featured in an article by The New Yorker with a captivating headline: Is Bioelectricity the Key to Limb Regeneration? The thought struck me that Levin's work is a great example of what you might call a novel paradigm in biology that people here might not...