Lessons from Convergent Evolution for AI Alignment

rosehadshar

In addition to looking at biology, I'd look at human organizations (corporations, governments, organized religions, militaries, etc.) Under what conditions do they evolve towards something like "agency?" What about "intelligence?" Under what conditions do they evolve away from those things?

[-]Jan_Kulveit3yΩ6114

On the topic thinking about it for yourself and posting further examples as comments...

This is GPT4 thinking about convergent properties, using the post as a prompt and generating 20 plausibly relevant convergences.

Modularity: Biological systems, like the human brain, display modularity in their structure, allowing for functional specialization and adaptability. Modularity is also found in industries and companies, where teams and departments are organized to handle specific tasks.
Hierarchical organization: In biological systems, hierarchical organization is common, with higher-level structures built from lower-level components. Hierarchies are also observed in companies and organizations.
Recurrent connections: Neural networks in the brain have recurrent connections, which allow for feedback and information processing over time. Recurrent structures are also seen in supply chains and communication networks.
Redundancy: Redundancy in biological systems provides robustness and fault tolerance. This concept is also utilized in industries for backup systems and fail-safe mechanisms.
Adaptation: Biological systems adapt to changing environments to survive. Adaptation is also a crucial aspect for businesses and industries, where they must adjust to market trends and demands.
Error correction: Biological systems have mechanisms to detect and correct errors (e.g., DNA repair mechanisms). Error correction is an essential aspect of modern communication systems and data storage.
Network robustness: Biological networks (e.g., metabolic networks) exhibit robustness against perturbations. Similar robustness is desirable in communication and transportation networks.
Small-world networks: Biological networks often display small-world properties, with short path lengths and high clustering. These properties are found in social networks and the internet.
Scale-free networks: Biological networks often exhibit scale-free properties, with a few highly connected nodes (hubs) and many less connected nodes. Scale-free networks are also found in the internet, social networks, and citation networks.
Sparsity: Neural networks in the brain are sparse, with many fewer connections than theoretically possible. Sparsity is also utilized in machine learning algorithms and data compression techniques.
Decentralization: Biological systems often rely on decentralized control mechanisms. Decentralization can also be seen in blockchain technology and peer-to-peer networks.
Homeostasis: Biological systems maintain internal stability through feedback mechanisms. Homeostasis is also relevant to industries, where maintaining stable operating conditions is essential.
Oscillations: Oscillatory behavior is common in biological systems, such as circadian rhythms. Oscillations can also be observed in economic cycles and traffic patterns.
Synchronization: Synchronization occurs in biological systems, such as the firing of neurons. Synchronization is also essential in distributed computing and communication systems.
Division of labor: Division of labor is observed in biological systems (e.g., cells within multicellular organisms) and is a fundamental principle in industries and organizations.
Cooperation and competition: Biological systems display a balance of cooperation and competition. These dynamics are also observed in economic systems, business strategies, and social interactions.
Plasticity: Plasticity in biological systems allows for learning and adaptation. In industries, plasticity is important for innovation and adaptation to changing market conditions.
Evolvability: Biological systems can evolve through mutation and selection. Evolvability is also relevant in industries, where companies must be able to innovate and adapt to survive.
Self-organization: Self-organization occurs in biological systems, such as pattern formation in developing organisms. Self-organization is also observed in swarm intelligence and decentralized control systems.
Energy efficiency: Biological systems are optimized for energy efficiency, as seen in metabolic pathways. Energy efficiency is also a crucial consideration in industries and technology development.

In my view
a) it broadly got the idea
b) the result are in my view in a better taste for understand agents than e.g. what you get from karma-ranked LW frontpage posts about AIs on an average day

[-]Oliver Sourbut3yΩ4103

Really enjoyed this post, both aesthetically (I like evolution and palaeontology, and obviously AI things!) and as a motivator for some lines of research and thought.

I had a go at one point connecting natural selection with gradient descent which you might find useful depending on your aims.

I also collected some cases of what I think are potentially convergent properties of 'deliberating systems', many of them natural, and others artificial. Maybe you'll find those useful, and I'd love to know to what extent you agree or disagree with the concepts there.

[-]gpt4_summaries3y43

Tentative GPT4's summary. This is part of an experiment.
Up/Downvote "Overall" if the summary is useful/harmful.
Up/Downvote "Agreement" if the summary is correct/wrong.
If so, please let me know why you think this is harmful.
(OpenAI doesn't use customers' data anymore for training, and this API account previously opted out of data retention)

TLDR: Convergent evolution, where organisms with different origins develop similar features, can provide insights into deep selection pressures that may extend to advanced AI systems, potentially informing AI alignment work and predicting future AI system properties.

Arguments: The article provides several examples of convergent evolution, including the body shapes of sharks and dolphins, multicellularity, agency, intelligence, and sentience. The article discusses that these convergent properties might provide valuable insights into selection pressures relevant to AI alignment research.

Takeaways:
1. Cases of convergent evolution might point to deep selection pressures, which may help predict advanced AI systems' properties.
2. Convergent evolution may challenge existing assumptions about AI alignments, which often rely on convergence.
3. Learning from convergent evolution can help AI alignment work by understanding the properties that may extend to advanced AI systems.

Strengths:
1. The article presents strong examples of convergent evolution that can potentially extend to AI systems.
2. Convergent evolution as a concept provides a powerful framework for searching for deep selection pressures relevant to AI alignment.
3. The article explores the relevance of convergent evolution to AI alignment work and suggests fruitful areas of future research.

Weaknesses:
1. The article acknowledges that biology is significantly different from AI, which might limit the direct applicability of convergent evolution insights to AI alignment.
2. Due to the complex interactions of selection pressures and contingencies, it may be challenging to predict which properties will extend to advanced AI systems.

Interactions: The exploration of convergent evolution interacts with AI safety topics like instrumental convergence, natural abstraction hypothesis, and selection theorems. Understanding these interactions can help refine alignment work and predictions about AI systems.

Factual mistakes: The summary accurately represents the content of the article and does not contain factual mistakes or hallucinations.

Missing arguments: The main missing argument in the earlier sections is the importance of explicitly discussing convergence and contingency in AI alignment. This discussion can help refine our understanding of the properties that may extend to advanced AI systems and the selection pressures that shape their development.

[-]Bogdan Ionut Cirstea3yΩ130

Partial convergence between language models and brains and evolutionary analogy

[-]Mateusz Bagiński3yΩ230

Standardized communication protocols

Language is the most obvious example, but there's plenty of others. E.g. taking different parts of the body as subsystems communicating with each other, one neurotransmitter/hormone often has very similar effects in many parts of the body.

In software, different processes can communicate with each other by passing messages having some well-defined format. When you're sending an API request, you usually have a good idea of what shape the response is going to take and if the request fails, it should fail in a predictable way that can be harmlessly handled. This makes making reliable software easier.

Some cases of standardization are spontaneous/bottom-up, whereas others are engineered top-down. Human language is both. Languages with greater number of users seem to evolve simpler, more standardized grammars, e.g. compare Russian to Czech or English to Icelandic (though syncretism and promiscuous borrowing may also have had an impact in the second case). I don't know if something like that occurs at all in programming languages but one factor that makes it much less likely is the need to maintain backward-compatibility, which is important for programing languages but much weaker for human languages.

[-]Oliver Sourbut3y20

One hypothesis is that runaway selection for social skills leads to intelligence.

I realise you're explicitly not claiming that this has been the only route to intelligence, but I wanted to insert a counterexample here: cephalopods (octopus, squid, ...) are generally regarded as highly intelligent, but as far as I know there are few or no social species. They don't even interact socially with their own young, unlike orangutans, another example of an otherwise usually solitary intelligent species.

[-]clem_acs3y31

"Social" is slightly too coarse-grained a tag. The thing we're actually interested in is "whether successfully predicting the behaviour of other members of its own species is a strong selection pressure". Social collaboration is one way this happens - another seems to be "deception" arms races (such as corvids stealing and hiding things from each other), or specific kinds of mating rituals. It also depends on the relative strength of other selection pressures - in most cases highly intelligent creatures also seem to have developed a "slack" in resources they can devote to intelligence (e.g. humans cooking food).

This does seem to hold for cephalopods - a strong datapoint for which being their highly sophisticated forms of communication (e.g. video below).

[-]Jan_Kulveit3y20

To be clear we are explicitly claiming it's likely not the only pressure - check footnotes 9 and 10 for refs.

^{^}

Nobu Tamura (http://spinops.blogspot.com), CC BY 3.0, via Wikimedia Commons.

^{^}

Nobu Tamura (http://spinops.blogspot.com), CC BY 3.0, via Wikimedia Commons.

^{^}

This way of thinking about convergent evolution is used by evolutionary biologists, e.g. here. There are also other ways of approaching it, most commonly in terms of fitness landscape, where instead of individuals falling down into attractor states, selection pressures push individuals uphill. Conventions depend on the subfield.

^{^}

Note that the attractor state applies to some feature or features of the organism, but is irrelevant to most others. In the shark and dolphin case, the attractor relates to body shape, but does not affect other features like type of immune cells.

^{^}

https://commons.wikimedia.org/wiki/File:Local_search_attraction_basins.png , CC BY-SA 3.0 <http://creativecommons.org/licenses/by-sa/3.0/>, via Wikimedia Commons.

^{^}

See https://www.frontiersin.org/articles/10.3389/fpsyg.2019.02688/full and this on hierarchical agency.

^{^}

See What multipolar failure looks like.

^{^}

Roughly:

- You can look at any system as an agent

- A system is more agentic the more that describing it using the intentional stance is useful, relative to other stances.

See https://en.wikipedia.org/wiki/Intentional_stance.

^{^}

Some candidates: parasitoidism; a combination of causal reasoning, flexibility, imagination, and prospection.

^{^}

This paper argues that different pressures operated in different taxa, and that for some taxa social learning was a key selection pressure.

^{^}

For example, The Evolution of the Sensitive Soul: Learning and the Origins of Consciousness by Simona Ginsburg, Eva Jablonka.

^{^}

https://www.sciencedirect.com/science/article/pii/S0166223621000771?casa_token=5BYKizvB2TEAAAAA:A3Hl3XjcLnW1BeEFFJOsfW3ahfX2qLc0tnQalIQfMl8BXtn2_most_W9DoqbMbSk9jGItbYsBEU.

^{^}

Stochastic gradient descent also isn’t the smartest designer, but with enough compute it’s been able to find the smartest AI systems we have.

LESSWRONG
is fundraising!
LW

LESSWRONG
is fundraising!
LW

54

Lessons from Convergent Evolution for AI Alignment

54

Ω 22

54

Ω 22

Prelude: sharks, aliens, and AI

Introduction

The basics of convergent evolution

This is a potentially big deal for AI alignment work

Convergent evolution might point to deep selection pressures

Multicellularity

Agency

"Intelligence"

"Sentience"

The limits of convergent evolution may challenge some existing ideas in AI alignments

But biology is super different from AI, no?