Why rationalists get depressed

Pjain

How high learning rate can lead to depression

Thanks @Ariel Cheng for helping a lot in refining the idea, with her thorough understanding of FEP

Epistemic status: An attempt at a synthesis of the cholinergic theory of depression and the role of acetylcholine in the Active Inference theory of the brain, by a neuroscience layperson. My understanding of the math behind FEP is also incomplete, but it seems to me that it's worth writing out a potentially mathematically mistaken idea, rather than delaying shipping by continually getting sidetracked by all the existing FEP literature.

I am not claiming to explain depression fully by the theory, it is a probably wrong mechanistic model explaining maybe just a tiny fraction of depression etiology, there are many more biological explanations that may apply better to many cases.

Intro:

Depression is often (usually implicitly) conceived of as "fixed priors" on the state of oneself and the world, with an overly pessimistic bias. Depressed people's views are considered to be a mere product of a "chemical imbalance" (which chemical? serotonin almost certainly not^[1]). The standard psychotherapeutic treatment of depression, CBT, is based on this idea; Your problems are cognitive distortions, and by getting into a better epistemic state about them, they diminish.

However, depressive realism seems to hold for at least some cognitive tasks, and increased activity of the same neurotransmitter appears to mediate both the effects of many "cognitive enhancers" (nootropics) and depression. This may be explained by depression being an attractor state achieved by pathologically increased learning rate.

In this text, I propose a theory of the mechanism behind this connection, using mostly an Active Inference model of the mind.

TL;DR (by GPT-5.1):

Claim. Major depression is not a state of fixed priors, but a miscalibrated learning regime: high precision on ascending prediction errors (↑ACh) and relatively low precision on deep priors/policies (↓catecholamines). In short: a pathologically high learning rate.
Mechanism. Acetylcholine (ACh) up-weights sensory (or internally generated) prediction errors, forcing aggressive bottom‑up updating; monoamines (esp. dopamine) down, high‑level priors (goals, self‑model) lose precision. The agent becomes exquisitely sensitive to surprises and revises beliefs rapidly—especially in the negative direction.
Function (when adaptive). In volatile or failed environments, this allostatic shift enables radical model revision—analytic rumination—until a better strategy is found [Gwern’s summary].
Dysfunction (when excessive). Excess ζ (ACh‑driven sensory precision) + low γ (policy precision) yields helplessness, anhedonia, and rumination: overfitting to local negative evidence; nightly REM (high ACh, low monoamines) further destabilizes high‑level priors. REM deprivation and anticholinergics (e.g., scopolamine) can be acutely antidepressant.
Converse. Mania ≈ low ζ / high γ: down‑weight errors, over‑precise priors/policies (↑DA) → grandiosity, risk‑taking.

Short FEP background

basic idea

FEP posits that any self-organizing system (like a human) must act to resist increasing entropy to preserve itself. In information theory terms, this means the agent must continually minimize Suprisal (the negative log evidence of its sensory observations).

Computing suprisal () is intractable for a brain, since it can't know the summation of all possible causes for a sensation. So, the brain minimizes a tractable proxy: the Variational Free Energy (VFE).
The VFE ( $F$ ) is an upper bound on surprisal. Mathematically, it decomposes like this:
$F = D_{K L} [Q (s) | | P (s | o)]      Divergence - ln P (o)      Log Evidence$
This equation gives the brain two mechanism to stay alive:
1. Perception (Minimising Divergence): The first term is the KL divergence between your internal beliefs $Q (s)$ and the true posterior $P (s | o)$ . By updating one's internal beliefs ( $Q$ ) to match the probabilistic reality of the world, one decreases this term toward zero. i.e. Perceptual inference
2. Action (Maximising Log Evidence): Even if your beliefs perfectly match reality ( $D_{K L} = 0$ ), you still have high VFE if the second term ( $- ln P (o)$ ) is high. If you correctly perceive that you are freezing to death, your inference is perfect, but your surprise is high. To reduce this, you must change the world so that $o$ (observations) fall within your preferred range.
But you cannot minimize VFE directly through action, because you cannot control the present instant. You can only control the future.
This requires Expected Free Energy (EFE):
To minimize Surprisal over time, the agent "rolls out" its generative model into the future and selects policies ( $π$ ) that minimize the VFE expected to occur.
$G (π) = \sum_{τ} E_{Q (o, τ | π)} [F (π, τ)]$
When you unpack this, the EFE drives two competing behaviors:
- Epistemic Value (Ambiguity Reduction): Going to states that resolve uncertainty ( $p (s)$ ).
- Extrinsic Value (Risk Minimization): Going to states that match your priors ( $p (o)$ ).

Standard RL agent theory usually separates the world-state (is) from the reward function (ought). Active Inference reduces this distinction by using the same "currency" for utility and epistemic value- prediction error (PE). In this framework, desires are just probability distributions- specifically, priors over observations ( $p (o)$ ).

In standard RL, the agent has a state space and a separate reward function. The agent checks the state, consults the reward function, and computes a policy.

The brain (in the FEP framework) just has a generative model of what it expects to happen.

The cost function is simply the probability of the observation: $C = p (o)$

If you "want" to be warm, your brain implies a generative model where the prior probability of observing a body temperature of (around) $37^{\circ} C$ is extremely high. which is the basic mechanism behind life-preserving homeostasis.

In a standard Bayesian update, if you observe you are cold, you should update your prior to expect coldness. The reason why this doesn't happen, is that the deep, homeostatic priors (temperature, oxygen) are not standard beliefs.

Mathematically, this means that the parameters of these innate prior distributions – encoding the agent’s expectations as part of its generative model – have hyperpriors that are infinitely precise (e.g., a Dirac delta distribution) and thus cannot be updated in an experience dependent fashion.

from Active inference on discrete state-spaces: A synthesis

Because the hyperprior is a Dirac delta, the agent cannot update its expectation of what its temperature "should" be based on experience. No matter how long you stand in the snow, you will never "learn" that hypothermia is your natural state. The prediction error between the fixed prior ( $37^{\circ} C$ ) and the sensory reality ( $35^{\circ} C$ ) remains essentially infinite, forcing the system to minimise VFE the only way left: by acting to heat the body up.

While $p (o)$ generally encodes these fixed preferences, beliefs about hidden states, $p (s)$ , often encode epistemic beliefs. The deeper you go in the hierarchy, further from immediate sensory input, the more these p(s) distributions begin to resemble stubborn preferences or identity/self-concepts, and the slower they are to update.

In this post, when I talk about priors/beliefs/desires, it means this hierarchy of expectations, where the deepest layers act as the immovable "oughts" that the agent strives to fulfill.

For example, an agent with an abnormally high learning rate might have the $p (s)$ prior of "I am worthy/competent", but a single failed exam might update it to "I am incompetent/dumb/worthless". This depressed state becomes an attractor, because the brain, aiming to minimize prediction error, subsequently filters and discounts positive data to confirm the new, negative self-belief.

ACh background + evidence

The neurotransmitter acetylcholine (ACh) is present both in all parts of the CNS and in the PNS. In the brain, there are two classes of receptors for acetylcholine; the nicotinic receptors (the target of nicotine), and muscarinic receptors, both of which are known for having central roles in memory-formation and cognition, as well as (indirectly) being the targets of common Alzheimer's disease medication.

In the 1950s, the correlation of increased central ACh and depression was discovered, and in the 70s it was formalised as the cholinergic-adrenergic hypothesis of mania and depression^[2]. Later, experimental increase in central acetylcholine has been shown to induce analogues of depression in animal models, such as "learned helplessness"^[3].

The cholinergic (affecting ACh receptors) system is also the target of many "cognitive enhancers", such as the first explicitly labelled "nootropic" piracetam, as well as nicotine. The mechanism of these cholinergic nootropics has been proposed by Scott Alexander, Steven Byrnes, and firstly Karl Friston, to be an increase in something called "learning rate" in ML, and "precision" (of bottom-up transmission) in the Free Energy Principle approach to neuroscience^[4]. In essence, this parameter, encoded by ACh, determines how "significant" the currently perceived signals are, and thus how significantly they may "override" prior models of the perceived object/situation. In ActInf terms, the prediction error in bottom-up signal is made more significant^[5], independent of the actual significance of the "wrongness" in one's prior understanding of the given sensed situation. Since prediction error may be perceived as suffering/discomfort, this seems relevant to the dysphoria^[6], that is part of depression.

This is similar to the concept of Direction of Fit, where the parameter is [mind-to-world/world-to-mind]. In other words, how strongly one imposes their will to change the world when perceived data conflicts with their desires (~low sensory precision), as opposed to “The signals I perceive differ significantly from my prior beliefs, so I must change my beliefs” (~high sensory precision).

In another model, ACh can be viewed as strengthening the arrows from the blue area, causing the "planning" part to be relatively less influenced by the green (goal) nodes 32 and 33, whereas dopamine is doing the opposite (which suggests the proposed tradeoff between the "simulation" being more "epistemically accurate" vs. "will-imposing").

from: Idealized Agents Are Approximate Causal Mirrors (+ Radical Optimism on Agent Foundations) by Thane Ruthenis

More handwavily, if agency is time travel, ACh makes this time travel less efficient, for the benefit of better simulation of the current state of the world.

The post assumes that desires and epistemic priors are encoded in a similar way in the brain (explained in the previous section), and a state of high acetylcholine signalling is thus able to "override" not only prior beliefs, but also desires about how the world (including the agent) should be, leading to loss of motivation and goals (long-term and immediate, even physiological needs in severe cases), compromising a part of the symptomatics of depression.

There is also some evidence for ACh modulating updating on aversive stimuli specifically^[7]^[8], as well as acetylcholineesterase (the enzyme breaking down ACh) increasing in the recovery period after stress^[9] (suggesting the role of ACh as a positive stress modulator). However, it seems too unclear, so I'll assume for the rest of the post that ACh modulates precision on ascending information (PEs), in general.

Dopamine

Dopamine is a neurotransmitter belonging to the class of catecholamines (together with (nor)-epinephrine) and, more broadly, to the monoamines (with serotonin).

Dopamine (DA) seems to be the reward signal created by the human's learned "reward function", coming from the striatum. In Big picture of phasic dopamine, Steve Byrnes proposes this idea in more detail. In short, there is a tonic level of dopaminergic neuron activation, and depending on whether the current situation is evaluated as positive or negative by the brain, more or less dopaminergic firing will occur than at baseline. At the same time, this reward mechanism applies to internally generated thoughts and ideas on potential actions. This is why dopamine-depleted rats will not put in any effort into finding food (but will consume it if placed into their mouth).

In this theory, dopamine is (very roughly and possibly completely incorrectly) the "top-down" signal enforcer; the mechanism for enforcing priors about oneself (which, according to FEP theory, are all downstream of the prior on one's continued existence). In ActInf literature, dopamine has the role of increasing policy precision $(γ)$ , balancing bottom-up information precision.^[10]

Overactivity of dopaminergic signalling (in certain pathways, certain receptors) leads to mania^[11], and in different pathways, to psychosis^[12]. Both seem somewhat intuitive; mania seems like the inverse of depression, as a failure to update realistically based on reality and instead enforcing grandiose ideas propagating top-down. Psychosis seems like the more "epistemic" counterpart to this - internally generated priors on the state of reality are enforced on perception, while bottom-up, epistemically-correcting signalling is deficient. If a psychotic person has a specific delusion or specific pattern/symbol that they are convinced is ever-present, pattern-matching will be extremely biased towards these patterns, enforcing the prior.

Then, should we just give dopamine agonists or amphetamines to depressed people?

maybe, but it does not always work.
- This can be explained by the fundamental prediction error creating suffering in depression; The mismatch of internally generated sense of value/goals/self-esteem/... and the amplified bottom-up signalling showing that these goals are not achieved at this moment, that the world does not exactly fit what one likes/values, and that one's own current state is, well.. depressed.
- As shown in the diagram in the section Relevance to lesswrong, having relatively high ACh signalling, even while dopamine is higher, may not feel great either. While it would be different from depression, irritability can easily develop in such a state (e.g. when on amphetamines, but without addressing the high ACh state) - the prediction error comes from reality not conforming to the now high-precision probability on $γ$ .
Strangely, dopaminergic antagonists are also quite effective treatments for severe depression.
- Specifically antipsychotics, which block certain dopamine receptors, are effective against psychoses, mania, and depression. This seems to contrast with what I have proposed - namely that mania and psychosis are the exact inverse of depression, and so balancing depression by some mania-inducing substance might alleviate it.
  - However; The problem in depression is not high learning rate per se, but rather its combination with relatively normal dopamine signalling, which creates the expected priors clashing with perceived signals. In fact, I (tentatively) believe that if both are "turned up" significantly, a very dysphoric state, such as psychotic depression might emerge, so in such a situation, stimulants might rather be harmful.
  - Therefore, the approach of "decreasing the intensity of the reward function" (which, importantly, is both positively and negatively valenced) also leads to lower prediction error, or "clash", that creates the phenotype of depression. On the downside, this approach might lead to a dulled and less motivated personality, as especially typical (older) antipsychotics are often reported to act. (Anecdotally, I found the atypical antipsychotic Aripiprazole to be an effective antidepressant, possibly due to its partial-agonist activity at dopamine receptors.)

The attractor state

Depression usually begins after, or during, some unpleasant life situation. This then leads to the adaptive increase in Acetylcholine and rumination, often reinforced by inflammatory cytokines, causing one to prefer to spend time withdrawn, passive. This adaptation has the role of enabling intense reevaluation and mistake-focused analysis to isolate what exactly one might have done wrong, causing this large clash of one's value function with reality.

In the modern environment, these unpleasant states can often be chronic stress, success anxiety, feeling left out of fully living, etc. If this is the case, enough and/or intense enough situations of failure (in career, socially, ..) can lead to this adaptive hypervigilance to mistakes and rumination, mediated by ACh, as well as expectation of uncertainty.^[13]

This increases one's focus on past mistakes, but also on repeated situations where mistakes have occurred in the past. Since (as described before) this high-ACh state erodes confidence in top-down processing (such as values/goals/self-concepts), the observed situation, such as an exam, or a socially stressful situation, is already objectively perceived as being "out of one's control", as the human is less confident in their ability to impose their will on the situation, as opposed to the situation imposing its implications on the human's beliefs/aliefs.

This leads to a positive feedback cycle leading to withdrawal, passivity, pessimism about one's own abilities, etc.

This state seems consistent with the later evolutionary explanation, but usually leads to an inflexible and hard-to-escape attractor, making recovery quite hard. This may plausibly be explained by the fact, that in modern times, the specific "mistakes" leading to this cycle tend to be less tractable, or amplified by contrast to a global set of humans to compare oneself with.

In addition, the depressed state may in part be an adaptation to reduce dysphoria caused by constant prediction error. Specifically, as the world becomes perceived as unpredictable and uncontrollable, it is a simple fallback strategy to predict constant failure. While depression is often seen as a condition of intense suffering, dysphoria (the opposite of euphoria) is not a central symptom (as opposed to e.g. OCD or BPD). This may be because once one is already in a depressed state, the depression can become a sort of "comforting", predictable state, where at least the prediction "it will not get better" is getting confirmed by reality.

The lack of things (success, action, happiness, executive function) is easier to predict than their presence (including their presence to a normal degree - functioning existence is still more variable than a severely depressed state).

How might this be escaped?

Increasing temperature
- psychedelics
- unexpected adventures
- lucid dreaming (?)
- moving countries (or similarly drastic life changes)
Giving strong, positive data to update on
- most obviously, emotional support
- "winning the lottery"
- getting opportunities to develop..

Formalisation:

[Minimizing prediction error] can be achieved in any of three ways: first, by propagating the error back along cortical connections to modify the prediction; second, by moving the body to generate the predicted sensations; and third, by changing how the brain attends to or samples incoming sensory input.

from Interoceptive predictions in the brain

Using notation from Mirza et al. (2019)^[14]

Used notation:
εt = prediction error
Π(o) = sensory precision (inverse variance)
Π(μ) = prior precision
ζ = log-precision; ACh increases ζ → Π(o) = exp(ζ)
γ = policy precision (dopaminergic inverse temperature)
η_eff = effective learning rate induced by precision
G(π) = expected free energy of policy π

Variational free energy for a generative model $(p_{θ} (s, o))$ , approximated as a density $(q_{ϕ} (s))$ is:

$F (q, θ) = E_{q (s)} [ln q (s) - ln p_{θ} (o, s)] .$

Under a Gaussian predictive-coding formulation, and with sensory prediction errors
$ε_{t} = o_{t} - g (μ_{t})$ , free energy can be locally approximated as:

$F (q, θ) = E_{q} [ln q (s) - ln p_{θ} (o, s)]      negative ELBO \approx \sum_{t} \frac{1}{2} ε_{t}^{⊤} Π_{t}^{(o)} ε_{t}      precision‑weighted prediction error + complexity terms$

where $Π_{t}^{(o)}$ is the sensory precision (inverse covariance) at time $t$ . “Complexity” collects the KL terms over states and any higher-level priors.

Gradient descent on $F$ yields the canonical update of sufficient statistics $μ_{t}$ :

$Δ μ_{t} \propto Π_{t}^{(o)}    sensory precision {\frac{\partial g (μ_{t})}{\partial μ_{t}}}^{⊤} ε_{t} - Π_{t}^{(μ)}    prior precision (μ_{t} - {^μ}_{t})$

Increasing $Π_{t}^{(o)}$ steepens the contribution of sensory prediction errors and thus
increases the effective learning rate $η_{eff}$ , while increasing
$Π_{t}^{(μ)}$ stabilises $μ_{t}$ by tightening priors $(Π^{(μ)})$ .

Claim:

Acetylcholine primarily modulates the log-precision $ζ$ on ascending prediction errors,
so that
$Π_{t}^{(o)} = exp (ζ_{t})$
and high ACh corresponds to high sensory precision $Π^{(o)}$ , producing a high effective learning rate $η_{eff}$ .

Catecholamines (especially dopamine) encode policy precision $γ$ and contribute to
the stability of higher-level priors (increasing $Π^{(μ)}$ ). Policies are inferred via
$q (π) \propto exp (- γ G (π))$ , where $G (π)$ ) is expected free energy.

Thus:

Depression is characterised by
$Π^{(o)}$ high (ζ↑ via ACh),
$Π^{(μ)}$ low,
$γ$ low (DA↓) (but not extremely low, that would probably cause DDMs like Athymhomia^[15])

This regime overweights bottom-up errors, underweights stabilising priors, and flattens
the posterior over policies $q (π)$ . Small mismatches produce large belief-updates,
leading to unstable self-models, helplessness, anhedonia, and rumination.

Mania is characterised by
$Π^{(o)}$ low (ζ↓),
$Π^{(μ)}$ high,
$γ$ high (DA↑).

Prediction errors are underweighted, priors and policies become over-precise, and
$q (π)$ becomes sharply peaked. This suppresses corrective evidence and produces
grandiosity, overconfidence, and reckless goal pursuit.

[source].

Possible evopsych explanation: Rumination and Sickness behaviour

On ancestral timescales, encountering a persistent, catastrophic model failure (social defeat, resource collapse) justifies switching into a high‑ACh, high‑learning regime that suspends costly goal pursuit and reallocates compute to problem solving (analytic rumination), until a better policy emerges. The cost of false negatives (missing the hidden cause of a disaster) exceeded the cost of prolonged withdrawal; hence a design that forces extended search even when the cause is exogenous.

Hollon et al 2021 justifies long depressive episodes as evolutionarily adaptive because they force rumination & re-examination of the past for mistakes.
One might object that such rumination is merely harmful in many cases, like bereavement from a spouse dying of old age—but from the blackbox perspective, the agent may well be mistaken in believing there was no mistake! After all, an extremely bad thing did happen. So better to force lengthy rumination, just on the chance that a mistake will be discovered after all. (This brings us back to RL’s distinction between high-variance evolutionary/Monte Carlo learning vs smarter lower-variance but potentially biased learning using models or bootstraps, and the “deadly triad”.)

from 'Evolution as a backstop for Reinforcement Learning' by Gwern^[16]

This is related to the model of depression as sickness behaviour; an adaptive behaviour caused by an increase in inflammatory cytokines (which are also implicated in depression)^[17], causing social withdrawal, physical inactivity and excessive sleep.

This might serve a dual role - giving the immune system the opportunity to focus on combating the pathogen in case of infection, and when combined with increased ACh, allowing the mind to focus on ruminating about how one might have done things differently to avoid failures/mistakes committed.

REM sleep and depression:

Depressed patients' sleep tends to have a higher proportion of REM sleep and REM deprivation (REM-D) has been found to be an effective treatment for depression.^[18] The standard medications for depression (SSRIs, SNRIs, DNRIs, MAOis,...) increase REM latency and shorten its duration (by preventing the decrease of monoamines necessary for REM sleep to occur), effectively creating REM sleep deprivation, which may be a possible mechanism of their effectiveness.^[19] (Interestingly, it doesn't seem like the significantly reduced amount of REM sleep due to SSRI usage causes any severe cognitive side effects.)

How this relates to the theory:

REM sleep (when most dreaming occurs) is characterized by high ACh and relative monoaminergic silence (NE/5‑HT/DA strongly reduced). If ACh scales precision on ascending signals, what does it do in REM when there is no external sensory stream? It amplifies the precision of internally generated activity, treating spontaneous (often related to that day's memories) cortical/hippocampal patterns as high‑fidelity “data,” while weakened monoaminergic tone relaxes top‑down priors. Acetylcholine in REM sleep is theorized to function as following;

"Cholinergic suppression of feedback connections prevents hippocampal retrieval from distorting representations of sensory stimuli in the association neocortex".

This seems to suggest that REM sleep functions essentially as the stage of sleep in which most new/prior memories are not consolidated (as happens in slow-wave-sleep), but rather the space is given for "learning" of new (synthetic) information, without interference from existing models. This happens during waking life when ACh is high, but during dreaming this process is radically upregulated, while there is an absence of external stimuli. (Karl Friston explains this as REM sleep portraying the basal "theatre of perception", which in waking life updates based on sensory information, but during dreaming, the generative "virtual reality model" exists by itself, to be refined for the next time it's used for waking perception).^[20]

In Active Inference terms, REM is a regime where precision on ascending (internally generated) errors is high and priors are pliable; the model explores and re‑parameterizes itself by fitting narratives to internally sampled data. If the waking baseline is already ACh‑biased and monoamine‑depleted (the depressed phenotype), REM further erodes stabilizing priors about one's values and self. If REM sleeps dominates compared to slow-wave-sleep, more space is given to increasing uncertainty related to dream subjects (which may be related to the previous day's experiences), rather than consolidating existing priors.^[21]

Two forms of neuroplasticity:

Why is ACh causing plasticity that leads to depression, and BDNF (e.g. through psychedelics) is inversely correlated with it?^[22]

Acetylcholine causes updating based on prediction errors - the learning occurs in uncertain situations, when the agent needs to be hyperaware of possible mistakes that are expected to happen (or have happened, as in the case of rumination).^[23] Long-term potentiation (LTM) or long-term depression (LTD) are more likely to occur, in existing synaptic connections.^[24]

BDNF, on the other hand, stimulates the creation of entirely new synapses and maintains the survival of existing neurons, such as in the hippocampus. BDNF expression tends to be decreased in depressed individuals, and hippocampal volume usually seems to be lowered.

This enables the emergence of "local plasticity" leading into the depression-phenotype attractor state, while "global plasticity" is lowered. Synapses in the hippocampus die, while a small subset gets continually amplified.

In FEP, the type of learning that's facilitated by BDNF might be structure learning, specifically bayesian model expansion^[25], though I have not read much about this.

The role of serotonin

While this theory is focused on a non-serotonin mechanism of depression, there is an interesting connection to be made to the role of serotonin. Serotonin famously has many types of receptors, many of which seem to have completely different effects, spanning from nausea to cognition to control of neurotransmitter release, sometimes even having seemingly opposite effects.
Several are relevant to depression, mostly by being their antagonism having antidepressant effects - this is true of 5-HT2A (the psychedelic receptor), 5-HT3, 5-HT7 (both related to nausea), and 5-HT2C (inhibitory of dopamine release).
However, the pair of receptors most often targeted by antidepressants are the 5-HT2A and 5-HT1A receptors. The strange thing is that both antagonism (e.g. Mirtazapine) and agonism (e.g. Psilocybin) of the 5-HT2A receptor have antidepressant effects. The 5-HT1A receptor seems to have effects opposing that of 5-HT2A, e.g. , it's agonism reduces effects of psychedelics.^[26]
Carhart-Harris and Nutt argue that serotonin signalling has the role of dealing with adverse situations, with the 1A and 2A receptors playing the role of passive coping and active coping, respectively. The 1A receptor is posited to have a stress-moderating, calming, "stoic" mechanism, such that adversity is passively tolerated more easily. This is said to be the default pathway. Meanwhile, if serotonergic signalling takes on a higher level, the active coping pathway mediated by the 2A receptor begins, causing an increase in neuroplasticity, destabilising the person's beliefs and models, to completely reevaluate the solutions to the current adverse situation, for which passive coping doesn't suffice anymore.
This 2A-mediated increase in plasticity may be analogous to the ACh-induced plasticity seen in depression - both may be part of a unified mechanism by which the brain deals with unexpectedly severe adverse situations, forcing the brain into reevaluation mode.

Relevance to LessWrong

It seems like Sequences-style epistemic rationality favours a state similar to the high-ACh state described above. There appears to be a divide between the Rationalist and the Bay-area-startup-founder archetypes, the former of which is notably identified with the "doomer", while the other wishes to "accelerate", not worrying about risks.
In addition, it seems like many of the ones closest to the former camp tend to either become disillusioned with their work (such as MIRI dissolving its research team) or switch into the other camp, starting work in AI capabilities research (thus moving right on the diagram).

While I don't have actual data, it anecdotally seems to me like depression is quite common among lesswrongers and is to some extent connected to the emphasis on careful epistemic rationality (through the relative downregulation of policy precision and upregulation of ascending information precision).

It would be foolish to propose taking anticholinergics and dopaminergics because of this; rather, it seems good to be aware of the potential emotional fragility of a highly cautious, high-learning-rate state and the tradeoff that might exist between motivation (limbic dopamine activity driving up policy precision) and learning rate - amphetamines may not necessarily make you smarter/wiser.

Applications of the idea for treating depression:

Most importantly: avoid nootropics such as acetylcholinesterase inhibitors (huperzine, galantamine), piracetam, Alpha-GPC, CDP choline, ... (anything cholinergic) when depressed.

Potentially effective alternative nootropics:

with some anticholinergic effect:
- Bromantane, Amantadine, Memantine
non-anticholinergic, but upregulate BDNF significantly:
- Semax, Selank

Targeting ACh receptos- Anticholinergics:

The older (tricyclic) antidepressants, such as amitryptiline^[26], may have been effective in part due to their anticholinergic effects. It's probably worth trying these if conventional antidepressants fail. Bupropion also has anticholinergic effects, but only on nicotinic receptors, which seem less relevant to the depression-inducing effect of acetylcholine.
A pure anticholinergic - Scopolamine, seems to be an effective, fast acting antidepressant^[27]

Targeting sleep - REM deprivation:

If this is a significant component of SSRI (and others') effectiveness, maybe something with a shorter duration, taken at night, would be as effective and have fewer side effects.
Polyphasic sleep as explained in this comment, might be a useful DIY method of REM-D for depression.
As long as one doesn't use supplements for this (usually ACh-esterase-inhibitors), lucid dreaming might potentially enable one to shape the signal one is updating on, into a positive one (e.g. by performing activities successfully in the dream).

Targeting sickness behaviour - antiiflamatory drugs:

NSAIDs, such as ibuprufen or aspirin may have some effect in some cases.^[28]
Psilocybin seems to be an effective antidepressant^[29] and it is also a strong antiinflamatory drug.^[30]

Targeting dopamine:

bromantane
amphetamines
bupropion (slightly)
selegiline

Targeting BDNF:

psychedelics (DMT, Psilocybin)^[31]
Russian peptides Semax^[32] and Selank^[33]
traditional antidepressants (SSRIs, MAOis, others)^[34]
Amitriptyline (again); directly binds to TrkB, the BDNF receptor^[35]
Aerobic exercise^[36]
Sleep deprivation^[37]
maybe Lion's Mane mushrooms^[38]
many many others

Addendum - other things

The topic of why SSRIs and other serotonergics work is very vast; it may involve indirect increase of a neuroplastogen, increase in the GABAergic neurosteroid allopregnanolone, decrease in inflammatory cytokines, 1A serotonin receptor activation decreasing substance P release, sigmaergic agonism, nitric oxide inhibition, REM sleep suppression (mentioned in text), and many more.

Ketamine seems to act by redirecting glutamate from NMDA receptors (which it blocks) to AMPA receptors (which upregulate neuroplasticity). Glutamate in general is quite important in depression.

The HPA axis, specifically overproduction of CRF, which promotes cortisol release, is really important to depression too.^[39]
The body also has it's endogenous "dysphoriant" (unpleasant-qualia-inducer), called dynorphin, which is quite understandably linked to depression.^[40]

The trace amine phenethylamine (PEA) seems to also be implicated in depression^[41], it acts as a sort of endogenous amphetamine and is increased in schizophrenia^[42], so maybe it plays a large part in what I attribute to dopamine in the post. Selegiline, mentioned above, inhibits its breakdown.

That is to say, acetylcholine and dopamine are far from being the sole factors in depression, and targeting them may mean not targeting the central factor in some people's depression. Nonetheless, it seems useful not to ignore this model when thinking about depression, as some high-impact interventions that are otherwise ignored depend on this model (or the underlying True Mechanism).

^{^}
SSRIs increase intersynaptic serotonin acutely, but take weeks to have an effect - there must be something other than serotonin increase going on.
^{^}
Janowsky et al. (1972)
^{^}
more detail in: The Role of Acetylcholine Mechanisms in Affective Disorders
^{^}
https://www.sciencedirect.com/science/article/pii/S0022249621000973
^{^}
Acetylcholine modulates the precision of prediction error in the auditory cortex - PMC
^{^}
https://pmc.ncbi.nlm.nih.gov/articles/PMC5390700/
^{^}
https://pmc.ncbi.nlm.nih.gov/articles/PMC5241223/#S1
^{^}
https://pubmed.ncbi.nlm.nih.gov/34782794/
^{^}
https://pubmed.ncbi.nlm.nih.gov/21198638/
^{^}
https://journals.plos.org/ploscompbiol/article?id=10.1371/journal.pcbi.1002327
^{^}
https://pubmed.ncbi.nlm.nih.gov/970489/
^{^}
https://en.wikipedia.org/wiki/Dopamine_hypothesis_of_schizophrenia
^{^}
https://www.gatsby.ucl.ac.uk/~dayan/papers/ydnips02.pdf
^{^}
https://www.nature.com/articles/s41598-019-50138-8/figures/1
^{^}
https://en.wikipedia.org/wiki/Disorders_of_diminished_motivation, Athymhormia being a severe variant, where motivation is so low it destroys even motivation to move (https://en.wikipedia.org/wiki/Athymhormia)
^{^}
Evolution as Backstop for Reinforcement Learning · Gwern.net
^{^}
https://pmc.ncbi.nlm.nih.gov/articles/PMC3741070/
^{^}
https://pmc.ncbi.nlm.nih.gov/articles/PMC9960519/#sec4-jpm-13-00306
^{^}
https://en.wikipedia.org/wiki/Rapid_eye_movement_sleep#Effects_of_SSRIs
^{^}
https://pubmed.ncbi.nlm.nih.gov/25346710/
^{^}
An interesting anecdotal report of @Emrik using these facts about REM sleep to increase their sleep efficiency.
^{^}
https://en.wikipedia.org/wiki/Epigenetics_of_depression#Brain-derived_neurotrophic_factor
^{^}
Uncertainty, neuromodulation, and attention - PubMed
^{^}
https://www.nature.com/articles/ncomms3760
^{^}
https://www.sciencedirect.com/science/article/pii/S0022249620300857#sec9.2
^{^}
https://en.wikipedia.org/wiki/Amitriptyline#Pharmacology
^{^}
https://www.sciencedirect.com/science/article/abs/pii/S1876201823000382
^{^}
https://bmcmedicine.biomedcentral.com/articles/10.1186/1741-7015-11-74
^{^}
unlikely the reader hasn't already heard of this research...
^{^}
https://www.sciencedirect.com/science/article/pii/S0889159123002684
^{^}
https://pubmed.ncbi.nlm.nih.gov/38385351/
^{^}
https://pubmed.ncbi.nlm.nih.gov/16635254/
^{^}
https://www.researchgate.net/publication/23306196_Intranasal_administration_of_the_peptide_Selank_regulates_BDNF_expression_in_the_rat_hippocampus_in_vivo
^{^}
https://pmc.ncbi.nlm.nih.gov/articles/PMC8346988/
^{^}
https://en.wikipedia.org/wiki/Amitriptyline#Pharmacology
^{^}
https://pubmed.ncbi.nlm.nih.gov/21722657/
^{^}
https://pubmed.ncbi.nlm.nih.gov/26758201/
^{^}
https://pmc.ncbi.nlm.nih.gov/articles/PMC10952766/
^{^}
https://pubmed.ncbi.nlm.nih.gov/9662725/
^{^}
https://arxiv.org/html/2408.06763v1
^{^}
https://pubmed.ncbi.nlm.nih.gov/9081552/
^{^}
https://pubmed.ncbi.nlm.nih.gov/7906896/

LESSWRONG
LW