Wikitags in Need of Work

AI Control in the context of AI Alignment is a category of plans that aim to ensure safety and benefit from AI systems, even if they are goal-directed and are actively trying to subvert your control measures. From The case for ensuring that powerful AIs are controlled:.. (read more)

Merge Candidate

Archetypal Transfer Learning (ATL) is a proposal by @whitehatStoic for what is argued by the author to be a fine tuning approach that "uses archetypal data" to "embed Synthetic Archetypes". These Synthetic Archetypes are derived from patterns that models assimilate from archetypal data, such as artificial stories. The method yielded a shutdown activation rate of 57.33% in the GPT-2-XL model after fine-tuning. .. (read more)

Merge Candidate

Language & Linguistics

Merge Candidate

A Self Fulfilling Prophecy is a prophecy that, when made, affects the environment such that it becomes more likely. similarly, a Self Refuting Prophecy is a prophecy that when made makes itself less likely. This is also relevant for beliefs that can affect reality directly without being voiced, for example, the belief "I'm confident" can increase a person confidence, thus making it true, while the opposite belief can reduce a person's confidence, thus also making it true... (read more)

Merge Candidate

A project announcement is what you might expect - an announcement of a project.
Posts that are about a project's announcement, but do not themselves announce anything, should not have this tag... (read more)

Merge Candidate

A rational agent is an entity which has a utility function, forms beliefs about its environment, evaluates the consequences of possible actions, and then takes the action which maximizes its utility. They are also referred to as goal-seeking. The concept of a rational agent is used in economics, game theory, decision theory, and artificial intelligence... (read more)

Merge Candidate

^{^}

EFE is closely related to, and can be derived from, VFE. Action does not always minimize EFE; in some cases, it minimizes generalized free energy (a closely related quantity). See this figure for a brief overview.

^{^}

E.g. (1) sensory, active, internal and external states have independent random fluctuations; (2) there exists an injective map between the mode of internal states and mode of external

Process theories

Since FEP ~~gives rise~~is an unfalsifiable mathematical principle, it does not make sense to ask whether FEP is true (because it is true mathematically given the assumptions.) Rather, it makes sense to ask whether its assumptions hold for a given system, and, if so, how that system minimizes VFE and EFE. Unlike the FEP itself, a proposal of ~~Active Inference~~how some particular system minimizes VFE and EFE---a process theory---is falsifiable.

There are two FEP process theories most relevant to neuroscience.^[1]4]: Predictive processing is a process theory of ~~agency, that~~how VFE is minimized in brains during perception. Active Inference (AIF) is a process theory of the "action" part of FEP, which can also be seen ~~both as an explanatory theory and~~ as an agent architecture. ~~In the latter sense, Active Inference rivals~~ ~~Reinforcement Learning~~.

It has been argued^[2]5] that ~~Active Inference~~AIF as an agent architecture manages the model complexity (i.e., the bias-variance tradeoff) and the exploration-exploitation tradeoff in a principled ~~way,~~way; favours explicit, disentangled, and hence more interpretable belief ~~representations,~~representations; and is amenable for working within hierarchical systems of collective intelligence (which are seen as Active Inference agents themselves^[3]6]). Building ecosystems of hierarchical collective intelligence can be seen as a proposed solution for and an alternative ~~conceptualisation~~conceptualization of the general problem of alignment.

Name	Description	Explanation
Agreed		LessWrong allows you to agree-~~upvote,~~upvote, but then people couldn't know who agreed. When agreeing with a reply to your comment, this reaction allows people to know you specifically agree
Disagree
Important
Good point!	This comment makes a good point
That's a crux	My other beliefs would be different if I had different beliefs about this
Not a crux	My other beliefs would not change if I had different beliefs about this
Strong Argument	This is a strong, well-reasoned argument.
Weak Argument	This argument is weak or poorly reasoned.
Changed My Mind	I updated my beliefs based on this.	In math, Δ ('delta') means 'change'
Changed My Mind (on this point)	I've changed my mind on this particular point (not necessarily on any larger claims).
Scout Mindset	Good job focusing on figuring out what's true, rather than fighting for a side	Scout mindset is "the motivation to see things as they are, not as you wish they were". From the book The Scout Mindset by Julia Galef
Soldier Mindset	This seems to be trying to fight for a side rather than figure out what's true	Opposite of Scout Mindset
I notice I'm confused	I don't have a clear explanation of what's going on here
I don't understand
Smells like LLM	This reads to me like it could've been written by a language model.
Clearly Written	I had an easy time understanding this
Difficult to Parse	I had trouble reading this.
Missed the point	I think this misses what I (or the other person) actually believes and was trying to say or explain
Misunderstands position?	This seems to misunderstand the thing that it argues against
Seems Offtopic?	I don't see how this is relevant to what's being discussed.
Too Sneering?	This is too much sneering (signaling disrespect and derision) relative to its substantive critique.
Too Combative?	This seems more combative than it needs to be to communicate its point.
Concrete	This makes things more concrete by bringing in specifics or examples.
Examples?	I'd be interested in seeing concrete examples of this
Already addressed	This has been covered earlier in the post or comment thread.
Bowing Out	I'm bowing out of this thread at this point. Goodbye for now!
Paperclip		See the Squiggle Maximizer (formerly "Paperclip maximizer") tag.
Hits the Mark	This hits the mark
Why? / Citation?	Why do you believe that? Or, what's your source for that?
Nice Scholarship!	Good job looking into existing literature and citing sources
Let's make a bet!	I'm willing to put money on this claim!	Why is betting important?
Good Facilitation	This seemed to help people understand each other
Question Answered	This resolved my question! Thanks.
Plus One		Akin to saying "me too"
I Saw This	...and thought it'd be useful to let people know.
Less than 1% likely	I put 1% or less likelihood on this claim
10% likely	I put about 10% likelihood on this claim
~25% likely	I put about 25% likelihood on this claim
~40% likely	I put about 40% likelihood on this claim
~50% likely	I put about 50% likelihood on this claim
~60% likely	I put about 60% likelihood on this claim
~75% likely	I put about 75% likelihood on this claim
~90% likely	I put

Image

Name

Description

Explanation

Agreed

LessWrong allows you to agree-~~upvote,~~upvote, but then people couldn't know who agreed. When agreeing with a reply to your comment, this reaction allows people to know you specifically agree

Disagree

Important

Good point!

This comment makes a good point

That's a crux

My other beliefs would be different if I had different beliefs about this

Not a crux

My other beliefs would not change if I had different beliefs about this

Strong Argument

This is a strong, well-reasoned argument.

Weak Argument

This argument is weak or poorly reasoned.

Changed My Mind

I updated my beliefs based on this.

In math, Δ ('delta') means 'change'

Changed My Mind (on this point)

I've changed my mind on this particular point (not necessarily on any larger claims).

Scout Mindset

Good job focusing on figuring out what's true, rather than fighting for a side

Scout mindset is "the motivation to see things as they are, not as you wish they were".
From the book The Scout Mindset by Julia Galef

Soldier Mindset

This seems to be trying to fight for a side rather than figure out what's true

Opposite of Scout Mindset

I notice I'm confused

I don't have a clear explanation of what's going on here

I don't understand

Smells like LLM

This reads to me like it could've been written by a language model.

Clearly Written

I had an easy time understanding this

Difficult to Parse

I had trouble reading this.

Missed the point

I think this misses what I (or the other person) actually believes and was trying to say or explain

Misunderstands position?

This seems to misunderstand the thing that it argues against

Seems Offtopic?

I don't see how this is relevant to what's being discussed.

Too Sneering?

This is too much sneering (signaling disrespect and derision) relative to its substantive critique.

Too Combative?

This seems more combative than it needs to be to communicate its point.

Concrete

This makes things more concrete by bringing in specifics or examples.

Examples?

I'd be interested in seeing concrete examples of this

Already addressed

This has been covered earlier in the post or comment thread.

Bowing Out

I'm bowing out of this thread at this point. Goodbye for now!

Paperclip

See the Squiggle Maximizer (formerly "Paperclip maximizer") tag.

Hits the Mark

This hits the mark

Why? / Citation?

Why do you believe that? Or, what's your source for that?

Nice Scholarship!

Good job looking into existing literature and citing sources

Let's make a bet!

I'm willing to put money on this claim!

Why is betting important?

Good Facilitation

This seemed to help people understand each other

Question Answered

This resolved my question! Thanks.

Plus One

Akin to saying "me too"

I Saw This

...and thought it'd be useful to let people know.

Less than 1% likely

I put 1% or less likelihood on this claim

10% likely

I put about 10% likelihood on this claim

~25% likely

I put about 25% likelihood on this claim

~40% likely

I put about 40% likelihood on this claim

~50% likely

I put about 50% likelihood on this claim

~60% likely

I put about 60% likelihood on this claim

~75% likely

I put about 75% likelihood on this claim

~90% likely

I put

plex	Utopia (32)	14d
Linda Linsefors	Comp-In-Sup (4)	26d
AnnaSalamon	Scissors Statements (3)	1mo
Gyrodiot	IABIED (33)	1mo

LESSWRONG
Wikitags Dashboard
LW

LESSWRONG
Wikitags Dashboard
LW

Wikitags in Need of Work

See Also

Newest Wikitags

Wikitag Voting Activity

Recent Wikitag Activity

Writing

Process theories

Connections to other theories

OrganisationalOrganizational Support: