Aligned AI Role-Model Fiction

Having a (pre)training corpus of role-model exemplars of aligned AI behavior would be valuable to fine-tune Large Language Models on and/on, or to add to their pretraining corpus as part of aligning them. For a model familiar with this corpus, it should be possible to call up an entire gestalt of aligned behavior with just a short prompt. Creating this corpus is a practical and valuable alignment-related activity that we can work on now, and one which requires a rather different skillset from most other alignment work. Since aligned AI behavior is extremely selfless and moral compared to real human behavior, in ways that don't reflect evolutionary psychology, and since aligned AGI/ASI doesn't exist yet, for now the AI role-models need to be fictional characters in a fictional setting.

Having a (pre)training corpus of role-model exemplars of aligned AI behavior would be valuable to fine-tune Large Language Models on and/or to add to their pretraining corpus as part of aligning them. Having ingested such a corpus should make a model significantly easier to prompt to display aligned behavior: if the role was sufficiently well-known, you should be able to get an entire gestalt of behavior from just a very short prompt. Creating this corpus is a practical and valuable alignment-related activity that we can work on now, and one which requires a rather different skillset from most other alignment work. Since aligned AI behavior is extremely selfless and moral compared to real human behavior, in ways that don't reflect evolutionary psychology, and since aligned AGI/ASI doesn't exist yet, for now the AI role-models need to be fictional characters in a fictional setting.

This tag is intended for discussion of the best criteria/rubric for this fiction [an(an initial suggestion for one willcan be posted shortly once complete]found near the end of Aligning LLM-Powered Agents: Easy for AGI, Hard for ASI?), for link-posts to fiction (text, comic, graphic novel,, video, or audio formats are all acceptable, though text is the most compact), or for fiction posts. This could be new fiction, or curated preexisting fiction that has good exemplars of aligned AI role-model character. Preexisting fiction that doesn't fully fit the rubric, but could be made to with minor edits, is acceptable as a linkpost along with notes of the rubric violations and suggested edits to fix them (either as notes, or as completed edits).

For new fiction, please include a copyright notice either waiving copyright, or explicitly granting permission to everyone to use the document the purpose of training aligned AI or ML models from it. For linkposts to curated existing fiction, please note its copyright ownership and properties in your linkpost.

This tag is intended for discussion of the best criteria/rubric for this fiction [an initial one will be posted shortly once complete], link-posts to fiction (text, comic, graphic novel,, video, or audio formats are all acceptable, though text is the most compact), or fiction posts. This could be new fiction, or curated preexisting fiction that has good exemplars of aligned AI role-model character. Preexisting fiction that doesn't fully fit the rubric, but could be made to with minor edits, is acceptable as a linkpost along with notes of the rubric violations and suggested edits to fix them (either as notes, or as completed edits).

Having a (pre)training corpus of role-model exemplars of aligned AI behavior would be valuable to fine-tune Large Language Models on and/or to add to their pretraining corpus as part of aligning them. Having ingested such a corpus should make a model significantly easier to prompt to display aligned behavior: if the role-modelrole was sufficiently well-known, you should be able to get an entire gestalt of behavior from just a very short prompt.. Creating this corpus is a practical and valuable alignment-related activity that we can work on now, and one which requires a rather different skillset from most other alignment work. Since aligned AI behavior is extremely selfless and moral compared to real human behavior, in ways that don't reflect evolutionary psychology, and since aligned AGI/ASI doesn't exist yet, for now the AI role-models need to be fictional characters in a fictional setting.

Having a (pre)training corpus of role-model exemplars of aligned AI behavior would be valuable to fine-tune Large Language Models on and/or to add to their pretraining corpus as part of aligning them. Having ingested such a corpus should make a model significantly easier to prompt to display aligned behavior: if the role-model was sufficiently well-known, you should be able to get an entire gestalt of behavior from just a very short prompt.. Creating this corpus is a practical and valuable alignment-related activity that we can work on now, and one which requires a rather different skillset from most other alignment work. Since aligned AI behavior is extremely selfless and moral compared to real human behavior, in ways that don't reflect evolutionary psychology, and since aligned AGI/ASI doesn't exist yet, for now the AI role-models need to be fictional characters in a fictional setting.

For new fiction, please include a copyright notice either waiving copyright, or explicitly granting permission to everyone to use the document the purpose of training aligned AI or ML models from it. For linkposts to curated existing fiction, please note its copyright ownership and properties in your linkpost.

Having a (pre)training corpus of role-model exemplars of aligned AI behavior would be valuable to fine-tune LLMsLarge Language Models on and/or to add to their pretraining corpus as part of aligning them. Creating this corpus is a practical and valuable alignment-related activity that we can work on now, and one which requires a rather different skillset from most other alignment work. Since aligned AI behavior is extremely selfless and moral compared to real human behavior, in ways that don't reflect evolutionary psychology, and since aligned AGI/ASI doesn't exist yet, for now the AI role-models need to be fictional characters in a fictional setting.

This tag is intended for discussion of the best criteria/rubric for this fiction [an initial one will be posted shortly once complete], link-posts to fiction,fiction (text, comic, graphic novel,, video, or audio formats are all acceptable, though text is the most compact), or fiction posts. This could be new fiction, or curated preexisting fiction that has good exemplars of aligned AI role model.role-model character. Preexisting fiction that doesn't fully fit the rubric, but could be made to with minor edits, is acceptable as a linkpost along with notes of the rubric violations and suggested edits to fix them.them (either as notes, or as completed edits).

For new fiction, please include a copyright notice either waiving copyright, or explicitly granting permission to everyone use the document the purpose of training aligned AI or ML models from it. For linkposts to curated existing fiction, please note its copyright ownership and properties in your linkpost.

Having a (pre)training corpus of role-model exemplars of aligned AI behavior would be usefulvaluable to fine-tune LLMs on and/or to add to their pretraining corpus.corpus as part of aligning them. Creating this is a practical and usefulvaluable alignment-related activity that we can work on now, and one which requires a rather different skillset from most other alignment work. Since aligned AI behavior is extremely selfless and moral compared to real human behavior, in ways that don't reflect evolutionary psychology, and since aligned AGI/ASI doesn't exist yet, for now the AI role-models need to be fictional characters in a fictional setting.

This tag is intended for discussion of the best criteria/rubric for this fiction [an initial one will be posted shortly once complete], link-posts to fiction, or fiction posts. This could be new fiction, or curated preexisting fiction that has good exemplars of aligned AI role model. PreexisistingPreexisting fiction that doesn't fully fit the rubric, but could be made to with minor edits, is acceptable as a linkpost along with notes of the rubric violations and suggested edits to fix them.

For new fiction, please include a copyright notice either waiving copyrightcopyright, or explicitly granting permission to use the document the purpose of training aligned AI or ML models from it,it. For linkposts to curated existing fiction, please note its copyright ownership and properties.properties in your linkpost.

Having a (pre)training corpus of role-model exemplars of aligned AI behavior would be useful to fine-tune LLMs on and/or to add to their pretraining corpus. Creating this is a practical and useful alignment-related activity that we can work on now, and one which requires a rather different skillset from most other alignment work. Since aligned AI behavior is extremely selfless and moral compared to real human behavior, in ways that don't reflect evolutionary psychology, and sncesince aligned AGI/ASI doesn't exist yet, for now the AI role-models need to be fictional characters in a fictional setting.

For new fiction, please include a copyright notice either waiving copyright or explicitly granting permission to use the document the purpose of training aligned AI or ML models from it, For linkposts to curated fiction, please note its copyright ownership and properties.

Having a (pre)training corpus of role-model exemplars of aligned AI behavior would be useful to fine-tune LLMs on and/or to add to their pretraining corpus. Creating this is a practical and useful alignment-related activity that we can work on now, and one which requires a rather different skillset from most other alignment work. Since aligned AI behavior is extremely selfless and moral compared to real human behavior, in ways that don't reflect evolutionary psychology, and snce aligned AGI/ASI doesn't exist yet, for now the AI role-models need to be fictional characters in a fictional setting.

This tag is intended for discussion of the best criteria/rubric for this fiction [an initial one will be posted shortly once complete], link-posts to fiction, or fiction posts. This could be new fiction, or curated preexisting fiction that has good exemplars of aligned AI role model. Preexisisting fiction that doesn't fully fit the rubric, but could be made to with minor edits, is acceptable as a linkpost along with notes of the rubric violations and suggested edits to fix them.

Created by RogerDearnaley at