LESSWRONGTags
LW

Language Models

EditHistorySubscribe

Help improve this page

EditHistorySubscribe

Help improve this page

Language Models

Contributors

Language models are computer programs made to estimate the likelihood of a piece of text. "Hello, how are you?" is likely. "Hello, fnarg horses" is unlikely.

Language models can answer questions by estimating the likelihood of possible question-and-answer pairs, selecting the most likely question-and-answer pair. "Q: How are You? A: Very well, thank you" is a likely question-and-answer pair. "Q: How are You? A: Correct horse battery staple" is an unlikely question-and-answer pair.

The language models most relevant to AI safety are language models based on "deep learning". Deep-learning-based language models can be "trained" to understand language better, by exposing them to text written by humans. There is a lot of human-written text on the internet, providing loads of training material....

Posts tagged Language Models

9

93Inverse Scaling Prize: Round 1 Winners

Ethan Perez, Ian McKenzie

2y

16

8

2y

161

7

122How LLMs are and are not myopic

9mo

14

6

64How to Control an LLM's Behavior (why my P(DOOM) went down)

5mo

30

5

144Transformer Circuits

2y

4

5

105On the future of language models

4mo

17

5

46Goodbye, Shoggoth: The Stage, its Animatronics, & the Puppeteer – a New Metaphor

4mo

8

5

35Striking Implications for Learning Theory, Interpretability — and Safety?

4mo

4

5

22Motivating Alignment of LLM-Powered Agents: Easy for AGI, Hard for ASI?

4mo

4

5

18A Chinese Room Containing a Stack of Stochastic Parrots

4mo

2

5

4Alignment has a Basin of Attraction: Beyond the Orthogonality Thesis

3mo

15

4

3Language Model Memorization, Copyright Law, and Conditional Pretraining Alignment

5mo

0

3

183Large Language Models will be Great for Censorship

8mo

14

3

103Testing PaLM prompts on GPT3

2y

14

3

98LLM Modularity: The Separability of Capabilities in Large Language Models

1y

3