216

LESSWRONG
LW

215
AI
Personal Blog

16

[ Question ]

What are some good language models to experiment with?

by tailcalled
10th Sep 2023
1 min read
A
2
3

16

AI
Personal Blog

16

What are some good language models to experiment with?
6LawrenceC
5Tao Lin
2LawrenceC
New Answer
New Comment

2 Answers sorted by
top scoring

LawrenceC

Sep 10, 2023

65

If you care about having both the instruction-finetuned variant and the base model, I think I'd go with one of the smaller LLaMAs (7B/13B). Importantly, they fit on one 40/80 GB A100 comfortably, which saves a lot of hassle. There's also a bajillion fine-tuned versions of them if you want to experiment. 

Add Comment

Tao Lin

Sep 10, 2023

54

Pythia is meant for this

Add Comment
[-]LawrenceC2y20

Aren't the larger Pythias pretty undertrained?

Reply
Moderation Log
More from tailcalled
View more
Curated and popular this week
A
2
0

Like if I want to experiment with a steering technique, it would be useful to have a language model that is small, capable, but not so finetuned that it becomes inflexible. (Or maybe ideally, a model which has both a finetuned and a non-finetuned variant.)

I've seen some people use GPT-2. Is that recommended? Are there any alternatives?