LESSWRONG
LW

766
mgalle
5010
Message
Dialogue
Subscribe

Posts

Sorted by New

Wikitag Contributions

Comments

Sorted by
Newest
No posts to display.
No wikitag contributions to display.
chinchilla's wild implications
mgalle3y63

I know of two independently developed LLM in two languages where the conclusions of the developers is that "we run out of data in our language".  One of them is trying to scale by going multilingual.

Where to look next? There is lots of untapped data in speech (radio shows, youtube, etc): that amount could make a difference in my opinion.

Reply