LESSWRONG
LW

mgalle
5010
Message
Dialogue
Subscribe

Posts

Sorted by New

Wikitag Contributions

Comments

Sorted by
Newest
No wikitag contributions to display.
chinchilla's wild implications
mgalle3y63

I know of two independently developed LLM in two languages where the conclusions of the developers is that "we run out of data in our language".  One of them is trying to scale by going multilingual.

Where to look next? There is lots of untapped data in speech (radio shows, youtube, etc): that amount could make a difference in my opinion.

Reply
No posts to display.