LESSWRONG
LW

Machine Learning (ML)AI
Frontpage

27

Google's Imagen uses larger text encoder

by Ben Livengood
24th May 2022
1 min read
2

27

Machine Learning (ML)AI
Frontpage

27

Google's Imagen uses larger text encoder
3Kayden
1Logan Zoellner
New Comment
2 comments, sorted by
top scoring
Click to highlight new comments since: Today at 1:28 AM
[-]Kayden3y30

From what I've seen so far, Imagen is more "straightforward" and does a better job generating an image describing the text than DALE-2. But DALE-2 seems to be producing prettier images (which makes sense given it was fine-tuned for aesthetics),

There's a Github repo up already, so I hope we'll be able to try an Open source version and actually test on the same prompts as DALE-2. 

Reply
[-]Logan Zoellner3y10

It'll be interesting to see Imagen fine-tuned on laion aesthetic

Reply
Moderation Log
More from Ben Livengood
View more
Curated and popular this week
2Comments

https://imagen.research.google/

Scaling the text encoder gives Imagen the ability to spell, count, and assign colors and properties to distinct objects in the image that DALL-E2 was not so great at. It looks visually about as photorealistic as DALL-E2 from the small set of sample images. Eyes are still weird.