ELLA: Equip Diffusion Models with LLM for Enhanced Semantic Alignment
Bridging Language Models and Diffusion: A Concise Review of ELLA Background and Motivation At first glance the problem feels straightforward: modern text-to-image diffusion models often struggle with rich, multi-part prompts. What struck me immediate...
paperium.hashnode.dev4 min read