Tag

Text To Image Synthesis

All articles tagged with #text to image synthesis

ai-research2 years ago

UC Berkeley Researchers Use Large Language Models to Enhance Text-to-Image Synthesis.

UC Berkeley and UCSF researchers have proposed a novel LLM-grounded Diffusion (LMD) approach that enhances prompt understanding in text-to-image generation. LMD integrates off-the-shelf frozen LLMs into diffusion models, resulting in a two-stage generation process that provides enhanced spatial and common sense reasoning capabilities. LMD offers several advantages beyond improved prompt understanding, including dialog-based multi-round scene specification and handling prompts in unsupported languages. The research team’s work opens new possibilities for improving the accuracy and diversity of synthesized images through the integration of off-the-shelf frozen models.

artificial-intelligence2 years ago

"AI Generates Realistic Bird Images from Text Using Common Sense and Surprising Learning Techniques"

Researchers in China have developed a new neural network called CD-GAN that generates high-quality bird images from textual descriptions using common-sense knowledge to enhance the generated image at three different levels of resolution, achieving competitive scores with other neural network methods. The network was trained with a dataset of bird images and text descriptions, with the goal of promoting the development of text-to-image synthesis. The authors believe that the introduction of common sense can greatly promote the development of text-to-image synthesis.