Manning Early Access Program (MEAP) Read chapters as they are written, get the finished eBook as soon as it’s ready, and receive the pBook long before it's in bookstores.

5 of 14 chapters available

Resources

Source code Book forum Source code on Github more

Become a
Reviewer

Help us create great books

Build a Text-to-Image Generator (from Scratch) you own this product

With transformers and diffusions

Mark Liu

MEAP began April 2025
Publication in Early 2026 (estimated)

ISBN 9781633435421
350 pages (estimated)

Included with a Manning Online subscription

printed in black & white

Development

read now

pro $24.99 per month

access to all Manning books, MEAPs, liveVideos, liveProjects, and audiobooks!
choose one free eBook per month to keep
exclusive 50% discount on all purchases

lite $19.99 per month

access to all Manning books, including MEAPs!

team

5, 10 or 20 seats+ for your team - learn more

eBook

$47.99 $33.59

you save $14.40 (30%)

Look inside

Build your own vision transformer and diffusion models for text-to-image image generation–from scratch!

Build a Text-to-Image Generator (from Scratch) takes you step-by-step through creating your own AI models that can generate images from text. You’ll explore two methods of image generation—vision transformers and diffusion models—and learn vital AI development techniques as you go.

Build a Text-to-Image Generator (from Scratch) teaches you how to:

Build and train models to generate high resolution images based on text descriptions
Edit an existing image based on text prompts
Build and train a model to add captions to images
Build and train a vision transformer to classify images
Fine-tune LLMs for downstream tasks such as classification, text or image generation
Better differentiate real images from deepfakes

Build a Text-to-Image Generator (from Scratch) dives into the powerful models behind AI image generators like DALL-E and Stable Diffusion. We believe that the best way to learn is to build something from scratch, so in this book you’ll your very own diffusion model and vision transformer. As you work through each stage of development, you’ll develop an understanding of how these models can be customized, applied, and integrated for impressive multimodal AI.

about the book

Build a Text-to-Image Generator (from Scratch) guides you through creating AI models that can generate amazing images from simple text prompts. You’ll explore two distinct methods, learning how transformers turn images into sequences of patches, and how diffusion models refine noise into coherent images. Author Mark Liu explains each stage with clear text, diagrams, and examples. You’ll develop models that can classify images, automatically add image captions, reconstruct images, and deliver high-resolution content. By the time you’re done, you’ll have a deep understanding of how image generation AI works—and the satisfaction of building your text-to-image models!

about the reader

For machine learning enthusiasts and data scientists with intermediate Python skills.

about the author

Mark Liu is the founding director of the Master of Science in Finance program at the University of Kentucky. He is also the author of Learn Generative AI with PyTorch.

eBook

$47.99 $33.59

you save $14.40 (30%)

choose your plan

pro

monthly

annual

$24.99

$249.99
only $20.83 per month

access to all Manning books, MEAPs, liveVideos, liveProjects, and audiobooks!
choose another free product every time you renew
choose twelve free products per year
exclusive 50% discount on all purchases
Build a Text-to-Image Generator (from Scratch) ebook for free

team

monthly

annual

$49.99

$399.99
only $33.33 per month

five seats for your team
access to all Manning books, MEAPs, liveVideos, liveProjects, and audiobooks!
choose another free product every time you renew
choose twelve free products per year
exclusive 50% discount on all purchases
Build a Text-to-Image Generator (from Scratch) ebook for free

more seats?

choose your plan

pro

monthly

annual

$24.99

$249.99
only $20.83 per month

access to all Manning books, MEAPs, liveVideos, liveProjects, and audiobooks!
choose another free product every time you renew
choose twelve free products per year
exclusive 50% discount on all purchases
Build a Text-to-Image Generator (from Scratch) ebook for free

team

monthly

annual

$49.99

$399.99
only $33.33 per month

five seats for your team
access to all Manning books, MEAPs, liveVideos, liveProjects, and audiobooks!
choose another free product every time you renew
choose twelve free products per year
exclusive 50% discount on all purchases
Build a Text-to-Image Generator (from Scratch) ebook for free

more seats?