Manning Early Access Program (MEAP) Read chapters as they are written, get the finished eBook as soon as it’s ready, and receive the pBook long before it's in bookstores.

all chapters available

ch 1 audio

video summary first chapter summary

Resources

Source code Book forum Source code on GitHub more

Become a
Reviewer

Help us create great books

Build a Large Language Model (From Scratch) you own this product

Sebastian Raschka

MEAP began December 2023
Publication in August 2024 (estimated)

ISBN 9781633437166
400 pages (estimated)

Included with a Manning Online subscription

printed in black & white

Data

read now

pro $24.99 per month

access to all Manning books, MEAPs, liveVideos, liveProjects, and audiobooks!
choose one free eBook per month to keep
exclusive 50% discount on all purchases

lite $19.99 per month

access to all Manning books, including MEAPs!

team

5, 10 or 20 seats+ for your team - learn more

eBook

$47.99 $31.19

you save $16.80 (35%)

include audio $24.99 $16.24

Look inside

Learn how to create, train, and tweak large language models (LLMs) by building one from the ground up!

In Build a Large Language Model (from Scratch), you’ll discover how LLMs work from the inside out. In this insightful book, bestselling author Sebastian Raschka guides you step by step through creating your own LLM, explaining each stage with clear text, diagrams, and examples. You’ll go from the initial design and creation to pretraining on a general corpus, all the way to finetuning for specific tasks.

Build a Large Language Model (from Scratch) teaches you how to:

Plan and code all the parts of an LLM
Prepare a dataset suitable for LLM training
Finetune LLMs for text classification and with your own data
Apply instruction tuning techniques to ensure your LLM follows instructions
Load pretrained weights into an LLM

The large language models (LLMs) that power cutting-edge AI tools like ChatGPT, Bard, and Copilot seem like a miracle, but they’re not magic. This book demystifies LLMs by helping you build your own from scratch. You’ll get a unique and valuable insight into how LLMs work, learn how to evaluate their quality, and pick up concrete techniques to finetune and improve them.

The process you use to train and develop your own small-but-functional model in this book follows the same steps used to deliver huge-scale foundation models like GPT-4. Your small-scale LLM can be developed on an ordinary laptop, and you’ll be able to use it as your own personal assistant.

about the book

Build a Large Language Model (from Scratch) is a one-of-a-kind guide to building your own working LLM. In it, machine learning expert and author Sebastian Raschka reveals how LLMs work under the hood, tearing the lid off the Generative AI black box. The book is filled with practical insights into constructing LLMs, including building a data loading pipeline, assembling their internal building blocks, and finetuning techniques. As you go, you’ll gradually turn your base model into a text classifier tool, and a chatbot that follows your conversational instructions.

about the reader

For readers who know Python. Experience developing machine learning models is useful but not essential.

about the author

Sebastian Raschka has been working on machine learning and AI for more than a decade. Sebastian joined Lightning AI in 2022, where he now focuses on AI and LLM research, developing open-source software, and creating educational material. Prior to that, Sebastian worked at the University of Wisconsin-Madison as an assistant professor in the Department of Statistics, focusing on deep learning and machine learning research. He has a strong passion for education and is best known for his bestselling books on machine learning using open-source software.

eBook

$47.99 $31.19

you save $16.80 (35%)

include audio $24.99 $16.24

This is a fantastic resource for getting up to speed on LLMs fast.

Walter Reade, Staff Developer Relations Engineer, Kaggle/Google

This book, simply, sets the new standard for a detailed, practical guide on building and fine-tuning LLMs. This comprehensive, no-nonsense, and hands-on resource is a must-read for readers trying to understand the technical details or implement the processes on their own from scratch.

Dibyendu Roy Chowdhury, Data Scientist, Care Daily

The book is just what it should be.

Paul Silisteanu

pro $24.99 per month

lite $19.99 per month

team

about the book

about the reader

about the author

related titles

related titles

choose your plan

pro

team

choose your plan

pro

team

choose your plan

pro

team