What are the key considerations when deciding whether to build or buy an LLM solution?

Choosing between building or buying an LLM solution involves weighing the strengths of open source models and frameworks against the convenience and speed of third-party APIs.

How does data engineering impact the success of LLM projects?

High-quality data collection, cleaning, and preparation are foundational for effective LLM training and fine-tuning, directly influencing model performance.

What is prompt engineering and why is it important for LLM applications?

Prompt engineering is the process of designing and refining prompts to guide LLMs toward consistent, meaningful outputs without retraining the model.

What are some best practices for crafting effective prompts for LLMs?

Effective prompts often include clear instructions, relevant context, and examples, and may leverage tools like LangChain or Guidance for structured outputs.

How can LLM agents use external tools to enhance their capabilities?

LLM agents can be built to interact with external tools such as search engines or APIs, enabling them to perform complex tasks autonomously.

What are the challenges and opportunities of deploying LLMs on edge devices like Raspberry Pi?

Deploying LLMs on edge devices requires memory-saving techniques and quantization, offering practical insights for running AI in resource-constrained environments.

How are knowledge graphs changing the landscape of retrieval-augmented generation (RAG) compared to vector databases?

Knowledge graphs are emerging as a powerful alternative to vector databases for RAG, offering long-term benefits for organizing and retrieving proprietary data.

What is knowledge editing in LLMs and why is it significant?

Knowledge editing allows direct updates, insertions, or deletions of facts in LLMs without retraining, using tools like EasyEdit and techniques such as ROME and MEND.

What are the latest trends in scaling LLMs and expanding their context windows?

Recent advances have enabled million-token context windows and larger models, driven by innovations like RoPE, YaRN, and Hyena.

How are government regulations and legal challenges shaping the future of LLMs?

Evolving regulations, legal liability, and copyright lawsuits are increasingly influencing how LLM products are developed and deployed.

click to
look inside

Look inside

ch 1 audio

video summary first chapter summary

Resources

Source code Book forum Source code on GitHub Register your pBook for a free eBook more

Become a
Reviewer

Help us create great books

LLMs in Production

you own this product

From language models to successful products

Christopher Brousseau and Matthew Sharp
Foreword by Joe Reis

December 2024
ISBN 9781633437203
456 pages

Included with a Manning Online subscription

printed in black & white

Available translations: Japanese, Korean, Russian

catalog / Data Science / Machine Learning / Large Language Models

Python
Data

read now

pro $24.99 per month

access to all Manning books, MEAPs, liveVideos, liveProjects, and audiobooks!
choose one free eBook per month to keep
exclusive 50% discount on all purchases

lite $19.99 per month

access to all Manning books, including MEAPs!

team

5, 10 or 20 seats+ for your team - learn more

eBook

$47.99 $35.99

you save $12.00 (25%)

include audio $24.99 $18.74

Look inside

Learn how to put Large Language Model-based applications into production safely and efficiently.

This practical book offers clear, example-rich explanations of how LLMs work, how you can interact with them, and how to integrate LLMs into your own applications. Find out what makes LLMs so different from traditional software and ML, discover best practices for working with them out of the lab, and dodge common pitfalls with experienced advice.

In LLMs in Production you will:

Grasp the fundamentals of LLMs and the technology behind them
Evaluate when to use a premade LLM and when to build your own
Efficiently scale up an ML platform to handle the needs of LLMs
Train LLM foundation models and finetune an existing LLM
Deploy LLMs to the cloud and edge devices using complex architectures like PEFT and LoRA
Build applications leveraging the strengths of LLMs while mitigating their weaknesses

LLMs in Production delivers vital insights into delivering MLOps so you can easily and seamlessly guide one to production usage. Inside, you’ll find practical insights into everything from acquiring an LLM-suitable training dataset, building a platform, and compensating for their immense size. Plus, tips and tricks for prompt engineering, retraining and load testing, handling costs, and ensuring security.

about the technology

Most business software is developed and improved iteratively, and can change significantly even after deployment. By contrast, because LLMs are expensive to create and difficult to modify, they require meticulous upfront planning, exacting data standards, and carefully-executed technical implementation. Integrating LLMs into production products impacts every aspect of your operations plan, including the application lifecycle, data pipeline, compute cost, security, and more. Get it wrong, and you may have a costly failure on your hands.

about the book

LLMs in Production teaches you how to develop an LLMOps plan that can take an AI app smoothly from design to delivery. You’ll learn techniques for preparing an LLM dataset, cost-efficient training hacks like LORA and RLHF, and industry benchmarks for model evaluation. Along the way, you’ll put your new skills to use in three exciting example projects: creating and training a custom LLM, building a VSCode AI coding extension, and deploying a small model to a Raspberry Pi.

what's inside

Balancing cost and performance
Retraining and load testing
Optimizing models for commodity hardware
Deploying on a Kubernetes cluster

about the reader

For data scientists and ML engineers who know Python and the basics of cloud deployment.

about the authors

Christopher Brousseau and Matt Sharp are experienced engineers who have led numerous successful large scale LLM deployments.

eBook

$47.99 $35.99

you save $12.00 (25%)

include audio $24.99 $18.74

Covers all the essential aspects of how to build and deploy LLMs. It goes into the deep and fascinating areas that most other books gloss over.

Andrew Carr, Cartwheel

A must-read for anyone looking to harness the potential of LLMs in production environments.

Jepson Taylor, VEOX Inc.

An exceptional guide that simplifies the building and deployment of complex LLMs.

Arunkumar Gopalan, Microsoft UK

A thorough and practical guide for running LLMs in production.

Dinesh Chitlangia, AMD

LLMs in Production

pro $24.99 per month

lite $19.99 per month

team

about the technology

about the book

Frequently Asked Questions

what's inside

about the reader

about the authors

related titles

related titles

pro

team

pro

team

pro

team