Hugging Face in Action you own this product

Build intelligent applications with transformers, agents, and RAG

Wei-Meng Lee

October 2025
ISBN 9781633436718
368 pages

Included with a Manning Online subscription

printed in black & white

available in Korean

catalog / Data Science / AI

resources: Source code Book forum Source code on GitHub Register your pBook for a free eBook

table of content

1 Introducing Hugging Face

1.1 Hugging Face Transformers library

1.2 Hugging Face models

1.3 Hugging Face Gradio Python library

1.4 Understanding the Hugging Face mental model

1.4.1 Step 1: User need

1.4.2 Step 2: Model Hub discovery

1.4.3 Step 3: Model card

1.4.4 Step 4: Two execution paths

1.4.5 Step 5: Results delivered

2 Getting started

2.1 Downloading Anaconda

2.1.1 Creating virtual environments

2.1.2 Starting Jupyter Notebook

2.2 Installing the Transformers library

2.2.1 Support for GPU

2.2.2 Using GPU in the pipeline object

2.3 Installing the Hugging Face Hub package

2.3.1 Downloading files

2.3.2 Using the Hugging Face CLI

3 Using Hugging Face transformers and pipelines for NLP tasks

3.1 Introduction to the transformer architecture

3.1.1 Tokenization

3.1.2 Token embeddings

3.1.3 Positional encoding

3.1.4 Transformer block

3.1.5 Softmax

3.2 Working with the Transformers library

3.2.1 What are pretrained transformers models?

3.2.2 What are transformers pipelines?

3.2.3 Using a model directly

3.2.4 Using a transformers pipeline

3.3 Using transformers for NLP tasks

3.3.1 Text classification

3.3.2 Text generation

3.3.3 Text summarization

3.3.4 Text translation

3.3.5 Zero-shot classification

3.3.6 Question-answering tasks

4 Using Hugging Face for computer vision tasks

4.1 Hugging Face computer vision models

4.2 Object detection

4.2.1 Using the model directly

4.2.2 Using the transformers pipeline

4.2.3 Binding to a webcam

4.3 Image classification

4.4 Image segmentation

4.4.1 Using the model programmatically

4.4.2 Binding to Gradio

4.5 Video classification

4.5.1 Installing the prerequisites

4.5.2 Downloading the videos for testing

4.5.3 Using the transformers pipeline object

5 Exploring, tokenizing, and visualizing Hugging Face datasets

5.1 What are Hugging Face datasets?

5.1.1 Getting the list of datasets available

5.1.2 Validating the availability of a dataset

5.1.3 Downloading a dataset

5.1.4 Shuffling a dataset

5.1.5 Streaming a dataset

5.1.6 Getting the Parquet files of a dataset

5.2 Tokenization in NLP

5.2.1 Types of tokenization methods

5.2.2 Tokenizing datasets

5.3 Visualizing datasets

5.3.1 Using the twitter-financial-news-topic dataset

5.3.2 Using the CIFAR-10 dataset

6 Fine-tuning pretrained models and working with multimodal models

6.1 Fine-tuning pretrained models

6.1.1 Loading the yelp_polarity dataset

6.1.2 Filtering the yelp_polarity dataset

6.1.3 Tokenizing the reduced dataset

6.1.4 Setting up a pretrained model for sequence classification

6.1.5 Configuring and initializing a trainer for fine-tuning a pretrained model

6.1.6 Using the fine-tuned model

6.1.7 Fine-tuning models for multiclass text classification

6.2 Working with multimodal models

6.2.1 Single-modal models

6.2.2 Multimodal models

7 Creating LLM-based applications using LangChain and LlamaIndex

7.1 Introducing LLMs

7.2 Introducing LangChain

7.2.1 Installing LangChain

7.2.2 Creating a prompt template

7.2.3 Specifying an LLM

7.2.4 Creating an LLM chain

7.2.5 Running the chain

7.2.6 Maintaining a conversation

7.2.7 Using the RunnableWithMessageHistory class

7.2.8 Using other LLMs

7.3 Connecting LLMs to your private data using LlamaIndex

7.3.1 Installing the packages

7.3.2 Preparing the documents

7.3.3 Loading the documents

7.3.4 Using an embedding model

7.3.5 Indexing the document

7.3.6 Loading the embeddings

7.3.7 Using an LLM for querying

7.3.8 Asking questions

7.3.9 Using LlamaIndex with OpenAI

7.3.10 Creating a web frontend for the app

7.3.11 Holding a conversation

7.3.12 Creating a chatbot UI

8 Building LangChain applications visually using Langflow

8.1 What is Langflow?

8.1.1 Installing Langflow using the pip command

8.1.2 Installing Langflow using Docker

8.1.3 Running Langflow in the cloud

8.2 Creating a new Langflow project

8.2.1 Adding a Prompt component

8.2.2 Adding a Models component

8.2.3 Adding a Chains component

8.2.4 Adding Chat Input and Chat Output components

8.2.5 Testing the project

8.2.6 Maintaining a conversation using the Chat Memory component

8.3 Asking questions on your own data

8.3.1 Loading PDF documents using the File component

8.3.2 Splitting long text into smaller chunks using the Parse Data component

8.3.3 Getting questions using the Prompt component

8.3.4 Using the HuggingFace component

8.3.5 Connecting to the Chat Output component

8.3.6 Testing the project

8.3.7 Using an LLM with the OpenAI component

8.4 Using your project programmatically

8.4.1 cURL

8.4.2 Python code

9 Programming agents

9.1 What are agents?

9.2 Developing agents using smolagents

9.2.1 Using built-in tools: DuckDuckGoSearchTool

9.2.2 Using built-in tools: PythonInterpreterTool

9.2.3 Writing your own custom tools

9.3 Developing agents with LangChain

9.3.1 Using the built-in Tool class

9.3.2 Using custom tools

9.4 Developing agents using LangGraph

9.4.1 What is LangGraph?

9.4.2 LangGraph agent basics

9.4.3 Using LangGraph with tools

9.4.4 Using LangGraph with a custom tool

9.4.5 Using LangGraph with memory

10 Building a web-based UI using Gradio

10.1 Basics of Gradio

10.1.1 Using Gradio’s Interface class

10.1.2 Configuring flagging options

10.1.3 Configuring authentication

10.1.4 Customizing the server and port

10.1.5 Sharing your Gradio application

10.1.6 Deploying your Gradio application to Hugging Face Spaces

10.2 Working with widgets

10.2.1 Working with Textbox

10.2.2 Working with Audio

10.2.3 Working with Images

10.2.4 Working with selection widgets

10.2.5 Layout using the TabbedInterface class

10.3 Creating a chatbot UI

10.3.1 Creating the basic chatbot UI

10.3.2 Wiring the Textbox’s submit event

10.3.3 Clearing the chatbot

11 Building locally running LLM-based applications using GPT4All

11.1 Introducing GPT4All

11.2 Installing GPT4All

11.2.1 Installing the GPT4All application

11.2.2 Installing the gpt4all Python library

11.2.3 Listing all supported models

11.2.4 Loading a specific model

11.2.5 Asking a question

11.2.6 Binding with Gradio

12 Using LLMs to query your local data

12.1 Using GPT4All to query with your own data

12.1.1 Installing the required packages

12.1.2 Importing the various modules from the LangChain package

12.1.3 Loading the PDF documents

12.1.4 Splitting the text into chunks

12.1.5 Embedding

12.1.6 Loading the embeddings

12.1.7 Downloading the model

12.1.8 Asking questions

12.1.9 Loading multiple documents

12.1.10 Loading CSV files

12.1.11 Loading JSON files

12.2 Using LLMs to write code to analyze your data

12.2.1 Preparing the JSON file

12.2.2 Loading the JSON file

12.2.3 Asking the question using the Mistral 7B model

12.2.4 Asking questions using OpenAI

13 Bridging LLMs to the real world with the Model Context Protocol

13.1 What is MCP?

13.1.1 The problems MCP solves

13.1.2 Understanding MCP

13.1.3 MCP server deployment

13.1.4 Components in an MCP server

13.2 Building an MCP server

13.2.1 Installing uv

13.2.2 Initializing the project

13.2.3 Installing the packages

13.2.4 Creating the MCP server

13.2.5 Inspecting the MCP server

13.2.6 Implementing Resources

13.2.7 Implementing Tools

13.2.8 Implementing a prompt

13.2.9 Testing the components

13.3 Testing the MCP server using Claude Desktop

13.3.1 Configuring Claude Desktop to use the MCP server

13.3.2 Getting the weather

13.3.3 Getting the content of a text file

13.3.4 Getting the content of a PDF file

13.3.5 Improving the MCP server

13.4 Trying third-party MCP servers

13.4.1 Get My Location

13.4.2 mcp-datetime

Overview

2 Getting Started

This chapter is a hands-on primer for getting productive with Hugging Face, focusing on a clean, reproducible Python setup and the tools you’ll use throughout the book. It emphasizes working in Jupyter Notebook for interactive exploration, managing dependencies with Anaconda and conda, and taking advantage of hardware acceleration when available. You also learn how to connect to the Hugging Face Hub so you can fetch models and assets programmatically and work both online and offline.

You begin by installing Anaconda, creating and activating an isolated conda environment, and launching Jupyter from a dedicated project folder. With the environment ready, you install the transformers library and, if you have a compatible GPU, set up PyTorch with CUDA (or use Apple’s MPS on Apple Silicon) to accelerate inference and training. The chapter shows how to verify GPU availability with PyTorch and inspect device details with GPUtil, then demonstrates directing transformers pipelines to the right device and even auto-selecting between CUDA, MPS, and CPU for seamless performance.

Finally, the chapter introduces the huggingface_hub package for interacting with the Hugging Face Hub from code and the command line. You learn to download specific model files (including pinned revisions), manage repositories, and authenticate securely using access tokens via the CLI or directly in Python. With these environment, acceleration, and Hub workflows in place, you’re equipped to run examples efficiently, manage dependencies safely, and integrate models and datasets into your projects with minimal friction.

Downloading Anaconda for the three major platforms – Windows, Mac, and Linux

Launching the Anaconda Prompt in Windows

Creating a new virtual environment and installing all the required packages

The virtual environment name will prefix the prompt

The web browser will now display the Jupyter Notebook’s main page

Creating a new notebook

Selecting a kernel for your notebook

The notebook is now ready to use

Renaming your notebook

The Hugging Face Hub

Downloading a file directly from a model’s page

Selecting the file you want to download

Viewing the historical commits for a project

Copying the commit hash for a file

Signing up for a new Hugging Face account

Logging in to Hugging Face Hub from Jupyter Notebook

Summary

The Anaconda package comes with the conda package manager which simplifies package management and environment creation. It also comes with Jupyter Notebook.
Creating virtual environments allow you to install and manage Python packages separately from your system-wide Python installation. It's a useful tool for isolating dependencies and managing different project requirements.
The easiest way to start Jupyter Notebook is to launch it from Terminal (or Anaconda Prompt).
The Transformers library is primarily built on top of PyTorch, a popular deep learning framework primarily developed by FaceBook’s AI Research Lab (FAIR).
PyTorch supports GPU, enabling smooth integration with Nvidia's CUDA (Compute Unified Device Architecture), a parallel computing platform and programming model designed for GPUs.
The Hugging Face Hub package allows you to download files, upload files, as well as perform authentication using the CLI.

FAQ

What’s the easiest way to install Jupyter Notebook for this book?

Download and install Anaconda (free for personal use) from https://www.anaconda.com/download/success. Anaconda includes Jupyter Notebook plus many commonly used data-science packages and the conda package/environment manager.

Why create a virtual environment and how do I make one with conda?

A virtual environment isolates dependencies per project. Create one for this book with:

conda create -n HuggingFaceBook python=3.11 anaconda

When prompted, type Y to proceed. This sets up Python 3.11 with the Anaconda distribution inside the environment.

How do I activate the environment and know it’s active?

Activate it with:

conda activate HuggingFaceBook

Your terminal prompt will be prefixed by the environment name (e.g., (HuggingFaceBook)), indicating it’s active.

On Windows, why use Anaconda Prompt instead of Command Prompt?

Anaconda Prompt is the same shell but with Anaconda’s paths and environment variables preconfigured, so Python, conda, and related tools are ready to use without extra setup.

How do I start Jupyter Notebook in a project folder and create a notebook?

1) Create and enter a folder:

mkdir HF_Projects
cd HF_Projects

2) Launch Jupyter:

jupyter notebook

3) In the browser, click New → Notebook, choose “Python 3 (ipykernel)”, and rename the notebook as desired.

How do I install the Hugging Face Transformers library?

In a notebook:

!pip install transformers

Or in Terminal/Anaconda Prompt:

pip install transformers

How do I install a CUDA-enabled PyTorch build and verify GPU availability?

Install GPU wheels (example for CUDA 12.1):

pip install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cu121 -U

Verify CUDA:

import torch
print(torch.cuda.is_available())  # True means CUDA is available

How can I inspect GPU details (name, memory, utilization) from Python?

With PyTorch:

import torch
if torch.cuda.is_available():
    print("__CUDNN VERSION:", torch.backends.cudnn.version())
    print("__Number CUDA Devices:", torch.cuda.device_count())
    print("__CUDA Device Name:", torch.cuda.get_device_name(0))
    print("__CUDA Device Total Memory [GB]:",
          torch.cuda.get_device_properties(0).total_memory/1e9)

Optionally, install GPUtil for more stats:

pip install GPUtil
import GPUtil
print(GPUtil.getAvailable())  # e.g., [0]
for gpu in GPUtil.getGPUs():
    print(gpu.id, gpu.name, gpu.load, gpu.memoryUtil, gpu.temperature, gpu.memoryTotal)

How do I run a Transformers pipeline on GPU or Apple Silicon GPU (MPS)?

Pass the device parameter to pipeline(): - First GPU by index:

from transformers import pipeline
clf = pipeline("text-classification",
               model="huaen/question_detection",
               device=0)           # or device="cuda:0"

- Apple Silicon (MPS):

clf = pipeline("text-classification",
               model="huaen/question_detection",
               device="mps:0")

Auto-detect best device:

import torch
device = "cuda" if torch.cuda.is_available() else \
         ("mps" if torch.backends.mps.is_available() else "cpu")
clf = pipeline("text-classification", model="huaen/question_detection", device=device)

Check what it’s using:

print(clf.device.type)  # "cuda", "mps", or "cpu"

How do I install and use Hugging Face Hub tools (download files, authenticate)?

- Install the Python package:

pip install huggingface_hub

- Download a file (latest main) from a model repo:

from huggingface_hub import hf_hub_download
hf_hub_download(repo_id="google/pegasus-xsum", filename="config.json")

- Download a specific revision (commit hash/branch/tag):

hf_hub_download("google/pegasus-xsum", "config.json",
                revision="a0aa5531c00f59a32a167b75130805098b046f9c")

- Authenticate via CLI (requires a token from https://huggingface.co/settings/tokens):

huggingface-cli login
huggingface-cli whoami

- Or authenticate in Python:

from huggingface_hub import login
login()

If a widget error appears in Jupyter, update:

pip install -U ipywidgets

pro $24.99 per month

access to all Manning books, MEAPs, liveVideos, liveProjects, and audiobooks!
choose one free eBook per month to keep
exclusive 50% discount on all purchases
renews monthly, pause or cancel renewal anytime

lite $19.99 per month

access to all Manning books, including MEAPs!

team

5, 10 or 20 seats+ for your team - learn more

eBook

pdf, ePub, online

$54.99 $41.24

you save $13.75 (25%)

include audio $19.99 $14.99

pro $24.99 per month

access to all Manning books, MEAPs, liveVideos, liveProjects, and audiobooks!
choose one free eBook per month to keep
exclusive 50% discount on all purchases
renews monthly, pause or cancel renewal anytime

lite $19.99 per month

access to all Manning books, including MEAPs!

team

5, 10 or 20 seats+ for your team - learn more

eBook

$54.99 $41.24

you save $13.75 (25%)

include audio $19.99 $14.99

eBook

pdf, ePub, online

$54.99 $41.24

you save $13.75 (25%)

include audio $19.99 $14.99

pro $24.99 per month

access to all Manning books, MEAPs, liveVideos, liveProjects, and audiobooks!
choose one free eBook per month to keep
exclusive 50% discount on all purchases
renews monthly, pause or cancel renewal anytime

lite $19.99 per month

access to all Manning books, including MEAPs!

team

5, 10 or 20 seats+ for your team - learn more