Build AI-Enhanced Web Apps you own this product

How to get reliable results with React, Next.js, and Vercel

Theo Despoudis

February 2026
ISBN 9781633436084
392 pages

Included with a Manning Online subscription

printed in black & white

catalog / Data Science / AI

resources: Source code Book forum Source code on GitHub Register your pBook for a free eBook

table of content

Part 1 Building basic generative AI web apps

1 Using generative AI in web apps

1.1 What generative AI can do for web applications

1.1.1 Generative AI capabilities

1.1.2 Real-world uses of generative AI

1.2 How a generative AI web app works

1.2.1 Core components

1.2.2 The flow of user interactions

1.3 AI tools and the ecosystem

1.4 Choosing the right model

1.4.1 Model types

1.4.2 Pretrained vs. self-hosted

1.4.3 Performance considerations

1.5 Generative vs. traditional AI

1.6 Handling the concerns and implications of generative AI

1.6.1 What are the limitations of generative AI?

1.6.2 Will developers lose jobs because of AI?

1.6.3 Are generative AI outputs reliable?

2 Building your first generative AI web application

2.1 Introducing Astra

2.2 Project goal and requirements

2.2.1 Goal: Build a simple interactive AI chat interface

2.2.2 Project and technology requirements

2.2.3 Setting up

2.2.4 Running the project

2.3 Under the hood: The generative AI lifecycle

2.4 Designing for a better user experience

2.5 Building the major components

2.5.1 Frontend

2.5.2 Autoscroll

2.5.3 ChatPage

2.5.4 ChatList

2.5.5 The backend: Handling API communication

2.5.6 Tests

2.5.7 Common challenges and solutions

2.6 Assessing the app’s first iteration

2.7 Migrating the app to Next.js

2.7.1 Setting up

2.7.2 Running the project

2.8 Routing and configuration on Next.js

2.8.1 File-based routing

2.8.2 Configuration

2.8.3 Environment variables in Next.js

2.8.4 Route groups

2.8.5 Layout components

2.8.6 Route API handlers

2.8.7 Going deeper with Next.js

3 Connecting AI models with the Vercel AI SDK

3.1 Introducing the Vercel AI SDK

3.1.1 Key features and benefits

3.1.2 A strategic approach to integration

3.1.3 Practical integration: The Vercel AI SDK with Astra AI

3.2 Handling streaming responses with the Vercel AI SDK

3.2.1 Challenges and how the SDK solves streaming in web applications

3.2.2 Implementing streaming with the Vercel AI SDK

3.2.3 Integrating streaming into Astra AI

3.3 Working with multiple AI providers

3.3.1 Handling different AI providers and models

3.3.2 Using the Vercel AI SDK’s interoperability

3.3.3 Astra AI project: Integrating multiple AI providers and models

3.4 Enhancing conversational UIs with multimedia content

3.4.1 Introducing OpenAI’s vision capabilities

3.4.2 Astra AI project: Integrating Gemini vision queries

4 Managing conversation and state in your application

4.1 AI SDK React server components

4.1.1 Overview of RSCs

4.1.2 Using server actions for AI-powered RSCs

4.1.3 Updating the UI to use server actions

4.1.4 Techniques for generating and streaming UI components

4.1.5 Creating streamable UI components from LLM providers with streamUI

4.1.6 Streaming React components with createStreamableUI

4.2 Managing UI state in AI-powered applications

4.2.1 Separating AI and UI state in React/Next.js applications

4.2.2 Key components for UI state management

4.2.3 Implementing UI state management patterns

4.3 Structured data generation using the Vercel AI SDK

4.3.1 How structured data generation works

4.3.2 Techniques for generating structured data from AI responses

4.3.3 Tools for implementing type-safe AI-generated content

4.3.4 Integrating structured data generation into our web application

4.4 Tool and function calling with AI models

4.4.1 Understanding tool calling and function calling in AI models

4.4.2 Implementing custom tools and functions with the Vercel AI SDK

Part 2 Advanced generative AI techniques and deployment

5 Prompt engineering in web applications

5.1 Introducing prompt engineering

5.1.1 What exactly are prompts?

5.1.2 Prompt types

5.1.3 Organizing your prompts: Versioning, testing, and optimization

5.2 Few-shot learning

5.2.1 Examples of few-shot learning

5.2.2 General methodology for creating few-shot learning prompts

5.3 Chain-of-thought prompting: A deeper dive into reasoning

5.3.1 Example of chain-of-thought prompting

5.3.2 General methodology for creating chain-of-thought prompts

5.4 Embeddings: Giving AI a sense of meaning

5.4.1 The restaurant menu analogy: A taste of embeddings

5.4.2 Using embeddings in practice: The Vercel AI SDK

5.4.3 Use case: IT support knowledge base

5.5 Going deeper into LLM techniques

5.5.1 Tree of thoughts

5.5.2 Self-refine

5.5.3 LLM-as-a-judge

6 Building AI workflows with LangChain.js

6.1 Introducing LangChain

6.1.1 Chaining calls with LangChain

6.1.2 Integration with the Vercel AI SDK

6.2 Preparing and storing documents for retrieval using LangChain

6.2.1 Document ingestion using text splitters

6.2.2 Introducing vector stores

6.2.3 Document retrieval

6.2.4 Full example of preparing and storing documents with LangChain

6.3 Using memory components in LangChain to remember conversation history

6.4 Utilizing agents in LangChain.js

6.4.1 How LangChain agents work

6.4.2 Creating an agent using LangChain.js

6.4.3 Agent integration with the Vercel AI SDK

6.4.4 Overview of LangChain.js modules

6.5 Going deeper with LangChain.js

6.5.1 LangChain Expression Language

6.5.2 LangGraph

7 Document summarization and RAG with LangChain.js

7.1 Building a document summarization web application with LangChain.js

7.1.1 Summarization app project requirements

7.1.2 Architecture and workflow

7.1.3 Building the document summarization web application

7.1.4 Caveats and limitations of document summarization

7.1.5 Demonstrating the app

7.1.6 Additional considerations for summarizing documents

7.2 Building a RAG web application with LangChain.js

7.2.1 RAG app project requirements

7.2.2 Key architectural components of RAG

7.2.3 Technical architecture overview

7.2.4 RAG system components

7.2.5 Web app demonstration

7.2.6 Adding grounding support

8 Testing and debugging techniques

8.1 Debugging Next.js AI applications

8.1.1 Debugging common Next.js rendering Issues

8.1.2 Debugging client–server problems

8.1.3 Handling state management

8.1.4 Performance monitoring

8.2 Vercel AI SDK troubleshooting

8.2.1 Handling error states in AI-generated content

8.2.2 Managing token limits and rate limiting

8.3 Troubleshooting LangChain.js

8.3.1 Chain execution errors

8.3.2 Troubleshooting model integration problems

8.4 Testing strategies for AI applications

8.4.1 Unit and integration testing in React and Next.js

8.4.2 Mocking LLM responses

8.4.3 Testing Vercel AI SDK responses

8.4.4 Testing LangChain.js

9 Deployment and security

9.1 Building a secure foundation with input validation, rate limits, and middleware

9.1.1 Input validation

9.1.2 Security middleware layer

9.2 Building a core security and data protection pipeline

9.3 Setting up authentication and authorization

9.3.1 Simple authentication with Clerk.js and Next.js

9.3.2 Practical security control: Rate limiting

9.4 API key and secrets management

9.4.1 Understanding Next.js environment variables

9.4.2 Application-level API keys

9.4.3 User-provided API keys

9.5 Data protection and compliance

9.5.1 Example: Adding anonymization to our chat messages

9.6 Deployment considerations for AI web applications

9.6.1 Deployment options

9.6.2 Production deployment checklist

9.6.3 Example deployment to Vercel

9.6.4 Alternative deployments: Netlify

9.6.5 Alternative deployments: Hugging Face Spaces

9.6.6 Next steps

Part 3 Hands-on projects

10 Building an AI interview assistant: Project walk-through

10.1 Overview of the application

10.1.1 Key features

10.1.2 Technical implementation

10.1.3 Technology stack overview

10.2 Security measures implemented

10.3 Challenges during development

10.3.1 State management considerations

10.3.2 Text-to-speech integration

10.3.3 Generating feedback

10.4 Additional considerations and improvements

11 Building an AI RAG agent: Project walk-through

11.1 Overview of the application

11.1.1 Key features

11.1.2 Technical implementation

11.1.3 Technology stack overview

11.2 Challenges during development

11.2.1 Shared vs. dedicated user data in vector stores

11.2.2 Security considerations around document management and heavy workloads

11.2.3 API design and URL structure to minimize information exposure

11.3 Additional thoughts on AI and the future of web development

Part 4 Advanced integrations and the future of AI

12 Integrating web apps with the Model Context Protocol

12.1 Why the MCP matters for AI integration

12.2 MCP architecture

12.3 Connecting Next.js and the Vercel AI SDK with the MCP

12.3.1 Architecture overview

12.3.2 Building an end-to-end integration with the MCP in Next.js

12.3.3 Benefits of using the MCP for web applications with LLMs

12.4 Inside an MCP server: Extending web applications

12.4.1 MCP server structure

12.4.2 Additional considerations for MCP servers

12.5 Integrating MCP servers with LangChain.js

12.5.1 Architecture overview

12.5.2 Building an end-to-end integration with LangChain.js

12.6 The future of the MCP: Gateways, directories, and MCP-as-a-service

12.6.1 MCP gateways

12.6.2 MCP-as-a-service

12.6.3 MCP directories and registries

12.7 Your next steps with MCP servers

Appendix

Appendix A: Running the examples

A.1 Running examples

A.2 Accessing OpenAI APIs

A.3 Accessing Google AI APIs

A.4 Accessing the Upstash Redis database

A.5 Integrating Clerk.js authentication

Overview

3 Connecting AI models with the Vercel AI SDK

This chapter explains how to evolve a basic AI app into a robust, scalable web experience by adopting the Vercel AI SDK. It begins with the core challenges developers face—vendor lock-in from direct provider APIs, the engineering complexity of real-time streaming, and growing pains around state management—and positions the SDK as a unifying solution. The guidance emphasizes sound integration principles: separation of concerns between intents and actions, abstraction layers to decouple external dependencies, incremental adoption to reduce risk, and continuous testing and documentation to preserve reliability as features expand.

Practically, the chapter shows how to replace direct model calls with SDK utilities that abstract providers and streamline UX. It introduces generateText for non-streaming scenarios and then upgrades to streamText for real-time responses via async iterables, while the useChat and useCompletion React hooks handle buffering, partial updates, and error states for a smooth UI. The Astra AI app is incrementally migrated: first swapping the route handler to use SDK functions, then enabling streaming on the backend and adopting useChat on the frontend so messages arrive and render progressively, improving perceived performance and conversational flow.

Beyond streaming, the chapter demonstrates portable, multi-provider support through a Language Model Specification that applies an Abstract Factory approach. A centralized selector (e.g., getSupportedModel) validates provider/model availability and keys, enabling easy switching between OpenAI, Google, and others from a simple UI control that passes choices through the hook body. Finally, it extends the chat to multimodal interactions by allowing users to upload images alongside text; the backend reformats the last user message into text and image parts for vision-capable models, while the UI handles file selection and optional preview. With notes on limits and quality considerations for images, the result is a more natural, flexible conversational interface that’s provider-agnostic, stream-enabled, and ready for richer media.

Separation of concerns between intents and actions. The intent (left) represents the high-level feature or functionality, such as "generateText". The action (right) represents the specific implementation or concrete steps to fulfill the intent, such as making a "chatCompletion" request. The arrow signifies the connection point that bridges the intent with its corresponding action, facilitating communication and interaction between the two components.

Speed comparison of gpt-4 models served from OpenAI vs Azure. The output token throughput are merely within a couple of dozen tokens per second maximum. Source https://artificialanalysis.ai.

Without streaming the client sends a request and waits for the server to generate and send the full response before displaying it.

With streaming the client sends a request and the server sends the response in small chunks that are being streamed back to the client for processing.

Implementation of the Abstract Factory pattern in the Vercel AI SDK for generating text using different language model providers

Screenshot of the current application showcasing the usage of multiple providers in the same chat session.

The image illustrates the process of sending an image alongside a text prompt to the OpenAI API, and how the language model utilizes computer vision techniques to understand the image and generate a relevant text response.

Uploading an image and getting an accurate description from the AI model. This functionality shows how the LLM can generate text descriptions from other media like images.

Summary

The Vercel AI SDK simplifies AI integration into web applications.
It also offers features such as provider abstraction, streaming responses, state management, and support for React server components
The SDK allows developers to break down complex AI tasks into smaller, more manageable components
Guidelines for integrating the SDK include separation of concerns, abstraction layers, incremental integration, testing and validation, and documentation.
The SDK provides functions like generateText and streamText for text generation.
React hooks like useChat and useCompletion are available for creating conversational UI and text completion capabilities.
Implementing streaming responses with the SDK has challenges like asynchronous processing, connection management, data buffering, and error handling.
The SDK abstracts away many of these low-level details to simplify handling streaming responses in web applications.
The SDK leverages the Language Model Specification to simplify working with different AI providers and models.
The integration of the SDK enhances functionality and user experience by enabling streaming chat, multiple AI provider support, and integration of OpenAI’s vision capabilities.

FAQ

What problems does the Vercel AI SDK help solve in AI web apps?

The SDK tackles three main challenges: vendor lock-in (by abstracting providers), real-time streaming (with a consistent streaming API), and growing state complexity (via utilities and hooks that separate AI and UI concerns). It lets you switch models/providers with minimal code changes, stream partial outputs to the UI, and keep client/server in sync.

What is “provider abstraction” and why is it useful?

Provider abstraction gives you a unified interface to multiple AI providers (e.g., OpenAI, Anthropic, Google). Instead of hard-coding a single vendor’s API, you pass a provider-specific model to SDK utilities, so switching providers becomes a configuration change, not an architectural rewrite.

When should I use generateText vs. streamText?

Use generateText when you need a full, non-streaming result (e.g., summaries, one-off completions). Use streamText when you want the response streamed in chunks for better UX, such as chat UIs where displaying partial results improves perceived speed and engagement.

How does streaming work in practice with the SDK?

The server generates output incrementally and sends chunks to the client over an open connection; the client renders them as they arrive. The SDK abstracts async iteration, buffering, connection handling, and errors, so you focus on rendering updates rather than low-level streaming details.

Which React hooks does the SDK provide for conversational UIs?

The SDK offers useChat and useCompletion. useChat manages a multi-message conversation and streams assistant replies into your UI, while useCompletion handles single-prompt completions; both reduce boilerplate for input state, submission, and incremental updates.

How do I incrementally integrate the Vercel AI SDK into an existing app?

Start small: install the core package (ai) and the provider package you need (e.g., @ai-sdk/google or @ai-sdk/openai), then replace a single route’s direct API call with generateText. Verify behavior, add streamText for streaming, and finally update the UI to use useChat or useCompletion—testing after each step.

What is the Language Model Specification and how does it enable multi-provider support?

It’s a common interface that providers implement so the SDK’s utilities (generateText/streamText) can work uniformly across models. Conceptually similar to the Abstract Factory pattern, it decouples your app from vendor-specific clients and lets you swap or add providers without changing core logic.

How can users choose between providers and models at runtime?

Create a helper (e.g., getSupportedModel) that validates provider/model pairs, checks for the proper API key, and returns the configured model instance. On the client, pass the selected provider and model in the useChat “body” option; the backend uses them to pick the model dynamically.

How do I add image (vision) prompts with Gemini or GPT-4o using the SDK?

Extend the last user message to include both text and an image “part” (e.g., { type: "text", ... } and { type: "image", image: imageUrl }) and send it to streamText. On the frontend, add a file uploader, preview if desired, and include the image (often base64 or URL) in the request body alongside the text prompt.

What caveats should I consider when sending multimedia with prompts?

Check provider limits (e.g., max image size), prefer clear, well-lit images, and avoid bundling many images in a single prompt. Not all models support vision—consult provider capability tables and handle fallback behavior gracefully.

pro $24.99 per month

access to all Manning books, MEAPs, liveVideos, liveProjects, and audiobooks!
choose one free eBook per month to keep
exclusive 50% discount on all purchases
renews monthly, pause or cancel renewal anytime

lite $19.99 per month

access to all Manning books, including MEAPs!

team

5, 10 or 20 seats+ for your team - learn more

eBook

pdf, ePub, online

$47.99 $23.99

you save $24.00 (50%)

include audio $24.99 $12.49

pro $24.99 per month

access to all Manning books, MEAPs, liveVideos, liveProjects, and audiobooks!
choose one free eBook per month to keep
exclusive 50% discount on all purchases
renews monthly, pause or cancel renewal anytime

lite $19.99 per month

access to all Manning books, including MEAPs!

team

5, 10 or 20 seats+ for your team - learn more

eBook

$47.99 $23.99

you save $24.00 (50%)

include audio $24.99 $12.49

eBook

pdf, ePub, online

$47.99 $23.99

you save $24.00 (50%)

include audio $24.99 $12.49

pro $24.99 per month

access to all Manning books, MEAPs, liveVideos, liveProjects, and audiobooks!
choose one free eBook per month to keep
exclusive 50% discount on all purchases
renews monthly, pause or cancel renewal anytime

lite $19.99 per month

access to all Manning books, including MEAPs!

team

5, 10 or 20 seats+ for your team - learn more