Build AI-Enhanced Web Apps you own this product

How to get reliable results with React, Next.js, and Vercel

Theo Despoudis

February 2026
ISBN 9781633436084
392 pages

Included with a Manning Online subscription

printed in black & white

catalog / Data Science / AI

resources: Source code Book forum Source code on GitHub Register your pBook for a free eBook

table of content

Part 1 Building basic generative AI web apps

1 Using generative AI in web apps

1.1 What generative AI can do for web applications

1.1.1 Generative AI capabilities

1.1.2 Real-world uses of generative AI

1.2 How a generative AI web app works

1.2.1 Core components

1.2.2 The flow of user interactions

1.3 AI tools and the ecosystem

1.4 Choosing the right model

1.4.1 Model types

1.4.2 Pretrained vs. self-hosted

1.4.3 Performance considerations

1.5 Generative vs. traditional AI

1.6 Handling the concerns and implications of generative AI

1.6.1 What are the limitations of generative AI?

1.6.2 Will developers lose jobs because of AI?

1.6.3 Are generative AI outputs reliable?

2 Building your first generative AI web application

2.1 Introducing Astra

2.2 Project goal and requirements

2.2.1 Goal: Build a simple interactive AI chat interface

2.2.2 Project and technology requirements

2.2.3 Setting up

2.2.4 Running the project

2.3 Under the hood: The generative AI lifecycle

2.4 Designing for a better user experience

2.5 Building the major components

2.5.1 Frontend

2.5.2 Autoscroll

2.5.3 ChatPage

2.5.4 ChatList

2.5.5 The backend: Handling API communication

2.5.6 Tests

2.5.7 Common challenges and solutions

2.6 Assessing the app’s first iteration

2.7 Migrating the app to Next.js

2.7.1 Setting up

2.7.2 Running the project

2.8 Routing and configuration on Next.js

2.8.1 File-based routing

2.8.2 Configuration

2.8.3 Environment variables in Next.js

2.8.4 Route groups

2.8.5 Layout components

2.8.6 Route API handlers

2.8.7 Going deeper with Next.js

3 Connecting AI models with the Vercel AI SDK

3.1 Introducing the Vercel AI SDK

3.1.1 Key features and benefits

3.1.2 A strategic approach to integration

3.1.3 Practical integration: The Vercel AI SDK with Astra AI

3.2 Handling streaming responses with the Vercel AI SDK

3.2.1 Challenges and how the SDK solves streaming in web applications

3.2.2 Implementing streaming with the Vercel AI SDK

3.2.3 Integrating streaming into Astra AI

3.3 Working with multiple AI providers

3.3.1 Handling different AI providers and models

3.3.2 Using the Vercel AI SDK’s interoperability

3.3.3 Astra AI project: Integrating multiple AI providers and models

3.4 Enhancing conversational UIs with multimedia content

3.4.1 Introducing OpenAI’s vision capabilities

3.4.2 Astra AI project: Integrating Gemini vision queries

4 Managing conversation and state in your application

4.1 AI SDK React server components

4.1.1 Overview of RSCs

4.1.2 Using server actions for AI-powered RSCs

4.1.3 Updating the UI to use server actions

4.1.4 Techniques for generating and streaming UI components

4.1.5 Creating streamable UI components from LLM providers with streamUI

4.1.6 Streaming React components with createStreamableUI

4.2 Managing UI state in AI-powered applications

4.2.1 Separating AI and UI state in React/Next.js applications

4.2.2 Key components for UI state management

4.2.3 Implementing UI state management patterns

4.3 Structured data generation using the Vercel AI SDK

4.3.1 How structured data generation works

4.3.2 Techniques for generating structured data from AI responses

4.3.3 Tools for implementing type-safe AI-generated content

4.3.4 Integrating structured data generation into our web application

4.4 Tool and function calling with AI models

4.4.1 Understanding tool calling and function calling in AI models

4.4.2 Implementing custom tools and functions with the Vercel AI SDK

Part 2 Advanced generative AI techniques and deployment

5 Prompt engineering in web applications

5.1 Introducing prompt engineering

5.1.1 What exactly are prompts?

5.1.2 Prompt types

5.1.3 Organizing your prompts: Versioning, testing, and optimization

5.2 Few-shot learning

5.2.1 Examples of few-shot learning

5.2.2 General methodology for creating few-shot learning prompts

5.3 Chain-of-thought prompting: A deeper dive into reasoning

5.3.1 Example of chain-of-thought prompting

5.3.2 General methodology for creating chain-of-thought prompts

5.4 Embeddings: Giving AI a sense of meaning

5.4.1 The restaurant menu analogy: A taste of embeddings

5.4.2 Using embeddings in practice: The Vercel AI SDK

5.4.3 Use case: IT support knowledge base

5.5 Going deeper into LLM techniques

5.5.1 Tree of thoughts

5.5.2 Self-refine

5.5.3 LLM-as-a-judge

6 Building AI workflows with LangChain.js

6.1 Introducing LangChain

6.1.1 Chaining calls with LangChain

6.1.2 Integration with the Vercel AI SDK

6.2 Preparing and storing documents for retrieval using LangChain

6.2.1 Document ingestion using text splitters

6.2.2 Introducing vector stores

6.2.3 Document retrieval

6.2.4 Full example of preparing and storing documents with LangChain

6.3 Using memory components in LangChain to remember conversation history

6.4 Utilizing agents in LangChain.js

6.4.1 How LangChain agents work

6.4.2 Creating an agent using LangChain.js

6.4.3 Agent integration with the Vercel AI SDK

6.4.4 Overview of LangChain.js modules

6.5 Going deeper with LangChain.js

6.5.1 LangChain Expression Language

6.5.2 LangGraph

7 Document summarization and RAG with LangChain.js

7.1 Building a document summarization web application with LangChain.js

7.1.1 Summarization app project requirements

7.1.2 Architecture and workflow

7.1.3 Building the document summarization web application

7.1.4 Caveats and limitations of document summarization

7.1.5 Demonstrating the app

7.1.6 Additional considerations for summarizing documents

7.2 Building a RAG web application with LangChain.js

7.2.1 RAG app project requirements

7.2.2 Key architectural components of RAG

7.2.3 Technical architecture overview

7.2.4 RAG system components

7.2.5 Web app demonstration

7.2.6 Adding grounding support

8 Testing and debugging techniques

8.1 Debugging Next.js AI applications

8.1.1 Debugging common Next.js rendering Issues

8.1.2 Debugging client–server problems

8.1.3 Handling state management

8.1.4 Performance monitoring

8.2 Vercel AI SDK troubleshooting

8.2.1 Handling error states in AI-generated content

8.2.2 Managing token limits and rate limiting

8.3 Troubleshooting LangChain.js

8.3.1 Chain execution errors

8.3.2 Troubleshooting model integration problems

8.4 Testing strategies for AI applications

8.4.1 Unit and integration testing in React and Next.js

8.4.2 Mocking LLM responses

8.4.3 Testing Vercel AI SDK responses

8.4.4 Testing LangChain.js

9 Deployment and security

9.1 Building a secure foundation with input validation, rate limits, and middleware

9.1.1 Input validation

9.1.2 Security middleware layer

9.2 Building a core security and data protection pipeline

9.3 Setting up authentication and authorization

9.3.1 Simple authentication with Clerk.js and Next.js

9.3.2 Practical security control: Rate limiting

9.4 API key and secrets management

9.4.1 Understanding Next.js environment variables

9.4.2 Application-level API keys

9.4.3 User-provided API keys

9.5 Data protection and compliance

9.5.1 Example: Adding anonymization to our chat messages

9.6 Deployment considerations for AI web applications

9.6.1 Deployment options

9.6.2 Production deployment checklist

9.6.3 Example deployment to Vercel

9.6.4 Alternative deployments: Netlify

9.6.5 Alternative deployments: Hugging Face Spaces

9.6.6 Next steps

Part 3 Hands-on projects

10 Building an AI interview assistant: Project walk-through

10.1 Overview of the application

10.1.1 Key features

10.1.2 Technical implementation

10.1.3 Technology stack overview

10.2 Security measures implemented

10.3 Challenges during development

10.3.1 State management considerations

10.3.2 Text-to-speech integration

10.3.3 Generating feedback

10.4 Additional considerations and improvements

11 Building an AI RAG agent: Project walk-through

11.1 Overview of the application

11.1.1 Key features

11.1.2 Technical implementation

11.1.3 Technology stack overview

11.2 Challenges during development

11.2.1 Shared vs. dedicated user data in vector stores

11.2.2 Security considerations around document management and heavy workloads

11.2.3 API design and URL structure to minimize information exposure

11.3 Additional thoughts on AI and the future of web development

Part 4 Advanced integrations and the future of AI

12 Integrating web apps with the Model Context Protocol

12.1 Why the MCP matters for AI integration

12.2 MCP architecture

12.3 Connecting Next.js and the Vercel AI SDK with the MCP

12.3.1 Architecture overview

12.3.2 Building an end-to-end integration with the MCP in Next.js

12.3.3 Benefits of using the MCP for web applications with LLMs

12.4 Inside an MCP server: Extending web applications

12.4.1 MCP server structure

12.4.2 Additional considerations for MCP servers

12.5 Integrating MCP servers with LangChain.js

12.5.1 Architecture overview

12.5.2 Building an end-to-end integration with LangChain.js

12.6 The future of the MCP: Gateways, directories, and MCP-as-a-service

12.6.1 MCP gateways

12.6.2 MCP-as-a-service

12.6.3 MCP directories and registries

12.7 Your next steps with MCP servers

Appendix

Appendix A: Running the examples

A.1 Running examples

A.2 Accessing OpenAI APIs

A.3 Accessing Google AI APIs

A.4 Accessing the Upstash Redis database

A.5 Integrating Clerk.js authentication

Overview

1 Using generative AI in web apps

Generative AI web apps weave advanced models—especially large language models—into modern interfaces to create original text, images, audio, and video on demand. By generating content dynamically, they enable conversational experiences, intelligent automation, and personalized interactions that go beyond static logic. This chapter introduces what these apps can do, how they differ from traditional AI, and the core concepts needed to design and deploy them. It sets expectations for the tech stack used throughout the book—React, Next.js, and the Vercel AI SDK with models from providers like Google AI and OpenAI—and previews secure access to external tools and data via the Model Context Protocol, assuming readers have basic JavaScript and React familiarity.

At a high level, these apps combine user-facing UIs and conversational agents with robust backends that preprocess inputs, select and call models, post-process outputs, and deliver results—often with a feedback loop. The architecture spans caching, serverless functions, containers and orchestration, and model-serving frameworks, alongside data pipelines and third-party API integrations, all built to deploy and scale reliably. The chapter outlines a typical interaction flow from user input to response delivery, and surveys model choices and trade-offs—transformer-based LLMs, autoregressive models, GANs, VAEs, and RNNs; pre-trained services versus self-hosting; plus performance considerations like latency and resource costs. The practical tooling centers on React and Next.js for UI and backend, the Vercel AI SDK for provider-agnostic integration, and LangChain.js for capabilities such as Retrieval-Augmented Generation.

Because these systems are powerful, the chapter emphasizes responsibility and pragmatism: managing quality and hallucinations, containing costs, guarding against misuse, and complying with privacy and data-protection rules when handling sensitive information. It recommends concrete validation tactics (clear objectives, careful prompting, parameter tuning, cross-checks), approaches to mitigate bias (curated knowledge bases, diverse training sources, auditing), and UX practices that improve trust and satisfaction (fast, accessible, personalized, and multimodal interactions with features like streaming). Acknowledging the impact on developer workflows, it frames AI as an accelerant for higher-value work. The book reinforces these principles through hands-on projects, including a voice-driven interview assistant with AI feedback and a RAG-powered knowledge system, to help readers build production-ready generative AI applications.

The flow of information and interactions between the key components of a generative AI web application.

How an AI web app works: users input data, the app processes it, selects a model, generates content, delivers it, and optionally collects feedback.

Simplified architecture diagram of a web application ecosystem. Clients, including web browsers and mobile devices, interact with the core application service, which handles user requests and business logic. The service interacts with a database to store and manage application data. Additionally, the service communicates with external APIs to access additional functionality and interacts with external services utilized by the application.

Leveraging key technologies to create generative AI web applications

How AI can be used to detect whether a picture of a cat is a cat or not. It accepts an image as input and responds with yes or no (or 0 and 1).

Summary

Generative AI can generate not only text, but all sorts of media resources like images, video clips and audio. This greatly enhances their potential usage in web applications, and real-world uses of generative AI in web applications range from digital marketing and customer experience management to mock interview applications.
Generative AI web apps center on powerful models like large language models (LLMs) to create content from user input. The apps require a full supporting ecosystem to integrate with the model, including UI and conversational AI components, backend infrastructure, data processing pipelines, API integration, and deployment and scaling mechanisms.
The apps we build in this book will use JavaScript and React to display the UI interface components, along with Next.js and the Vercel AI SDK to manage the backend and interact with external AI service providers.
Choosing the right model for an app is a key architectural decision and depends on the task required. Different model types ( such as LLMs, GANs, autoregressive, transformers, VAE, and RNNs) excel at different kinds of problems. But the model architecture is just one consideration; developers also need to consider the quality and type of data it was trained on.
Software engineers have been using AI long before generative AI came into existence. Common applications include machine learning, search recommendations, chatbots and computer vision.
Foundational research like Google's "Attention is All You Need" laid the groundwork for transformative technologies such as transformers, which simplified natural language processing tasks by leveraging attention mechanisms. Transformers revolutionized language modeling by improving efficiency and accuracy in understanding textual data, addressing long-standing challenges faced by traditional AI models.
Limitations of generative AI include quality control issues, resource intensiveness, security concerns, and regulatory compliance. Concerns include its potential impact on jobs, the reliability of outputs, handling bias, and enhancing the user experience.

FAQ

What is a generative AI web app and what can it do?

It’s a web application that integrates advanced AI models—often large language models (LLMs)—to generate text, images, audio, or video dynamically. This enables conversational interfaces, personalized content, intelligent automation, and new interactive experiences like code assistants and creative tools.

How does generative AI differ from traditional AI?

Traditional AI typically classifies or predicts (e.g., “cat vs not cat”). Generative AI learns patterns well enough to produce new content that resembles its training data. Modern capabilities are powered by transformers and self-attention, which capture long-range context to generate coherent, contextually relevant outputs.

What are the core components of a generative AI web app?

LLMs and AI models for content generation
UIs and conversational components (chatbots, agents)
Backend infrastructure (caching, containers/orchestration, serverless, model serving)
Data processing (pre- and post-processing, feature extraction)
API integrations to external services
Deployment and scaling for reliability and spikes

How does the user-to-model interaction flow work?

User input through the UI (text, images, selections)
Backend processing (cleaning, feature extraction, model selection)
Content generation via selected model and APIs (optionally with RAG)
Response delivery with an optional feedback loop to refine future results

Which tools and frameworks does the chapter recommend, and why?

React for UI components and accessibility
Next.js for backend integration and data fetching
Vercel AI SDK for multi-provider AI abstractions, streaming, and state
LangChain.js for RAG, agents, and composable chains
Models/providers: Google Gemini (default), OpenAI as needed

These choices offer seamless integration, good developer ergonomics, and production-ready patterns.

How should I choose the right model and provider?

Match model type to task (text, image, code, multimodal)
Assess training data alignment with your domain
Evaluate latency, cost, and resource needs
Plan UI, pre-processing, and post-processing
Consider RAG or fine-tuning for domain specificity

Should I use pre-trained APIs or host my own models?

Pre-trained APIs (e.g., Gemini, OpenAI) are fastest to adopt and reduce operational burden. Self-hosting offers maximum control and customization but requires significant ML expertise, infrastructure, and cost. The chapter’s projects use pre-trained APIs for practicality.

What real-world use cases does the chapter highlight?

Digital marketing: image generation, image-to-image, and copywriting
Customer experience: chatbots, sentiment analysis, dynamic replies
Mock interviews: AI interviewer agents, speech-to-text, adaptive difficulty

What are the main risks and limitations to plan for?

Quality control and hallucinations affecting accuracy
Resource intensity and cost of training/inference
Security risks and potential misuse (impersonation, misinformation)
Regulatory compliance and PII handling (GDPR, CCPA)
Bias in outputs; mitigate via diverse data, scope control, and bias audits

How can I improve reliability and user experience?

Validate outputs: clear objectives, context framing, parameter tuning, cross-checking
Enhance UX: streaming responses, multimodal inputs, personalization
Strengthen systems: caching, scalable deployment, feedback loops
Use RAG to ground answers in verified knowledge sources

pro $24.99 per month

access to all Manning books, MEAPs, liveVideos, liveProjects, and audiobooks!
choose one free eBook per month to keep
exclusive 50% discount on all purchases
renews monthly, pause or cancel renewal anytime

lite $19.99 per month

access to all Manning books, including MEAPs!

team

5, 10 or 20 seats+ for your team - learn more

eBook

pdf, ePub, online

$47.99 $23.99

you save $24.00 (50%)

include audio $24.99 $12.49

pro $24.99 per month

access to all Manning books, MEAPs, liveVideos, liveProjects, and audiobooks!
choose one free eBook per month to keep
exclusive 50% discount on all purchases
renews monthly, pause or cancel renewal anytime

lite $19.99 per month

access to all Manning books, including MEAPs!

team

5, 10 or 20 seats+ for your team - learn more

eBook

$47.99 $23.99

you save $24.00 (50%)

include audio $24.99 $12.49

eBook

pdf, ePub, online

$47.99 $23.99

you save $24.00 (50%)

include audio $24.99 $12.49

pro $24.99 per month

access to all Manning books, MEAPs, liveVideos, liveProjects, and audiobooks!
choose one free eBook per month to keep
exclusive 50% discount on all purchases
renews monthly, pause or cancel renewal anytime

lite $19.99 per month

access to all Manning books, including MEAPs!

team

5, 10 or 20 seats+ for your team - learn more