Overview

1 What is deep learning?

Amid intense media attention and soaring expectations, this chapter separates signal from noise and lays out what deep learning really is. It clarifies the relationship between artificial intelligence, machine learning, and deep learning: AI aims to automate intellectual tasks; machine learning learns rules from data rather than relying on hand-crafted logic; and deep learning focuses on learning successive, increasingly abstract representations. Neural networks implement these layered transformations, and training adjusts their weights to minimize a loss function through backpropagation—an iterative process that turns simple building blocks into powerful, task-relevant representations.

The chapter explains why deep learning has become the dominant approach: it automates feature engineering (simplifying workflows), scales efficiently on modern hardware, and supports continuous learning and reuse through pretraining and adaptation. It highlights the rise of generative AI fueled by self-supervised learning and foundation models, which leverage massive datasets to produce text, code, images, and more—often controllable via prompting rather than task-specific programming. This capability underpins recent breakthroughs, from fluent chatbots and coding assistants to photorealistic image synthesis, human-level perception tasks, improved translation and speech systems, and robust real-world applications like autonomous driving and advanced recommender systems.

At the same time, the chapter urges caution about short-term hype. It distinguishes today’s systems as cognitive automation rather than general intelligence, noting that adaptability to the truly unknown remains a defining gap between automation and human-like intelligence. Drawing on past boom-and-bust cycles (AI winters), it warns that inflated expectations can prompt retrenchment if results don’t match promises—though any future slowdown would likely be mild given demonstrated value. The balanced outlook: deep learning’s core ideas are here to stay, its impact is already meaningful and growing, but progress should be judged by measurable capabilities rather than speculative timelines for AGI.

Artificial intelligence, machine learning, and deep learning
ai ml dl
Machine learning: a new programming paradigm
image
Some sample data
ch01 example data points
Coordinate change
ch01 learning representations
A deep neural network for digit classification
ch01 a deep network
Deep representations learned by a digit-classification model
ch01 mnist representations
A neural network is parameterized by its weights.
ch01 deep learning in 3 figures 1
A loss function measures the quality of the network’s output.
ch01 deep learning in 3 figures 2
The loss score is used as a feedback signal to adjust the weights.
ch01 deep learning in 3 figures 3

The promise of AI

Although we may have unrealistic short-term expectations for AI, the long-term picture is looking bright. We’re only getting started in applying deep learning to many important problems for which it could prove transformative, from medical diagnoses to digital assistants.

In 2017, in this very book, I wrote:

Right now, it may seem hard to believe that AI could have a large impact on our world, because it isn’t yet widely deployed – much as, back in 1995, it would have been difficult to believe in the future impact of the internet. Back then, most people didn’t see how the internet was relevant to them and how it was going to change their lives. The same is true for deep learning and AI today. But make no mistake: AI is coming. In a not-so-distant future, AI will be your assistant, even your friend; it will answer your questions, help educate your kids, and watch over your health. It will deliver your groceries to your door and drive you from point A to point B. It will be your interface to an increasingly complex and information-intensive world. And, even more important, AI will help humanity as a whole move forward, by assisting human scientists in new breakthrough discoveries across all scientific fields, from genomics to mathematics.

Fast-forward to 2025, most of these things have either come true or are on the verge of coming true – and this is just the beginning.

  • Tens of millions of people are using AI chatbots like ChatGPT, Gemini, or Claude as assistants on a daily basis. In fact, question-answering and “educating your kids” (homework assistance) have turned out to be the top applications of these chatbots! For many people, AI is already the go-to interface to the world’s information.
  • Hundreds of thousands of people interact with AI “friends” in applications such as Character.ai
  • Fully autonomous driving is already deployed at scale in cities like Phoenix, San Francisco, Los Angeles, and Austin.
  • AI is making major strides towards helping accelerate science. The AlphaFold model from DeepMind is helping biologists predict protein structures with unprecedented accuracy. Renowned mathematician Terence Tao believes that by around 2026, AI could become a reliable co-author in mathematical research and other fields when used appropriately.

The AI revolution, once a distant vision, is now rapidly unfolding before our eyes. On the way, we may face a few setbacks – in much the same way the internet industry was overhyped in 1998–1999 and suffered from a crash that dried up investment throughout the early 2000s. But we’ll get there eventually. AI will end up being applied to nearly every process that makes up our society and our daily lives, much like the internet is today.

Don’t believe the short-term hype, but do believe in the long-term vision. It may take a while for AI to be deployed to its true potential – a potential the full extent of which no one has yet dared to dream – but AI is coming, and it will transform our world in a fantastic way.

[1] A. M. Turing, “Computing Machinery and Intelligence,” Mind 59, no. 236 (1950): 433-460.

[2] Although the Turing test has sometimes been interpreted as a literal test – a goal the field of AI should set out to reach – Turing merely meant it as a conceptual device in a philosophical discussion about the nature of cognition.

FAQ

How do AI, machine learning, and deep learning relate to each other?AI is the broad effort to automate intellectual tasks typically performed by humans. Machine learning is a subfield of AI where systems learn rules from data instead of being explicitly programmed. Deep learning is a subfield of machine learning that learns many successive layers of representations using neural networks.
How is machine learning different from traditional programming?Traditional programming encodes explicit rules that map inputs to outputs. Machine learning flips this: given many examples of inputs and correct outputs, the system learns the rules automatically by finding statistical regularities in the data.
What does “learning representations” mean, and why does it matter?Representations are alternative ways to encode data so that a task becomes easier. Machine learning searches for transformations of the input that expose the structure needed to solve the task with simple rules, reducing the need for hand-crafted features.
What does the “deep” in deep learning refer to?“Deep” refers to models with many successive layers that each learn increasingly abstract representations. Depth (the number of layers) enables the model to build complex features from simple ones, learned end to end from data.
How do neural networks learn in practice?Each layer has weights (parameters) that define its transformation. A loss function measures the gap between predictions and targets; an optimizer adjusts the weights to reduce this loss using backpropagation. Repeating this over many examples (the training loop) gradually improves performance.
What makes deep learning different from earlier approaches?Deep learning automates feature engineering by learning all features jointly, simplifying pipelines. It scales well on modern hardware (GPUs/ML accelerators) and can train on very large datasets. Trained models are versatile and reusable, enabling continuous learning and efficient adaptation (e.g., foundation models).
What is generative AI, and how does self-supervised learning fit in?Generative AI models produce text, images, or other content by learning to reconstruct or predict parts of their input (e.g., next-word prediction). Because targets come from the input itself, this “self-supervised” setup leverages vast unlabeled data, enabling large foundation models that can be steered via prompting.
What are some major achievements of deep learning so far?Breakthroughs include fluent chatbot assistants, coding copilots, photorealistic image generation, human-level perception tasks (image/speech recognition, transcription), improved translation and text-to-speech, autonomous driving in select cities, enhanced recommender systems, and superhuman game play.
Why should we be cautious about AI hype and near-term AGI claims?Current AI excels at cognitive automation—tasks with clear goals and abundant examples—but lacks true cognitive autonomy and adaptability. Hype can outpace reality, leading to unrealistic timelines for economic transformation or AGI and obscuring genuine, steady progress.
What are AI “winters,” and could there be another?AI winters are periods when inflated expectations lead to disappointment and reduced funding, slowing progress (notably in the 1970s and early 1990s). While a full retreat seems unlikely today given AI’s proven value, some correction of recent overinvestment and expectations is plausible.

pro $24.99 per month

  • access to all Manning books, MEAPs, liveVideos, liveProjects, and audiobooks!
  • choose one free eBook per month to keep
  • exclusive 50% discount on all purchases
  • renews monthly, pause or cancel renewal anytime

lite $19.99 per month

  • access to all Manning books, including MEAPs!

team

5, 10 or 20 seats+ for your team - learn more


choose your plan

team

monthly
annual
$49.99
$399.99
only $33.33 per month
  • five seats for your team
  • access to all Manning books, MEAPs, liveVideos, liveProjects, and audiobooks!
  • choose another free product every time you renew
  • choose twelve free products per year
  • exclusive 50% discount on all purchases
  • renews monthly, pause or cancel renewal anytime
  • renews annually, pause or cancel renewal anytime
  • Deep Learning with Python, Third Edition ebook for free
choose your plan

team

monthly
annual
$49.99
$399.99
only $33.33 per month
  • five seats for your team
  • access to all Manning books, MEAPs, liveVideos, liveProjects, and audiobooks!
  • choose another free product every time you renew
  • choose twelve free products per year
  • exclusive 50% discount on all purchases
  • renews monthly, pause or cancel renewal anytime
  • renews annually, pause or cancel renewal anytime
  • Deep Learning with Python, Third Edition ebook for free