Overview

1 What is deep learning?

Artificial intelligence has surged into mainstream attention, but separating durable progress from hype requires clear definitions. AI broadly aims to automate intellectual tasks; early systems relied on symbolic, hand-crafted rules that proved brittle beyond narrow domains. Machine learning reframed the problem as learning rules from data using feedback, and deep learning advanced this further by automatically discovering layered, increasingly abstract representations that make complex tasks tractable.

Deep learning centers on neural networks: stacks of layers parameterized by weights that are tuned to minimize a loss via backpropagation and iterative optimization. This end-to-end training learns useful representations directly from examples, replacing much of the manual feature engineering of earlier approaches. Its impact stems from simplicity (single pipelines over bespoke feature stacks), scalability (parallel training on modern hardware and vast datasets), and versatility (models that can be adapted and reused). The recent wave of generative AI is powered by large “foundation models” trained with self-supervised learning, enabling broad capabilities—from dialogue to image generation—often accessible via prompting rather than task-specific programming.

Results have been transformative across perception, language, code assistance, recommendation, autonomous driving, and even superhuman game play, with growing real-world deployments in science, medicine, and industry. Yet the chapter counsels skepticism toward short-term hype—especially near-term AGI claims and sweeping economic predictions—and frames today’s systems as powerful cognitive automation rather than general intelligence. History shows cycles of optimism and retrenchment; while a severe “AI winter” is unlikely given proven value, a correction from inflated expectations is plausible. Sustained progress will hinge on realistic goals, measured evaluation, and continued investment grounded in demonstrated utility.

Artificial intelligence, machine learning, and deep learning
ai ml dl
Machine learning: a new programming paradigm
image
Some sample data
ch01 example data points
Coordinate change
ch01 learning representations
A deep neural network for digit classification
ch01 a deep network
Deep representations learned by a digit-classification model
ch01 mnist representations
A neural network is parameterized by its weights.
ch01 deep learning in 3 figures 1
A loss function measures the quality of the network’s output.
ch01 deep learning in 3 figures 2
The loss score is used as a feedback signal to adjust the weights.
ch01 deep learning in 3 figures 3

The promise of AI

Although we may have unrealistic short-term expectations for AI, the long-term picture is looking bright. We’re only getting started in applying deep learning to many important problems for which it could prove transformative, from medical diagnoses to digital assistants.

In 2017, in this very book, I wrote:

Right now, it may seem hard to believe that AI could have a large impact on our world, because it isn’t yet widely deployed – much as, back in 1995, it would have been difficult to believe in the future impact of the internet. Back then, most people didn’t see how the internet was relevant to them and how it was going to change their lives. The same is true for deep learning and AI today. But make no mistake: AI is coming. In a not-so-distant future, AI will be your assistant, even your friend; it will answer your questions, help educate your kids, and watch over your health. It will deliver your groceries to your door and drive you from point A to point B. It will be your interface to an increasingly complex and information-intensive world. And, even more important, AI will help humanity as a whole move forward, by assisting human scientists in new breakthrough discoveries across all scientific fields, from genomics to mathematics.

Fast-forward to 2025, most of these things have either come true or are on the verge of coming true – and this is just the beginning.

  • Tens of millions of people are using AI chatbots like ChatGPT, Gemini, or Claude as assistants on a daily basis. In fact, question-answering and “educating your kids” (homework assistance) have turned out to be the top applications of these chatbots! For many people, AI is already the go-to interface to the world’s information.
  • Hundreds of thousands of people interact with AI “friends” in applications such as Character.ai
  • Fully autonomous driving is already deployed at scale in cities like Phoenix, San Francisco, Los Angeles, and Austin.
  • AI is making major strides towards helping accelerate science. The AlphaFold model from DeepMind is helping biologists predict protein structures with unprecedented accuracy. Renowned mathematician Terence Tao believes that by around 2026, AI could become a reliable co-author in mathematical research and other fields when used appropriately.

The AI revolution, once a distant vision, is now rapidly unfolding before our eyes. On the way, we may face a few setbacks – in much the same way the internet industry was overhyped in 1998–1999 and suffered from a crash that dried up investment throughout the early 2000s. But we’ll get there eventually. AI will end up being applied to nearly every process that makes up our society and our daily lives, much like the internet is today.

Don’t believe the short-term hype, but do believe in the long-term vision. It may take a while for AI to be deployed to its true potential – a potential the full extent of which no one has yet dared to dream – but AI is coming, and it will transform our world in a fantastic way.

FAQ

How do AI, machine learning, and deep learning relate to each other?AI is the broad effort to automate intellectual tasks. Machine learning is a subfield of AI where systems learn rules from data instead of being explicitly programmed. Deep learning is a subfield of machine learning that learns many layers of increasingly abstract representations to solve tasks.
What is artificial intelligence, and how did symbolic AI differ from modern approaches?Artificial intelligence aims to automate tasks that typically require human intellect. Early “symbolic AI” relied on hand-crafted rules and explicit knowledge bases, which worked for well-defined logic tasks but broke down on fuzzy, high-variability problems like vision or speech. Modern ML/DL learns the rules from data instead of manually encoding them.
What makes machine learning a new programming paradigm?Rather than coding rules, you provide examples of inputs and desired outputs. The system searches for patterns that map inputs to targets and uses a feedback signal to improve. In practice, you train on many examples and let the algorithm discover the rules that generalize.
What is a “representation,” and why is it central to ML and DL?A representation is a way of encoding data so that a task becomes easier—like changing coordinates to make classes linearly separable. Machine learning is fundamentally about discovering useful representations that enable simple decision rules. Deep learning automates this discovery through stacked transformations.
What does the “deep” in deep learning mean?“Deep” refers to models with many successive layers, each learning a more informative representation than the last. Depth lets models distill raw data into features that are directly useful for the task. Despite the name “neural networks,” these systems are mathematical constructs, not biological brain models.
How does a neural network learn in practice?Each layer has weights (parameters) that define its transformation. A loss function measures how far predictions are from targets, and an optimizer uses backpropagation to adjust weights to reduce that loss. Starting from random weights, repeating this loop over many examples gradually yields a trained model.
What makes deep learning different from earlier machine-learning methods?It simplifies workflows by learning features end-to-end, removing most manual feature engineering. It scales well on GPUs and specialized hardware and trains on mini-batches, enabling very large datasets. It’s versatile and reusable: models can be updated continuously and adapted to new tasks, underpinning today’s “foundation models.”
What is generative AI, and how does self-supervised learning power it?Generative AI models produce text, images, or audio from prompts. They’re trained with self-supervised learning, where targets are derived from the inputs themselves (e.g., predicting the next word or denoising images), allowing training on vast unlabeled datasets. The resulting foundation models can be reused across many tasks via prompting or light fine-tuning.
What has deep learning achieved so far?Breakthroughs include fluent chatbots, coding assistants, photorealistic image generation, human-level image and speech recognition, improved machine translation and text-to-speech, autonomous driving deployments, stronger recommender systems, and superhuman play in games like Go and Chess. It’s also transforming domains from medical imaging to agriculture and disaster prediction.
Why be cautious about AI hype, and what are “AI winters”?Short-term claims about imminent AGI or explosive productivity gains often overreach current capabilities, which are best viewed as powerful cognitive automation rather than general intelligence. Historically, inflated expectations led to “AI winters” when funding dried up after disappointment. While a full retreat seems unlikely today, a cooling of the current investment bubble is plausible.

pro $24.99 per month

  • access to all Manning books, MEAPs, liveVideos, liveProjects, and audiobooks!
  • choose one free eBook per month to keep
  • exclusive 50% discount on all purchases
  • renews monthly, pause or cancel renewal anytime

lite $19.99 per month

  • access to all Manning books, including MEAPs!

team

5, 10 or 20 seats+ for your team - learn more


choose your plan

team

monthly
annual
$49.99
$399.99
only $33.33 per month
  • five seats for your team
  • access to all Manning books, MEAPs, liveVideos, liveProjects, and audiobooks!
  • choose another free product every time you renew
  • choose twelve free products per year
  • exclusive 50% discount on all purchases
  • renews monthly, pause or cancel renewal anytime
  • renews annually, pause or cancel renewal anytime
  • Deep Learning with Python, Third Edition ebook for free
choose your plan

team

monthly
annual
$49.99
$399.99
only $33.33 per month
  • five seats for your team
  • access to all Manning books, MEAPs, liveVideos, liveProjects, and audiobooks!
  • choose another free product every time you renew
  • choose twelve free products per year
  • exclusive 50% discount on all purchases
  • renews monthly, pause or cancel renewal anytime
  • renews annually, pause or cancel renewal anytime
  • Deep Learning with Python, Third Edition ebook for free
choose your plan

team

monthly
annual
$49.99
$399.99
only $33.33 per month
  • five seats for your team
  • access to all Manning books, MEAPs, liveVideos, liveProjects, and audiobooks!
  • choose another free product every time you renew
  • choose twelve free products per year
  • exclusive 50% discount on all purchases
  • renews monthly, pause or cancel renewal anytime
  • renews annually, pause or cancel renewal anytime
  • Deep Learning with Python, Third Edition ebook for free