Why are probability distributions fundamental to machine learning?

Probability distributions allow us to model uncertainty, analyze high-dimensional data, and form the basis for clustering, recommendation systems, and generative models.

How does principal component analysis (PCA) help in understanding data?

PCA identifies the directions of maximum variance in data, enabling dimensionality reduction and noise elimination while preserving the underlying patterns.

What is latent semantic analysis (LSA) and how does it improve document retrieval?

LSA uses topic modeling by projecting document vectors onto topic axes, revealing latent similarities between documents that may not share explicit terms.

How do Bayesian methods enhance model parameter estimation?

Bayesian tools incorporate prior knowledge and uncertainty, enabling robust parameter estimation through concepts like conditional probability, entropy, and maximum likelihood.

What role do convolutions play in neural networks?

Convolutions extract spatial features from data, forming the backbone of image, video, and signal processing in deep learning models.

How are neural networks trained using forward propagation and backpropagation?

Training involves computing outputs via forward propagation, then minimizing loss by updating weights through backpropagation and gradient descent.

What makes convolutional neural networks (CNNs) effective for image classification and object detection?

CNNs leverage hierarchical feature extraction and deep architectures to achieve state-of-the-art performance in vision tasks like classification and detection.

How do generative models like autoencoders and variational autoencoders (VAEs) use latent space?

Generative models learn compact latent representations that enable data generation, noise elimination, and streamlined data encoding.

What is the evidence lower bound (ELBO) and why is it important in VAEs?

ELBO provides a tractable objective for training VAEs by approximating the true posterior and minimizing KL divergence between distributions.

How do manifolds and homeomorphism relate to neural networks?

Neural networks can learn mappings between complex data manifolds, offering a geometric perspective on how deep learning models transform data.

click to
look inside

Look inside

Understanding the Math Behind the Algorithms read this article now in
Manning's Free Content Center

ch 1 audio

video summary first chapter summary

Resources

Source code Book Forum Source code on GitHub Understanding the Math Behind the Algorithms 🎙️ Krishnendu Chaudhury interviewed 🎙️ Essential Tools For Deep Learning and Data Science Register your pBook for a free eBook more

Become a
Reviewer

Help us create great books

Math and Architectures of Deep Learning

you own this product

Krishnendu Chaudhury
with Ananya H. Ashok, Sujay Narumanchi, Devashish Shankar
Foreword by Prith Banerjee

April 2024
ISBN 9781617296482
552 pages

Included with a Manning Online subscription

printed in black & white

Available translations: Russian, Simplified Chinese

catalog / Data Science / Deep Learning

read now

pro $24.99 per month

access to all Manning books, MEAPs, liveVideos, liveProjects, and audiobooks!
choose one free eBook per month to keep
exclusive 50% discount on all purchases

lite $19.99 per month

access to all Manning books, including MEAPs!

team

5, 10 or 20 seats+ for your team - learn more

eBook

$55.99 $41.99

you save $14.00 (25%)

include audio $24.99 $18.74

Look inside

Shine a spotlight into the deep learning “black box”. This comprehensive and detailed guide reveals the mathematical and architectural concepts behind deep learning models, so you can customize, maintain, and explain them more effectively.

Inside Math and Architectures of Deep Learning you will find:

Math, theory, and programming principles side by side
Linear algebra, vector calculus and multivariate statistics for deep learning
The structure of neural networks
Implementing deep learning architectures with Python and PyTorch
Troubleshooting underperforming models
Working code samples in downloadable Jupyter notebooks

The mathematical paradigms behind deep learning models typically begin as hard-to-read academic papers that leave engineers in the dark about how those models actually function. Math and Architectures of Deep Learning bridges the gap between theory and practice, laying out the math of deep learning side by side with practical implementations in Python and PyTorch. Written by deep learning expert Krishnendu Chaudhury, you’ll peer inside the “black box” to understand how your code is working, and learn to comprehend cutting-edge research you can turn into practical applications.

about the technology

Discover what’s going on inside the black box! To work with deep learning you’ll have to choose the right model, train it, preprocess your data, evaluate performance and accuracy, and deal with uncertainty and variability in the outputs of a deployed solution. This book takes you systematically through the core mathematical concepts you’ll need as a working data scientist: vector calculus, linear algebra, and Bayesian inference, all from a deep learning perspective.

about the book

Math and Architectures of Deep Learning teaches the math, theory, and programming principles of deep learning models laid out side by side, and then puts them into practice with well-annotated Python code. You’ll progress from algebra, calculus, and statistics all the way to state-of-the-art DL architectures taken from the latest research.

what's inside

The core design principles of neural networks
Implementing deep learning with Python and PyTorch
Regularizing and optimizing underperforming models

about the reader

Readers need to know Python and the basics of algebra and calculus.

about the author

Krishnendu Chaudhury is co-founder and CTO of the AI startup Drishti Technologies. He previously spent a decade each at Google and Adobe.

eBook

$55.99 $41.99

you save $14.00 (25%)

include audio $24.99 $18.74

Machine learning uses a cocktail of linear algebra, vector calculus, statistical analysis, and topology to represent, visualize, and manipulate points in high dimensional spaces. This book builds that foundation in an intuitive way–along with the PyTorch code you need to be a successful deep learning practitioner.

Vineet Gupta, Google Research

A thorough explanation of the mathematics behind deep learning!

Grigory Sapunov, Intento

Deep learning in its full glory, with all its mathematical details. This is the book!

Atul Saurav, Genworth Financial

Math and Architectures of Deep Learning

pro $24.99 per month

lite $19.99 per month

team

about the technology

about the book

Frequently Asked Questions

what's inside

about the reader

about the author

related titles

related titles

pro

team

pro

team

pro

team