Human Pose Estimation with Deep Neural Networks you own this product

intermediate Python • intermediate PyTorch • intermediate NumPy • basics of deep learning
skills learned
object detection and keypoint detection algorithms • transfer learning • training CNNs on images • deploy human pose estimator using webcam
Armin Kappeler and Aadit Patel
8 weeks · 8-10 hours per week · ADVANCED

placing your order...

Don't refresh or navigate away from the page.
liveProject liveProjects give you the opportunity to learn new skills by completing real-world challenges in your local development environment. Solve practical problems, write working code, and analyze real data—with liveProject, you learn by doing. These self-paced projects also come with full liveBook access to select books for 90 days plus permanent access to other select Manning products. $53.99 $59.99 you save $6 (10%)
Human Pose Estimation with Deep Neural Networks (liveProject) added to cart
continue shopping
adding to cart

choose your plan


only $41.67 per month
  • five seats for your team
  • access to all Manning books, MEAPs, liveVideos, liveProjects, and audiobooks!
  • choose another free eBook every time you renew
  • choose twelve free eBooks per year
  • exclusive 50% discount on all purchases
  • Human Pose Estimation with Deep Neural Networks eBook for free
Look inside
In this liveProject, you’ll take on the role of a machine learning engineer working for a company developing augmented reality apps. These apps include games, virtual shopping assistants, and fitness coaches that need to be able to reliably recognize the shape of a human body. Your challenge is to create an application for human pose estimation: detecting a human body in an image and estimating its key points such as knees and elbows. To do this, you’ll build a convolutional neural network from scratch, training your model using Google Colab and your GPU. At the end of this liveProject, you’ll have completed an interactive demo application that uses a simple webcam to detect and predict human keypoints.
This project is designed for learning purposes and is not a complete, production-ready application or solution.

book resources

When you start your liveProject, you get full access to the following books for 90 days.

project authors

Armin Kappeler
Armin Kappeler received his MS and PhD degrees in electrical engineering from Northwestern University in 2016 where he worked on different deep neural network applications for Image and video classification and recovery. For his thesis, he applied deep neural networks to video super-resolution. In 2015, he joined Verizon Media Group (Yahoo Research) as a research engineer, where he is working on image captioning, active learning, human pose estimation, and other related computer vision applications.
Aadit Patel
Aadit Patel received a B.S. in aerospace engineering and an M.S. in computer science, both from the University of California, Los Angeles (UCLA), where his primary focuses included autonomous systems and reinforcement learning. He has several years of industry experience as a research engineer and data scientist, including at Boeing, Yahoo, and Flyr Labs. Currently, Aadit is a lead data scientist at The Not Company, where he works on custom deep learning architectures to generate novel plant-based foods.


This liveProject is for intermediate Python programmers who are familiar with machine learning. Knowledge of PyTorch and NumPy will be helpful. To begin this liveProject, you will need to be familiar with:

  • Basics of PIL
  • Basics of JSON
  • Basics of Matplotlib
  • Intermediate PyTorch
  • Intermediate NumPy
  • Intermediate machine learning concepts such as classification and regression
  • Basics of matrix and vector operations

you will learn

In this liveProject, you’ll learn how to build deep neural networks, and how to utilize them for computer vision. The building blocks of this project are easily transferable to other deep learning and computer vision challenges, such as face recognition, optical character recognition, and self-driving vehicles.

  • Object detection algorithms
  • Keypoint detection algorithms
  • Transfer Learning
  • Fully convolutional networks
  • Learning and using the main building blocks and layers of deep learning models
  • Training deep learning models on images
  • Deploying deep learning models


You choose the schedule and decide how much time to invest as you build your project.
Project roadmap
Each project is divided into several achievable steps.
Get Help
While within the liveProject platform, get help from other participants and our expert mentors.
Compare with others
For each step, compare your deliverable to the solutions by the author and other participants.
book resources
Get full access to select books for 90 days. Permanent access to excerpts from Manning products are also included, as well as references to other resources.