Life Expectancy Analysis Using Regression

prerequisites
intermediate Python • basics of statistics (regression analysis) • basics of NumPy and pandas
skills learned
organize a data-mining project following the CRISP-DM model • perform data cleaning and exploration • perform regression modeling and diagnostics • perform model selection
Monica Guimaraes and Alexey Grigoriev
5 weeks · 5-7 hours per week · INTERMEDIATE
This title has been retired and is no longer for sale.
Look inside
In this liveProject, you’ll take on the role of a data analyst working for the Jones Family philanthropic foundation. The board of directors is interested in learning about the life expectancy of Americans so that they can better target their charitable spending. To help them in their research, they’ve turned to you. Your challenge in this liveProject is to run a regression analysis on demographic data to find factors related to life expectancy, and answer data-mining questions about the distribution of these demographic variables. To do this, you’ll plan your data-mining and regression analysis following the CRISP-DM model, clean and model your data, assess the accuracy of your findings, and present your results—all with open source tools from the Python ecosystem.
This project is designed for learning purposes and is not a complete, production-ready application or solution.

book and video resources

When you start your liveProject, you get full access to the following books and videos for 90 days.

project authors

Monica Guimaraes
Monica Guimaraes has more than fifteen years of experience organizing and teaching undergraduate courses in automata theory, numerical analysis, programming languages, and statistics. Her passion for artificial intelligence goes back to her college years, having done her thesis in automatic-theorem-proving and working later as an AI researcher. She also has extensive experience in software development in corporations.
Alexey Grigoriev
Alexey Grigorev has more than ten years of experience as a software engineer, and has spent the last six years focused on machine learning. Currently, he works as a lead data scientist at the OLX Group, where he deals with content moderation and image models. He is the author of two other books on using Java for data science and TensorFlow for deep learning.

prerequisites

This liveProject is for intermediate Python programmers who know the basics of regression analysis. To begin with this liveProject, you will need to be familiar with:

TOOLS
  • Basics of NumPy and pandas
  • Basics of Matplotlib
  • Basics of Jupyter notebook
TECHNIQUES
  • Basics of statistics and regression analysis

you will learn

In this liveProject, you’ll learn vital skills for planning and orchestrating a data analysis project. These foundational skills are easy to transfer to almost any data undertaking.

  • Collecting, cleaning and exploring data
  • Regression modelling and diagnostics
  • Evaluating model appropriateness
  • Presenting findings as Jupyter notebooks

features

Self-paced
You choose the schedule and decide how much time to invest as you build your project.
Project roadmap
Each project is divided into several achievable steps.
Get Help
While within the liveProject platform, get help from other participants and our expert mentors.
Compare with others
For each step, compare your deliverable to the solutions by the author and other participants.
book resources
Get full access to select books for 90 days. Permanent access to excerpts from Manning products are also included, as well as references to other resources.

choose your plan

team

monthly
annual
$49.99
$499.99
only $41.67 per month
  • five seats for your team
  • access to all Manning books, MEAPs, liveVideos, liveProjects, and audiobooks!
  • choose another free product every time you renew
  • choose twelve free products per year
  • exclusive 50% discount on all purchases
  • Life Expectancy Analysis Using Regression project for free