intermediate Python • basics of statistics (regression analysis) • basics of NumPy and pandas
skills learned
organize a data-mining project following the CRISP-DM model • perform data cleaning and exploration • perform regression modeling and diagnostics • perform model selection
In this liveProject, you’ll take on the role of a data analyst working for the Jones Family philanthropic foundation. The board of directors is interested in learning about the life expectancy of Americans so that they can better target their charitable spending. To help them in their research, they’ve turned to you. Your challenge in this liveProject is to run a regression analysis on demographic data to find factors related to life expectancy, and answer data-mining questions about the distribution of these demographic variables. To do this, you’ll plan your data-mining and regression analysis following the CRISP-DM model, clean and model your data, assess the accuracy of your findings, and present your results—all with open source tools from the Python ecosystem.
This project is designed for learning purposes and is not a complete, production-ready application or solution.
project authors
Monica Guimaraes
Monica Guimaraes has more than fifteen years of experience organizing and teaching undergraduate courses in automata theory, numerical analysis, programming languages, and statistics. Her passion for artificial intelligence goes back to her college years, having done her thesis in automatic-theorem-proving and working later as an AI researcher. She also has extensive experience in software development in corporations.
Alexey Grigoriev
Alexey Grigorev has more than ten years of experience as a software engineer, and has spent the last six years focused on machine learning. Currently, he works as a lead data scientist at the OLX Group, where he deals with content moderation and image models. He is the author of two other books on using Java for data science and TensorFlow for deep learning.
prerequisites
This liveProject is for intermediate Python programmers who know the basics of regression analysis. To begin with this liveProject, you will need to be familiar with:
TOOLS
Basics of NumPy and pandas
Basics of Matplotlib
Basics of Jupyter notebook
TECHNIQUES
Basics of statistics and regression analysis
features
Self-paced
You choose the schedule and decide how much time to invest as you build your project.
Project roadmap
Each project is divided into several achievable steps.
Get Help
While within the liveProject platform, get help from fellow participants and even more help with paid sessions with our expert mentors.
Compare with others
For each step, compare your deliverable to the solutions by the author and other participants.
book resources
Get full access to select books for 90 days. Permanent access to excerpts from Manning products are also included, as well as references to other resources.
choose your plan
pro
monthly
annual
$24.99
$249.99
only $20.83 per month
access to all Manning books, MEAPs, liveVideos, liveProjects, and audiobooks!
choose another free product every time you renew
choose twelve free products per year
exclusive 50% discount on all purchases
renews monthly, pause or cancel renewal anytime
renews annually, pause or cancel renewal anytime
Life Expectancy Analysis Using Regression project for free
team
monthly
annual
$49.99
$399.99
only $33.33 per month
five seats for your team
access to all Manning books, MEAPs, liveVideos, liveProjects, and audiobooks!
choose another free product every time you renew
choose twelve free products per year
exclusive 50% discount on all purchases
renews monthly, pause or cancel renewal anytime
renews annually, pause or cancel renewal anytime
Life Expectancy Analysis Using Regression project for free