Analyzing GitHub Data with GCP Services you own this product

UNIX shell/terminal • basic git & SQL • basic summary statistics
skills learned
serverless data extraction • storage and presentation within Google Cloud
Mike Fowler
5 weeks · 5-7 hours per week · INTERMEDIATE

pro $24.99 per month

  • access to all Manning books, MEAPs, liveVideos, liveProjects, and audiobooks!
  • choose one free eBook per month to keep
  • exclusive 50% discount on all purchases

lite $19.99 per month

  • access to all Manning books, including MEAPs!


5, 10 or 20 seats+ for your team - learn more

Look inside
GitHub is stuffed full of free and open-source development tools, all with no guarantee that they’re stable, up-to-date, or even still maintained! Companies need a way to ensure that the GitHub projects they’re adopting come with a clean bill of health — and for that, they turn to you!

You’re the CTO of GitHub Health, a unique startup that provides reports and analysis of GitHub software projects. But your findings aren’t based on gossip, they’re based on data. In this liveProject, you’ll build a serverless data system that can extract meaningful data from GitHub, store it in a database, and display the statistics using Google Cloud. Once you’re done, you’ll generate a report for a new client on which of three open-source projects is the best choice for its new DevOps team.
This project is designed for learning purposes and is not a complete, production-ready application or solution.

liveProject mentor Kunal Rustagi shares what he likes about the Manning liveProject platform.

book resources

When you start your liveProject, you get full access to the following books for 90 days.

project author

Mike Fowler
Mike Fowler is Chief Technology Officer at Synalogik with over 16 years of experience building secure & reliable data-intensive distributed systems. He is a Certified Data Management Professional (CDMP) with 15 public cloud certifications including Google's Professional Cloud Architect & Data Engineer.


This liveProject is for early-career data analysts and programmers looking to experiment with serverless data platforms. The project is language-agnostic, and skeleton solutions for Python, Go, Node.js and SQL are available for those who need them. To begin this liveProject, you will need to be familiar with:

  • UNIX shell/terminal
  • Basic Git
  • Basic SQL
  • HTTP request/response handling
  • Basic summary statistics

you will learn

In this liveProject, you’ll learn how to extract insights from data without having to provision and manage data processing infrastructure. The serverless mindset you develop will help boost your productivity and prove a valuable asset to your career in data science. While this liveProject uses Google Cloud for its solution, the techniques developed are equally applicable to Amazon Web Services and Microsoft Azure.

  • Querying Github to extract data
  • Executing the code and storing the results of the queries
  • Loading query results into a database
  • Transforming data using BigQuery SQL functions
  • Using Data Studio to draw up reports


You choose the schedule and decide how much time to invest as you build your project.
Project roadmap
Each project is divided into several achievable steps.
Get Help
While within the liveProject platform, get help from other participants and our expert mentors.
Compare with others
For each step, compare your deliverable to the solutions by the author and other participants.
book resources
Get full access to select books for 90 days. Permanent access to excerpts from Manning products are also included, as well as references to other resources.

choose your plan


only $41.67 per month
  • five seats for your team
  • access to all Manning books, MEAPs, liveVideos, liveProjects, and audiobooks!
  • choose another free product every time you renew
  • choose twelve free products per year
  • exclusive 50% discount on all purchases
  • Analyzing GitHub Data with GCP Services project for free