Analyzing GitHub Data with GCP Services

prerequisites
UNIX shell/terminal • basic git & SQL • basic summary statistics
skills learned
serverless data extraction • storage and presentation within Google Cloud
Mike Fowler
5 weeks · 5-7 hours per week · INTERMEDIATE

placing your order...

Don't refresh or navigate away from the page.
liveProject liveProjects give you the opportunity to learn new skills by completing real-world challenges in your local development environment. These self-paced projects also come with full liveBook access to select books for 90 days plus permanent access to other select Manning products. $34.99 $49.99 you save: $15 (30%) self-paced learning
FREE domestic shipping on orders of three or more print books
Analyzing GitHub Data with GCP Services (liveProject) added to cart
continue shopping
go to cart

Look inside
GitHub is stuffed full of free and open-source development tools, all with no guarantee that they’re stable, up-to-date, or even still maintained! Companies need a way to ensure that the GitHub projects they’re adopting come with a clean bill of health — and for that, they turn to you!

You’re the CTO of GitHub Health, a unique startup that provides reports and analysis of GitHub software projects. But your findings aren’t based on gossip, they’re based on data. In this liveProject, you’ll build a serverless data system that can extract meaningful data from GitHub, store it in a database, and display the statistics using Google Cloud. Once you’re done, you’ll generate a report for a new client on which of three open-source projects is the best choice for its new DevOps team.
This project is designed for learning purposes and is not a complete, production-ready application or solution.

book resources

When you start your liveProject, you get full access to the following books for 90 days.

project author

Mike Fowler
Mike Fowler is Chief Technology Officer at Synalogik with over 16 years of experience building secure & reliable data-intensive distributed systems. He is a Certified Data Management Professional (CDMP) with 15 public cloud certifications including Google's Professional Cloud Architect & Data Engineer.

prerequisites

This liveProject is for early-career data analysts and programmers looking to experiment with serverless data platforms. The project is language-agnostic, and skeleton solutions for Python, Go, Node.js and SQL are available for those who need them. To begin this liveProject, you will need to be familiar with:

TOOLS
  • UNIX shell/terminal
  • Basic Git
  • Basic SQL
TECHNIQUES
  • HTTP request/response handling
  • Basic summary statistics

you will learn

In this liveProject, you’ll learn how to extract insights from data without having to provision and manage data processing infrastructure. The serverless mindset you develop will help boost your productivity and prove a valuable asset to your career in data science. While this liveProject uses Google Cloud for its solution, the techniques developed are equally applicable to Amazon Web Services and Microsoft Azure.

  • Querying Github to extract data
  • Executing the code and storing the results of the queries
  • Loading query results into a database
  • Transforming data using BigQuery SQL functions
  • Using Data Studio to draw up reports

features

Self-paced
You choose the schedule and decide how much time to invest as you build your project.
Project roadmap
Each project is divided into several achievable steps.
Get Help
While within the liveProject platform, get help from other participants and our expert mentors.
Compare with others
For each step, compare your deliverable to the solutions by the author and other participants.
book resources
Get full access to select books for 90 days. Permanent access to excerpts from Manning products are also included, as well as references to other resources.
RECENTLY VIEWED