GitHub is stuffed full of free and open-source development tools, all with no guarantee that they’re stable, up-to-date, or even still maintained! Companies need a way to ensure that the GitHub projects they’re adopting come with a clean bill of health — and for that, they turn to you!
You’re the CTO of GitHub Health, a unique startup that provides reports and analysis of GitHub software projects. But your findings aren’t based on gossip, they’re based on data. In this liveProject, you’ll build a serverless data system that can extract meaningful data from GitHub, store it in a database, and display the statistics using Google Cloud. Once you’re done, you’ll generate a report for a new client on which of three open-source projects is the best choice for its new DevOps team.
This project is designed for learning purposes and is not a complete, production-ready application or solution.
How to get your FREE
Certificate of Completion
Finish all the projects in this liveProject series
Take a short online test
Answer questions from the liveProject mentor
That's it!
liveProject mentor Kunal Rustagi shares what he likes about the Manning liveProject platform.
book resources
When you start your liveProject, you get full access to the following books for 90 days.
project author
Mike Fowler
Mike Fowler is Chief Technology Officer at Synalogik with over 16 years of experience building secure & reliable data-intensive distributed systems. He is a Certified Data Management Professional (CDMP) with 15 public cloud certifications including Google's Professional Cloud Architect & Data Engineer.
prerequisites
This liveProject is for early-career data analysts and programmers looking to experiment with serverless data platforms. The project is language-agnostic, and skeleton solutions for Python, Go, Node.js and SQL are available for those who need them. To begin this liveProject, you will need to be familiar with:
TOOLS
UNIX shell/terminal
Basic Git
Basic SQL
TECHNIQUES
HTTP request/response handling
Basic summary statistics
you will learn
In this liveProject, you’ll learn how to extract insights from data without having to provision and manage data processing infrastructure. The serverless mindset you develop will help boost your productivity and prove a valuable asset to your career in data science. While this liveProject uses Google Cloud for its solution, the techniques developed are equally applicable to Amazon Web Services and Microsoft Azure.
Querying Github to extract data
Executing the code and storing the results of the queries
Loading query results into a database
Transforming data using BigQuery SQL functions
Using Data Studio to draw up reports
features
Self-paced
You choose the schedule and decide how much time to invest as you build your project.
Project roadmap
Each project is divided into several achievable steps.
Get Help
While within the liveProject platform, get help from other participants and our expert mentors.
Compare with others
For each step, compare your deliverable to the solutions by the author and other participants.
Certificate of Completion
Earn a certificate of completion, including a badge to display on your resume, LinkedIn page, and other social media, after you complete this series.
book resources
Get full access to select books for 90 days. Permanent access to excerpts from Manning products are also included, as well as references to other resources.
how to play
guess the geekle in 5-, 6-, 7- tries.
each guess must be a valid 4-6 letter tech word. hit enter to submit.
after each guess, the color of the tiles will change to show how close your guess was to the word.