basic Python • basic SQL • familiarity with tables as CSV files • basic data science and machine learning
skills learned
defining graphs • visualizing graphs • working with LynxKite for efficient graph data science • applying linear regression with graph invariants as input features • using graph neural networks
Andras Nemeth
4 weeks · 6-8 hours per week average · INTERMEDIATE
In this series of liveProjects, you’ll learn to apply insightful graph data science techniques to real-world data problems. Making use of Python and the LynxKite graph data science platform, you’ll explore how graph data structuring can reveal new insights from highly interlinked data. Each liveProject in this series stands alone, so you can pick and choose the skills that are most relevant to you.
These projects are designed for learning purposes and are not complete, production-ready applications or solutions.
Sergii Zakharov, lead data scientist at Lynx Analytics, shares what he likes about the Manning liveProject platform.
here's what's included
Project 1 Analyze the Graph Structure of Soccer
Project 1 Analyze the Graph Structure of Soccer
In this liveProject, you’ll construct event sequence graphs to reveal interesting information about soccer games. You’ll work to find longest pass sequences, most important players, and to understand the spatial structure of the game. You’ll define and visualize these graphs, and use connected components to find interesting event subsequences. You’ll quickly be able to uncover insights such as the most important players and the spatial structuring of the playing pitch.
In this liveProject, you’ll use LynxKite and graph data techniques to identify some of the most important geographic points in the city of Bruges. You’ll start by downloading and processing map data, and then use a simple Python program to convert it into a graph. You’ll use graph centrality metrics to quantify the importance of vertices in your graph, and determine some of Bruges’ important locations. You’ll then use the same structure to figure out the main areas of the city without using actual district data.
In this liveProject, you’ll use graph data optimization to determine improvements that could be made to the water infrastructure of Bruges. You’ll incorporate business data into your visualized Bruges street graph, and translate your graph into a prize collecting Steiner tree problem. You’ll then use the PCST solver in LynxKite to get an exact answer to your business problem.
In this liveProject, you’ll build a model that can predict the age of customers for a telecom company. You’ll take basic profile data and call records for your customers, and build and improve a linear regression model using graph features. Finally, you’ll use a powerful graph neural network that can combine data from both profile features and graph structure to reliably reveal customer age range.
András Németh is the CTO of Lynx Analytics, with significant experience (as the top technology executive on client projects) in using graphs in real-world consultancy projects. He also led the development of LynxKite, a Graph Data Science Platform in the past six years. He has an MsC in Software Engineering and an MsC in Mathematics. Before Lynx Analytics, he worked for Google where he was the tech lead on a project aiming to annotate web pages with entities from Google's Knowledge Graph.
Prerequisites
This liveProject is for data scientists interested in the basic techniques of graph data science. This project is suitable for a large range of expertise levels from beginners to experienced practitioners. To begin this liveProject, you will need to be familiar with the following:
Basic SQL
Tables, their representation as CSV files
Basic data science and machine learning
you will learn
In this liveProject, you’ll learn the basics of graph data structures and how to define relationships between data.
Defining graphs based on event sequences
Defining a pass graph among players
Defining a graph on areas of the pitch
Various ways of visualizing these graphs
Using connected components to find interesting event subsequences
Applying pagerank and reverse pagerank to gain different importance metrics for players
Working with LynxKite for efficient graph data science
features
Self-paced
You choose the schedule and decide how much time to invest as you build your project.
Project roadmap
Each project is divided into several achievable steps.
Get Help
While within the liveProject platform, get help from other participants and our expert mentors.
Compare with others
For each step, compare your deliverable to the solutions by the author and other participants.
book resources
Get full access to select books for 90 days. Permanent access to excerpts from Manning products are also included, as well as references to other resources.
how to play
guess the geekle in 5-, 6-, 7- tries.
each guess must be a valid 4-6 letter tech word. hit enter to submit.
after each guess, the color of the tiles will change to show how close your guess was to the word.