Jonathan Rioux

Jonathan Rioux uses PySpark inside and out on a daily basis. He also teaches large-scale data analysis to data scientists, engineers, and data-savvy business analysts.

Jonathan spent a decade in various analytical positions in the insurance industry before venturing into the consulting industry as a machine learning and data analysis expert. He currently works as the director of machine learning for Laivly, a company that equips friendly humans with intelligent automations and machine learning to create the best customer experiences on the planet.

books by Jonathan Rioux

Data Analysis with Python and PySpark

  • February 2022
  • ISBN 9781617297205
  • 456 pages
  • printed in black & white
  • Available translations: Russian, Simplified Chinese

Data Analysis with Python and PySpark helps you solve the daily challenges of data science with PySpark. You’ll learn how to scale your processing capabilities across multiple machines while ingesting data from any source—whether that’s Hadoop clusters, cloud data storage, or local data files. Once you’ve covered the fundamentals, you’ll explore the full versatility of PySpark by building machine learning pipelines, and blending Python, pandas, and PySpark code.