ML Pipelines are one of the best ways to organize your machine learning code. Learn how to extend PySpark's ML Pipelines with your own components.
A quick overview of creating a simple machine-learning model using Spark's MLLib.
A primer to get you started blending Python with Spark.