MLflow is an open source library by the Databricks team designed for managing the machine learning lifecycle. It allows for the creation of projects, tracking of metrics, and model versioning.
pip install mlflow
MLflow can be used in any Spark environmnet, but the automated tracking and UI of MLflow is Databricks-Specific Functionality.
Track metrics and parameters
## Log Parameters and Metrics from your normal MLlib run
# Log a parameter (key-value pair)
# Log a metric; metrics can be updated throughout the run