Machine Learning with Azure Databricks

Easy to get started collection of Machine Learning Examples in Azure Databricks

Azure Databricks Reference Architecture - Machine Learning & Advanced Analytics

Key Benefits:

Built for enterprise with security, reliability, and scalability
End to end integration from data access (ADLS, SQL DW, EventHub, Kafka, etc.), data prep, feature engineering, model building in single node or distributed, MLops with MLflow, integration with AzureML, Synapse, & other Azure services.
Delta Lake to set the data foundation with higher data quality, reliability and performance for downstream ML & AI use cases
ML Runtime Optimizations
- Reliable and secure distribution of open source ML frameworks
- Packages and optimizes most common ML frameworks
- Built-in optimization for distributed deep learning
- Built-in AutoML and Experiment tracking
- Customized environments using conda for reproducibility
Distributed Machine Learning
- Spark MLlib
- Migrate Single Node to distributed with just a few lines of code changes:
  - Distributed hyperparameter search (Hyperopt, Gridsearch)
  - PandasUDF to distribute models over different subsets of data or hyperparameters
  - Koalas: Pandas DataFrame API on Spark
- Distributed Deep Learning training with Horovod
Use your own tools
- Multiple languages in same Databricks notebooks (Python, R, Scala, SQL)
- Databricks Connect: connect external tools with Azure databricks (IDEs, RStudio, Jupyter,...)

Machine Learning & MLops Examples using Azure Databricks:

To review example notebooks below in HTML format: https://joelcthomas.github.io/ml-azuredatabricks/
To reproduce in a notebook, see instructions below.

Adding soon:

Single node scikit-learn to distributed hyperparamter search using Hyperopt
Single node pandas to distributed using Koalas
PandasUDF to distribute models over different subsets of data or hyperparameters
Using databricks automl-toolkit in Azure Databricks
Using automl from AzureML in Azure Databricks

Other:

Model Drift

MLflow

Overview of MLflow and its features

How to run this example?

To reproduce examples provided here, please import ml-azuredatabricks.dbc file in git root directory to databricks workspace.

Instructions on how to import notebooks in databricks

Setup Cluster

Create a cluster - https://docs.microsoft.com/en-us/azure/databricks/clusters/create
GPU enabled Clusters - https://docs.microsoft.com/en-us/azure/databricks/clusters/gpu
Install a library/package - https://docs.microsoft.com/en-us/azure/databricks/libraries
Machine Learning Runtime - https://docs.microsoft.com/en-us/azure/databricks/runtime/mlruntime
To see list of already available package in each runtime - https://docs.microsoft.com/en-us/azure/databricks/release-notes/runtime/releases

Additional Information

For more information on using Azure Databricks
https://docs.microsoft.com/en-us/azure/azure-databricks/

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
azureml		azureml
config		config
docs		docs
graphviz		graphviz
horovod		horovod
.gitignore		.gitignore
PyTorch-Horovod.py		PyTorch-Horovod.py
PyTorch-SingleNode.py		PyTorch-SingleNode.py
README.md		README.md
ml_azuredatabricks.dbc		ml_azuredatabricks.dbc
mlflow.md		mlflow.md
mnist_utils.py		mnist_utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Machine Learning with Azure Databricks

Azure Databricks Reference Architecture - Machine Learning & Advanced Analytics

Key Benefits:

Machine Learning & MLops Examples using Azure Databricks:

MLflow

How to run this example?

Setup Cluster

Additional Information

About

Releases

Packages

Languages

maartenhbe/ml-azuredatabricks

Folders and files

Latest commit

History

Repository files navigation

Machine Learning with Azure Databricks

Azure Databricks Reference Architecture - Machine Learning & Advanced Analytics

Key Benefits:

Machine Learning & MLops Examples using Azure Databricks:

MLflow

How to run this example?

Setup Cluster

Additional Information

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages