GitHub

All Purpose Bypass

Maybe save time but definitley save MONEY

About The Project

In Databricks, running code on an All Purpose Cluster is over 3x the cost of running on a Job Cluster. All Purpose Bypass provides a convenient method to quickly convert your Notebook into a job saving you time and money.

It is perfect for when you know you will have a long running command block or plan on leaving a notebook running overnight.

(back to top)

Built With

(back to top)

Prerequisites

Databricks

This tool is meant to be used in databricks workspaces

(back to top)

Installation

pip install in your Databricks Notebook

%pip install all_purpose_bypass

(back to top)

Quickstart

from all_purpose_bypass import Bypass

# Databricks API Token (Found in User Settings)
api_token = '###############################'

bypass = Bypass(api_token)
job_id = bypass.create_job()
bypass.run_job(job_id)

>>> Job located at: https://my-workspace.cloud.databricks.com/?#job/571474934623337
>>> Job Running: run_id is 1535015

Default Behavior

By default Bypass will create a Job named after the current notebook and assign the owner to the current user. The Job cluster that is created is a clone of the attached active all purpose cluster. To make the job more discoverable a tag of all-purpose-bypass is assigned to every job. If the job already exists, the parameters/options are updated.

Advanced Usage

Note: You can create cluster compatibility issues. Please check with the databricks create cluster page to make sure the options are compatible with each other.

There are a number of arguments you pass to Bypass to modify the default behavior.

Parameters:

new_cluster: pass in your own json like dictionary with cluster configurations

https://docs.databricks.com/dev-tools/api/latest/clusters.html#examples

{
    "cluster_name": "autoscaling-cluster",
    "spark_version": "7.3.x-scala2.12",
    "node_type_id": "i3.xlarge",
    "autoscale" : {
        "min_workers": 2,
        "max_workers": 50},
    "aws_attributes": {
        "availability": "SPOT",
        "zone_id": "us-west-2a"}
}

spark_version: modify the spark_version of the default current active all purpose cluster
- https://docs.databricks.com/dev-tools/api/latest/clusters.html#runtime-versions
node_type_id: modify the node_type_id of the default current active all purpose cluster
- https://docs.databricks.com/dev-tools/api/latest/clusters.html#list-node-types
aws_attributes: modify the aws_attributes of the default current active all purpose cluster
autoscale: modify the autoscale of the default current active all purpose cluster
- if this parameter is set do not use num_workers
num_workers: modify the num_workers of the default current active all purpose cluster
- if this parameter is set do not use autoscale
libraries: modify the libraries of the default current active all purpose cluster
clusterId: change the default current active all purpose cluster to anothe existing all purpose cluster

Example:

from all_purpose_bypass import Bypass

# Databricks API Token (Found in User Settings)
api_token = '###############################'

bypass = Bypass(api_token, node_type_id="i3.4xlarge", clusterId="1095-225741-yhdswzetj")
job_id = bypass.create_job()
bypass.run_job(job_id)

>>> Job located at: https://my-workspace.cloud.databricks.com/?#job/571474934623337
>>> Job Running: run_id is 1535015

(back to top)

License

Distributed under the MIT License. See LICENSE.txt for more information.

(back to top)

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
all_purpose_bypass.egg-info		all_purpose_bypass.egg-info
all_purpose_bypass		all_purpose_bypass
build/lib/all_purpose_bypass		build/lib/all_purpose_bypass
dist		dist
LICENSE		LICENSE
README.md		README.md
SIMPLEREADME.md		SIMPLEREADME.md
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

All Purpose Bypass

About The Project

Built With

Prerequisites

Installation

Quickstart

Default Behavior

Advanced Usage

License

About

Releases

Packages

Languages

License

gardnmi/all_purpose_bypass

Folders and files

Latest commit

History

Repository files navigation

All Purpose Bypass

About The Project

Built With

Prerequisites

Installation

Quickstart

Default Behavior

Advanced Usage

License

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages