h2o

H2O Sparkling Water Initialization Action

This initialization action installs H2O Sparkling Water on all nodes of Google Cloud Dataproc cluster.

This initialization works with Dataproc image version 1.3 and newer, except 1.5 image.

Using this initialization action

⚠️ NOTICE: See best practices of using initialization actions in production.

You can use this initialization action to create a new Dataproc cluster with H2O Sparkling Water installed:

To create Dataproc 1.3 cluster use conda initialization action:

REGION=<region>
CLUSTER_NAME=<cluster_name>
gcloud dataproc clusters create ${CLUSTER_NAME} \
    --image-version 1.3 \
    --scopes "cloud-platform" \
    --initialization-actions "gs://goog-dataproc-initialization-actions-${REGION}/conda/bootstrap-conda.sh,gs://goog-dataproc-initialization-actions-${REGION}/h2o/h2o.sh"

To create Dataproc 1.4 cluster use ANACONDA optional component:

REGION=<region>
CLUSTER_NAME=<cluster_name>
gcloud dataproc clusters create ${CLUSTER_NAME} \
    --image-version 1.4 \
    --optional-components ANACONDA \
    --scopes "cloud-platform" \
    --initialization-actions "gs://goog-dataproc-initialization-actions-${REGION}/h2o/h2o.sh"

To create Dataproc 2.0 cluster and newer you don't need any additional initialization actions or optional components:

REGION=<region>
CLUSTER_NAME=<cluster_name>
gcloud dataproc clusters create ${CLUSTER_NAME} \
    --image-version 2.0 \
    --scopes "cloud-platform" \
    --initialization-actions "gs://goog-dataproc-initialization-actions-${REGION}/h2o/h2o.sh"

Submit sample job:

REGION=<region>
CLUSTER_NAME=<cluster_name>
gcloud dataproc jobs submit pyspark --cluster ${CLUSTER_NAME} \
    "gs://goog-dataproc-initialization-actions-${REGION}/h2o/sample-script.py"

Supported metadata parameters

H2O_SPARKLING_WATER_VERSION: Sparkling Water version number. You can find the versions from the releases page on GitHub. Default is 3.30.1.2-1.

Name		Name	Last commit message	Last commit date
parent directory ..
BUILD		BUILD
README.md		README.md
__init__.py		__init__.py
h2o.sh		h2o.sh
sample-script.py		sample-script.py
test_h2o.py		test_h2o.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

h2o

h2o

README.md

H2O Sparkling Water Initialization Action

Using this initialization action

Supported metadata parameters

Files

h2o

Directory actions

More options

Directory actions

More options

Latest commit

History

h2o

Folders and files

parent directory

README.md

H2O Sparkling Water Initialization Action

Using this initialization action

Supported metadata parameters