This [initialization action] (https://cloud.google.com/dataproc/init-actions) installs the latest version of HUE on a master node within a Google Cloud Dataproc cluster.
You can use this initialization action to create a new Dataproc cluster with Hue installed by:
-
Uploading a copy of this initialization action (
hue.sh
) to Google Cloud Storage. -
Using the
gcloud
command to create a new cluster with this initialization action. The following command will create a new cluster named<CLUSTER_NAME>
and specify the initialization action stored in<GCS_BUCKET>
gcloud dataproc clusters create <CLUSTER_NAME> \ --initialization-actions gs://<GCS_BUCKET>/hue.sh
Alternatively, you can start a regular dataproc cluster, [ssh to the master node] (https://cloud.google.com/dataproc/submit-job) (see SSH into instance), clone this repository and run ./hue.sh (as sudo)
- Once the cluster has been created, Hue is configured to run on port
8888
on the master node in a Dataproc cluster. To connect to the Hue web interface, you will need to create an SSH tunnel and use a SOCKS 5 Proxy with your web browser as described in the dataproc web interfaces documentation. In the opened web browser go to 'localhost:8888' and you should see the Hue UI.