Skip to content

Latest commit

 

History

History
 
 

Folders and files

NameName
Last commit message
Last commit date

parent directory

..
 
 
 
 

HUE - Hadoop User Experience

This [initialization action] (https://cloud.google.com/dataproc/init-actions) installs the latest version of HUE on a master node within a Google Cloud Dataproc cluster.

Using this initialization action

You can use this initialization action to create a new Dataproc cluster with Hue installed by:

  1. Uploading a copy of this initialization action (hue.sh) to Google Cloud Storage.

  2. Using the gcloud command to create a new cluster with this initialization action. The following command will create a new cluster named <CLUSTER_NAME> and specify the initialization action stored in <GCS_BUCKET>

    gcloud dataproc clusters create <CLUSTER_NAME> \
    --initialization-actions gs://<GCS_BUCKET>/hue.sh   

Alternatively, you can start a regular dataproc cluster, [ssh to the master node] (https://cloud.google.com/dataproc/submit-job) (see SSH into instance), clone this repository and run ./hue.sh (as sudo)

  1. Once the cluster has been created, Hue is configured to run on port 8888 on the master node in a Dataproc cluster. To connect to the Hue web interface, you will need to create an SSH tunnel and use a SOCKS 5 Proxy with your web browser as described in the dataproc web interfaces documentation. In the opened web browser go to 'localhost:8888' and you should see the Hue UI.