Spark is compatible with Hadoop filesystems and formats so this allows it to access HDFS and S3. The Spark build installed on EMR as described at https://github.com/awslabs/emr-bootstrap-actions/tree/master/spark allows the Spark application to access S3 out of the box without any additional configuration needed. For example, if a cluster is created with IAM roles (http://docs.aws.amazon.com/Elast
{{#tags}}- {{label}}
{{/tags}}