redshift-benchmark

This project includes the data set for Redshift benchmark.

SlideShare
- amazon redshift is 10x faster and cheaper than hadoop hive (English)
- amazon redshift is 10x faster and cheaper than hadoop hive (Japanese)

Resources

Test Data
- s3://hapyrus-examples/redshift-benchmark/ad-network-examples/case-01/ad_campaigns
- s3://hapyrus-examples/redshift-benchmark/ad-network-examples/case-01/advertisers
- s3://hapyrus-examples/redshift-benchmark/ad-network-examples/case-01/publishers
- s3://hapyrus-examples/redshift-benchmark/ad-network-examples/case-01/imp_logs
- s3://hapyrus-examples/redshift-benchmark/ad-network-examples/case-01/click_logs

Requirements

Redshift cluster
- required to a launch a Redshift cluster
- minimum instance type is enough
  - dw.hs1.xlarge single node
- see Amazon Redshift Getting Started Guide
Local environment
- postgresql client

Import the data set into Redshift cluster

Create tables
- run sql/create_tables_redshift.sql on your Redshift cluster
Copy the test data set on our s3 to your Redshift cluster(it took over 17 hours in our case)
- edit sql/copy_all_[data-size].sql and write your own [aws-access-key-id] and [aws-secret-access-key].
- run sql/copy_all_[data-size].sql on your Redshift cluster
Run the test sql
- run sql/test-query.sql on your Redshift cluster

Hadoop + Hive

See sql/create_tables_hadoop_hive.sql to create Hive tables.
Run the test sql
- see sql/test-query.sql

License

This project is liscensed under the Apache License, Version 2.0 and powered by Hapyrus Inc.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
scripts		scripts
sql		sql
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

redshift-benchmark

Resources

Requirements

Import the data set into Redshift cluster

Hadoop + Hive

License

About

Releases

Packages

Contributors 2

Languages

flydata/redshift-benchmark

Folders and files

Latest commit

History

Repository files navigation

redshift-benchmark

Resources

Requirements

Import the data set into Redshift cluster

Hadoop + Hive

License

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages