mapreduce: All content about mapreduce in NoSQL databases and polyglot persistence
Saturday, 22 November 2014
Stripe's Hadoop tools open sourced
Stripe has put on GitHub 4 Hadoop related projects they’ve developed internally:
- a dashboard for Hadoop jobs
- a Scala framework for distributed learning
- a database for serving data in SequenceFile format
- a collection of command-line utilities.
As a side note, Stripe is using Cloudera Impala with Parquet.
Original title and link: Stripe’s Hadoop tools open sourced ( ©myNoSQL)