ALL COVERED TOPICS

NoSQLBenchmarksNoSQL use casesNoSQL VideosNoSQL Hybrid SolutionsNoSQL PresentationsBig DataHadoopMapReducePigHiveFlume OozieSqoopHDFSZooKeeperCascadingCascalog BigTableCassandraHBaseHypertableCouchbaseCouchDBMongoDBOrientDBRavenDBJackrabbitTerrastoreAmazon DynamoDBRedisRiakProject VoldemortTokyo CabinetKyoto CabinetmemcachedAmazon SimpleDBDatomicMemcacheDBM/DBGT.MAmazon DynamoDynomiteMnesiaYahoo! PNUTS/SherpaNeo4jInfoGridSones GraphDBInfiniteGraphAllegroGraphMarkLogicClustrixCouchDB Case StudiesMongoDB Case StudiesNoSQL at AdobeNoSQL at FacebookNoSQL at Twitter

NAVIGATE MAIN CATEGORIES

Close

mapreduce: All content about mapreduce in NoSQL databases and polyglot persistence

Stripe's Hadoop tools open sourced

Stripe has put on GitHub 4 Hadoop related projects they’ve developed internally:

  1. a dashboard for Hadoop jobs
  2. a Scala framework for distributed learning
  3. a database for serving data in SequenceFile format
  4. a collection of command-line utilities.

As a side note, Stripe is using Cloudera Impala with Parquet.

Original title and link: Stripe’s Hadoop tools open sourced (NoSQL database©myNoSQL)

via: https://stripe.com/blog/four-new-hadoop-projects