Posts about NoSQL databases and Polyglot persistence from Thursday, 20 May 2010

Lessons from Redis-Land

About:

usecase

,

Short summary of an adventurous journey in the NOSQL world with Redis.

Leaving aside the introductory part, what I enjoyed most is the analysis that went into a simple usecase determining how to map many-to-many relationships into Redis. This part follows pretty close the process of data modeling with key-value stores I’ve described before.

via: http://labs.alcacoop.it/doku.php?id=articles:redis_land

Google BigQuery SQL-like API

About:

mapreduce

,

hadoop

,

google

,

hive

,

pig

,

Google has announced at GoogleIO 2010, but didn’t launch yet, a new API for ad-hoc analysis, reporting, data exploration of massively large datasets: ☞ BigQuery. What I find interesting is that, BigQuery is using ☞ an SQL flavor, instead of MapReduce or Hive or PIG.

It still strikes me that Google hasn’t figured out yet a way to expose access to their MapReduce implementation. Judging by the numbers in the industry, I’d say that by now Hadoop is probably handling the largest volumes of data.

NoSQL Ecosystem News & Links 2010-05-20

About:

couchdb

,

nosql ecosystem news

,

riak

,

mobiles

,

Running CouchDB on the Android powered Motorola Droid ¶
☞ Riakfuse: Filesystem backed by Riak. Cool! ¶

This is exactly the scenario of handling Firefox crash reports at Mozilla. And according to their ☞ blog, all this will change pretty soon with the migration to HBase and Hadoop.

However, we are in the process of moving to Hadoop, and currently all our crashes are also being written to HBase. Soon this will become our main data storage, and we’ll be able to do a lot more interesting things with the data. We’ll also be able to process 100% of crashes.

Some of the steps involved to get to the new and simplified architecture depicted below:

[…] we will no longer be relying on NFS in production. All crash report submissions are already stored in HBase, but with Socorro 1.7, we will retrieve the data from HBase for processing and store the processed result back into HBase.
[…]
[…] we will migrate the processors and minidump_stackwalk instances to run on our Hadoop nodes, further distributing our architecture. This will give us the ability to scale up to the amount of data we have as it grows over time.

Nice!

myNoSQL

NoSQL Databases and Polyglot Persistence: A Curated Guide

Lessons from Redis-Land

key-value store

redis

usecase

Google BigQuery SQL-like API

mapreduce

hadoop

google

hive

pig

NoSQL Ecosystem News & Links 2010-05-20

couchdb

nosql ecosystem news

riak

mobiles

NoSQL Case Study: Migrating to HBase/Hadoop to Handle Firefox Crash Reports at Mozilla

hbase

hadoop

NoSQL case study