GitHub - dataux/dataux at ead7716b5d974813bd83d5bfc6c63796a2ca5459

Sql Query Proxy to Elasticsearch, Mongo, Etc

Mysql tcp proxy to Elasticsearch, Mongo, Mysql backend sources, including join.

This is an early prototype, not production ready. It is wire compatible with Mysql by implementing a relational algebra engine to map the sql queries to one or more backends.

Why? An experiement to see if it is possible, as more and more databases for more and more specialized needs seems to be the norm and this is an attempt to translate back into a cohesive data model.

SQL -> Mongo

Mongo	SQL Query
`show collections`	`show tables;`
na, -- runtime inspection	`describe mytable;`
`db.accounts.find({},{created:{"$gte":"1/1/2016"}}).count();`	`select count(*) from accounts WHERE created > "1/1/2016";`
`select min(year), max(year), avg(year), sum(year) from table WHERE exists(a);`

      | `select * from table WHERE year IN (2015,2014,2013);`
   | `select * from table WHERE year BETWEEN 2012 AND 2014`

SQL -> Elasticsearch Api

ES API	SQL Query
Aliases	`show tables;`
Mapping	`describe mytable;`
hits.total for filter	`select count(*) from table WHERE exists(a);`
aggs min, max, avg, sum	`select min(year), max(year), avg(year), sum(year) from table WHERE exists(a);`
filter: terms	`select * from table WHERE year IN (2015,2014,2013);`
filter: gte, range	`select * from table WHERE year BETWEEN 2012 AND 2014`

see tools/importgithub for tool to import 2 days of github data for examples above.

# to run queries below, run test data import

go get -u github.com/dataux/dataux
cd $GOPATH/src/github.com/dataux/dataux/tools/importgithub
go build
./importgithub  ## will import ~200k plus docs from Github archive

Other Projects, Database Proxies & Multi-Data QL

Data-Accessability Making it easier to query, access, share, and use data. Protocol shifting (for accessibility). Sharing/Replication between db types.
Scalability/Sharding Implement sharding, connection sharing

Name	Scaling	Ease Of Access (sql, etc)	Comments
Couchbase N1QL	Y	Y	sql interface to couchbase k/v (and full-text-index)
prestodb		Y	not really a proxy more of query front end
cratedb	Y	Y	all-in-one db, not a proxy, sql to es
Vitess	Y		for scaling (sharding), very mature
twemproxy	Y		for scaling memcache
codis	Y		for scaling redis
MariaDB MaxScale	Y		for scaling mysql/mariadb (sharding) mature
Netflix Dynomite	Y		not really sql, just multi-store k/v
redishappy	Y		for scaling redis, haproxy
mixer	Y		simple mysql sharding

We use more and more databases, flatfiles, message queues, etc. For db's the primary reader/writer is fine but secondary readers such as investigating ad-hoc issues means we might be accessing and learning many different query languages.

Credit to mixer, derived mysql connection pieces from it (which was forked from vitess).

Roadmap(ish)

Elasticsearch Make elasticsearch more accessible through SQL
Mongo Extend SQL with search specfiic syntax
Backends: Redis, CSV, Json-FlatFiles, Kafka
Cross-DB Join
Event-Bus for Writes

Inspiration/Other works

In Internet architectures, data systems are typically categorized into source-of-truth systems that serve as primary stores for the user-generated writes, and derived data stores or indexes which serve reads and other complex queries. The data in these secondary stores is often derived from the primary data through custom transformations, sometimes involving complex processing driven by business logic. Similarly data in caching tiers is derived from reads against the primary data store, but needs to get invalidated or refreshed when the primary data gets mutated. A fundamental requirement emerging from these kinds of data architectures is the need to reliably capture, flow and process primary data changes.

from Databus

Name		Name	Last commit message	Last commit date
Latest commit History 82 Commits
backends		backends
frontends/mysqlfe		frontends/mysqlfe
models		models
planner		planner
proxy		proxy
testdata		testdata
testutil		testutil
tools/importgithub		tools/importgithub
vendored		vendored
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
dataux.conf		dataux.conf
main.go		main.go

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Sql Query Proxy to Elasticsearch, Mongo, Etc

SQL -> Mongo

SQL -> Elasticsearch Api

Other Projects, Database Proxies & Multi-Data QL

Roadmap(ish)

Inspiration/Other works

About

Uh oh!

Releases 4

Packages

Uh oh!

Contributors 9

Uh oh!

Languages

License

dataux/dataux

Folders and files

Latest commit

History

Repository files navigation

Sql Query Proxy to Elasticsearch, Mongo, Etc

SQL -> Mongo

SQL -> Elasticsearch Api

Other Projects, Database Proxies & Multi-Data QL

Roadmap(ish)

Inspiration/Other works

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 4

Packages 0

Uh oh!

Contributors 9

Uh oh!

Languages

Packages