[B! sql][Hadoop] yassã®ãƒ–ãƒƒã‚¯ãƒžãƒ¼ã‚¯

yass id:yass

sqlã¨Hadoopã«é–¢ã™ã‚‹yassã®ãƒ–ãƒƒã‚¯ãƒžãƒ¼ã‚¯ (21)

${{author_name}}$

{{{comment_expanded}}}

{{label}}

{{#is_bookmark}}ãƒªã‚¹ãƒˆ{{/is_bookmark}}{{^is_bookmark}}ãƒªãƒ³ã‚¯{{/is_bookmark}}

${{author_name}}$
{{author_name}}{{created}}
{{ #comment }}{{ comment }}{{ /comment }}
- {{ label }}

${{author_name}}$

{{{comment_expanded}}}

{{label}}

{{#is_bookmark}}ãƒªã‚¹ãƒˆ{{/is_bookmark}}{{^is_bookmark}}ãƒªãƒ³ã‚¯{{/is_bookmark}}

Inside Yellow Pages' SQL-on-Hadoop Journey
yass 2016/02/11
hadoop

vertica

impala

orc

parquet

sql
ãƒªãƒ³ã‚¯
Discover Scaling Analytics at Amplitude | Amplitude | Amplitude
Get fast and easy access to customer insights at every step of their journey
yass 2015/08/30
hadoop

sql

lambda architecture

s3
ãƒªãƒ³ã‚¯
SQL on Hadoop æ¯”è¼ƒæ¤œè¨¼ ã€2014æœˆ11æ—¥ã«ãŠã‘ã‚‹æ¤œè¨¼ãƒ¬ãƒãƒ¼ãƒˆã€‘
Impala Meetup 2014/10/31 @Tokyo è¬›æ¼”è³‡æ–™ ã€æ³¨æ„äº‹é …ã€‘ æœ¬è³‡æ–™ã§ç´¹ä»‹ã—ã¦ã„ã‚‹æ¤œè¨¼çµæžœã¯2014å¹´å½“æ™‚ã®ã‚‚ã®ã§ã™ã€‚å½“è©²ã‚½ãƒ•ãƒˆã‚¦ã‚§ã‚¢ã¯æˆé•·ã‚„æ”¹å–„ãŒæ—©ãã€ç¾æ™‚ç‚¹ã®ãƒãƒ¼ã‚¸ãƒ§ãƒ³ã§ã¯å¤§ããç•°ãªã‚‹æ©Ÿèƒ½ã‚„æ€§èƒ½ã¨ãªã£ã¦ã„ã¾ã™ã€‚ SQL on Hadoopã®æœ€æ–°æƒ…å ±ã«åŸºã¥ãã‚µãƒ¼ãƒ“ã‚¹ã‚„ã‚·ã‚¹ãƒ†ãƒ ã‚¤ãƒ³ãƒ†ã‚°ãƒ¬ãƒ¼ã‚·ãƒ§ãƒ³ã«ã”èˆˆå‘³ã‚’ãŠæŒã¡ã®æ–¹ã¯ã€NTTãƒ‡ãƒ¼ã‚¿ åŸºç›¤ã‚·ã‚¹ãƒ†ãƒ äº‹æ¥æœ¬éƒ¨ OSSãƒ—ãƒãƒ•ã‚§ãƒƒã‚·ãƒ§ãƒŠãƒ«ã‚µãƒ¼ãƒ“ã‚¹ï¼ˆé›»åãƒ¡ãƒ¼ãƒ«ï¼š hadoop [AT] kits.nttdata.co.jpï¼‰ ã«ã”ç›¸è«‡ãã ã•ã„ã€‚Read less
yass 2014/11/05
Hadoop

benchmark

comparison

hive

Impala

sql

presto

impala

tez
ãƒªãƒ³ã‚¯
SQL on Hadoop in Taiwan
This document discusses SQL engines for Hadoop, including Hive, Presto, and Impala. Hive is best for batch jobs due to its stability. Presto provides interactive queries across data sources and is easier to manage than Hive with Tez. Presto's distributed architecture allows queries to run in parallel across nodes. It supports pluggable connectors to access different data stores and has language bi
yass 2014/09/27
presto

hadoop

sql
ãƒªãƒ³ã‚¯
Cloudera Blog
We are thrilled to announce the general availability of the Cloudera AI Inference service, powered by NVIDIA NIM microservices, part of the NVIDIA AI Enterprise platform, to accelerate generative AI deployments for enterprises. This service supports a range of optimized AI models, enabling seamless and scala ble AI inference. Background The generative AI landscape is evolving [â€¦] Read blog post
yass 2014/09/05
impala

hadoop

cloudera

sql
ãƒªãƒ³ã‚¯
Cloudera Blog
We are thrilled to announce the general availability of the Cloudera AI Inference service, powered by NVIDIA NIM microservices, part of the NVIDIA AI Enterprise platform, to accelerate generative AI deployments for enterprises. This service supports a range of optimized AI models, enabling seamless and scala ble AI inference. Background The generative AI landscape is evolving [â€¦] Read blog post
yass 2014/09/04
" a new approach using a hybrid engine that leverages Tez and something new called Â LLAP (Live Long and Process, #llap online). "

Hadoop

hive

stinger

sql

tez
ãƒªãƒ³ã‚¯
War of the Hadoop SQL engines. And the winner is ...? - Sonra
War of the Hadoop SQL engines. And the winner is â€¦? You may have wondered why we were quiet over the last couple of weeks? Well, we locked ourselves into the basement and did some research and a couple of projects and PoCs on Hadoop, Big Data, and distributed processing frameworks in general. We were also looking at Clickstream data and Web Analytics solutions. Over the next couple of weeks we wil
yass 2014/07/28
" Right now I would run both batch style queries (ETL) and interactive queries on Hive Tez as Hive offers the richest SQL feature set, especially analytic functions and supports a wide set of file formats. "

hadoop

sql

hive

tez

impala

presto

spark

infinidb

drill
ãƒªãƒ³ã‚¯
Apache Drill at ApacheCon2014
Lot of workloads exist for Big data, batch, machine learning, search, interactive SQL, Operational/user facing applicationsApache Drill fits into the interactive SQL category Analytics on Semi-Structured/Nested dataUse standard SQL to query Nested data without upfront flattening/modelingExtensions to ANSI SQL to operate on nested dataGeneric architecture for a broad variety of nested data types (e
yass 2014/06/25
" Current state : Alpha â€¢ Timeline 1.0 Beta (End of Q2, 2014) 1.0 GA (Q3, 2014) "

drill

hadoop

sql

MapR
ãƒªãƒ³ã‚¯
Yahoo Betting on Apache Hive, Tez, and YARN
Low-latency SQL queries, Business Intelligence (BI), and Data Discovery on Big Data are some of the hottest topics these days in the industry with a range of solutions coming to life lately to address them as either proprietary or open-source implementations on top of Hadoop.Â Some of the popular ones talked about in the Big Data communities are Hive, Presto, Impala, Shark, and Drill. Yahoo has tr
yass 2014/05/17
" Hive 0.13 execution times were comparable or better than Shark on a 100 node cluster. "

hadoop

hive

yahoo

stinger

sql

Tez

shrak
ãƒªãƒ³ã‚¯
Apache Drill: Building Highly Flexible, High Performance Query Engines by M.C. Srivas, Co-founder and CTO at MapR
Apache Drill: Building Highly Flexible, High Performance Query Engines by M.C. Srivas, Co-founder and CTO at MapR SQL is one of the most widely used languages to access, analyze, and manipulate structured data. As Hadoop gains traction within enterprise data architectures across industries, the need for SQL for both structured and loosely-structured data on Hadoop is growing rapidly Apache Drill s
yass 2014/05/16
drill

hadoop

sql
ãƒªãƒ³ã‚¯
Under Construction | Home
yass 2014/05/03
impala

vertica

hadoop

sql

benchmark
ãƒªãƒ³ã‚¯
Apache Hadoop Distribution | MapR
Your HPE MyAccount provides you with: Single sign-on to the HPE ecosystem Personalized recommendations Test drives and other trials And many more exclusive benefits
yass 2014/02/28
hadoop

hive

drill

impala

presro

shark

spark

comparison

sql
ãƒªãƒ³ã‚¯
å•†ç”¨Hadoopãƒ‡ã‚£ã‚¹ãƒˆãƒã€ŒPivotal HDã€ã¨DBã‚¨ãƒ³ã‚¸ãƒ³ã€ŒHAWQã€ã‚’æä¾›é–‹å§‹
yass 2014/02/07
hadoop

sql

HAWQ

pivotal
ãƒªãƒ³ã‚¯
Hadoopï¼‹SQLï¼‹ã‚¤ãƒ³ãƒ¡ãƒ¢ãƒªã€ãƒžãƒ«ãƒã‚¯ãƒ©ã‚¦ãƒ‰å¯¾å¿œã®ã€ŒPivotal Oneã€ãƒ—ãƒ©ãƒƒãƒˆãƒ•ã‚©ãƒ¼ãƒ ç™ºè¡¨ã€‚EMC World 2013
Hadoopï¼‹SQLï¼‹ã‚¤ãƒ³ãƒ¡ãƒ¢ãƒªã€ãƒžãƒ«ãƒã‚¯ãƒ©ã‚¦ãƒ‰å¯¾å¿œã®ã€ŒPivotal Oneã€ãƒ—ãƒ©ãƒƒãƒˆãƒ•ã‚©ãƒ¼ãƒ ç™ºè¡¨ã€‚EMC World 2013 EMCãŒãƒ©ã‚¹ãƒ™ã‚¬ã‚¹ã§é–‹å‚¬ä¸ã®ã‚¤ãƒ™ãƒ³ãƒˆã€ŒEMC World 2013ã€ã€‚2æ—¥ç›®ã®åŸºèª¿è¬›æ¼”ã«ã¯ã€EMCã¨VMwareãŒè¨ç«‹ã—ãŸæ–°ä¼šç¤¾ã€ŒPivotalã€ã®CEO ãƒãƒ¼ãƒ«ãƒ»ãƒžãƒªãƒƒãƒ„ï¼ˆPaul Maritzï¼‰æ°ãŒç™»å£‡ã—ã€ã‚¯ãƒ©ã‚¦ãƒ‰æ™‚ä»£ã®ã‚¢ãƒ—ãƒªã‚±ãƒ¼ã‚·ãƒ§ãƒ³åŸºç›¤ã¨ãªã‚‹ã€ŒPivotal Oneã€ã‚’ç™ºè¡¨ã—ã¾ã—ãŸã€‚ Pivotalã¯ã€EMCãŒè²·åŽã—ãŸGreenplumã‚„é–‹ç™ºã‚³ãƒ³ã‚µãƒ«ã‚¿ãƒ³ãƒˆã®Pivotal Labsã€VMwareãŒè²·åŽã—ãŸSpring Sourceã‚„CloudFoundryãªã©ã®ãƒãƒ¼ãƒ ã‚’é›†ã‚ã¦12æœˆã«ç™ºè¶³ã—ãŸçµ„ç¹”ã€‚ä»Šæœˆã‹ã‚‰æ£å¼ãªä¼æ¥ã¨ã—ã¦ã®æ´»å‹•ã‚’é–‹å§‹ã—ã¦ã„ã¾ã™ã€‚ Pivotal Oneã¯ã€ãƒ“ãƒƒã‚°ãƒ‡ãƒ¼ã‚¿ã¨ã‚¯ãƒ©ã‚¦ãƒ‰æ™‚ä»£ã®ã‚¢ãƒ—ãƒªã‚±ãƒ¼ã‚·ãƒ§ãƒ³åŸºç›¤ã¨ã—ã¦ã€åŒç¤¾ãŒä»Šå¹´æœ«ã«ãƒªãƒªãƒ¼ã‚¹äºˆå®š
yass 2014/02/06
hadoop

EMC

pivotal

hdfs

greenplum

sql
ãƒªãƒ³ã‚¯
ã‚ªãƒ¼ãƒ—ãƒ³ã‚½ãƒ¼ã‚¹ã®SQL-in-Hadoopã‚½ãƒªãƒ¥ãƒ¼ã‚·ãƒ§ãƒ³:æˆ‘ã€…ã¯ã„ã¾ã©ã“ã«ï¼Ÿ
Spring Bootã«ã‚ˆã‚‹APIãƒãƒƒã‚¯ã‚¨ãƒ³ãƒ‰æ§‹ç¯‰å®Ÿè·µã‚¬ã‚¤ãƒ‰ ç¬¬2ç‰ˆ ä½•åƒäººã‚‚ã®é–‹ç™ºè€…ãŒã€InfoQã®ãƒŸãƒ‹ãƒ–ãƒƒã‚¯ã€ŒPractical Guide to Building an API Back End with Spring Bootã€ã‹ã‚‰ã€Spring Bootã‚’ä½¿ã£ãŸREST APIæ§‹ç¯‰ã®åŸºç¤Žã‚’å¦ã‚“ã ã€‚ã“ã®æœ¬ã§ã¯ã€å‡ºç‰ˆæ™‚ã«æ–°ã—ããƒªãƒªãƒ¼ã‚¹ã•ã‚ŒãŸãƒãƒ¼ã‚¸ãƒ§ãƒ³ã§ã‚ã‚‹ Spring Boot 2 ã‚’ä½¿ç”¨ã—ã¦ã„ã‚‹ã€‚ã—ã‹ã—ã€Spring Boot3ãŒæœ€è¿‘ãƒªãƒªãƒ¼ã‚¹ã•ã‚Œã€é‡è¦ãªå¤‰...
yass 2014/01/16
hadoop

sql

drill

presto

impala
ãƒªãƒ³ã‚¯
Teradata Presto | Product Details | Open Source
Teradata Blogs When big data becomes vast, what's your data dropping strategy? Read more Support Teradata at Your Service (TAYS) Simple, secure customer access to products, services, education, and support function information. Read more Certifications Teradata Certified Professional Program (TCPP) Management, development, and oversight of the premiere Teradata Certification Program. Read more Con
yass 2013/11/02
" SQL processed by a specialized (Google-inspired) SQL engine that sits on a Hadoop cluster. Both Impala and Drill fall into this category. Impala is inspired by Googleâ€™s F1 project and Drill by Googleâ€™s Dremel project. "

hadoop

impala

drill

stinger

hadapt

hive

sql
ãƒªãƒ³ã‚¯
Don't use Hadoop - your data isn't that big
"So, how much experience do you have with Big Data and Hadoop?" they asked me. I told them that I use Hadoop all the time, but rarely for jobs larger than a few TB. I'm basically a big data neophite - I know the concepts, I've written code, but never at scale. The next question they asked me. "Could you use Hadoop to do a simple group by and sum?" Of course I could, and I just told them I needed t
yass 2013/09/18
" If you have a single table containing many terabytes of data, Hadoop might be a good option for running full table scans on it. If you donâ€™t have such a table, avoid Hadoop like the plague. / Hadoop does not have any conception of indexing. Hadoop has only full table scans. "

hadoop

sql

bigdata
ãƒªãƒ³ã‚¯
Hadoopã®ã‚»ã‚«ãƒ³ãƒ€ãƒªã‚½ãƒ¼ãƒˆã‚’é¿ã‘ã€ã‚ˆã‚Šé«˜é€Ÿã«å€¤ã‚’ã‚½ãƒ¼ãƒˆã™ã‚‹æ–¹æ³•
Hadoopã®Reduceã«æ¸¡ã•ã‚Œã‚‹ã®ã¯ã‚ãƒ¼ã¨å€¤ã®ãƒªã‚¹ãƒˆã ãŒã€ã“ã®ã¨ãå€¤ã®ãƒªã‚¹ãƒˆã«å«ã¾ã‚Œã‚‹å„ã‚¢ã‚¤ãƒ†ãƒ ï¼ˆå€¤ãã®ã‚‚ã®ï¼‰ã¯ã‚½ãƒ¼ãƒˆã•ã‚Œã¦ã„ãªã„ã€‚ã‚½ãƒ¼ãƒˆã•ã‚Œã¦ã„ã¦æ¬²ã—ã„å ´åˆã«ã¯ã‚»ã‚«ãƒ³ãƒ€ãƒªã‚½ãƒ¼ãƒˆã¨å‘¼ã°ã‚Œã‚‹ãƒ†ã‚¯ãƒ‹ãƒƒã‚¯ã‚’ä½¿ã†ã®ãŒå®šçŸ³ã¨ã•ã‚Œã¦ã„ã‚‹ãŒã€ã“ã‚Œã¯å®Ÿè£…ã®é¢ã§ã‚‚æ¦‚å¿µçš„ãªé¢ã§ã‚‚ãƒãƒƒãƒ‰ãƒŽã‚¦ãƒã‚¦çš„ãªå´é¢ãŒã‚ã‚‹ã€‚Hadoopã«ã¯ã€Œã‚ãƒ¼ã‚’ã‚½ãƒ¼ãƒˆã™ã‚‹ã€æ©Ÿèƒ½ã¯å®Ÿè£…ã•ã‚Œã¦ã„ã‚‹ã€‚ãã“ã§ã€å€¤ã‚’ã‚ãƒ¼ã«å…¥ã‚Œã¦ã—ã¾ã„ã€ã“ã®Hadoopã«å‚™ã‚ã£ã¦ã„ã‚‹ã€Œã‚ãƒ¼ã‚’ã‚½ãƒ¼ãƒˆã™ã‚‹ã€æ©Ÿèƒ½ã«ã‚ˆã£ã¦ã€å®Ÿè³ªçš„ã«å€¤ã‚’ã‚½ãƒ¼ãƒˆã—ã‚ˆã†ã¨ã„ã†ã‚ã‘ã ã€‚ Map/Reduceã¨ã„ã†ã®ã¯ã‚ãƒ¼ã”ã¨ã«ãƒ‡ãƒ¼ã‚¿ã‚’åˆ†å‰²ã—ã¦å‡¦ç†ã™ã‚‹æ–¹æ³•ãªã®ã§ã€ã€Œã‚ãƒ¼ã«å€¤ãŒå…¥ã£ãŸã‚‰åˆ†å‰²ãŒãŠã‹ã—ããªã‚‹ã‚“ã˜ã‚ƒï¼Ÿã€ã¨æ€ã†ã®ã¯å½“ç„¶ã§ã‚ã‚‹ã€‚ã‚ãƒ¼ã«å€¤ãŒå…¥ã£ã¦ã„ã¦ã‚‚ã€åˆ†å‰²ã«å½±éŸ¿ã—ãªã„ã‚ˆã†ã€Partitioningã‚¯ãƒ©ã‚¹ã‚’è‡ªåˆ†ã§æ‹¡å¼µã—ã€åˆ†å‰²ã®åŸºæº–ã¨ãªã‚‹å€¤ï¼ˆæœ¬æ¥ã®ã‚ãƒ¼ï¼‰ã«ã¯ã€å€¤ã®å½±éŸ¿ãŒå‡ºãªã„ã‚ˆã†ã«ã™ã‚‹ã®ã ã€‚ãã‚Œ
yass 2013/08/16
" ã¤ã¾ã‚Šã‚»ã‚«ãƒ³ãƒ€ãƒªã‚½ãƒ¼ãƒˆã¯ã‚¦â—‹ã‚³ã ã¨ã„ã†ã“ã¨ãªã®ã§ã‚ã‚‹(w ãã“ã§ã€Javaçµ„ã¿è¾¼ã¿åž‹ã®RDBMSã§ã‚ã‚‹H2ã‚’åˆ©ç”¨ã—ã¦ã€å€¤ã®ã‚½ãƒ¼ãƒˆã‚’è¡Œã†ã¨ã„ã†ãƒ†ã‚¯ãƒ‹ãƒƒã‚¯ã‚’ä½¿ã†ã€‚Reduceã®å‡¦ç†ã«ãŠã„ã¦ã€å˜ç´”ã«ã™ã¹ã¦ã®å€¤ã‚’H2ãƒ‡ãƒ¼ã‚¿ãƒ™ãƒ¼ã‚¹ã«æ ¼ç´"

hadoop

sort

h2

sql

reduce
ãƒªãƒ³ã‚¯
SQL, Pigã®CUBE - wyukawa's diary
SQLã§å°è¨ˆã‚„ç·åˆè¨ˆã‚’æ±‚ã‚ã‚‹æ™‚ã«GROUP BYã‚’åˆ©ç”¨ã™ã‚‹ã“ã¨ãŒå¤šã„ã¨æ€ã„ã¾ã™ãŒã„ã‚ã‚“ãªè»¸ã§é›†è¨ˆã—ãŸã„å ´åˆã«ROLLUP, CUBE, GROUPING SETSã‚’ä½¿ã†ã“ã¨ãŒã§ãã‚‹ã‚ˆã†ã§ã™ã€‚ è©³ã—ãã¯ã“ã¡ã‚‰å‚ç…§ http://homepage2.nifty.com/sak/w_sak3/doc/sysbrd/sq_kj04_4.htm ROLLUP, CUBE, GROUPING SETSã‚’ä½¿ã†ã“ã¨ãŒã§ãã¾ã™ã¨æ–å®šã—ã¦ã„ãªã„ã®ã¯åƒ•ãŒè©¦ã—ã¦ãªã„ã‹ã‚‰ã§ã™ï¼ˆæ±— ãªãœè©¦ã—ã¦ã„ãªã„ã‹ã¨ã„ã†ã¨ã“ã‚Œã‚‰ã®æ©Ÿèƒ½ã‚’åˆ©ç”¨ã§ãã‚‹ã®ãŒOracle, SQL Server, DB2ã ã‹ã‚‰ã§ã™ã€‚Oracle XEã‚’ãƒ€ã‚¦ãƒ³ãƒãƒ¼ãƒ‰ã—ã‚ˆã†ã‹ã¨æ€ã„ã¾ã—ãŸã‘ã©ãƒ¦ãƒ¼ã‚¶ç™»éŒ²ã«å¿ƒãŒæŠ˜ã‚Œã¾ã—ãŸwã€€ã¡ãªã¿ã«MySQLã§ã¯ROLLUPã®ã¿ã‚µãƒãƒ¼ãƒˆã—ã¦ã„ã‚‹ã‚‰ã—ã„ã§ã™ã€‚ ä»Šå›žã¯è€ƒãˆã‚‰ã‚Œã‚‹å…¨ã¦ã®çµ„ã¿åˆã‚ã›ã§é›†è¨ˆã™ã‚‹CUBEã«ã¤ã„ã¦æ›¸ã„ã¦ã¿ãŸã„ã¨æ€
yass 2013/04/26
hadoop

SQL

pig

cube

bi
ãƒªãƒ³ã‚¯
GitHub - intel-hadoop/project-panthera-ase: Analytical SQL Engine (ASE) for Hadoop under "Project Panthera"
Dismiss All your code in one place Over 40 million developers use GitHub together to host and review code, project manage, and build software together across more than 100 million projects. Sign up for free See pricing for teams and enterprises
yass 2013/02/28
hadoop

sql

intel
ãƒªãƒ³ã‚¯
1 2 æ¬¡ã®ãƒšãƒ¼ã‚¸