Impala Meetup 2014/10/31 @Tokyo è¬æ¼è³æ ã注æäºé ã æ¬è³æã§ç´¹ä»ãã¦ããæ¤è¨¼çµæã¯2014å¹´å½æã®ãã®ã§ããå½è©²ã½ããã¦ã§ã¢ã¯æé·ãæ¹åãæ©ããç¾æç¹ã®ãã¼ã¸ã§ã³ã§ã¯å¤§ããç°ãªãæ©è½ãæ§è½ã¨ãªã£ã¦ãã¾ãã SQL on Hadoopã®ææ°æ å ±ã«åºã¥ããµã¼ãã¹ãã·ã¹ãã ã¤ã³ãã°ã¬ã¼ã·ã§ã³ã«ãèå³ããæã¡ã®æ¹ã¯ãNTTãã¼ã¿ åºç¤ã·ã¹ãã äºæ¥æ¬é¨ OSSãããã§ãã·ã§ãã«ãµã¼ãã¹ï¼é»åã¡ã¼ã«ï¼ hadoop [AT] kits.nttdata.co.jpï¼ ã«ãç¸è«ãã ãããRead less
This document discusses SQL engines for Hadoop, including Hive, Presto, and Impala. Hive is best for batch jobs due to its stability. Presto provides interactive queries across data sources and is easier to manage than Hive with Tez. Presto's distributed architecture allows queries to run in parallel across nodes. It supports pluggable connectors to access different data stores and has language bi
We are thrilled to announce the general availability of the Cloudera AI Inference service, powered by NVIDIA NIM microservices, part of the NVIDIA AI Enterprise platform, to accelerate generative AI deployments for enterprises. This service supports a range of optimized AI models, enabling seamless and scalable AI inference. Background The generative AI landscape is evolving [â¦] Read blog post
We are thrilled to announce the general availability of the Cloudera AI Inference service, powered by NVIDIA NIM microservices, part of the NVIDIA AI Enterprise platform, to accelerate generative AI deployments for enterprises. This service supports a range of optimized AI models, enabling seamless and scalable AI inference. Background The generative AI landscape is evolving [â¦] Read blog post
War of the Hadoop SQL engines. And the winner is � You may have wondered why we were quiet over the last couple of weeks? Well, we locked ourselves into the basement and did some research and a couple of projects and PoCs on Hadoop, Big Data, and distributed processing frameworks in general. We were also looking at Clickstream data and Web Analytics solutions. Over the next couple of weeks we wil
Lot of workloads exist for Big data, batch, machine learning, search, interactive SQL, Operational/user facing applicationsApache Drill fits into the interactive SQL category Analytics on Semi-Structured/Nested dataUse standard SQL to query Nested data without upfront flattening/modelingExtensions to ANSI SQL to operate on nested dataGeneric architecture for a broad variety of nested data types (e
Low-latency SQL queries, Business Intelligence (BI), and Data Discovery on Big Data are some of the hottest topics these days in the industry with a range of solutions coming to life lately to address them as either proprietary or open-source implementations on top of Hadoop. Some of the popular ones talked about in the Big Data communities are Hive, Presto, Impala, Shark, and Drill. Yahoo has tr
Apache Drill: Building Highly Flexible, High Performance Query Engines by M.C. Srivas, Co-founder and CTO at MapR SQL is one of the most widely used languages to access, analyze, and manipulate structured data. As Hadoop gains traction within enterprise data architectures across industries, the need for SQL for both structured and loosely-structured data on Hadoop is growing rapidly Apache Drill s
Hadoopï¼SQLï¼ã¤ã³ã¡ã¢ãªããã«ãã¯ã©ã¦ã対å¿ã®ãPivotal Oneããã©ãããã©ã¼ã çºè¡¨ãEMC World 2013 EMCãã©ã¹ãã¬ã¹ã§éå¬ä¸ã®ã¤ãã³ããEMC World 2013ãã2æ¥ç®ã®åºèª¿è¬æ¼ã«ã¯ãEMCã¨VMwareãè¨ç«ããæ°ä¼ç¤¾ãPivotalãã®CEO ãã¼ã«ã»ããªããï¼Paul Maritzï¼æ°ãç»å£ããã¯ã©ã¦ãæ代ã®ã¢ããªã±ã¼ã·ã§ã³åºç¤ã¨ãªããPivotal Oneããçºè¡¨ãã¾ããã Pivotalã¯ãEMCãè²·åããGreenplumãéçºã³ã³ãµã«ã¿ã³ãã®Pivotal LabsãVMwareãè²·åããSpring SourceãCloudFoundryãªã©ã®ãã¼ã ãéãã¦12æã«çºè¶³ããçµç¹ãä»æããæ£å¼ãªä¼æ¥ã¨ãã¦ã®æ´»åãéå§ãã¦ãã¾ãã Pivotal Oneã¯ãããã°ãã¼ã¿ã¨ã¯ã©ã¦ãæ代ã®ã¢ããªã±ã¼ã·ã§ã³åºç¤ã¨ãã¦ãå社ãä»å¹´æ«ã«ãªãªã¼ã¹äºå®
Spring Bootã«ããAPIããã¯ã¨ã³ãæ§ç¯å®è·µã¬ã¤ã 第2ç ä½å人ãã®éçºè ããInfoQã®ããããã¯ãPractical Guide to Building an API Back End with Spring BootããããSpring Bootã使ã£ãREST APIæ§ç¯ã®åºç¤ãå¦ãã ããã®æ¬ã§ã¯ãåºçæã«æ°ãããªãªã¼ã¹ããããã¼ã¸ã§ã³ã§ãã Spring Boot 2 ã使ç¨ãã¦ãããããããSpring Boot3ãæè¿ãªãªã¼ã¹ãããéè¦ãªå¤...
Teradata Blogs When big data becomes vast, what's your data dropping strategy? Read more Support Teradata at Your Service (TAYS) Simple, secure customer access to products, services, education, and support function information. Read more Certifications Teradata Certified Professional Program (TCPP) Management, development, and oversight of the premiere Teradata Certification Program. Read more Con
"So, how much experience do you have with Big Data and Hadoop?" they asked me. I told them that I use Hadoop all the time, but rarely for jobs larger than a few TB. I'm basically a big data neophite - I know the concepts, I've written code, but never at scale. The next question they asked me. "Could you use Hadoop to do a simple group by and sum?" Of course I could, and I just told them I needed t
Hadoopã®Reduceã«æ¸¡ãããã®ã¯ãã¼ã¨å¤ã®ãªã¹ãã ãããã®ã¨ãå¤ã®ãªã¹ãã«å«ã¾ããåã¢ã¤ãã ï¼å¤ãã®ãã®ï¼ã¯ã½ã¼ãããã¦ããªããã½ã¼ãããã¦ãã¦æ¬²ããå ´åã«ã¯ã»ã«ã³ããªã½ã¼ãã¨å¼ã°ãããã¯ããã¯ã使ãã®ãå®ç³ã¨ããã¦ããããããã¯å®è£ ã®é¢ã§ãæ¦å¿µçãªé¢ã§ãããããã¦ãã¦çãªå´é¢ããããHadoopã«ã¯ããã¼ãã½ã¼ããããæ©è½ã¯å®è£ ããã¦ãããããã§ãå¤ããã¼ã«å ¥ãã¦ãã¾ãããã®Hadoopã«åãã£ã¦ããããã¼ãã½ã¼ããããæ©è½ã«ãã£ã¦ãå®è³ªçã«å¤ãã½ã¼ããããã¨ããããã ã Map/Reduceã¨ããã®ã¯ãã¼ãã¨ã«ãã¼ã¿ãåå²ãã¦å¦çããæ¹æ³ãªã®ã§ãããã¼ã«å¤ãå ¥ã£ããåå²ããããããªããããï¼ãã¨æãã®ã¯å½ç¶ã§ããããã¼ã«å¤ãå ¥ã£ã¦ãã¦ããåå²ã«å½±é¿ããªããããPartitioningã¯ã©ã¹ãèªåã§æ¡å¼µããåå²ã®åºæºã¨ãªãå¤ï¼æ¬æ¥ã®ãã¼ï¼ã«ã¯ãå¤ã®å½±é¿ãåºãªãããã«ããã®ã ããã
SQLã§å°è¨ãç·åè¨ãæ±ããæã«GROUP BYãå©ç¨ãããã¨ãå¤ãã¨æãã¾ããããããªè»¸ã§éè¨ãããå ´åã«ROLLUP, CUBE, GROUPING SETSã使ããã¨ãã§ããããã§ãã 詳ããã¯ãã¡ãåç § http://homepage2.nifty.com/sak/w_sak3/doc/sysbrd/sq_kj04_4.htm ROLLUP, CUBE, GROUPING SETSã使ããã¨ãã§ãã¾ãã¨æå®ãã¦ããªãã®ã¯åã試ãã¦ãªãããã§ãï¼æ± ãªã試ãã¦ããªããã¨ããã¨ãããã®æ©è½ãå©ç¨ã§ããã®ãOracle, SQL Server, DB2ã ããã§ããOracle XEããã¦ã³ãã¼ãããããã¨æãã¾ãããã©ã¦ã¼ã¶ç»é²ã«å¿ãæãã¾ããwãã¡ãªã¿ã«MySQLã§ã¯ROLLUPã®ã¿ãµãã¼ããã¦ãããããã§ãã ä»åã¯èããããå ¨ã¦ã®çµã¿åããã§éè¨ããCUBEã«ã¤ãã¦æ¸ãã¦ã¿ããã¨æ
ã©ã³ãã³ã°
ã©ã³ãã³ã°
ã©ã³ãã³ã°
ãªãªã¼ã¹ãé害æ å ±ãªã©ã®ãµã¼ãã¹ã®ãç¥ãã
ææ°ã®äººæ°ã¨ã³ããªã¼ã®é ä¿¡
å¦çãå®è¡ä¸ã§ã
j次ã®ããã¯ãã¼ã¯
kåã®ããã¯ãã¼ã¯
lãã¨ã§èªã
eã³ã¡ã³ãä¸è¦§ãéã
oãã¼ã¸ãéã
{{#tags}}- {{label}}
{{/tags}}