Spark
Datasetã®APIããã¥ã¡ã³ãã¯ã> https://spark.apache.org/docs/latest/api/scala/index.html#org.apache.spark.sql.Datasetã«ããã¨æ¸ãã¦ããã次ã«é²ãåã«ã¡ãã£ã¨çºãã¦ã¿ãã説ææ¸ãã«æ¸ãã¦ãããã¨ãã¾ã¨ãã¦ããã Dataset ã¨ã¯å¥ã«DataFrameã¨ãâ¦
æ¬å®¶ã®Quick Start(https://spark.apache.org/docs/latest/quick-start.html)ããã£ã¦ã¿ããæè¿ã¡ãã£ã¨åå¼·ãããã¦ããScalaã®æ¹ããã£ã¦ã¿ãã > spark-shell 2018-05-03 16:40:22 WARN NativeCodeLoader:62 - Unable to load native-hadoop library forâ¦
ãªãã ãã¤ã³ã¹ãã¼ã«ã°ã£ãããã¦ããæ°ãããããã¾ãããããªãã®ã§ããã社å ã®åå¼·ä¼ã§ãã¡ãã£ããApache Sparkã触ã£ã¦ã¿ããã¨ã«ãªã£ããã¾ãã¯ãç°å¢ã®ç¢ºèª > python --version Python 3.6.0 :: Anaconda 4.3.0 (x86_64) > java -version java versâ¦
Quick Startã®ç¶ãã"More on Dataset Operations"ã«æ²¿ã£ã¦éãã§ã¿ãããã¼ã¿ã¯ã大æã®ã¨ãããµã¼ãã®javaheapã®ãã¼ã¿ã使ããã¨ã«ããããããªãã¼ã¿ã > head -n5 javaheap.app01 09:00:11 550792720 1543502336 09:01:10 321361976 1543502336 09:02:1â¦