ãã1年以ä¸åã®è©±ã«ãªãã¾ãããNetflixãSpark対å¿ã®Scalaç¨ãã¼ãããã¯PolynoteãOSSåããã¨ãã話ãããã¾ããã netflixtechblog.com æ¢åã®ãã¼ãããã¯ã§ã¯Scalaã使ã£ã¦ãã¦ãã³ã¼ãè£å®ãªã©ããã¾ãå¹ããªããã®ãå¤ããã¾ã¨ã¾ã£ãã³ã¼ããæ¸ãã¨ãã¯çµå±IDEã使ãã¨ããæãã«ãªããã¡ãªã®ã§ãããPolynoteã¯Scalaã第ä¸è¨èªã¨ãã¦ãµãã¼ãããçãããã¼ãããã¯ã§ãã³ã¼ãè£å®ãªã©ã®æ©è½ãå å®ãã¦ãããããªã®ã§é ãã°ããªãã試ãã¦ã¿ã¾ããã ã¤ã³ã¹ãã¼ã« Sparkã使ãå ´åãã¾ãã¯å ã«Sparkãã¤ã³ã¹ãã¼ã«ãã¦ããå¿ è¦ãããã¾ããã¨ãããããã¼ã«ã«ã¢ã¼ãã§åããã ãã§ããã°Sparkã®ãªãªã¼ã¹ãã£ã¹ããªãã¥ã¼ã·ã§ã³ããã¦ã³ãã¼ããã¦é©å½ãªãã£ã¬ã¯ããªã«å±éãã¦ããã ãã§OKã§ãããPolynoteã¯å é¨çã«spark-submitã³
Sparkã«JDBCã§ã¢ã¯ã»ã¹ããã«ã¯Thriftserverãå ¥ãããHive Metastoreãå¿ è¦ã ã£ããã§è²ã é¢åãªã®ã§ãããåä½ã§å©ç¨å¯è½ãªæ¹æ³ã¯ãªãã®ããªã¨æã£ã¦æ¢ãã¦ã¿ãã¨ããã以ä¸ã®ãã®ãè¦ã¤ããã®ã§è©¦ãã¦ã¿ã¾ããã github.com ãã®JDBCãã©ã¤ãã¯ä»¥ä¸ã®ãããªURLã§JDBCçµç±ã§SparkSQLã使ããã¨ãã§ãã¾ãã com.zensolution.jdbc.spark:/Users/foobar/temp/console?format=csv&csv.header=true&csv.delimiter=; SQLå ã§ã¢ã¯ã»ã¹ããã¦ãããã¼ãã«ã¯ã¯ã¨ãªã®å®è¡åã«ãã³ãã©ãªãã¥ã¼ã¨ãã¦èªåçã«ç»é²ããã¾ãããã¨ãã°ä»¥ä¸ã®ãããªSQLãå®è¡ããã¨ãã¾ãã SELECT * FROM people ãã®JDBCãã©ã¤ãã¯ã¾ãã¯ã¨ãªããã¼ã¹ãããã®ã¯ã¨ãªã®å®è¡ã«p
ArchiveBox - A tool which maintains an additive archive from RSS feeds, bookmarks, and links using wget, Chrome headless, and other methods (formerly Bookmark Archiver). (In Development) archivenow - A Python library to push web resources into on-demand web archives. (Stable) ArchiveWeb.Page - A plugin for Chrome and other Chromium based browsers that lets you interactively archive web pages, repl
ã¯ããã« æè¿ãã¼ã¿ã®æ´å½¢çã«Spark Shellã使ã£ã¦ããã®ã§ããã使ãæ¹ãå¿ããã®ã§åå¿é²çãªï½±ï¾ã§ãã Welcome to ____ __ / __/__ ___ _____/ /__ _\ \/ _ \/ _ `/ __/ '_/ /___/ .__/\_,_/_/ /_/\_\ version 2.4.4 /_/ Using Scala version 2.11.12 (OpenJDK 64-Bit Server VM, Java 1.8.0_222) ãã¼ã¸ã§ã³ã¯ãããªæãã§ãã ãã¼ã¿ã½ã¼ã¹èªã¿è¾¼ã¿ CSVï¼ã¨ã³ã³ã¼ãã£ã³ã°æå®ã»ãããããã«ã©ã åãæ¨å®ï¼ val prods = spark.read.format("csv"). option("header", true). option("encoding", "shift_jis"). load("Prod.cs
ãªãSpark? ããã°ãã¼ã¿ã§ãã¼ã¿ãµã¤ã¨ã³ã¹ã§ãã·ã³ã©ã¼ãã³ã°ã®ã¢ã¼ãã£ãã£ã·ã£ã«ã¤ã³ããªã¸ã§ã³ã¹ã ããã§ããããã°ãã¼ã¿åæã¯Hadoopãããã¡ã¯ãã¹ã¿ã³ãã¼ãã§ããã¨ãããã¨ãæè¿å ¥ç¤¾ããä¼ç¤¾ã§çã¾ãã¦åãã¦ç¥ãã¾ããã Sparkãããã°MapReduceã ãã§ã¯é£ããåæããã¼ã¿å¦çããããã¨ã§ãã¦ãã¾ãã¾ãã ãªãClojure? ç§ã¯OCamlã大好ãã§ããã¤ã¾ãJavaã¨ãã¡ãã£ã¨ããã©ãã§ãããããSparkã¯JVMè¨èªãPython(PySpark)ã使ããã¨åæã¨ãªã£ã¦ãã¾ããOCamlã¯æ®å¿µãªããJVMã§ã¯åããªããPythonã§ãããã¾ããã®ã§ä½¿ãã¾ããã æ®éã ã£ããJavaãScalaãªã®ã§ãããJavaãä¼æ¥ã«ä½¿ãã®ã¯åå¼ãã¦æ¬²ããã§ããfinal List<String> someString = new ArrayList<String>();ã£ã¦ã
ãç¥ãã
ã©ã³ãã³ã°
ã©ã³ãã³ã°
ãªãªã¼ã¹ãé害æ å ±ãªã©ã®ãµã¼ãã¹ã®ãç¥ãã
ææ°ã®äººæ°ã¨ã³ããªã¼ã®é ä¿¡
å¦çãå®è¡ä¸ã§ã
j次ã®ããã¯ãã¼ã¯
kåã®ããã¯ãã¼ã¯
lãã¨ã§èªã
eã³ã¡ã³ãä¸è¦§ãéã
oãã¼ã¸ãéã
{{#tags}}- {{label}}
{{/tags}}