Hadoop Advent Calendar 2013 4æ¥ç®ã®è¨äºã§ã tl;dr explainã¨job historyãèªã 1 reducerã¯æª data skewã¯æª åæ¸ã ã¿ããªå¤§å¥½ãSQLã§Hadoopä¸ã§ã®å¦çãå®è¡ã§ããHiveã«ã¯ã¿ãªããæ®æ®µãããä¸è©±ã«ãªã£ã¦ãããã¨ã§ããããã¡ãã£ã¨èª¿ã¹ç©ã§ã°ã°ã度ã«ç®ã«å ¥ãæãããããã¹ã³ããããèãã å¿ã«æ¸ 涼ãªé¢¨ãã¯ããã§ããã¾ãã ã§ããHiveã®ã¯ã¨ãªè¨èªã¯SQLã§ã¯ãªãHiveQLã§ãããå®è¡ã¨ã³ã¸ã³ãRDBã®ããã¨ã¯å ¨ãç°ãªãMapReduceã§ããSQLã®ã¤ããã§HiveQLãæ¸ãã¦ããã¨å°é·ãè¸ãã§ãã¾ããã¨ãã¾ãã«ããããã¾ããæ¬ã¨ã³ããªã§ã¯é¥ããã¡ãªHiveQLã®è½ã¨ãç©´ã2ã¤ç´¹ä»ãã¾ãã ä¾1 SELECT count(DISTINCT user_id) FROM access_log SQLã«æ £ããæ¹ã§ãã
{{#tags}}- {{label}}
{{/tags}}