Hive on Spark を活用した高速データ分析 - Hadoop / Spark Conference Japan 2016Nagato Kasaki
現在、DMM.comでは、1日あたり1億レコード以上の行動ログを中心に、各サービスのコンテンツ情報や、地域情報のようなオープンデータを収集し、データドリブンマーケティングやマーケティングオートメーションに活用しています。しかし、データの規模が増大し、その用途が多様化するにともなって、データ処理のレイテンシが課題となってきました。本発表では、既存のデータ処理に用いられていたHiveの処理をHive on Sparkに置き換えることで、1日あたりのバッチ処理の時間を3分の1まで削減することができた事例を紹介し、Hive on Sparkの導入方法やメリットを具体的に解説します。
Hadoop / Spark Conference Japan 2016
http://www.eventbrite.com/e/hadoop-spark-conference-japan-2016-tickets-20809016328
The document outlines the divisions and focus areas of a large company, with most resources allocated to technology R&D, including deep learning and CNN. Other divisions include infrastructure, promotions, UI design, SEO, big data, IT, recruiting, staffing services, and administration.
Company Recommendation for New Graduates via Implicit Feedback Multiple Matri...Recruit Technologies
This document describes a method for recommending suitable companies to new graduates based on their browsing history and other data. It proposes using implicit matrix factorization of browsing data along with Bayesian optimization of hyperparameters to focus recommendations on less popular, low-browsed companies. An evaluation on Japanese student and company data showed this approach achieved higher recall of suitable matches for low-browsed companies compared to other methods, especially when incorporating additional student and company profile information.
The document discusses the BIG DATA department of Recruit Technologies. It describes how the department has used Amazon's Elastic MapReduce (EMR) and Elastic Compute Cloud (EC2) services since 2010 to perform big data analytics on datasets using Hadoop. It provides details on how the department configures and manages EMR clusters on EC2 spot instances to perform tasks like log analysis and recommendation algorithms in a cost-effective manner. Various strategies are discussed around optimizing the use of spot instances and EMR to reduce costs while managing reliability.
IoT Devices Compliant with JC-STAR Using Linux as a Container OSTomohiro Saneyoshi
Security requirements for IoT devices are becoming more defined, as seen with the EU Cyber Resilience Act and Japan’s JC-STAR.
It's common for IoT devices to run Linux as their operating system. However, adopting general-purpose Linux distributions like Ubuntu or Debian, or Yocto-based Linux, presents certain difficulties. This article outlines those difficulties.
It also, it highlights the security benefits of using a Linux-based container OS and explains how to adopt it with JC-STAR, using the "Armadillo Base OS" as an example.
Feb.25.2025@JAWS-UG IoT