This document discusses Apache Hivemall, an open-source machine learning library for SQL-on-Hadoop. It can be used with Apache Hive, Spark SQL, Spark DataFrame API, and Pig Latin to add machine learning capabilities to SQL queries. The presentation describes Hivemall's capabilities, how it works with different SQL query engines, and new features in versions 0.5.2 and 0.6 like field-aware factoriza
{{#tags}}- {{label}}
{{/tags}}