1. Akira Chiku is an engineer who works on an engineering team. Their requirements include collecting between 10-20GB of data per day from various sources like Hadoop and Hive.
2. Data is collected from sources like Fluentd and parsed using Query String and stored in Hive. It is then processed and visualized.
3. Data can be stored in S3, processed using services like AWS EMR, and visualized in dashboards for analysis.
70. configuration
property
BQQIJWFXJUIDPOHKTPO
namehive.optimize.s3.query/name
valuetrue/value
descriptionOptimize
query
on
S3/description
/property
property
namejavax.jdo.option.ConnectionURL/name
valuejdbc:mysql://hostname:3306/hive?createDatabaseIfNotExist=true/value
descriptionJDBC
connect
string
for
a
JDBC
metastore/description
/property
property
namejavax.jdo.option.ConnectionDriverName/name
valuecom.mysql.jdbc.Driver/value
descriptionDriver
class
name
for
a
JDBC
metastore/description
/property
property
namejavax.jdo.option.ConnectionUserName/name
valueusername/value
descriptionUsername
to
use
against
metastore
database/description
/property
property
namejavax.jdo.option.ConnectionPassword/name
valuepassword/value
descriptionPassword
to
use
against
metastore
database/description
/property
/configuration