Documente Academic
Documente Profesional
Documente Cultură
==============================
sqoop import-all-tables \
-m 3 \
--connect jdbc:mysql://localhost:3306/retail_db \
--username=retail_dba \
--password=cloudera \
--compression-codec=snappy \
--as-parquetfile \
--warehouse-dir=/user/hive/warehouse \
--hive-import
Check DB
========
mysql -u root -p
pass: cloudera
list database
=============
show databases;
/opt/examples/log_files/access.log.2
====================
=====================================
=================================
=================================
Run in hive
-----------
Sample row
==========
==========
Result row
==========
The tool in CDH best suited for quick analytics on object relationships is Apache Spark.
You can compose a Spark job to do this work and give you insight on product relationships.
Ordinarily when you are deploying a new search schema, there are four steps:
https://www.cloudera.com/developers/get-started-with-hadoop-tutorial/exercise-4.html
Cloudera Search to allow exploration of data in real time, using Flume and Solr and Morphlines