Thursday, December 8, 2016

Big Data

Install Hadoop "hadoop-2.7.3" on Windows 10

http://www.ics.uci.edu/~shantas/Install_Hadoop-2.6.0_on_Windows10.pdf
https://wiki.apache.org/hadoop/Hadoop2OnWindows
http://www.srccodes.com/p/article/45/run-hadoop-wordcount-mapreduce-example-windows
http://toodey.com/2015/08/10/hadoop-installation-on-windows-without-cygwin-in-10-mints/
Get Windows10 binaries to put in /bin : https://github.com/sardetushar/hadooponwindows/

Verify Installation: 
Resource Manager and Node Manager at http://localhost:8042 and 
Namenode at http://localhost:50070

The Hadoop Ecosystem
https://hadoopecosystemtable.github.io/

Hive, Pig, Spark.

<>
http://docs.hortonworks.com/HDPDocuments/Ambari-2.2.0.0/bk_Installing_HDP_AMB/content/_set_up_password-less_ssh.html

CaseStudy
http://hortonworks.com/blog/spark-data-science-case-study/

...The Ultimate Marketing Machine...
http://www.sas.com/content/dam/SAS/en_us/doc/whitepaper2/hbr-ultimate-marketing-machine-107261.pdf

IoT Analytics in Practice
http://docs.media.bitpipe.com/io_13x/io_130443/item_1306139/iot-analytics-in-practice-107941.pdf

HBase bulk load:-  (Start in order and pay attention to the details one by one)
1) HBase bulk load architecture - 
        https://hbase.apache.org/book.html#arch.bulk.load
2) Writing HFiles Directly During Bulk Import -
        https://hbase.apache.org/book.html#_writing_hfiles_directly_during_bulk_import
3) Spark bulk load - 
        https://hbase.apache.org/book.html#_bulk_load


No comments:

Post a Comment