Hadoop

Door tom 5 december 2014

Everyone talks about big data and Hadoop. Someone even compared it to teenage sex: everone talks about it, everyone knows someone who does it but no-one yet does it. I just tried hadoop to see what it is all about.

I made two attempts to install hadoop.
One attempt was about installing Hadoop 1.0.3. I relied on a paper from Michael Noll ( http://www.michael-noll.com/tutorials/running-hadoop-on-ubuntu-linux-single-node-cluster/ ). I noticed it was really important to use the correct versions of the jdk; I first tried to install Hadoop 1.0.3 with a recent version of the Jdk but that failed. A subsequent attempt was successfull. This could be established with help of the interface ( localhost:50070 ).
A second attempt was about installing Hadoop 2.4. I used a blog from Matthew Sharpe ( http://dogdogfish.com/2014/04/26/installing-hadoop-2-4-on-ubuntu-14-04/ ) to get the necessary information. After the installion, I created a small example to check if this worked. It did work.

Hence, I ended up with two working examples of Hadoop.

The problem is that hadoop is now alive and kicking. I verified this by executing a small example. This worked out ok. But how to continue. Working with hadoop isn’t trivial. My next step will be is finding a means to work with Hadoop.

Door tom

Uncategorized

Breaking

Hadoop

Door tom

Gerelateerd bericht

Je miste

Flask and JSON

A webserver from the command line

Use the node.js server as restful app server

Reading a CSV file and translate into dataframe

Hadoop

Door tom

Gerelateerd bericht

Oh my God: how good is open source

Curl and elasticSearch

ElasticSearch

Je miste

Flask and JSON

A webserver from the command line

Use the node.js server as restful app server

Reading a CSV file and translate into dataframe