Monday, May 26, 2014

Learning Hadoop

Hello,

So me and my younger brother have started to tinker around with this new tool called "HADOOP".

Now, what's HADOOP:

"Hadoop was created by Doug Cutting and Mike Cafarella[5] in 2005. Cutting, who was working at Yahoo! at the time,[6] named it after his son's toy elephant.[7] It was originally developed to support distribution for the Nutch search engine project.[8]"

So what was the problem which gave birth to Hadoop?

"Doug Cutting, Cloudera's Chief Architect, helped create Apache Hadoop out of necessity as data from the web exploded, and grew far beyond the ability of traditional systems to handle it. Hadoop was initially inspired by papers published by Google outlining its approach to handling an avalanche of data, and has since become the de facto standard for storing, processing and analyzing hundreds of terabytes, and even petabytes of data."

 Hadoop is a software framework, which means it includes a number of components that were specifically designed to solve large-scale distributed data storage, analysis and retrieval tasks.

From the next post onwards, I will start posting on how to install hadoop, configure, what are the modes in which we can use it. 






No comments: