الاثنين، 10 نوفمبر 2014

chitchat about big data and hadoop - part1

Hi,,

As it is a blog and I can talk freely in it and not in formal way , I like to talk about big data and hadoop.
you may be not interested or even never heard about hadoop and big data, But if you work at IT field , I want you to know little about it.

What will you do if you have  a data  and you want to store it so you can do queries and analysis on it ?!
you may answer me just store it in database like oracle , DB2 or sql server

Okay , what if it is unstructured data like it contains log files , photos , videos and txt files also it is very large amount of data that your storage system can't store it ??
you may prefer to update your storage system and buy larger one

but what you 'll do if that data increased a lot during time ??

it is so hard to store it in v.expensive server , it is hard to use traditional data bases , that problem will need a solution and good one so you can manage the processing , execute queries and get answer quickly and also handling failure

Google face the same problem so they published a researcher paper and explain its solution by using GFS : Google file system , and MapReduce

but unfortunately this solution is private or specific solution for google and not an open source.

so a man named Doug Cutting decide to make an open source solution like the solution of google and named it Hadoop
by the way ,he named it hadoop after his son's toy and it was like a yellow elephant
interesting , isn't it ?? :)

that is the part 1 about the story of big data and hadoop , and of course by simplified  words , so you can't take it as scientific reference.
I want to hear from you...
seeing you in the next part "N Sha Allah " :) 

0 التعليقات:

إرسال تعليق