Hadoop is a Big Data that is developed by Mike Cafarella and Doug Cutting, in 2005. It was originally developed to construct the distribution of Nutch search engine project. It is open source projects that provide lots of framework from Apache to deal with big data-sets and it is also used to analyze data whose volume is so large. Generally, it is written in java language and used for offline analytical processing. It is being used by Google, Facebook, Twitter, Yahoo, LinkedIn and other sites.