Open Source

        Linux For Humanity

Open Source For You
 Linux For Humanity

Quick Links

Hadoop
Maemo
Virtualization
DistComputing
Web Designing


Hadoop

Apache Hadoop is ideal for organizations with a growing need to process massive application datasets. Hadoop is an open source Java software framework implementing the MapReduce algorithm behind Google's approach to querying the distributed data sets that constitute the internet. The Hadoop Project is inspired by Google's File System and its Map Reduce Programming paradigm implementation.

Hadoop was created by Doug Cutting (now a Cloudera employee), who named it after his child's stuffed elephant. It was originally developed to support distribution for the Nutch search engine project.

Hadoop implements a computational paradigm named map/reduce, where the application data is divided into many small fragments of work, each of which may be executed or executed on any node in the cluster. In addition, it provides a distributed file system HDFS which stores data on the nodes, providing very high aggregate bandwidth across the cluster. Both map/reduce and the distributed file system are designed so that node failures are automatically handled by the framework.

Related Solution

Single Node Hadoop Cluster
Multiple Node Hadoop Cluster
Pig Project
Hive Project
Cloudera

Developed By: Ankit Anand

May The Code Be With You

Copyright ankit anand