Open
Source For You Linux For Humanity
Quick Links
Hadoop
Maemo
Virtualization
DistComputing
Web Designing
|
Hadoop
Apache Hadoop is
ideal for organizations with a growing need to process massive
application datasets. Hadoop is an open source Java software framework
implementing the
MapReduce
algorithm behind Google's approach to querying the distributed data sets
that constitute the internet. The Hadoop Project is inspired by Google's
File System and its Map Reduce Programming paradigm implementation.
Hadoop was created by Doug
Cutting (now a Cloudera employee), who named it after his child's
stuffed elephant. It was originally developed to support distribution
for the Nutch search engine project.
Hadoop implements a
computational paradigm named map/reduce, where the application data is
divided into many small fragments of work, each of which may be executed
or executed on any node in the cluster. In addition, it provides a
distributed file system
HDFS
which stores data on the nodes, providing very high aggregate bandwidth
across the cluster. Both map/reduce and the distributed file system are
designed so that node failures are automatically handled by the
framework. |
Related Solution
Single Node Hadoop Cluster Multiple Node Hadoop Cluster
Pig Project
Hive Project
Cloudera
|