IBM Developerworks: "This article�the first in a series on Hadoop�explores the Hadoop framework, including its fundamental elements, such as the Hadoop file system (HDFS), and node types that are commonly used."
on 05/26/2010 – Made popular on 05/26/2010
With configuration, installation, and the use of Hadoop in single-node and also the use in multi-node architectures under your belt, you can now turn to the task of developing applications within the Hadoop infrastructure. This article explores the Hadoop APIs and data flow and demonstrates their use with a simple mapper and reducer application.
Enterprise Storage Forum: "Hadoop is an open-source software framework that facilitates the storage and analysis of large volumes of data.So what's all the fuss about? For one thing, it operates on commodity hardware."
Learn advanced setup that uses multiple nodes for parallel processing. It demonstrates the various node types required for multinode clusters and explores MapReduce functionality in a parallel environment. This article also digs into the management aspects of Hadoop—both command line and Web based.