Friday 8 March 2013

Explain how input and output data format of the Hadoop framework?


The MapReduce framework operates exclusively on pairs, that is, the framework views the input to the job as a set of pairs and produces a set of pairs as the output of the job, conceivably of different types. See the flow mentioned below
(input) -> map -> -> combine/sorting -> -> reduce -> (output)

No comments:

Post a Comment