API Reference¶
Pilot-MapReduce¶
MapReduce¶
-
class
pmr.
MapReduce
(pmrDesc, coordinationUrl)[source]¶ MapReduce: Class for managing MapReduce Jobs
-
__init__
(pmrDesc, coordinationUrl)[source]¶ Initializes MapReduce with Pilot computes/Data description and coordination system
-
setNbrReduces
(nbrReduces)[source]¶ Set the number of reduces of the MapReduce Job @param nbrReduces: Takes number of Reduces as integer
-
setChunk
(chunkDesc)[source]¶ Registers the chunk task description @param chunkDesc: SAGA Job Description of chunk task
-
setMapper
(mapDesc)[source]¶ Registers the Map task description @param mapDesc: SAGA Job Description of Map task
-
setReducer
(reduceDesc)[source]¶ Registers the Reduce task description @param reduceDesc: SAGA Job Description of Reduce task
-
setOutputPath
(path)[source]¶ Sets the output path to store the final results of MapReduce job @param reduceDesc: SAGA Job Description of Reduce task
-
getDetails
()[source]¶ Returns the execution time of MapReduce phases
@return: dictionary with execution timing details of MapReduce phases
-
chunkOnly
(inputDu)[source]¶ Executes the chunk Job only
@param inputDu: Takes input Data Units as Input argument @return: List of chunk task output Data Units
-
mapOnly
(chunkDus)[source]¶ Executes the Map Job only
@param mapDus: Takes map chunk/split Data Units as Input @return: List of Map task output Data Units
-
submitComputeUnit
(desc)[source]¶ Submits SAGA Job description to Pilot
@param desc: SAGA Job description
-
submitDataUnit
(desc)[source]¶ Submits SAGA Job description to Pilot
@param desc: SAGA Job description
-
Mapper¶
-
class
pmr.
Mapper
(args)[source]¶ mapper: Class for managing Map phase of MapReduce Job
-
__init__
(args)[source]¶ Initializes Map task with parameters passed by MapReduce framework as command line parameters.
-