mrrunner COMMAND [OPTIONS]
start -l <LEVEL> -f <jdf> : configure and deploy an MR cluster
list : list the active MR clusters
master <clusterID> : get the master node of an MR cluster
stop <clusterID> : stop the MR cluster with the given clusterID
LEVEL = FATAL|ERROR|WARN|DEBUG|INFO
The MR-Runner deploys MapReduce (MR) clusters on demand over the DAS-4 system. The MR-Runner is implemented in Java and currently configures Hadoop-1.0.0 clusters. KOALA is responsible for scheduling jobs, which in this case are complete MR clusters, received from the MR-runners. Based on the desired size (number of nodes) of the MR cluster, KOALA schedules the job on the adequate physical cluster by applying one of its placement policies.
1. Create your own Job Description File:
+(&
( count = "40")
( maxWallTime = "30" )
( resourcemanagercontact = "fs3.das4.tudelft.nl" )
)
count = the size of the MapReduce cluster (number of machines x 8 processors)
maxWallTime = the duration of the SGE reservation in minutes
resourcemanagercontact = preferred execution site
2. Configuration and log files:
Hadoop configuration files path: ~/.mrcluster/<clusterID>
Hadoop log files path: ~/var/scratch/$USER/logs/
3. Executing Hadoop commands:
All Hadoop commands are executed on the master node of the MR cluster:
e.g. ssh <masterNode> /home/koala/hadoop/bin/hadoop --config ~/.mrcluster/<clusterID> dfs -ls /data
The MR-Runner enables access to the following stack of frameworks for big data processing: