hadoop jar /opt/cloudera/parcels/CDH/hadoop-mapreduce/hadoop-streaming.jar -files /home/wang/mapper.py,/home/wang/reducer.py -D mapred.map.tasks=10 -D mapred.reduce.tasks=1 -input /home/xhl/word -output /home/xhl/output2 -mapper mapper.py -reducer reducer.py
设置map和reduce数目:-D mapred.map.tasks=10 -D mapred.reduce.tasks=1