Hadoop之TaskTraker分析

最新推荐文章于 2019-03-08 09:25:22 发布

VIP文章 ToBeAndNotToBe

最新推荐文章于 2019-03-08 09:25:22 发布

阅读量3.8k

点赞数

分类专栏： hadoop java 文章标签： hadoop exception string localization jvm events

本文链接：https://blog.csdn.net/ToBeAndNotToBe/article/details/7188819

版权

TaskTracker的工作职责之前已经和大家提过，主要负责维护，申请和监控Task，通过heartbeat和JobTracker进行通信。

TaskTracker的init过程：

1.读取配置文件，解析参数

2.将TaskTraker上原有的用户local files删除并新建新的dir和file

3. Map<TaskAttemptID, TaskInProgress> tasks = new HashMap<TaskAttemptID, TaskInProgress>(); 清除map

4. this.runningTasks = new LinkedHashMap<TaskAttemptID, TaskInProgress>();记录task的链表
this.runningJobs = new TreeMap<JobID, RunningJob>();记录job的id信息

5.初始化JVMManager：

  mapJvmManager = new JvmManagerForType(tracker.getMaxCurrentMapTasks(), 
        true, tracker);
    reduceJvmManager = new JvmManagerForType(tracker.getMaxCurrentReduceTasks(),
        false, tracker);

6.初始化RPC，获取JobTracker client用于heartbeat通信；

7.new一个后台线程用于监听map完成的事件

  
    this.mapEventsFetcher = new MapEventsFetcherThread();
    mapEventsFetcher.setDaemon(true);
    mapEventsFetcher.setName(
                             "Map-events fetcher for all reduce tasks " + "on " + 
                             taskTrackerName);
    mapEventsFetcher.start();

后台线程的run方法如下：

 while (running) {
        try {
          List <FetchStatus> fList = null;
          synchronized (runningJobs) {
            while (((fList = reducesInShuffle()).size()) == 0) {
              try {
                runningJobs.wait();
              } catch (InterruptedException e) {
                LOG.info("Shutting down: " + this.getName());
                return;
              }
            }
          }
          // now fetch all the map task events for all the reduce tasks
          // possibly belonging to different jobs
          boolean fetchAgain = false; //flag signifying whether we want to fetch
                                      //immediately again.
          for (FetchStatus f : fList) {
            long currentTime

最低0.47元/天解锁文章

ToBeAndNotToBe

关注

0
点赞
踩
1

收藏

觉得还不错? 一键收藏
0
评论
Hadoop之TaskTraker分析

TaskTracker的工作职责之前已经和大家提过，主要负责维护，申请和监控Task，通过heartbeat和JobTracker进行通信。 TaskTracker的init过程： 1.读取配置文件，解析参数 2.将TaskTraker上原有的用户local files删除并新建新的dir和file 3. Map tasks = new HashMa
复制链接

扫一扫