MIT 6.824 Lab1 MapReduce

最新推荐文章于 2024-03-28 12:24:18 发布

寒冰陨云

最新推荐文章于 2024-03-28 12:24:18 发布

阅读量404

点赞数 2

分类专栏： MIT6.824分布式系统文章标签： mapreduce hadoop 大数据

本文链接：https://blog.csdn.net/weixin_46840831/article/details/121544638

版权

文章目录

概述
总结

概述

本文章主要讲述lab1的基本实现思路，具体的实验要求见MIT-Lab1。实验代码见Lab代码。

基本需求

一个Coordinator管理多个Worker，通过RPC进行通信
Worker向Corrdinator请求任务，Coordinator向Worker分配任务
Coordinator能够处理Worker Crash

基本数据结构

Coordinator

type Coordinator struct {
   
	// Your definitions here.
	nReduce     int
	nMap        int
	workerLists sync.Map
	startReduce chan bool

	// MapTask
	muMapTask       sync.Mutex
	mapTaskNeedExec int
	mapTaskLists    []*MapTask
	mapTaskQueue    chan *MapTask

	// ReduceTask
	muReduceTask       sync.Mutex
	reduceTaskNeedExec int
	reduceTaskLists    []*ReduceTask
	reduceTaskQueue    chan *ReduceTask
}

workerLists用来管理Worker所有Worker的状态，mapTaskQueue和reduceTaskQueue为并发队列，用于Worker并发获取任务，mapTaskLists和reduceTaskLists用于存储所有的Task。

Worker

type worker struct {
   
	id       string
	nReduce  int
	needExit chan bool
}

needExit同于判断当前Worker是否可以退出，即所有任务已经完成。

具体功能

Worker注册

每个Worker新加入集群时，都要向Coordinator发起注册，Coordinator收到注册请求后，会进行合法性判断，如果合法则加入到workerLists中

// worker.go
func (w *worker) register() {
   
	w.id = strconv.Itoa(os.Getpid())
	reply := RegisterReply{
   }
	args := RegisterArgs{
   WorkerID: w.id}
	call("Coordinator.Register", &args, &reply)
	w.nReduce = reply.ReduceNum
}

// coordinator.go
func (c *Coordinator) Register(args *RegisterArgs, reply *RegisterReply) error {
   
	workerID := args.WorkerID
	_, exist := c.workerLists.Load(workerID)

	if exist {
   
		return errors.New(ErrDuplicateWorker)
	}
	reply.ReduceNum = c.nReduce
	worker := workerRecord

最低0.47元/天解锁文章

寒冰陨云

关注

2
点赞
踩
1

收藏

觉得还不错? 一键收藏
0
评论
MIT 6.824 Lab1 MapReduce

文章目录概述基本需求基本数据结构CoordinatorWorker具体功能Worker注册任务请求与执行宕机处理总结概述本文章主要讲述lab1的基本实现思路，具体的实验要求见MIT-Lab1基本需求一个Coordinator管理多个Worker，通过RPC进行通信 Worker向Corrdinator请求任务，Coordinator向Worker分配任务 Coordinator能够处理Worker Crash基本数据结构Coordinatortype Coordinator stru
复制链接

扫一扫