Golang协程池ants库的学习、使用及源码阅读，协程池与GMP模型关系的理解

最新推荐文章于 2024-10-02 16:38:03 发布

Climber47

最新推荐文章于 2024-10-02 16:38:03 发布

阅读量3.2k

点赞数 81

分类专栏： Golang 文章标签： golang 学习开发语言

本文链接：https://blog.csdn.net/ws_te47/article/details/135499910

版权

Golang 专栏收录该内容

31 篇文章 0 订阅

订阅专栏

前言

在工作时遇到了一个需要使用ants协程池的地方，因此顺带来学习一下他的原理。

在这里插入图片描述

协程池

Golang的资源还是偏少一些…因此先简单的参考学习了一下线程池。

类似于Java中的线程池，协程池也是为了减少协程频繁创建、销毁所带来资源消耗的问题。按默认每个goroutine 8kb内存来算，几十万个goroutine就会占满8Gb内存。同时，由于goroutine的结束需要等待自身运行结束才可以销毁，所以也可能出现goroutine泄露的问题。因此需要使用协程池，来进行管理。

本文是ants协程池的学习，其他优秀的协程池日后有机会再去学习。

`ants`库对性能的提升

直接引用作者的结论“用ants的吞吐性能相较于原生 goroutine 可以保持在 2-6 倍的性能压制，而内存消耗则可以达到 10-20 倍的节省优势。”

在goroutine越多时候，提升越明显。具体请参阅作者文档中给出的性能测试部分。——https://github.com/panjf2000/ants/blob/dev/README_ZH.md

`ants`库工作流程

这个是作者在文档中所给出的流程图。在原文档中还提供了动态图，但我的电脑上他们没动…所以我也就不搬运了。具体还是请参阅作者文档。

在这里插入图片描述

当一个任务被丢尽协程池后，大致分为以下几步：
1、判断是否有可用的worker。若有，分配任务到worker，并执行即可。
2、若无可用的worker，判断是否达到容量上限。若未达到上限，新增一个worker并分配执行即可。
3、若已达到上限，则判断是否允许阻塞。若不允许，则直接返回nil即可。
4、若允许阻塞，则阻塞至有可用worker，再分配执行即可。
5、对于一个执行完成任务的worker，再放回工作池。

`ants`库源码对照阅读

由于笔者学术不精且文笔欠佳，因此此部分仅是简单阅读记录，一篇详细且高质量的源码阅读请参阅Go 每日一库之 ants（源码赏析)。此文阅读、解析的十分详细。

新建Pool

Pool所用到的的数据结构如下。

type Pool struct {
	// capacity of the pool, a negative value means that the capacity of pool is limitless, an infinite pool is used to
	// avoid potential issue of endless blocking caused by nested usage of a pool: submitting a task to pool
	// which submits a new task to the same pool.
	capacity int32

	// running is the number of the currently running goroutines.
	running int32

	// lock for protecting the worker queue.
	lock sync.Locker

	// workers is a slice that store the available workers.
	workers workerQueue

	// state is used to notice the pool to closed itself.
	state int32

	// cond for waiting to get an idle worker.
	cond *sync.Cond

	// workerCache speeds up the obtainment of a usable worker in function:retrieveWorker.
	workerCache sync.Pool

	// waiting is the number of goroutines already been blocked on pool.Submit(), protected by pool.lock
	waiting int32

	purgeDone int32
	stopPurge context.CancelFunc

	ticktockDone int32
	stopTicktock context.CancelFunc

	now atomic.Value

	options *Options
}

其中大致如下几个参数：
capacity：池容量，表示ants最多能创建的 goroutine 数量。负值意味着池的容量是无限的，使用无限池是为了避免由于嵌套使用池而导致的无限阻塞的潜在问题:提交一个任务到池，该池将一个新任务提交到相同的池。
running：是当前正在运行的goroutines的数量。
workers：存储可用worker的切片。works是一个workerQueue类型的接口，位于workerQueue.go这个文件中。
state：记录池子当前是否已关闭（CLOSED）。
waiting：已经被pool.Submit()阻塞的协程数量，由pool.lock保护。
lock：锁。
cond：条件变量。处理任务等待和唤醒。
blockingNum：阻塞等待的任务数量。
其他具体的见源码中

// NewPool instantiates a Pool with customized options.
func NewPool(size int, options ...Option) (*Pool, error) {
	if size <= 0 {
		size = -1
	}

	opts := loadOptions(options...)

	if !opts.DisablePurge {
		if expiry := opts.ExpiryDuration; expiry < 0 {
			return nil, ErrInvalidPoolExpiry
		} else if expiry == 0 {
			opts.ExpiryDuration = DefaultCleanIntervalTime
		}
	}

	if opts.Logger == nil {
		opts.Logger = defaultLogger
	}

	p := &Pool{
		capacity: int32(size),
		lock:     syncx.NewSpinLock(),
		options:  opts,
	}
	p.workerCache.New = func() interface{} {
		return &goWorker{
			pool: p,
			task: make(chan func(), workerChanCap),
		}
	}
	if p.options.PreAlloc {
		if size == -1 {
			return nil, ErrInvalidPreAllocSize
		}
		p.workers = newWorkerQueue(queueTypeLoopQueue, size)
	} else {
		p.workers = newWorkerQueue(queueTypeStack, 0)
	}

	p.cond = sync.NewCond(p.lock)

	p.goPurge()
	p.goTicktock()

	return p, nil
}

NewPool就是新建一个Pool，其接收的参数是(size int, options ...Option) ，第一个是容量，其他可选项。

通过`Submit`进行任务提交

此部分源码如下

func (p *Pool) Submit(task func()) error {
	if p.IsClosed() {
		return ErrPoolClosed
	}

	w, err := p.retrieveWorker()
	if w != nil {
		w.inputFunc(task)
	}
	return err
}

首先判断池子是否关闭，关闭则直接err。
接下来通过retrieveWorker来判断是否有空闲的worker。
若w!=nil 即有空闲worker，则通过w.inputFunc(task)新增一个任务。

w.inputFunc(task)部分如下

func (w *goWorker) inputFunc(fn func()) {
	w.task <- fn
}

是把任务提交到了一个goWorker结构体中的task chan func()。这个goworker是worker接口的实现，他其中chan的任务会进行run。

通过`retrieveWorker`判断空闲`worker`

此部分源码如下，其所返回的是一个可用的worker


// retrieveWorker returns an available worker to run the tasks.
func (p *Pool) retrieveWorker() (w worker, err error) {
	p.lock.Lock()

retry:
	// First try to fetch the worker from the queue.
	if w = p.workers.detach(); w != nil {
		p.lock.Unlock()
		return
	}

	// If the worker queue is empty, and we don't run out of the pool capacity,
	// then just spawn a new worker goroutine.
	if capacity := p.Cap(); capacity == -1 || capacity > p.Running() {
		p.lock.Unlock()
		w = p.workerCache.Get().(*goWorker)
		w.run()
		return
	}

	// Bail out early if it's in nonblocking mode or the number of pending callers reaches the maximum limit value.
	if p.options.Nonblocking || (p.options.MaxBlockingTasks != 0 && p.Waiting() >= p.options.MaxBlockingTasks) {
		p.lock.Unlock()
		return nil, ErrPoolOverload
	}

	// Otherwise, we'll have to keep them blocked and wait for at least one worker to be put back into pool.
	p.addWaiting(1)
	p.cond.Wait() // block and wait for an available worker
	p.addWaiting(-1)

	if p.IsClosed() {
		p.lock.Unlock()
		return nil, ErrPoolClosed
	}

	goto retry
}

首先通过p.workers.detach()判断是否有空闲的worker。若有，则直接返回可用的worker。

若无，通过

if capacity := p.Cap(); capacity == -1 || capacity > p.Running()

这一个判断来判断是否耗尽了池容量。若没有耗尽，则通过p.workerCache.Get().(*goWorker)生成一个新的工作goroutine，并返回workers。

继续执行则是“没有空闲worker且池容量已满”的情况。便通过

	if p.options.Nonblocking || (p.options.MaxBlockingTasks != 0 && p.Waiting() >= p.options.MaxBlockingTasks)

判断，是否处于非阻塞模式，或者待处理的呼叫者数量达到最大值，若是，则退出返回nil。

反之，通过

	p.addWaiting(1)
	p.cond.Wait() // block and wait for an available worker
	p.addWaiting(-1)

进行阻塞等待可用的即可。

此部分整体通过一个retry：标签和goto retry来进行控制，通过一个lock保证了安全。具体看作者源码部分，此处不搬运描述。

run部分

在chan中的任务会依次去run，run的源码部分如下：


// run starts a goroutine to repeat the process
// that performs the function calls.
func (w *goWorker) run() {
	w.pool.addRunning(1)
	go func() {
		defer func() {
			w.pool.addRunning(-1)
			w.pool.workerCache.Put(w)
			if p := recover(); p != nil {
				if ph := w.pool.options.PanicHandler; ph != nil {
					ph(p)
				} else {
					w.pool.options.Logger.Printf("worker exits from panic: %v\n%s\n", p, debug.Stack())
				}
			}
			// Call Signal() here in case there are goroutines waiting for available workers.
			w.pool.cond.Signal()
		}()

		for f := range w.task {
			if f == nil {
				return
			}
			f()
			if ok := w.pool.revertWorker(w); !ok {
				return
			}
		}
	}()
}