Java 并发编程(四) 线程池的创建与使用

最新推荐文章于 2021-02-25 10:46:10 发布

weixin_33768481

最新推荐文章于 2021-02-25 10:46:10 发布

阅读量124

点赞数

文章标签： java python 数据结构与算法

原文链接：https://my.oschina.net/JackieRiver/blog/2051005

版权

2019独角兽企业重金招聘Python工程师标准>>>

一.前言

在项目中,避免不了要使用多线程,为了避免资源浪费和线程数的不可控而出现未知的问题,我们一般都会使用线程池;

JDK中给我们提供了多种可以立即使用的创建线程池的方法,其都是基于ThreadPoolExecutor创建的线程池,所以ThreadPoolExecutor是线程池的基础,我们主要分析一下ThreadPoolExecutor的使用,之后再分析一下快速创建线程池的方法优劣,再最后分析一下多线程的更加灵活运用;

二.ThreadPoolExecutor

2.1 构造方法

/**
     * Creates a new {@code ThreadPoolExecutor} with the given initial
     * parameters.
     *
     * @param corePoolSize the number of threads to keep in the pool, even
     *        if they are idle, unless {@code allowCoreThreadTimeOut} is set
     * @param maximumPoolSize the maximum number of threads to allow in the
     *        pool
     * @param keepAliveTime when the number of threads is greater than
     *        the core, this is the maximum time that excess idle threads
     *        will wait for new tasks before terminating.
     * @param unit the time unit for the {@code keepAliveTime} argument
     * @param workQueue the queue to use for holding tasks before they are
     *        executed.  This queue will hold only the {@code Runnable}
     *        tasks submitted by the {@code execute} method.
     * @param threadFactory the factory to use when the executor
     *        creates a new thread
     * @param handler the handler to use when execution is blocked
     *        because the thread bounds and queue capacities are reached
     * @throws IllegalArgumentException if one of the following holds:<br>
     *         {@code corePoolSize < 0}<br>
     *         {@code keepAliveTime < 0}<br>
     *         {@code maximumPoolSize <= 0}<br>
     *         {@code maximumPoolSize < corePoolSize}
     * @throws NullPointerException if {@code workQueue}
     *         or {@code threadFactory} or {@code handler} is null
     */
    public ThreadPoolExecutor(int corePoolSize,
                              int maximumPoolSize,
                              long keepAliveTime,
                              TimeUnit unit,
                              BlockingQueue<Runnable> workQueue,
                              ThreadFactory threadFactory,
                              RejectedExecutionHandler handler) {
        if (corePoolSize < 0 ||
            maximumPoolSize <= 0 ||
            maximumPoolSize < corePoolSize ||
            keepAliveTime < 0)
            throw new IllegalArgumentException();
        if (workQueue == null || threadFactory == null || handler == null)
            throw new NullPointerException();
        this.corePoolSize = corePoolSize;
        this.maximumPoolSize = maximumPoolSize;
        this.workQueue = workQueue;
        this.keepAliveTime = unit.toNanos(keepAliveTime);
        this.threadFactory = threadFactory;
        this.handler = handler;
    }

上边的是AllArgsConstructor,一共有七个参数;

int corePoolSize : 核心线程池的数量,也是线程池保持活跃的线程数量;
- 当新的任务进来时,若执行任务中的线程数量小于corePoolSize,则创建一条新的线程来执行任务, 直至线程数量等于corePoolSize;此后线程池中活跃线程的数量始终为corePoolSize,除非设置了allowCoreThreadTimeOut参数为true;
- 在线程池创建之初,线程数量是为0的,只有当任务进来没有闲置线程的时候才会创建新线程;
int maximumPoolSize : 最大线程数量;用于限制线程池的线程数量;
long keepAliveTime, TimeUnit unit : 保持活跃的时间,和时间单位;
- 当线程池中线程数量大于corePoolSize ,多出的线程在任务执行完成之后继续保持活跃的时间,时间过后线程将会被销毁,直至线程数量为corePoolSize;
BlockingQueue<Runnable> workQueue : 阻塞队列;
- 用于保存当线程数量达到corePoolSize时后续提交的任务;
- BlockingQueue有几个Impl,在后面会说到;
- 当阻塞在workQueue中的任务等于队列的长度的时候,也就是队列Full的时候,线程池会创建新的线程来处理任务,maximumPoolSize会限制线程数量的最大值;
ThreadFactory threadFactory : 线程工厂,由于设置线程属性;
RejectedExecutionHandler handler : 拒绝策略;
- 当线程池线程数量和阻塞队列同时Full的时候,后续提交任务的处理策略,可使用已提供的策略也可以自定义实现;后面会说到;

2.2 BlockingQueue

2.2.1 SynchronousQueue

没有任何容量的队列,可以理解为容量为0的队列,当处理任务的线程数量大于等于corePoolSize时,新进任务会直接创建线程执行,若线程数量等于maximumPoolSizes 则会抛出RejectedExecutionException异常;

2.2.2 LinkedBlockingQueue

无界队列,内部通过Node链表实现,若使用这种队列则线程池中线程最大数量为corePoolSize,参数maximumPoolSizes是无用的,因为超过corePoolSize的任务都会被放进queue中,且queue无界,不会触发corePoolSize之外的线程创建;

无界队列并不代表真的无界,只是说明该队列可支持无线长度,该队列支持一个有参构造,可设置队列长度,这样maximumPoolSizes就不会失效了;

2.2.3 ArrayBlockingQueue

有界队列,必须设置一个固定容量,因此称之为有界,内部数组实现,可定义公平与非公平策略;

2.3 RejectedExecutionHandler

A handler for tasks that cannot be executed by a {@link ThreadPoolExecutor}.

当队列和线程都处于Full状态时,新进任务的处理策略,有如下3个默认给出的实现策略供我们使用,当然也可以自定义,只要实现其

rejectedExecution(Runnable r, ThreadPoolExecutor e)方法就可以了;

2.3.1 ThreadPoolExecutor.AbortPolicy

查看其实现:

       /**
         * Always throws RejectedExecutionException.
         *
         * @param r the runnable task requested to be executed
         * @param e the executor attempting to execute this task
         * @throws RejectedExecutionException always
         */
        public void rejectedExecution(Runnable r, ThreadPoolExecutor e) {
            throw new RejectedExecutionException("Task " + r.toString() +
                                                 " rejected from " +
                                                 e.toString());
        }

可知不处理任务,始终抛出一个RejectedExecutionException异常;

该策略可很好的控制服务的线程数量和队列的容量,但应该catch异常信息返回状态码,例如在app请求服务的时候返回服务繁忙请稍后再试的提示;

2.3.2 ThreadPoolExecutor.DiscardPolicy

其rejectedExecution方法实现为空,从其注释也可以看出,该策略默默的丢弃了新进任务,没有任何提示及异常;

由于会导致任务丢失且不可感知,因此应该在特定的场景下使用;

2.3.3 ThreadPoolExecutor.DiscardOldestPolicy

        /**
         * Obtains and ignores the next task that the executor
         * would otherwise execute, if one is immediately available,
         * and then retries execution of task r, unless the executor
         * is shut down, in which case task r is instead discarded.
         *
         * @param r the runnable task requested to be executed
         * @param e the executor attempting to execute this task
         */
        public void rejectedExecution(Runnable r, ThreadPoolExecutor e) {
            if (!e.isShutdown()) {
                e.getQueue().poll();
                e.execute(r);
            }
        }
    }

删除队列中最头部的任务,然后将新进任务插入队列尾部;

该策略会导致任务丢失,与2.3.2一样,除非特定场景否则个人不建议使用;

2.3.4 ThreadPoolExecutor.CallerRunsPolicy

        /**
         * Executes task r in the caller's thread, unless the executor
         * has been shut down, in which case the task is discarded.
         *
         * @param r the runnable task requested to be executed
         * @param e the executor attempting to execute this task
         */
        public void rejectedExecution(Runnable r, ThreadPoolExecutor e) {
            if (!e.isShutdown()) {
                r.run();
            }
        }

如果线程池没有挂掉,则使用新进任务的线程直接执行任务,而非等待使用线程池中的线程;

此种策略应该在服务内线程数量可控的范围内,或在我们很了解服务的线程使用情况下使用;

若短时间内有大量的新任务产生,此策略会导致服务内线程数目飙升,与我们使用线程池的初衷不符;

三 Executors 快速创建线程池

3.1 ExecutorService newFixedThreadPool(int nThreads)

创建固定容量的线程池:

   /**
     * Creates a thread pool that reuses a fixed number of threads
     * operating off a shared unbounded queue.  At any point, at most
     * {@code nThreads} threads will be active processing tasks.
     * If additional tasks are submitted when all threads are active,
     * they will wait in the queue until a thread is available.
     * If any thread terminates due to a failure during execution
     * prior to shutdown, a new one will take its place if needed to
     * execute subsequent tasks.  The threads in the pool will exist
     * until it is explicitly {@link ExecutorService#shutdown shutdown}.
     *
     * @param nThreads the number of threads in the pool
     * @return the newly created thread pool
     * @throws IllegalArgumentException if {@code nThreads <= 0}
     */
    public static ExecutorService newFixedThreadPool(int nThreads) {
        return new ThreadPoolExecutor(nThreads, nThreads,
                                      0L, TimeUnit.MILLISECONDS,
                                      new LinkedBlockingQueue<Runnable>());
    }

通过内部实现我们看到也是利用ThreadPoolExecutor创建,只不过是默认了部分入参而已.

corePoolSize 和 maximumPoolSize 均为int nThreads,从而限制了线程的数量,采用无界队列LinkedBlockingQueue使没有获得线程资源的任务全部进入队列等待,任务不会丢失;参数keepAliveTime默认为0,因为没有多于核心线程数的线程被创建,所以无需设置此值;

固定线程池可以很好的控制服务中线程的数量,很好的避免的线程数量激增,控制了CPU的占用率,但也会带来另外的问题,若是队列中有大量的任务阻塞,势必会导致内存飙升;因此,固定线程池适用于任务并发数量可控的,短时间内不会有大量任务提交的场景;

若在短时间内有大量任务并发,但是每个任务的运算不会占用很长时间,可以考虑下面的线程池 : ExecutorService newCachedThreadPool();

3.2 ExecutorService newCachedThreadPool()

缓存线程池 :

    public static ExecutorService newCachedThreadPool() {
        return new ThreadPoolExecutor(0, Integer.MAX_VALUE,
                                      60L, TimeUnit.SECONDS,
                                      new SynchronousQueue<Runnable>());
    }

corePoolSize设置为0,即没有核心线程数量,所有的线程都是用完后超过keepAliveTime时间后就销毁;maximumPoolSize 设置为Integer.MAX_VALUE,基本可以理解为无上限,阻塞队列采用同步队列SynchronousQueue,所有任务即时提交线程执行,即不会有任务被阻塞在队列中.

该线程池适用于短时间内有任务并发,但任务都是在短时间内可以处理完毕的;maximumPoolSize的值保证了所有任务都能被线程或新创建线程立即处理,keepAliveTime = 60L使得大量线程在处理完当下任务时可以保持活跃等待下一个任务到来,避免每次都会新创建线程带来的开销,在支持创建大量线程的情况下有保证了线程不会被浪费,当线程空闲时间到达指定时间后销毁,又避免了大量线程同时存在,控制的线程的数量;

对于任务处理时间长的场景,线程占用时间过长,每次新进任务都会创建新的线程,线程数会上升,该线程池就不适用了需要考虑其他的线程池;

3.3 ExecutorService newSingleThreadExecutor()

单线程线程池;采用无界队列的核心线程和最大线程都是1的线程池,所有任务会被串行的执行;

3.4 ScheduledExecutorService newScheduledThreadPool(int corePoolSize)

支持延迟执行任务的线程池,maximumPoolSize为Integer.MAX_VALUE;可用于定时任务的执行;还有很多灵活的用法,详细的可以点击直接看第二节;

3.5 ExecutorService newWorkStealingPool(int parallelism)

工作窃取线程池,参数为指定并发等级,默认为服务器CPU的数量;该线程池内部基于ForkJoinPool,具体使用请点击链接跳转;

3.6 自定义线程池工具类

下面提供一个自定义线程池可直接使用,需要结合项目实际情况适当修改:

package com.river.thread;

import java.util.concurrent.*;
import java.util.concurrent.atomic.AtomicInteger;

public enum ContextThreadPool {

    /**
     * 该类的一个实例,通过枚举类实现单例模式
     */
    INSTANCE;

    public ThreadPoolExecutor getThreadPool(){
        return ThreadPoolHolder.pool;
    }

    private static class ThreadPoolHolder{
        /**
         * 阻塞队列的容量
         */
        private final static int CAPACITY = 500;

        private static ThreadPoolExecutor pool ;
        /**
         * 获取处理器数目
         */
        private static int availableProcessors = Runtime.getRuntime().availableProcessors();

        /**
         * 基于LinkedBlockingQueue的容量为{@link CAPACITY}
         */
        private static BlockingQueue queue = new LinkedBlockingQueue(CAPACITY);

        static {
            pool = new ThreadPoolExecutor(
                    availableProcessors * 2,
                    availableProcessors * 4 + 1,
                    0,
                    TimeUnit.MILLISECONDS,
                    queue,
                    new ThreadFactory() {
                        private AtomicInteger count = new AtomicInteger(0);

                        @Override
                        public Thread newThread(Runnable r) {
                            Thread thread = new Thread(r);
                            String threadName = EnvirmentThreadPool.class.getSimpleName() + "-thread-" + count.addAndGet(1);
                            thread.setName(threadName);
                            return thread;
                        }
                    },
                    //自定义线程池FULL时的策略,新的任务阻塞在队列外面;
                    (r, executor) -> {
                        try {
                            queue.put(r);
                        } catch (InterruptedException e) {
                            e.printStackTrace();
                        }
                    }
            );
        }

    }
}

工具调用:

ContextThreadPool.INSTANCE.getThreadPool();

四. Future

现在我们拥有了线程池,接下来就需要向线程池提交任务,目前有两种方式:

void execute(Runnable command)
<T> Future<T> submit(Callable<T> task)

前者定义在Executor中,用于任务无返回值的使用,后者定义在ExecutorService中,可以拿到任务的结果Future;

我们都知道,我们创建线程有几种方式,其中之一之二就是继承Runnable接口和Callable接口,普通使用没有什么区别,但是在线程执行结果的获取上就体现出来了;

@FunctionalInterface
public interface Runnable {
    //void返回
    public abstract void run();
}

@FunctionalInterface
public interface Callable<V> {
    /**
     * Computes a result, or throws an exception if unable to do so.
     *
     * @return computed result
     * @throws Exception if unable to compute a result
     */
    V call() throws Exception;
}

4.1 get result

V get() throws InterruptedException, ExecutionException;
V get(long timeout, TimeUnit unit) throws InterruptedException, ExecutionException, TimeoutException;

可以通过Future.上面的方法获取线程返回值,有时候任务执行的时间比较长,在我们获取结果的时候还没有执行完毕,因此通常调用

boolean isDone();来判断任务是否执行完毕;

package com.river.thread;

import lombok.extern.slf4j.Slf4j;

import java.util.concurrent.Callable;
import java.util.concurrent.ExecutionException;
import java.util.concurrent.Future;
import java.util.concurrent.ThreadPoolExecutor;

@Slf4j
public class FutureTest {

    public static void main(String[] args) throws ExecutionException, InterruptedException {
        ThreadPoolExecutor threadPool = ContextThreadPool.INSTANCE.getThreadPool();
        //正常这里我会用lamdba表达式去写,为了明了接口实现
        Future<String> result = threadPool.submit(new Callable<String>() {
            @Override
            public String call() throws Exception {
                return "123";
            }
        });

        while (true){
            if (result.isDone()){
                System.out.println(result.get());
                break;
            }
            log.info("not finish");
        }

    }
}

日志输出:

2018-09-17 17:47:37.352 myAppName [main] INFO  com.river.thread.FutureTest - not finish
123

这里看到第一次获取结果是没有获取到的,第二次就获取到了;

接下来我们使用待超时的get()方法获取结果:

public static void main(String[] args) throws ExecutionException, InterruptedException, TimeoutException {
        ThreadPoolExecutor threadPool = ContextThreadPool.INSTANCE.getThreadPool();
        Future<String> result = threadPool.submit(new Callable<String>() {
            @Override
            public String call() throws Exception {
                Thread.sleep(2000);
                return "123";
            }
        });

        log.info("get result");
        log.info(result.get(3, TimeUnit.SECONDS));

    }

2018-09-17 17:52:43.348 myAppName [main] INFO  com.river.thread.FutureTest - get result
2018-09-17 17:52:45.349 myAppName [main] INFO  com.river.thread.FutureTest - 123

可以看到2s中之后获取到了执行结果,如果线程执行时间超过获取时间呢?

我们将sleep参数改成了5000,

2018-09-17 17:55:14.380 myAppName [main] INFO  com.river.thread.FutureTest - get result
Exception in thread "main" java.util.concurrent.TimeoutException
	at java.util.concurrent.FutureTask.get(FutureTask.java:205)
	at com.river.thread.FutureTest.main(FutureTest.java:21)

可以看到抛出了异常;

但是当我们不尝试get结果的时候,异常是不会被抛出来的,也就是说,Future有持有异常的能力;我们可以通过在任务执行完毕后catch该异常,从而执行相应的处理办法;

通常情况下,我们会向线程池提交一个任务集合,将result保存在集合中,最后在遍历集合中的执行结果来得到最终的结果;

转载于:https://my.oschina.net/JackieRiver/blog/2051005