分析 Java 线程池 Callable 任务执行原理

KariShaffer 8年前
   <p>上一篇分析了线程池的执行原理,主要关于线程池的生命周期和任务如何在池里创建、运行和终止。不过上次研究的是execute方法,执行的是Runnable任务,它不返回任何值。如果希望任务完成后返回结果,那么需要使用Callable接口,这也是本文要研究的主题。</p>    <pre>  <code class="language-java">ExecutorService es = Executors.newSingleThreadExecutor();  Future<?> task = es.submit(new MyThread());  try {      //限定时间获取结果      task.get(5, TimeUnit.SECONDS);  } catch (TimeoutException e) {      //超时触发线程中止      System.out.println("thread over time");  } catch (ExecutionException e) {     //抛出执行异常      throw e;  } finally {     //如果任务还在运行,执行中断      boolean mayInterruptIfRunning = true;      task.cancel(mayInterruptIfRunning);  }</code></pre>    <p>上面代码是Future的一个简单例子:MyThread实现Callable接口,执行时要求在限定时间内获取结果,超时执行会抛出TimeoutException,执行异常会抛出ExecutionException。最后在finally里,如果任务还在执行,就进行取消;如果任务已经执行完,取消操作也没有影响。</p>    <p style="text-align:center"><img src="https://simg.open-open.com/show/bc192907d3008855e6aa848394f550a7.png"></p>    <p>图1 FutureTask</p>    <p>Future接口代表一个异步任务的结果,提供了相应方法判断任务是否完成或者取消。从图1可知,RunnableFuture同时继承了Future和Runnable,是一个可运行、可知结果的任务,FutureTask是具体的实现类。</p>    <h3><strong>FutureTask的状态</strong></h3>    <pre>  <code class="language-java">private volatile int state;  private static final int NEW          = 0;  private static final int COMPLETING   = 1;  private static final int NORMAL       = 2;  private static final int EXCEPTIONAL  = 3;  private static final int CANCELLED    = 4;  private static final int INTERRUPTING = 5;  private static final int INTERRUPTED  = 6;</code></pre>    <p>FutureTask有7种状态,初始状态从NEW开始,状态转换路径可以归纳为图2所示。在后文的代码,会使用int的大小比较判断状态处于哪个范围,需要留意上面状态的排列顺序。</p>    <p style="text-align:center"><img src="https://simg.open-open.com/show/beda8415725d631b0f347860c37b7366.png"></p>    <p>图2 FutureTask状态路径</p>    <p>FutureTask的状态路径,取决于run和cancel的调用顺序,在后文分析时,对号入座这几条路径。</p>    <ol>     <li>NEW -> COMPLETING -> NORMAL 正常的流程</li>     <li>NEW -> COMPLETING -> EXCEPTIONAL 异常的流程</li>     <li>NEW -> CANCELLED 被取消流程</li>     <li>NEW -> INTERRUPTING -> INTERRUPTED 被中断流程</li>    </ol>    <h3><strong>FutureTask的变量</strong></h3>    <ul>     <li>int state</li>     <li>Thread runner</li>     <li>WaitNode waiters</li>     <li>Callable<V> callable</li>     <li>Object outcome</li>    </ul>    <p>state、runner、waiters三个变量没有使用原子类,而是使用Unsafe对象进行原子操作。代码中会见到很多形如compareAndSwap的方法,入门原理可以看我以前写的 认识非阻塞的同步机制CAS 。</p>    <p>callable是要执行的任务,runner是执行任务的线程,outcome是返回的结果(正常结果或Exception结果)</p>    <pre>  <code class="language-java">static final class WaitNode {      volatile Thread thread;      volatile WaitNode next;      WaitNode() { thread = Thread.currentThread(); }  }</code></pre>    <p>waiters的数据结构是WaitNode,保存了Thread和下个WaitNode的引用。waiters保存了等待结果的线程,每次操作只会增减头,所以是一个栈结构,详细见后文对get方法的分析。</p>    <h3><strong>FutureTask的创建</strong></h3>    <pre>  <code class="language-java">public FutureTask(Callable<V> callable) {      if (callable == null)          throw new NullPointerException();      this.callable = callable;      this.state = NEW;       // ensure visibility of callable  }    public FutureTask(Runnable runnable, V result) {      this.callable = Executors.callable(runnable, result);      this.state = NEW;       // ensure visibility of callable  }</code></pre>    <p>FutureTask可以接受Callable或者Runnable,state从NEW开始。如果是Runnable,需要调用Executors.callable转成Callable,返回的结果是预先传入的result。转换过程使用一个实现了Callable的RunnableAdapter包装Runnable和result,代码比较简单。</p>    <pre>  <code class="language-java">static final class RunnableAdapter<T> implements Callable<T> {      final Runnable task;      final T result;      RunnableAdapter(Runnable task, T result) {          this.task = task;          this.result = result;      }      public T call() {          task.run();          return result;      }  }</code></pre>    <p>提交FutureTask到线程池的submit定义在AbstractExecutorService,根据入参的不同,有三个submit方法。下面以提交Callable为例:</p>    <pre>  <code class="language-java">public <T> Future<T> submit(Callable<T> task) {     if (task == null) throw new NullPointerException();     RunnableFuture<T> ftask = newTaskFor(task);     execute(ftask);     return ftask;  }    protected <T> RunnableFuture<T> newTaskFor(Callable<T> callable) {         return new FutureTask<T>(callable);  }</code></pre>    <p>FutureTask在newTaskFor创建,然后调用线程池的execute执行,最后返回Future。获取Future后,就可以调用get获取结果,或者调用cancel取消任务。</p>    <h3><strong>FutureTask的运行</strong></h3>    <p>FutureTask实现了Runnable,在线程池里执行时调用的方法是run。</p>    <pre>  <code class="language-java">public void run() {      //1      if (state != NEW ||          !UNSAFE.compareAndSwapObject(this, runnerOffset,null, Thread.currentThread()))          return;      //2      try {          Callable<V> c = callable;          if (c != null && state == NEW) {              V result;              boolean ran;              try {                  result = c.call();                  ran = true;              } catch (Throwable ex) {                  result = null;                  ran = false;                  setException(ex);              }              if (ran)                  set(result);          }      } finally {         //3          runner = null;          int s = state;          if (s >= INTERRUPTING)              handlePossibleCancellationInterrupt(s);      }  }</code></pre>    <p>标记1处检查FutureTask的状态,如果不是处于NEW,说明状态已经进入四条路径之一,也就没有必要继续了。如果状态是NEW,则将执行任务的线程交给runner。</p>    <p>标记2处开始正式执行任务,调用call方法获取结果,没有异常就算成功,最后执行set方法;出现异常就调用setException方法。</p>    <p>标记3处,无论任务执行是否成功,都需要将runner重新置为空。</p>    <pre>  <code class="language-java">protected void set(V v) {      if (UNSAFE.compareAndSwapInt(this, stateOffset, NEW, COMPLETING)) {          outcome = v;          UNSAFE.putOrderedInt(this, stateOffset, NORMAL); // final state          finishCompletion();      }  }    protected void setException(Throwable t) {      if (UNSAFE.compareAndSwapInt(this, stateOffset, NEW, COMPLETING)) {          outcome = t;          UNSAFE.putOrderedInt(this, stateOffset, EXCEPTIONAL); // final state          finishCompletion();      }  }</code></pre>    <p>任务执行成功与失败,分别对应NEW -> COMPLETING -> NORMAL和NEW -> COMPLETING -> EXCEPTIONAL两条路径。这里先将状态修改为中间状态,再对结果赋值,最后再修改为最终状态。</p>    <pre>  <code class="language-java">private void finishCompletion() {      // assert state > COMPLETING;      for (WaitNode q; (q = waiters) != null;) {          if (UNSAFE.compareAndSwapObject(this, waitersOffset, q, null)) {              for (;;) {                  Thread t = q.thread;                  if (t != null) {                      q.thread = null;                      LockSupport.unpark(t);                  }                  WaitNode next = q.next;                  if (next == null)                      break;                  q.next = null; // unlink to help gc                  q = next;              }              break;          }      }      done();      callable = null;        // to reduce footprint  }</code></pre>    <p>最后调用finishCompletion执行任务完成,唤醒并删除所有在waiters中等待的线程。done方法是空的,供子类实现,最后callable也设置为空。</p>    <p>FutureTask还有个runAndReset,逻辑和run类似,但没有调用set方法来设置结果,执行完成后将任务重新初始化。</p>    <pre>  <code class="language-java">protected boolean runAndReset() {      if (state != NEW ||          !UNSAFE.compareAndSwapObject(this, runnerOffset,                                       null, Thread.currentThread()))          return false;      boolean ran = false;      int s = state;      try {          Callable<V> c = callable;          if (c != null && s == NEW) {              try {                  c.call(); // don't set result                  ran = true;              } catch (Throwable ex) {                  setException(ex);              }          }      } finally {          // runner must be non-null until state is settled to          // prevent concurrent calls to run()          runner = null;          // state must be re-read after nulling runner to prevent          // leaked interrupts          s = state;          if (s >= INTERRUPTING)              handlePossibleCancellationInterrupt(s);      }      return ran && s == NEW;  }</code></pre>    <h3><strong>FutureTask的取消</strong></h3>    <p>对于已经提交执行的任务,可以调用cancel执行取消。</p>    <pre>  <code class="language-java">public boolean cancel(boolean mayInterruptIfRunning) {     //1      if (!(state == NEW &&            UNSAFE.compareAndSwapInt(this, stateOffset, NEW,                mayInterruptIfRunning ? INTERRUPTING : CANCELLED)))          return false;      try {    // in case call to interrupt throws exception         //2          if (mayInterruptIfRunning) {              try {                  Thread t = runner;                  if (t != null)                      t.interrupt();              } finally { // final state                  UNSAFE.putOrderedInt(this, stateOffset, INTERRUPTED);              }          }      } finally {          finishCompletion();      }      return true;  }</code></pre>    <p>标记1处判断任务状态,为NEW才能被取消。如果mayInterruptIfRunning是true,代表任务需要被中断,走NEW -> INTERRUPTING -> INTERRUPTED流程。否则代表任务被取消,走NEW -> CANCELLED流程。</p>    <p>标记2处理任务被中断的情况,这里仅仅是对线程发出中断请求,不确保任务能检测并处理中断,详细原理去看Java的中断机制。</p>    <p>最后调用finishCompletion完成收尾工作。</p>    <pre>  <code class="language-java">public boolean isCancelled() {      return state >= CANCELLED;  }</code></pre>    <p>判断任务是否被取消,具体逻辑是判断state >= CANCELLED,包括了被中断一共两条路径的结果。</p>    <h3><strong>FutureTask获取结果</strong></h3>    <p>调用FutureTask的get方法获取任务的执行结果,可以阻塞直到获取结果,也可以限制范围时间内获取结果,否则抛出TimeoutException。</p>    <pre>  <code class="language-java">public V get() throws InterruptedException, ExecutionException {      int s = state;      if (s <= COMPLETING)          s = awaitDone(false, 0L);      return report(s);  }    public V get(long timeout, TimeUnit unit)      throws InterruptedException, ExecutionException, TimeoutException {      if (unit == null)          throw new NullPointerException();      int s = state;      if (s <= COMPLETING &&          (s = awaitDone(true, unit.toNanos(timeout))) <= COMPLETING)          throw new TimeoutException();      return report(s);  }</code></pre>    <p>get的核心实现调用了awaitDone,入参为是否开启时间限制和最大的等待时间。</p>    <pre>  <code class="language-java">private int awaitDone(boolean timed, long nanos)      throws InterruptedException {      final long deadline = timed ? System.nanoTime() + nanos : 0L;      WaitNode q = null;      boolean queued = false;      for (;;) {          if (Thread.interrupted()) {              removeWaiter(q);              throw new InterruptedException();          }            int s = state;          if (s > COMPLETING) {    //1              if (q != null)                  q.thread = null;              return s;          }          else if (s == COMPLETING) // cannot time out yet    //2              Thread.yield();          else if (q == null)     //3              q = new WaitNode();          else if (!queued)    //4              queued = UNSAFE.compareAndSwapObject(this, waitersOffset,                                                   q.next = waiters, q);          else if (timed) {    //5              nanos = deadline - System.nanoTime();              if (nanos <= 0L) {                  removeWaiter(q);                  return state;              }              LockSupport.parkNanos(this, nanos);          }          else     //6              LockSupport.park(this);      }  }</code></pre>    <p>awaitDone主要逻辑是一个无限循环,首先判断线程是否被中断,是的话移除waiter并抛出中断异常。接下来是一串if-else,一共六种情况。</p>    <ol>     <li>判断任务状态是否已经完成,是就直接返回;</li>     <li>任务状态是COMPLETING,代表在set结果时被阻塞了,这里先让出资源;</li>     <li>如果WaitNode为空,就为当前线程初始化一个WaitNode;</li>     <li>如果当前的WaitNode还没有加入waiters,就加入;</li>     <li>如果是限定时间执行,判断有无超时,超时就将waiter移出,并返回结果,否则阻塞一定时间;</li>     <li>如果没有限定时间,就一直阻塞到下次被唤醒。</li>    </ol>    <p>LockSupport是用来创建锁和其他同步类的基本线程阻塞原语。park和unpark的作用分别是阻塞线程和解除阻塞线程。</p>    <pre>  <code class="language-java">private V report(int s) throws ExecutionException {     Object x = outcome;     if (s == NORMAL)         return (V)x;     if (s >= CANCELLED)         throw new CancellationException();     throw new ExecutionException((Throwable)x);  }</code></pre>    <p>最后get调用report,使用outcome返回结果。</p>    <p style="text-align:center"><img src="https://simg.open-open.com/show/d7c532dd4fdbd4277271d131ef69e9a8.png"></p>    <p>图3</p>    <p>看图3,如果多个线程向同一个FutureTask实例get结果,但FutureTask又没有执行完毕,线程将会阻塞并保存在waiters中。待FutureTask获取结果后,唤醒waiters等待的线程,并返回同一个结果。</p>    <h3><strong>总结</strong></h3>    <p style="text-align:center"><img src="https://simg.open-open.com/show/2032bd1bf05efa607d34b723ce0597b4.png"></p>    <p>图4</p>    <p>图4归纳了FutureTask的作用,任务的调用线程Caller和线程池的工作线程通过FutureTask交互。对比线程池的执行原理,FutureTask是比较简单的。</p>    <p> </p>    <p>来自:http://www.jianshu.com/p/f624934b9a23</p>    <p> </p>