分析 Java 线程池 Callable 任务执行原理

KariShaffer 8年前
   <p>上一篇分析了线程池的执行原理，主要关于线程池的生命周期和任务如何在池里创建、运行和终止。不过上次研究的是execute方法，执行的是Runnable任务，它不返回任何值。如果希望任务完成后返回结果，那么需要使用Callable接口，这也是本文要研究的主题。</p>    <pre>  <code class="language-java">ExecutorService es = Executors.newSingleThreadExecutor();  Future<?> task = es.submit(new MyThread());  try {      //限定时间获取结果      task.get(5, TimeUnit.SECONDS);  } catch (TimeoutException e) {      //超时触发线程中止      System.out.println("thread over time");  } catch (ExecutionException e) {     //抛出执行异常      throw e;  } finally {     //如果任务还在运行，执行中断      boolean mayInterruptIfRunning = true;      task.cancel(mayInterruptIfRunning);  }</code></pre>    <p>上面代码是Future的一个简单例子：MyThread实现Callable接口，执行时要求在限定时间内获取结果，超时执行会抛出TimeoutException，执行异常会抛出ExecutionException。最后在finally里，如果任务还在执行，就进行取消；如果任务已经执行完，取消操作也没有影响。</p>    <p style="text-align:center"><img src="https://simg.open-open.com/show/bc192907d3008855e6aa848394f550a7.png"></p>    <p>图1 FutureTask</p>    <p>Future接口代表一个异步任务的结果，提供了相应方法判断任务是否完成或者取消。从图1可知，RunnableFuture同时继承了Future和Runnable，是一个可运行、可知结果的任务，FutureTask是具体的实现类。</p>    <h3><strong>FutureTask的状态</strong></h3>    <pre>  <code class="language-java">private volatile int state;  private static final int NEW          = 0;  private static final int COMPLETING   = 1;  private static final int NORMAL       = 2;  private static final int EXCEPTIONAL  = 3;  private static final int CANCELLED    = 4;  private static final int INTERRUPTING = 5;  private static final int INTERRUPTED  = 6;</code></pre>    <p>FutureTask有7种状态，初始状态从NEW开始，状态转换路径可以归纳为图2所示。在后文的代码，会使用int的大小比较判断状态处于哪个范围，需要留意上面状态的排列顺序。</p>    <p style="text-align:center"><img src="https://simg.open-open.com/show/beda8415725d631b0f347860c37b7366.png"></p>    <p>图2 FutureTask状态路径</p>    <p>FutureTask的状态路径，取决于run和cancel的调用顺序，在后文分析时，对号入座这几条路径。</p>    <ol>     <li>NEW -> COMPLETING -> NORMAL 正常的流程</li>     <li>NEW -> COMPLETING -> EXCEPTIONAL 异常的流程</li>     <li>NEW -> CANCELLED 被取消流程</li>     <li>NEW -> INTERRUPTING -> INTERRUPTED 被中断流程</li>    </ol>    <h3><strong>FutureTask的变量</strong></h3>    <ul>     <li>int state</li>     <li>Thread runner</li>     <li>WaitNode waiters</li>     <li>Callable<V> callable</li>     <li>Object outcome</li>    </ul>    <p>state、runner、waiters三个变量没有使用原子类，而是使用Unsafe对象进行原子操作。代码中会见到很多形如compareAndSwap的方法，入门原理可以看我以前写的 认识非阻塞的同步机制CAS 。</p>    <p>callable是要执行的任务，runner是执行任务的线程，outcome是返回的结果（正常结果或Exception结果）</p>    <pre>  <code class="language-java">static final class WaitNode {      volatile Thread thread;      volatile WaitNode next;      WaitNode() { thread = Thread.currentThread(); }  }</code></pre>    <p>waiters的数据结构是WaitNode，保存了Thread和下个WaitNode的引用。waiters保存了等待结果的线程，每次操作只会增减头，所以是一个栈结构，详细见后文对get方法的分析。</p>    <h3><strong>FutureTask的创建</strong></h3>    <pre>  <code class="language-java">public FutureTask(Callable<V> callable) {      if (callable == null)          throw new NullPointerException();      this.callable = callable;      this.state = NEW;       // ensure visibility of callable  }    public FutureTask(Runnable runnable, V result) {      this.callable = Executors.callable(runnable, result);      this.state = NEW;       // ensure visibility of callable  }</code></pre>    <p>FutureTask可以接受Callable或者Runnable，state从NEW开始。如果是Runnable，需要调用Executors.callable转成Callable，返回的结果是预先传入的result。转换过程使用一个实现了Callable的RunnableAdapter包装Runnable和result，代码比较简单。</p>    <pre>  <code class="language-java">static final class RunnableAdapter<T> implements Callable<T> {      final Runnable task;      final T result;      RunnableAdapter(Runnable task, T result) {          this.task = task;          this.result = result;      }      public T call() {          task.run();          return result;      }  }</code></pre>    <p>提交FutureTask到线程池的submit定义在AbstractExecutorService，根据入参的不同，有三个submit方法。下面以提交Callable为例：</p>    <pre>  <code class="language-java">public <T> Future<T> submit(Callable<T> task) {     if (task == null) throw new NullPointerException();     RunnableFuture<T> ftask = newTaskFor(task);     execute(ftask);     return ftask;  }    protected <T> RunnableFuture<T> newTaskFor(Callable<T> callable) {         return new FutureTask<T>(callable);  }</code></pre>    <p>FutureTask在newTaskFor创建，然后调用线程池的execute执行，最后返回Future。获取Future后，就可以调用get获取结果，或者调用cancel取消任务。</p>    <h3><strong>FutureTask的运行</strong></h3>    <p>FutureTask实现了Runnable，在线程池里执行时调用的方法是run。</p>    <pre>  <code class="language-java">public void run() {      //1      if (state != NEW ||          !UNSAFE.compareAndSwapObject(this, runnerOffset,null, Thread.currentThread()))          return;      //2      try {          Callable<V> c = callable;          if (c != null && state == NEW) {              V result;              boolean ran;              try {                  result = c.call();                  ran = true;              } catch (Throwable ex) {                  result = null;                  ran = false;                  setException(ex);              }              if (ran)                  set(result);          }      } finally {         //3          runner = null;          int s = state;          if (s >= INTERRUPTING)              handlePossibleCancellationInterrupt(s);      }  }</code></pre>    <p>标记1处检查FutureTask的状态，如果不是处于NEW，说明状态已经进入四条路径之一，也就没有必要继续了。如果状态是NEW，则将执行任务的线程交给runner。</p>    <p>标记2处开始正式执行任务，调用call方法获取结果，没有异常就算成功，最后执行set方法；出现异常就调用setException方法。</p>    <p>标记3处，无论任务执行是否成功，都需要将runner重新置为空。</p>    <pre>  <code class="language-java">protected void set(V v) {      if (UNSAFE.compareAndSwapInt(this, stateOffset, NEW, COMPLETING)) {          outcome = v;          UNSAFE.putOrderedInt(this, stateOffset, NORMAL); // final state          finishCompletion();      }  }    protected void setException(Throwable t) {      if (UNSAFE.compareAndSwapInt(this, stateOffset, NEW, COMPLETING)) {          outcome = t;          UNSAFE.putOrderedInt(this, stateOffset, EXCEPTIONAL); // final state          finishCompletion();      }  }</code></pre>    <p>任务执行成功与失败，分别对应NEW -> COMPLETING -> NORMAL和NEW -> COMPLETING -> EXCEPTIONAL两条路径。这里先将状态修改为中间状态，再对结果赋值，最后再修改为最终状态。</p>    <pre>  <code class="language-java">private void finishCompletion() {      // assert state > COMPLETING;      for (WaitNode q; (q = waiters) != null;) {          if (UNSAFE.compareAndSwapObject(this, waitersOffset, q, null)) {              for (;;) {                  Thread t = q.thread;                  if (t != null) {                      q.thread = null;                      LockSupport.unpark(t);                  }                  WaitNode next = q.next;                  if (next == null)                      break;                  q.next = null; // unlink to help gc                  q = next;              }              break;          }      }      done();      callable = null;        // to reduce footprint  }</code></pre>    <p>最后调用finishCompletion执行任务完成，唤醒并删除所有在waiters中等待的线程。done方法是空的，供子类实现，最后callable也设置为空。</p>    <p>FutureTask还有个runAndReset，逻辑和run类似，但没有调用set方法来设置结果，执行完成后将任务重新初始化。</p>    <pre>  <code class="language-java">protected boolean runAndReset() {      if (state != NEW ||          !UNSAFE.compareAndSwapObject(this, runnerOffset,                                       null, Thread.currentThread()))          return false;      boolean ran = false;      int s = state;      try {          Callable<V> c = callable;          if (c != null && s == NEW) {              try {                  c.call(); // don't set result                  ran = true;              } catch (Throwable ex) {                  setException(ex);              }          }      } finally {          // runner must be non-null until state is settled to          // prevent concurrent calls to run()          runner = null;          // state must be re-read after nulling runner to prevent          // leaked interrupts          s = state;          if (s >= INTERRUPTING)              handlePossibleCancellationInterrupt(s);      }      return ran && s == NEW;  }</code></pre>    <h3><strong>FutureTask的取消</strong></h3>    <p>对于已经提交执行的任务，可以调用cancel执行取消。</p>    <pre>  <code class="language-java">public boolean cancel(boolean mayInterruptIfRunning) {     //1      if (!(state == NEW &&            UNSAFE.compareAndSwapInt(this, stateOffset, NEW,                mayInterruptIfRunning ? INTERRUPTING : CANCELLED)))          return false;      try {    // in case call to interrupt throws exception         //2          if (mayInterruptIfRunning) {              try {                  Thread t = runner;                  if (t != null)                      t.interrupt();              } finally { // final state                  UNSAFE.putOrderedInt(this, stateOffset, INTERRUPTED);              }          }      } finally {          finishCompletion();      }      return true;  }</code></pre>    <p>标记1处判断任务状态，为NEW才能被取消。如果mayInterruptIfRunning是true，代表任务需要被中断，走NEW -> INTERRUPTING -> INTERRUPTED流程。否则代表任务被取消，走NEW -> CANCELLED流程。</p>    <p>标记2处理任务被中断的情况，这里仅仅是对线程发出中断请求，不确保任务能检测并处理中断，详细原理去看Java的中断机制。</p>    <p>最后调用finishCompletion完成收尾工作。</p>    <pre>  <code class="language-java">public boolean isCancelled() {      return state >= CANCELLED;  }</code></pre>    <p>判断任务是否被取消，具体逻辑是判断state >= CANCELLED，包括了被中断一共两条路径的结果。</p>    <h3><strong>FutureTask获取结果</strong></h3>    <p>调用FutureTask的get方法获取任务的执行结果，可以阻塞直到获取结果，也可以限制范围时间内获取结果，否则抛出TimeoutException。</p>    <pre>  <code class="language-java">public V get() throws InterruptedException, ExecutionException {      int s = state;      if (s <= COMPLETING)          s = awaitDone(false, 0L);      return report(s);  }    public V get(long timeout, TimeUnit unit)      throws InterruptedException, ExecutionException, TimeoutException {      if (unit == null)          throw new NullPointerException();      int s = state;      if (s <= COMPLETING &&          (s = awaitDone(true, unit.toNanos(timeout))) <= COMPLETING)          throw new TimeoutException();      return report(s);  }</code></pre>    <p>get的核心实现调用了awaitDone，入参为是否开启时间限制和最大的等待时间。</p>    <pre>  <code class="language-java">private int awaitDone(boolean timed, long nanos)      throws InterruptedException {      final long deadline = timed ? System.nanoTime() + nanos : 0L;      WaitNode q = null;      boolean queued = false;      for (;;) {          if (Thread.interrupted()) {              removeWaiter(q);              throw new InterruptedException();          }            int s = state;          if (s > COMPLETING) {    //1              if (q != null)                  q.thread = null;              return s;          }          else if (s == COMPLETING) // cannot time out yet    //2              Thread.yield();          else if (q == null)     //3              q = new WaitNode();          else if (!queued)    //4              queued = UNSAFE.compareAndSwapObject(this, waitersOffset,                                                   q.next = waiters, q);          else if (timed) {    //5              nanos = deadline - System.nanoTime();              if (nanos <= 0L) {                  removeWaiter(q);                  return state;              }              LockSupport.parkNanos(this, nanos);          }          else     //6              LockSupport.park(this);      }  }</code></pre>    <p>awaitDone主要逻辑是一个无限循环，首先判断线程是否被中断，是的话移除waiter并抛出中断异常。接下来是一串if-else，一共六种情况。</p>    <ol>     <li>判断任务状态是否已经完成，是就直接返回；</li>     <li>任务状态是COMPLETING，代表在set结果时被阻塞了，这里先让出资源；</li>     <li>如果WaitNode为空，就为当前线程初始化一个WaitNode；</li>     <li>如果当前的WaitNode还没有加入waiters，就加入；</li>     <li>如果是限定时间执行，判断有无超时，超时就将waiter移出，并返回结果，否则阻塞一定时间；</li>     <li>如果没有限定时间，就一直阻塞到下次被唤醒。</li>    </ol>    <p>LockSupport是用来创建锁和其他同步类的基本线程阻塞原语。park和unpark的作用分别是阻塞线程和解除阻塞线程。</p>    <pre>  <code class="language-java">private V report(int s) throws ExecutionException {     Object x = outcome;     if (s == NORMAL)         return (V)x;     if (s >= CANCELLED)         throw new CancellationException();     throw new ExecutionException((Throwable)x);  }</code></pre>    <p>最后get调用report，使用outcome返回结果。</p>    <p style="text-align:center"><img src="https://simg.open-open.com/show/d7c532dd4fdbd4277271d131ef69e9a8.png"></p>    <p>图3</p>    <p>看图3，如果多个线程向同一个FutureTask实例get结果，但FutureTask又没有执行完毕，线程将会阻塞并保存在waiters中。待FutureTask获取结果后，唤醒waiters等待的线程，并返回同一个结果。</p>    <h3><strong>总结</strong></h3>    <p style="text-align:center"><img src="https://simg.open-open.com/show/2032bd1bf05efa607d34b723ce0597b4.png"></p>    <p>图4</p>    <p>图4归纳了FutureTask的作用，任务的调用线程Caller和线程池的工作线程通过FutureTask交互。对比线程池的执行原理，FutureTask是比较简单的。</p>    <p> </p>    <p>来自：http://www.jianshu.com/p/f624934b9a23</p>    <p> </p>
分析 Java 线程池 Callable 任务执行原理

相关经验

目录