Java Concurrency 3 - The Fork/Join Framework

date

Feb 26, 2022

slug

the-fork-join-framework

status

Published

Working with RecursiveTask

To submit tasks to this pool, you have to create a subclass of RecursiveTask<R>, where R is the type of the result produced by the parallelized task (and each of its subtasks) or of RecursiveAction if the task returns no result (it could be updating other nonlocal structures, though).

To define RecursiveTasks you need only implement its single abstract method, compute:

protected abstract R compute();

Implementation of this method often resembles the following pseudocode:

if (task is small enough or no longer divisible) {
	compute task sequentially
} else {
	split task in two subtasks
	call this method recursively possibly further splitting each subtask
	wait for the completion of all subtasks
	combine the results of each subtask
}

Best Practices To Use It Effectively

Invoking the join method on a task blocks the caller until the result produced by that task is ready. For this reason, it’s necessary to call it after the computation of both subtasks has been started.

The invoke method of a ForkJoinPool shouldn’t be used from within a RecursiveTask.

Calling the fork method on a subtask is the way to schedule it on the ForkJoinPool. Doing this allows you to reuse the same thread for one of the two subtasks and avoid the overhead caused by the unnecessary allocation of a further task on the pool.

Debugging a parallel computation using the fork/join framework can be tricky. In particular, it’s ordinarily quite common to browse a stack trace in your favorite IDE to discover the cause of a problem, but this can’t work with a fork/join computation because the call to compute occurs in a different thread than the conceptual caller, which is the code that called fork.

You should consider other things when comparing the performance of the sequential and parallel versions of the same algorithm. Like any other Java code, the fork/join framework needs to be “warmed up,” or executed, a few times before being optimized by the JIT compiler.

Work Stealing

The tasks are more or less evenly divided on all the threads in the ForkJoinPool. Each of these threads holds a doubly linked queue of the tasks assigned to it, and as soon as it completes a task it pulls another one from the head of the queue and starts executing it. One thread might complete all the tasks assigned to it much faster than the others, which means its queue will become empty while the other threads are still pretty busy. In this case, instead of becoming idle, the thread randomly chooses a queue of a different thread and “steals” a task, taking it from the tail of the queue. This process continues until all the tasks are executed, and then all the queues become empty. That’s why having many smaller tasks, instead of only a few bigger ones, can help in better balancing the workload among the worker threads.