Documente Academic
Documente Profesional
Documente Cultură
ii. The process of dividing a computation into smaller parts, some or all of which may
potentially be executed in parallel, is called decomposition.
iii. Tasks are programmer-defined units of computation into which the main computation is
subdivided by means of decomposition
Preliminaries
Decomposition, Tasks, and Dependency Graphs (1/3)
Dense Matrix Vector Multiplications
Note that all tasks in Figure 3.1 are independent
and can be performed all together or in any
sequence.
However, in general, some tasks may use data
produced by other tasks and thus may need to
wait for these tasks to finish execution.
An abstraction used to express such dependencies
among tasks and their relative order of execution
is known as a task dependency graph.
𝑛
𝑦 𝑖 = 𝐴 𝑖, 𝑗 . 𝑏[𝑗]
𝑗=1
Preliminaries
Decomposition, Tasks, and Dependency Graphs (2/3)
Database Query Processing
Table 3.1 shows a relational database of vehicles.
Each row of the table is a record that contains data
corresponding to a particular vehicle, such as its ID,
model, year, color, etc. in various fields.
Consider the computations performed in processing
the following query:
MODEL="Civic" AND YEAR="2001" AND
(COLOR="Green" OR COLOR="White")
Preliminaries
Decomposition, Tasks, and Dependency Graphs (3/3)
Database Query Processing
Preliminaries
Granularity, Concurrency, and Task-Interaction (1/4)
The number and size of tasks into which a
problem is decomposed determines the
granularity of the decomposition.
This factor is the interaction among tasks running on different physical processors. The tasks that a
problem is decomposed into often share input, output, or intermediate data.
For example, in the decomposition for matrix-vector multiplication, although all tasks are
independent, they all need access to the entire input vector b. Since originally there is only one copy
of the vector b, tasks may have to send and receive messages for all of them to access the entire
vector in the distributed-memory paradigm.
The pattern of interaction among tasks is captured by what is known as a task-interaction graph. The
nodes in a task-interaction graph represent tasks and the edges connect tasks that interact with each
other.
Preliminaries
Granularity, Concurrency, and Task-Interaction (4/4)
Sparse matrix-vector multiplication
Preliminaries
Processes and Mapping
The tasks, into which a problem is decomposed, run on physical processors.
Process in this chapter to refer to a processing or computing agent that performs tasks.
During this time, in addition to performing computations, a process may synchronize or communicate
with other processes, if needed.
In order to obtain any speedup over a sequential implementation, a parallel program must have several
processes active simultaneously, working on different tasks.
The mechanism by which tasks are assigned to processes for execution is called mapping.
The task-dependency and task-interaction graphs that result from a choice of decomposition play an
important role in the selection of a good mapping for a parallel algorithm
Preliminaries
Processes and Mapping (1/2)
Keys to good mapping technique
i. Maximize the use of concurrency by mapping independent tasks onto different processes.
ii. Minimize the total completion time by ensuring that processes are available to execute the tasks on
the critical path as soon as such tasks become executable.
iii. Minimize interaction among processes by mapping tasks with a high degree of mutual interaction
onto the same process.
Preliminaries
Processes and Mapping (2/2)
Figure 3.7 shows efficient mappings
for the decompositions and the
task-interaction graphs of Figure 3.5
onto four processes.
Note that, in this case, a maximum
of four processes can be employed
usefully, although the total number
of tasks is seven.
This is because the maximum
degree of concurrency is only four.
Preliminaries
Processes vs Processors
In the context of parallel algorithm design, processes are logical computing agents that perform tasks.
Processors are the hardware units that physically perform computations.
In most cases, when we refer to processes in the context of a parallel algorithm, there is a one-to-one
correspondence between processes and processors and it is appropriate to assume that there are as
many processes as the number of physical CPUs on the parallel computer.
However, sometimes a higher level of abstraction may be required to express a parallel algorithm,
especially if it is a complex algorithm with multiple stages or with different forms of parallelism.
Decomposition Techniques
Introduction
As mentioned earlier, one of the fundamental steps that we need to undertake to solve a
problem in parallel is to split the computations to be performed into a set of tasks for
concurrent execution defined by the task-dependency graph.