In your group you are going to do the following:

1. Develop a pseudocode solution

First, come up with an algorithm for parallelizing finding the max value in an array of size N using M threads. Assume that you are starting with an already initialized array of N values, and you can start out assuming that M evenly divides N (the case when it doesn’t is an extension to try after you get this case working).

Some questions to think about as you determine how to parallelize computing max:

  1. What part of the cumulative operation does each thread do?

  2. How does a thread know which work it will do?

  3. Do threads need to coordinate/synchronize their actions in some way? If so, when and how and how frequently?

  4. What are the limits on concurrency?

  5. What global state will you need? What local state?

2. Implement your algorithm

Try implementing your algorithm in pthreads. You can do this on paper (and in fact this might be the preferred way at least as a first step), or you can try implementing and testing it using some starting point code:

$ cd ~/cs31/
$ mkdir -p inclass/w13
$ cd inclass
$ cp -r ~richardw/public/cs31/inclass/w13/ ./
$ cd w13/max
$ ls
Makefile max.c

# To run:  ./max N M
# example: ./max 10000 16
  1. There is already code in this file to get the command line arguments and to initialize an array of N values to random long values.

  2. You should add all the pthread code to implement your parallel max algorithm.

  3. Use the hello_i.c example from last week to help you with pthread functions and syntax. (you can also use ./hello_i as an example to see thread execution info from top. Run ./hello_i 16 in one terminal and top -H in another to see.

  4. Look at the man pages for pthread functions:

    $ man pthread_create
    $ man pthread_join
    $ man pthread_mutex_lock
  5. Look at the weekly lab page for other pthread examples.

3. Extensions

If you get your solution to this working and debugged, think about a slightly different version of this problem that does two things:

  1. Compute max in parallel when the number of threads, M, does not evenly divide the number of elements in the array, N.

  2. Compute the number of times the max value occurs in the array (also in parallel).

Your solutions should only spawn off worker threads one time and only join them when all parallel computation from both parts is done.