In a typical computer, data, which is stored on disk, flows into memory, where it is held until required by the processors. Data then flows from memory into the processors, which perform multiplications and additions.

The following parameters are used to characterize the computer:

Configuration D = Disk transfer rate
(Megabytes per second).
D is the flow rate from disk to memory. If several disks are performing transfers in parallel (file striping), D is the aggregate transfer rate of all disks operating together when a single file record is transferred.

M = Memory size
(64-bit words).
M is the free memory available for buffer space in 64-bit words. M does not include memory required for the operating system, program instructions, or other program data.

C = Processor speed
(Millions of floating point operations per second, Mflops).
When several processors are operated in parallel, C is the aggregate sum of all processors operating together.

R = Reuse
(operations per word).
R is the number of times, on the average, the processors use each memory word for a multiply or add operation. R is determined by the algorithm.

In a properly balanced system, the data flowrate from the disks, times the reuse from memory, equals the computational rate as shown by the following:

DR = 8C

If DR < 8C, the process is I/O bound.

If DR > 8C, the process is CPU bound.

The reuse, R, is determined by the algorithm being used. For factoring PROFILE matrices, R is 2B, where B is the half bandwidth of the matrix. The factor 2 accounts for one multiplication and one addition per term.

During vector solution, R is 2(NUMRHS), where NUMRHS is the number of right-hand side vectors. For most machines, the ratio 4C/D exceeds the number of right-hand side vectors and the solution is I/O bound. The solution subroutines in FMS are designed to process multiple right-hand side vectors simultaneously to minimize I/O time. If the number of right-hand side vectors is small, you may also store them in memory to eliminate I/O.

The reuse R is different for matrices stored in PROFILE format and BLOCK format.

The most cost effective method of increasing performance is to increase the disk transfer rate, D. The FMS File Striping System was designed to do that task.

These equations can be used to estimate machine configurations. Any specific machine will be used for a variety of applications and problem sizes. These equations should be considered only as a guideline to prevent configurations which are extremely I/O bound or CPU bound.