CSC373/406: 5 [5/13] |
Figure 5.2: Performance of vector sum functions.
Figure 5.3: Vector abstract data type.
Figure 5.8: Comparative performance of lower-case conversion routines.
Figure 5.11: Block diagram of a modern processor.
Figure 5.13: Operations for first iteration of inner loop of combine4 for integer multiplication.
Figure 5.14: Operations for first iteration of inner loop of combine4 for integer addition.
Figure 5.15: Scheduling of operations for integer multiplication with unlimited number of execution units.
Figure 5.16: Scheduling of operations for integer addition with unbounded resource constraints.
Figure 5.17: Scheduling of operations for integer multiplication with actual resource constraints.
Figure 5.18: Scheduling of operations for integer addition with actual resource constraints.
Figure 5.20: Operations for first iteration of inner loop of three-way unrolled integer addition.
Figure 5.21: Scheduling of operations for three-way unrolled integer sum with bounded resource constraints.
Figure 5.25: Operations for first iteration of inner loop of two-way unrolled, two-way parallel integer multiplication.
Figure 5.26: Scheduling of operations for two-way unrolled, two-way parallel integer multiplication with unlimited resources.
Figure 5.31: Scheduling of operations for list length function.
Figure 5.33: Code to write and read memory locations, along with illustrative executions.
Figure 5.34: Detail of load and store units.
Figure 5.35: Timing of write_read for example A.
Figure 5.36: Timing of write_read for example B.
Figure 5.37: Profile results for different version of word frequency counting program.
Figure 5.37: Profile results for different version of word frequency counting program.