Intel IA-32 Computer Accessories User Manual


 
xx
Figure 6-2 Memory Access Latency and Execution Without Prefetch ..............6-23
Figure 6-3 Memory Access Latency and Execution With Prefetch ...................6-23
Figure 6-4 Prefetch and Loop Unrolling ............................................................6-29
Figure 6-5 Memory Access Latency and Execution With Prefetch ...................6-31
Figure 6-6 Cache Blocking – Temporally Adjacent and Non-adjacent
Passes.............................................................................................6-35
Figure 6-7 Examples of Prefetch and Strip-mining for Temporally
Adjacent and Non-Adjacent Passes Loops .....................................6-36
Figure 6-8 Single-Pass Vs. Multi-Pass 3D Geometry Engines .........................6-42
Figure 7-1 Amdahl’s Law and MP Speed-up ......................................................7-3
Figure 7-2 Single-threaded Execution of Producer-consumer
Threading Model................................................................................7-9
Figure 7-3 Execution of Producer-consumer Threading Model on
a Multi-core Processor.....................................................................7-10
Figure 7-4 Interlaced Variation of the Producer Consumer Model....................7-12
Figure 7-5 Batched Approach of Producer Consumer Model ...........................7-40
Figure 9-1 Performance History and State Transitions .......................................9-3
Figure 9-2 Active Time Versus Halted Time of a Processor ...............................9-4
Figure 9-3 Application of C-states to Idle Time...................................................9-6
Figure 9-4 Profiles of Coarse Task Scheduling and Power Consumption.........9-12
Figure 9-5 Thread Migration in a Multi-Core Processor....................................9-17
Figure 9-6 Progression to Deeper Sleep ..........................................................9-18
Figure A-1 Sampling Analysis of Hotspots by Location.....................................A-10
Figure A-2 Intel Thread Checker Can Locate Data Race Conditions................A-18
Figure A-3 Intel Thread Profiler Can Show Critical Paths of Threaded
Execution Timelines.........................................................................A-20
Figure B-1 Relationships Between the Cache Hierarchy, IOQ, BSQ
and Front Side Bus ..........................................................................B-10
Figure D-1 Stack Frames Based on Alignment Type.......................................... D-3
Figure E-1 Pentium II, Pentium III and Pentium 4 Processors Memory
Pipeline Sketch..................................................................................E-4
Figure E-2 Execution Pipeline, No Preloading or Prefetch..................................E-6
Figure E-3 Compute Bound Execution Pipeline ..................................................E-7
Figure E-4 Another Compute Bound Execution Pipeline.....................................E-8
Figure E-5 Memory Throughput Bound Pipeline...............................................E-10
Figure E-6 Accesses per Iteration, Example 1 ..................................................E-12
Figure E-7 Accesses per Iteration, Example 2 ..................................................E-13