AMD 250 Computer Hardware User Manual


 
Chapter 5 Cache and Memory Optimizations 111
Software Optimization Guide for AMD64 Processors
25112 Rev. 3.06 September 2005
iteration. However, unrolling the loop may not be feasible if the original loop stride is very small (for
example, only two bytes).
Prefetch at Least 64 Bytes Away from Surrounding Stores
The prefetch instructions can be affected by false dependencies on stores. If there is a store to an
address that matches a request, that request (the prefetch instruction) may be blocked until the store is
written to the cache. Therefore, code should prefetch data that is located at least 64 bytes away from
any surrounding store’s data address.