Intel IA-32 Computer Accessories User Manual


 
General Optimization Guidelines 2
2-67
Assembly/Compiler Coding Rule 32. (H impact, L generality) Minimize the
number of changes to the rounding mode. Do not use changes in the rounding
mode to implement the floor and ceiling functions if this involves a total of
more than two values of the set of rounding, precision and infinity bits.
Precision
If single precision is adequate, use it instead of double precision. This is
true because:
Single precision operations allow the use of longer SIMD vectors,
since more single precision data elements can fit in a register.
If the precision control (PC) field in the x87 FPU control word is set
to “Single Precision,” the floating-point divider can complete a
single-precision computation much faster than either a
double-precision computation or an extended double-precision
computation. If the PC field is set to “Double Precision,” this will
enable those x87 FPU operations on double-precision data to
complete faster than extended double-precision computation. These
characteristics affect computations including floating-point divide
and square root.
positive:
fstp dword ptr[ecx] ; 17-18 result of subtraction
mov ecx,[ecx] ; dword of diff(single precision)
add esp,16
add ecx,7fffffffh ; if diff<0 then decrement integer
sbb eax,0 ; dec eax (subtract CARRY flag)
ret
integer_QnaN_or_zero:
test edx,7fffffffh
jnz arg_is_not_integer_QnaN
add esp,16
ret
Example 2-23 Algorithm to Avoid Changing the Rounding Mode (continued)