I am experimenting with non-temporal instructions, and am already familiar with how fences with ordinary load/stores operate.
Intel defines an intrinsic, _mm_sfence, in relation with non-temporal operations, which the manual defines as:
Guarantees that every preceding store is globally visible before any subsequent store.
I have some questions about this operation.
- Is this just inserting a SFENCE instruction? If not, what does this translate to?
- If this is not just a SFENCE, does a SFENCE instruction also carry the ordering guarantee for non-temporal stores?