std::memory_order_relaxed example in cppreference.com

Question

The cppreference.com gives the following example for use of std::memory_order_relaxed. (https://en.cppreference.com/w/cpp/atomic/memory_order)

#include <vector>
#include <iostream>
#include <thread>
#include <atomic>
 
std::atomic<int> cnt = {0};
 
void f()
{
    for (int n = 0; n < 1000; ++n) {
        cnt.fetch_add(1, std::memory_order_relaxed);
    }
}
 
int main()
{
    std::vector<std::thread> v;
    for (int n = 0; n < 10; ++n) {
        v.emplace_back(f);
    }
    for (auto& t : v) {
        t.join();
    }
    std::cout << "Final counter value is " << cnt << '\n';
}

Output: Final counter value is 10000

Is this a correct/sound example (Can a standard complaint compiler introduce optimizations that will yield different answers?). Since std::memory_order_relaxed only guarantee the operation be atomic, one thread may not see an update from another thread. Am I missing something?

memory barriers are used to see non atomic data - for example, if the atomic variable is a pointer or an index of an array - than the store and load must have read/write barriers to synchronize non atomic data. atomics are always atomic and visible. — David Haim
So yeah, you missed what memory barriers are - means to synchronize non atomic data by "piggybacking" atomics, which are always thread-safe. — David Haim

mpoeter mpoeter · Accepted Answer · 2020-06-25T08:53:40

Yes, this is a correct example - so no, a compiler cannot introduce optimizations that would yield a different result. You are right that in general a thread is not guaranteed to see an update from another thread (or more specific, there is no guarantee when such an update becomes visible). However, in this case cnt is updated using an atomic read-modify-write operation, and the standard states in [atomics.order]:

Atomic read-modify-write operations shall always read the last value (in the modification order) written before the write associated with the read-modify-write operation.

And this absolutely makes sense if you think about it, because otherwise it would not be possible to make a read-modify-write operation atomic. Suppose fetch_add would not see the latest update, but some older value. That would mean that the operation would increment that old value and store it. But that would imply that 1) the values returned by fetch_add are not strictly increasing (some threads would see the same value) and 2) that some updates are missed.

std::memory_order_relaxed example in cppreference.com

2 Answers