MongoDB Aggregation - Does $unwind order documents the same way as the nested array order

Question

I am wandering whether using $unwind operator in aggregation pipeline for document with nested array will return the deconstructed documents in the same order as the order of the items in the array. Example: Suppose I have the following documents

{ "_id" : 1, "item" : "foo", values: [ "foo", "foo2", "foo3"] }
{ "_id" : 2, "item" : "bar", values: [ "bar", "bar2", "bar3"] }
{ "_id" : 3, "item" : "baz", values: [ "baz", "baz2", "baz3"] }

I would like to use paging for all values in all documents in my application code. So, my idea is to use mongo aggregation framework to:

sort the documents by _id
use $unwind on values attribute to deconstruct the documents
use $skip and $limit to simulate paging

So the question using the example described above is:

Is it guaranteed that the following aggregation pipeline:

[
    {$sort: {"_id": 1}},
    {$unwind: "$values"}
]

will always result to the following documents with exactly the same order?:

{ "_id" : 1, "item" : "foo", values: "foo" }
{ "_id" : 1, "item" : "foo", values: "foo2" }
{ "_id" : 1, "item" : "foo", values: "foo3" }
{ "_id" : 2, "item" : "bar", values: "bar" }
{ "_id" : 2, "item" : "bar", values: "bar2" }
{ "_id" : 2, "item" : "bar", values: "bar3" }
{ "_id" : 3, "item" : "baz", values: "baz" }
{ "_id" : 3, "item" : "baz", values: "baz2" }
{ "_id" : 3, "item" : "baz", values: "baz3" }

I can't rely on this online example. In the real life I will have a lot more documents and Items in the array (maybe millions). I am looking for official answer. Just like the order of the documents is not guaranteed with simple find({}), but testing it with a small set of documents the returned order is the same — Aleydin Karaimin
This is not explicitly documented, if you are paying for Atlas or MongoDB Enterprise I suggest you go through the official support channels. — D. SM
MongoDB source code calls libunwind function which iterates frames in the chain. Since such functions read binary files on disk, it's not possible alterate order, skip, swap frames... — Valijon
@Valijon. Thank you for this answer. I am not aware how wiredTiger engine actually stores documents on disk. But it looks logically since the the array order is guaranteed, the order of the destructed documents to be always the same. Even, the examples from the official documentation are ordered :), but is not mentioned if we can rely on that order. — Aleydin Karaimin

GitGitBoom GitGitBoom · Accepted Answer · 2020-08-05T16:57:09

In the case that you do run into issues with order. You could use includeArrayIndex to guarantee order.

[
 {$unwind: {
   path: 'values',
   includeArrayIndex: 'arrayIndex'
 }},
 {$sort: {
   _id: 1,
   arrayIndex: 1
 }},
 { $project: {
    index: 0
 }}
]

MongoDB Aggregation - Does $unwind order documents the same way as the nested array order

3 Answers