I am just trying to understand the mechanism of distributed system and their data storage mechanism. Most of the distributed system has "replicated data within the cluster to prevent data loss".
But what I am trying to understand is, suppose if I have a cluster of 3 systems with a memory of 1TB each, and the master node is replicating the data in other 2 nodes. And suppose if the master node utilizes 500GB of memory, then the other 2 nodes should utilize the same amount of memory for replicating data.
And if this holds true, then in that case, how should I increase the memory capacity of my cluster. Because in this case, it will hold the same amount of data (i.e. 1TB maximum) even on a cluster which has memory capacity way more than the data it is holding.