This is with reference to SimpleConsumer Example and High Level Consumer Example.
As per the documentation, it seems to suggest that SimpleConsumers are responsible for managing the offsets themselves and they can choose to read a message multiple times or consume only a subset of the partitions in a topic. All this is possible because they can form their request and specify what offset they want.
Now, if I have two clusters of simple consumers and both use a different zookeeper to store the offsets, then it is very likely that both the clusters will read duplicate messages. Is that understanding correct? To void duplication among them, they have to use a single zookeeper-cluster to store the offsets.
The concept of consumer-group applies only to the High-Level consumer. So if I have two clusters of high-level consumers and both use the same group-ID, then then they will not get any duplicate messages.
Please suggest if the above is not correct.