Kafka data behaviour on partition creation

Question

I'm testing adding Kafka partitions in a running system, but isn't clear to me how Kafka manage the existent data if you add partitions to an existing topic.

For example:

I have a Kafka instance with a topic named test with 1 partition and 1 replica.
The producer group starts to insert into that topic and the consumer group start to consume.
I alter the topic to add another partition.

What happen with the topic data in this case? Is rebalanced between both partitions or only new produced data will use the new partition?

Liju John Liju John · Accepted Answer · 2018-04-12T13:26:07

Adding partitions doesn't change the partitioning of existing data for that matter it is based on philosophy of "append-only".For example if data is partitioned by hash(key) % number_of_partitions then this partitioning will potentially be shuffled by adding partitions , kafka will not attempt to rebalance/redistribute it.

Kafka data behaviour on partition creation

2 Answers