Streaming from particular partition within a topic (Kafka Streams)

Question

As far as I understand after reading Kafka Streams documentation, it's not possible to use it for streaming data from only one partition from given topic, one always have to read it whole.

Is that correct?

If so, are there any plans to provide such an option to the API in the future?

The question is not so clear to me. The source of your streaming application can be a topic with only one partition. But it's possible I haven't understood the question ... can you elaborate please ? — ppatierno
I will give an example. Lets assume that I have topic "A" with 10 partitions and I want to stream data from this topic but only from partition 4 without gathering data from other paritions. — Purple
Then you need to copy only the data in partition 4 into another topic with only 1 partition and use that as input to Streams. — Hans Jespersen

ppatierno ppatierno · Accepted Answer · 2017-06-20T15:59:04

No you can't do that because the internal consumer subscribes to the topic joining a consumer group which is specified through the application-id so the partitions are assigned automatically. Btw why do you want do that ? Without re-balancing you lose the scalability feature provided by Kafka Stream because just adding/removing instances of your streaming application you can scale the entire process, thanks to the re-balancing on partitions.

Streaming from particular partition within a topic (Kafka Streams)

4 Answers