Storm Kafka Spout Unable to read last off read

Question

I am using storm-kafka-0.9.3 to read data from the Kafka and process those data in Storm. Below is the Kafka Spout I am using.But the problem is when I kill the Storm cluster, it does not read old data which was sent during the time it was dead, it start reading from the latest offset.

BrokerHosts hosts = new ZkHosts(Constants.ZOOKEEPER_HOST);

SpoutConfig spoutConfig = new SpoutConfig(hosts, CommonConstants.KAFKA_TRANSACTION_TOPIC_NAME
        , "/" + CommonConstants.KAFKA_TRANSACTION_TOPIC_NAME,UUID.randomUUID().toString());
spoutConfig.scheme = new SchemeAsMultiScheme(new StringScheme());
//Never should make this true
spoutConfig.forceFromStart=false;
spoutConfig.startOffsetTime =-2;

KafkaSpout kafkaSpout = new KafkaSpout(spoutConfig);
return kafkaSpout;

can you please try comment out spoutConfig.forceFromStart=false; line or by set spoutConfig.forceFromStart=true — user2720864
Tried that but same issue, see actually Assume I have 100 messages in kafka, Storm processing that, Now assume after 100th message Storm went down and my http end point pushed 300 more messages in Kafka, Since Storm processed only 100 message I expect when Storm wakes up it should start processing from 101 message where it left. — user1249655
so what exactly happening ? in your post you've mentioned it start reading from the latest offset .. isn't that what you are looking for? — user2720864
Basically when strom comes back it starts reading from 401 instead of 101. — user1249655

user1249655 user1249655 · Accepted Answer · 2015-07-24T07:23:47

Thanks All, Since I was running the Topology in Local mode,Storm did not store Offset in ZK, when I ran the topology in Prod mode It got resolved.

Sougata

Storm Kafka Spout Unable to read last off read

3 Answers