how to specify consumer group in Kafka Spark Streaming using direct stream

Question

How to specify consumer group id for kafka spark streaming using direct stream API.

HashMap<String, String> kafkaParams = new HashMap<String, String>();
kafkaParams.put("metadata.broker.list", brokers);
kafkaParams.put("auto.offset.reset", "largest");
kafkaParams.put("group.id", "app1");

    JavaPairInputDStream<String, String> messages = KafkaUtils.createDirectStream(
            jssc, 
            String.class, 
            String.class,
            StringDecoder.class, 
            StringDecoder.class, 
            kafkaParams, 
            topicsSet
    );

though i have specified the configuration not sure if missing something. using spark1.3

kafkaParams.put("group.id", "app1");

What do you mean by not sure if missing something? Please ask a specific question. Something like I tried X to achieve Y using library Z but got exception E with stacktrace S is appropriate from StackOverflow. — Debosmit Ray
@DebosmitRay I tried "group.id" using spark kafka direct stream to specify consumer group. Not getting any exception but want to know if this the right way to specify consumer group while using createDirectStream API method. Does it help now??? — Faisal Ahmed Siddiqui

C4stor C4stor · Accepted Answer · 2016-04-10T12:24:40

The direct stream API use the low level Kafka API, and as so doesn't use consumer groups in anyway. If you want to use consumer groups with Spark Streaming, you'll have to use the receiver based API.

Full details are available in the doc !

how to specify consumer group in Kafka Spark Streaming using direct stream

2 Answers