1
votes

Problem

We'd like to consume from an EventHub topic in Azure using a dataflow pipeline in Google Cloud.

Question

Is it known whether KafkaIO allows for consuming from EventHubs directly in an Apache Beam/Google Cloud dataflow job? (see this post)

Alternative approaches for getting the EventHub data into Pub/Sub are also appreciated (e.g. Publish from Azure Streaming Analytics to Pub/Sub)

Thank you!

2

2 Answers

3
votes

Azure Event Hubs supports the Apache Kafka protocol 1.0 and later, so you should be able to consume events from Event Hubs using KafkaIO (which supports Kafka versions 0.10.1 and newer). Google Cloud has a guide for processing messages from Kafka in Dataflow here.

1
votes

Azure Event Hubs is binary compliant with Kafka protocol so any Kafka client or pipeline should be able to consume events from an Event Hubs topic.