As described I am currently setting up a Kafka Connect Sink to sink data from Kafka to Google Cloud Storage.
Everything is going smoothly, however - it is only using the latest available offset. That is, once it begins running, it only sinks the newly produced messages to GCS and not the already existing messages from Kafka. I have tried deleting the kafka connect storage/offset topics, creating a new connector name, etc. However, it always starts at the latest offset.
If there anyway to configure earliest offset for Kafka Connect GCS Sink? I have not seen any configurations to handle this on
https://docs.confluent.io/current/connect/kafka-connect-gcs/configuration_options.html
or
https://docs.confluent.io/current/connect/references/allconfigs.html
I've tried deleting any kafka connect topics/file storage, as well as starting with a new connector name
I am seeing the Kafka Connect sink messages that were produced after the connector started.
I am expecting/need messages to sink from the earliest available offset, ie. start from earliest message if no offset is committed for the connector