Is it possible to avoid acknowledged message get lost in kafka?

Question

We consider using kafka as critical messaging middle-ware.
But it looks like message durability guarantee is optimistic in kafka replication design:

For better performance, each follower sends an acknowledgment after the message is written to memory. So, for each committed message, we guarantee that the message is stored in multiple replicas in memory However, there is no guarantee that any replica has persisted the commit message to disks though.

In worst case, if whole cluster outage at same time before flush acknowledged messages to disk, some data may get lost. Is it possible to avoid this case?

William Hammond William Hammond · Accepted Answer · 2016-11-21T02:21:02

There are multiple configurations to adjust frequency of log flushes. You can increase the time the flush scheduler thread checks if a flush is necessary log.flush.scheduler.interval.ms and you can decrease the number of messages needed to trigger a flush log.flush.interval.messages.

Although you'd never need to worry about this case if you're able to replicate across different data centers.

Is it possible to avoid acknowledged message get lost in kafka?

2 Answers