In Kafka cluster with a defined quotas, in order to control the broker resources that are used by clients a broker computes the amount of delay needed to bring the violating client under its quota and returns a response with the delay immediately Source from Kafka apache documentation.
In this topic, I would know what does delay refered to (with concret example please)?
In my knowleage, when we define for example the producer_byte_rate=1024
, the broker monitors the producer to not override the 1024o/sec, if the producer overrides the 1024o/sec the broker gonna bring down the client under its quota by throttling the pipe reserved for that client.