I am looking at the kafka broker reported various latency metrics to include them on grafana dashboard but i have difficulty understanding the reported metrics. i have exported the metrics to prometheus through JMX exporter. for e.g let's take the Produce Request Total time metric. (kafka.network:type=RequestMetrics,name=TotalTimeMs,request=Produce)
- when i query prometheus with
"kafka_network_requestmetrics_totaltimems_count{request="Produce"}"
, i get some large number. e.g. 56459366. what is big number mean ?
- when i query prometheus with
"kafka_network_requestmetrics_totaltimems{request="Produce"}"
, i get 6 rows. e.g. following
kafka_network_requestmetrics_totaltimems{instance="10.130.12.24:8020",job="kubernetes-pods",pod="kafka-0",quantile="0.50",request="Produce"} 2
kafka_network_requestmetrics_totaltimems{instance="10.130.12.24:8020",job="kubernetes-pods",pod="kafka-0",quantile="0.75",request="Produce"} 2
kafka_network_requestmetrics_totaltimems{instance="10.130.12.24:8020",job="kubernetes-pods",pod="kafka-0",quantile="0.95",request="Produce"} 3
kafka_network_requestmetrics_totaltimems{instance="10.130.12.24:8020",job="kubernetes-pods",pod="kafka-0",quantile="0.98",request="Produce"} 12.42
kafka_network_requestmetrics_totaltimems{instance="10.130.12.24:8020",job="kubernetes-pods",pod="kafka-0",quantile="0.99",request="Produce"} 21
kafka_network_requestmetrics_totaltimems{instance="10.130.12.24:8020",job="kubernetes-pods",pod="kafka-0",quantile="0.999",request="Produce"} 54
what these various quantile metric mean and how i can calculate the avg values out of these ?
- how frequently these metrics are updated by the broker ?