I am trying to understand quantization in tensorflow and I am following this tutorial.
In the tutorial it says that, the quantization equation is:
- r is the real value (usually float32)
- q is its quantized representation as a B-bit integer (uint8, uint32, etc.)
- S (float32) and z (uint) are the factors by which we scale and shift the number line. z is the quantized ‘zero-point’ which will always map back exactly to 0.f.
I am struggling to understand the meaning of the zero point and was hoping somebody could explain it with an example?
