I am trying to implement an LSB embedding steganalysis algorithm.
The database of images consists of 24-bit bmp color images (a few thousand). Almost all steganalysis of LSB embedding steganography research work focuses on grayscale images. I want to try to use inter-color correlations within the image for hidden message presence detection. Not just as three time grayscale.
I searched works on this topic and found a very simple algorithm (I can't provide the link). It uses data compression for detecting hidden messages.
In short:
It states that the more information is hidden in the file, the greater the size of its archive (for data compression methods such as rar, nanzip, png etc), because data compression algorithms use inter-color correlations.
There was no proof link for the statements. I don't know how data compression algorithms work, intuitively I agree with those statements but I want to know for sure is it true or not, for example, for gzip or zip algorithms.