LZ4 match search algorithm (fast scan)

Question

I implemented a LZ77/LZ4 (no entropy encoding) based compression algorithm based on hash chains of infinite depth. It works well and its speed is acceptable, but its compression ratio is close to LZ4. Reading documentation and browsing source code from LZ4 project I understand that it uses a hash chain of depth1, but if I fix my implementation's depthto 1, LZ4 outperforms it.

I can't understand how LZ4 match search algorithm (fast scan) works. Can someone explain it?

Thanks.

qinxian qinxian · Accepted Answer · 2017-08-21T16:00:34

The scan process employs hash searching. like the following:

Old Bytes--------------anchor------------New bytes----current
h=hash[int4]
reference=hash.get(h)
hash.put(h,current) for later matching
int(reference)==int(current)? handle matches: retry
handle matches

The seachMatchNb variable is a skip way, for fast matching but maybe loss minute matches or not.

The hash table is a JIT style, which only saves offsets. readIntEquals function does key matching.

Just ignore it in study mode.

LZ4 match search algorithm (fast scan)

1 Answers