Hey guys,
just wanted to share a survey I did for a software (IPFS) about rolling chunker vs fixed size chunking with different bounds/chunk sizes and compared it to light compression via zstd -1 --long
for different data types:
Sidenote: Buzhash and Rabin are just two different algorithms to do rolling chunking and the default parameters for Buzhash are min: 128 KByte and max: 512 KByte. For rabin this is min: ~85 KByte, avg: 256 KByte and max: 384 KByte.