TY - GEN
T1 - On the Randomness of Compressed Data
AU - Klein, Shmuel T.
AU - Shapira, Dana
N1 - Publisher Copyright:
© 2019 IEEE.
PY - 2019/5/10
Y1 - 2019/5/10
N2 - It seems reasonable to expect from a good compression method that its output should not be further compressible, because it should behave essentially like random data. We investigate this premise for a variety of known compression techniques, and find that, surprisingly, there is much variability in the randomness, depending on the chosen method. Arithmetic coding seems to produce perfectly random output, whereas that of Huffman or Ziv-Lempel coding still contains many dependencies. In particular, the output of Huffman coding has already been proven to be random under certain conditions, and we show here that arithmetic coding may produce an output that is identical to that of Huffman.
AB - It seems reasonable to expect from a good compression method that its output should not be further compressible, because it should behave essentially like random data. We investigate this premise for a variety of known compression techniques, and find that, surprisingly, there is much variability in the randomness, depending on the chosen method. Arithmetic coding seems to produce perfectly random output, whereas that of Huffman or Ziv-Lempel coding still contains many dependencies. In particular, the output of Huffman coding has already been proven to be random under certain conditions, and we show here that arithmetic coding may produce an output that is identical to that of Huffman.
KW - Kullback Leibler
KW - Lossless Compression
KW - Randomness
UR - http://www.scopus.com/inward/record.url?scp=85066321472&partnerID=8YFLogxK
U2 - 10.1109/DCC.2019.00093
DO - 10.1109/DCC.2019.00093
M3 - ???researchoutput.researchoutputtypes.contributiontobookanthology.conference???
AN - SCOPUS:85066321472
T3 - Data Compression Conference Proceedings
SP - 581
BT - Proceedings - DCC 2019
A2 - Storer, James A.
A2 - Serra-Sagrista, Joan
A2 - Bilgin, Ali
A2 - Marcellin, Michael W.
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2019 Data Compression Conference, DCC 2019
Y2 - 26 March 2019 through 29 March 2019
ER -