TY - GEN
T1 - Fibonacci based compressed suffix array
AU - Benza, Ekaterina
AU - Klein, Shmuel T.
AU - Shapira, Dana
N1 - Publisher Copyright:
© Proceedings of the Prague Stringology Conference, PSC 2018. All rights reserved.
PY - 2018/7/19
Y1 - 2018/7/19
N2 - We suggest the usage of Fibonacci Codes instead of Elias' C γ code. The implementation requires 1.44 n H k+n+o(n) bits of space, while retaining the searching functionalities. We used a less common variant of the Fibonacci code which was found to be often preferable for the encoding. This variant is constructed from the traditional Fibonacci code by omitting the rightmost 1-bit of every codeword and dropping those codewords that start with 0. As a result, every codeword now starts and ends with a 1-bit, so codeword boundaries may still be detected by the occurrence of the string 11. In order to obtain Φ[i], i mod b codewords need to be decoded. The traditional approach is to decode each codeword and add the decoded values. One of the advantages of using a Fibonacci based representation of the integers is that it is possible to perform this addition directly on the compressed form, without individually decoding each summand.
AB - We suggest the usage of Fibonacci Codes instead of Elias' C γ code. The implementation requires 1.44 n H k+n+o(n) bits of space, while retaining the searching functionalities. We used a less common variant of the Fibonacci code which was found to be often preferable for the encoding. This variant is constructed from the traditional Fibonacci code by omitting the rightmost 1-bit of every codeword and dropping those codewords that start with 0. As a result, every codeword now starts and ends with a 1-bit, so codeword boundaries may still be detected by the occurrence of the string 11. In order to obtain Φ[i], i mod b codewords need to be decoded. The traditional approach is to decode each codeword and add the decoded values. One of the advantages of using a Fibonacci based representation of the integers is that it is possible to perform this addition directly on the compressed form, without individually decoding each summand.
KW - Theoretical bounds
KW - Compressed suffix array
UR - http://www.scopus.com/inward/record.url?scp=85086075898&partnerID=8YFLogxK
M3 - ???researchoutput.researchoutputtypes.contributiontobookanthology.conference???
AN - SCOPUS:85086075898
T3 - Proceedings of the Prague Stringology Conference, PSC 2018
SP - 3
EP - 11
BT - Proceedings of the Prague Stringology Conference, PSC 2018
A2 - Holub, Jan
A2 - Zdarek, Jan
T2 - 22nd Prague Stringology Conference, PSC 2018
Y2 - 27 August 2018 through 28 August 2018
ER -