Fibonacci based compressed suffix array

Ekaterina Benza, Shmuel T. Klein, Dana Shapira

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

1 Scopus citations

Abstract

We suggest the usage of Fibonacci Codes instead of Elias' C γ code. The implementation requires 1.44 n H k+n+o(n) bits of space, while retaining the searching functionalities. We used a less common variant of the Fibonacci code which was found to be often preferable for the encoding. This variant is constructed from the traditional Fibonacci code by omitting the rightmost 1-bit of every codeword and dropping those codewords that start with 0. As a result, every codeword now starts and ends with a 1-bit, so codeword boundaries may still be detected by the occurrence of the string 11. In order to obtain Φ[i], i mod b codewords need to be decoded. The traditional approach is to decode each codeword and add the decoded values. One of the advantages of using a Fibonacci based representation of the integers is that it is possible to perform this addition directly on the compressed form, without individually decoding each summand.

Original languageEnglish
Title of host publicationProceedings of the Prague Stringology Conference, PSC 2018
EditorsJan Holub, Jan Zdarek
Pages3-11
Number of pages9
ISBN (Electronic)9788001064849
StatePublished - 19 Jul 2018
Event22nd Prague Stringology Conference, PSC 2018 - Prague, Czech Republic
Duration: 27 Aug 201828 Aug 2018

Publication series

NameProceedings of the Prague Stringology Conference, PSC 2018

Conference

Conference22nd Prague Stringology Conference, PSC 2018
Country/TerritoryCzech Republic
CityPrague
Period27/08/1828/08/18

Keywords

  • Theoretical bounds
  • Compressed suffix array

Fingerprint

Dive into the research topics of 'Fibonacci based compressed suffix array'. Together they form a unique fingerprint.

Cite this