TY - GEN
T1 - Streaming pattern matching with d wildcards
AU - Golan, Shay
AU - Kopelowitz, Tsvi
AU - Porat, Ely
N1 - Publisher Copyright:
© Shay Golan, Tsvi Kopelowitz, and Ely Porat.
PY - 2016/8/1
Y1 - 2016/8/1
N2 - In the pattern matching with d wildcards problem we are given a text T of length n and a pattern P of length m that contains d wildcard characters, each denoted by a special symbol '?'. A wildcard character matches any other character. The goal is to establish for each m-length substring of T whether it matches P. In the streaming model variant of the pattern matching with d wildcards problem the text T arrives one character at a time and the goal is to report, before the next character arrives, if the last m characters match P while using only o(m) words of space. In this paper we introduce two new algorithms for the d wildcard pattern matching problem in the streaming model. The first is a randomized Monte Carlo algorithm that is parameterized by a constant 0 ≤ δ ≤ 1. This algorithm uses Õ(d1-δ) amortized time per character and Õ(d1+δ) words of space. The second algorithm, which is used as a black box in the first algorithm, is a randomized Monte Carlo algorithm which uses O(d + log m) worst-case time per character and O(dlogm) words of space.
AB - In the pattern matching with d wildcards problem we are given a text T of length n and a pattern P of length m that contains d wildcard characters, each denoted by a special symbol '?'. A wildcard character matches any other character. The goal is to establish for each m-length substring of T whether it matches P. In the streaming model variant of the pattern matching with d wildcards problem the text T arrives one character at a time and the goal is to report, before the next character arrives, if the last m characters match P while using only o(m) words of space. In this paper we introduce two new algorithms for the d wildcard pattern matching problem in the streaming model. The first is a randomized Monte Carlo algorithm that is parameterized by a constant 0 ≤ δ ≤ 1. This algorithm uses Õ(d1-δ) amortized time per character and Õ(d1+δ) words of space. The second algorithm, which is used as a black box in the first algorithm, is a randomized Monte Carlo algorithm which uses O(d + log m) worst-case time per character and O(dlogm) words of space.
KW - Don't-cares
KW - Fingerprints
KW - Streaming pattern matching
KW - Wildcards
UR - https://www.scopus.com/pages/publications/85013011679
U2 - 10.4230/LIPIcs.ESA.2016.44
DO - 10.4230/LIPIcs.ESA.2016.44
M3 - ???researchoutput.researchoutputtypes.contributiontobookanthology.conference???
AN - SCOPUS:85013011679
T3 - Leibniz International Proceedings in Informatics, LIPIcs
BT - 24th Annual European Symposium on Algorithms, ESA 2016
A2 - Zaroliagis, Christos
A2 - Sankowski, Piotr
PB - Schloss Dagstuhl- Leibniz-Zentrum fur Informatik GmbH, Dagstuhl Publishing
T2 - 24th Annual European Symposium on Algorithms, ESA 2016
Y2 - 22 August 2016 through 24 August 2016
ER -