תקציר
The String-to-Dictionary Matching Problem is defined, in which a string is searched for in all the possible concatenations of the elements of a given dictionary, with applications to compressed matching in variable to fixed-length encodings, such as Tunstall's. Two algorithms based on suffix trees are suggested, the one focusing on the dictionary, the other on the pattern to be searched for. The problem is then extended to deal also with patterns that include gaps. Experiments on natural language text suggest that compressed search might use less comparisons for long enough patterns, in spite of a potentially large number of encodings.
שפה מקורית | אנגלית |
---|---|
עמודים (מ-עד) | 1347-1356 |
מספר עמודים | 10 |
כתב עת | Computer Journal |
כרך | 55 |
מספר גיליון | 11 |
מזהי עצם דיגיטלי (DOIs) | |
סטטוס פרסום | פורסם - נוב׳ 2012 |
פורסם באופן חיצוני | כן |