Truly Subquadratic-Time Extension Queries and Periodicity Detection in Strings with Uncertainties

Costas S. Iliopoulos, Jakub Radoszewski

Strings with don't care symbols, also called partial words, and more general indeterminate strings are a natural representation of strings containing uncertain symbols. A considerable effort has been made to obtain efficient algorithms for pattern matching and periodicity detection in such strings. Among those, a number of algorithms have been proposed that behave well on random data, but still their worst-case running time is Θ(n2). We present the first truly subquadratic-time solutions for a number of such problems on partial words. We show that n longest common compatible prefix queries (which correspond to longest common extension queries in regular strings) can be answered on-line in O(n√n log n) time after O(n√n log n)-time preprocessing. We also present O(n√n log n)-time algorithms for computing the prefix array and two types of border array of a partial word. We show how our solutions can be adapted to indeterminate strings over a constant-sized alphabet and prove that, unless the Strong Exponential Time Hypothesis is false, the considered problems cannot be solved efficiently over a general alphabet.

Original languageEnglish
Title of host publicationLeibniz International Proceedings in Informatics, LIPIcs
Place of PublicationGermany
PublisherSchloss Dagstuhl- Leibniz-Zentrum fur Informatik GmbH, Dagstuhl Publishing
Number of pages12
ISBN (Print)9783959770125
Publication statusPublished - 1 Jun 2016
Event27th Annual Symposium on Combinatorial Pattern Matching, CPM 2016 - Tel Aviv, Israel
Duration: 27 Jun 201629 Jun 2016


Conference27th Annual Symposium on Combinatorial Pattern Matching, CPM 2016
CityTel Aviv


  • Indeterminate string
  • Longest common conservative prefix queries
  • Partial word
  • Prefix array
  • String with don't cares


