BIROn - Birkbeck Institutional Research Online

    Approximate Circular Pattern Matching under Edit Distance

    Charalampopoulos, Panagiotis and Pissis, Solon P. and Radoszewski, Jakub and Rytter, Wojciech and Walen, Tomasz and Zuba, Wiktor (2024) Approximate Circular Pattern Matching under Edit Distance. Leibniz International Proceedings in Informatics (LIPIcs) 289 , 24:1-24:22. ISSN 1868-8969.

    [img]
    Preview
    Text
    53169a.pdf - Published Version of Record
    Available under License Creative Commons Attribution.

    Download (1MB) | Preview

    Abstract

    In the k-Edit Circular Pattern Matching (k-Edit CPM) problem, we are given a length-n text T, a length-m pattern P, and a positive integer threshold k, and we are to report all starting positions of the substrings of T that are at edit distance at most k from some cyclic rotation of P. In the decision version of the problem, we are to check if any such substring exists. Very recently, Charalampopoulos et al. [ESA 2022] presented O(nk^2)-time and O(nk \log^3 k)-time solutions for the reporting and decision versions of k-Edit CPM, respectively. Here, we show that the reporting and decision versions of k-Edit CPM can be solved in O(n+(n/m)k^6) time and O(n+(n/m)k^5 \log^3 k) time, respectively, thus obtaining the first algorithms with a complexity of the type O(n+(n/m) poly(k)) for this problem. Notably, our algorithms run in O(n) time when m=Ω(k^6) and are superior to the previous respective solutions when m=ω(k^4). We provide a meta-algorithm that yields efficient algorithms in several other interesting settings, such as when the strings are given in a compressed form (as straight-line programs), when the strings are dynamic, or when we have a quantum computer. We obtain our solutions by exploiting the structure of approximate circular occurrences of P in T, when T is relatively short w.r.t. P. Roughly speaking, either the starting positions of approximate occurrences of rotations of P form O(k^4) intervals that can be computed efficiently, or some rotation of P is almost periodic (is at a small edit distance from a string with small period). Dealing with the almost periodic case is the most technically demanding part of this work; we tackle it using properties of locked fragments (originating from [Cole and Hariharan, SICOMP 2002]).

    Metadata

    Item Type: Article
    Additional Information: 41st International Symposium on Theoretical Aspects of Computer Science (STACS 2024)
    School: Birkbeck Faculties and Schools > Faculty of Science > School of Computing and Mathematical Sciences
    Depositing User: Panagiotis Charalampopoulos
    Date Deposited: 09 Apr 2024 13:46
    Last Modified: 09 Apr 2024 15:35
    URI: https://eprints.bbk.ac.uk/id/eprint/53169

    Statistics

    Activity Overview
    6 month trend
    9Downloads
    6 month trend
    37Hits

    Additional statistics are available via IRStats2.

    Archive Staff Only (login required)

    Edit/View Item Edit/View Item