eng
Schloss Dagstuhl – Leibniz-Zentrum für Informatik
Leibniz International Proceedings in Informatics
1868-8969
2023-08-21
54:1
54:15
10.4230/LIPIcs.MFCS.2023.54
article
On the Complexity of Computing Time Series Medians Under the Move-Split-Merge Metric
Holznigenkemper, Jana
1
Komusiewicz, Christian
2
https://orcid.org/0000-0003-0829-7032
Morawietz, Nils
1
Seeger, Bernhard
1
Fachbereich Mathematik und Informatik, Philipps-Universität Marburg, Germany
Institute of Computer Science, Friedrich-Schiller-Universität Jena, Germany
We initiate a study of the complexity of MSM-Median, the problem of computing a median of a set of k real-valued time series under the move-split-merge distance. This distance measure is based on three operations: moves, which may shift a data point in a time series; splits, which replace one data point in a time series by two consecutive data points of the same value; and merges, which replace two consecutive data points of equal value by a single data point of the same value. The cost of a move operation is the difference of the data point value before and after the operation, the cost of split and merge operations is defined via a given constant c.
Our main results are as follows. First, we show that MSM-Median is NP-hard and W[1]-hard with respect to k for time series with at most three distinct values. Under the Exponential Time Hypothesis (ETH) our reduction implies that a previous dynamic programming algorithm with running time |I|^𝒪(k) [Holznigenkemper et al., Data Min. Knowl. Discov. '23] is essentially optimal. Here, |I| denotes the total input size. Second, we show that MSM-Median can be solved in 2^𝒪(d/c)⋅|I|^𝒪(1) time where d is the total distance of the median to the input time series.
https://drops.dagstuhl.de/storage/00lipics/lipics-vol272-mfcs2023/LIPIcs.MFCS.2023.54/LIPIcs.MFCS.2023.54.pdf
Parameterized Complexity
Median String
Time Series
ETH