2001年《Derivative Dynamic Time Warping》Eamonn J K & Michael J P

ww大魔王丷

已于 2024-12-11 17:09:18 修改

阅读量1k

点赞数

分类专栏：文献阅(fān)读(yì) DTW 文章标签：算法

于 2023-03-29 11:13:24 首次发布

本文链接：https://blog.csdn.net/qq_40292148/article/details/129820086

版权

DTW 同时被 2 个专栏收录

19 篇文章

订阅专栏

文献阅(fān)读(yì)

18 篇文章

订阅专栏

文章探讨了经典动态时间规整(DTW)在时间序列比较中的问题，包括对Y轴变异性的过度解释导致的奇点和无法找到自然对齐。提出了改进的DTW算法（DDTW），通过考虑序列的一阶导数来解决这些问题，增强了对形状特征的敏感性。实验表明，DDTW减少了不必要的扭曲，提高了对序列局部差异的处理能力。

摘要生成于 C知道，由 DeepSeek-R1 满血版支持，前往体验 >

DTW 的缺陷 / DDTW 所解决的问题

算法细节

实验结果

Spurious warping

Finding the correct warping

总结

Introduction

DTW 的用途

时间序列的一种常见应用是比较两个序列的相似性。在有些领域，很简单的距离度量（比如欧氏距离）就可以满足需求。但是，经常出现的问题是：两个序列的总体形状相似，但它们在 x 轴上没有对齐。Figure 1 就是一个简单的例子。

为了找到上述情况的序列之间的相似性，必须 “扭曲” 一个（或两个）个序列的时间轴，以获得两个序列之间更好的匹配。DTW 就是实现这种扭曲的一个有效手段。

DTW 的问题

1、singularity

DTW 可能会通过扭曲 X 轴来解释 Y 轴的变异性（the algorithm may try to explain variability in the Y-axis by warping the X-axis）。这可能会产生一些不直观的对齐方式：某个序列中的一个点和另一个序列中的很多点相关联。本文将出现此现象的样本称为 “singularity”。

许多尝试解决该问题的方法，本质上是在限制 warping 的范围，这会导致它们可能无法找到一些 “正确” 的 warping。

在模拟情况下，可以通过扭曲某个时间序列并尝试恢复原始序列来找到正确的 warping。在自然情况下，“正确” 的 warping 表示直观上明显的 “特征到特征” 的对齐。

2、fail to find obvious, natural alignments

DTW 可能会因为一个序列中的某个特征比另一个序列中的对应特征稍微高/低了一点，从而找不到两个序列之间明显的、自然的匹配。特征包括峰值(peak)、谷值(valley)、拐点(inflection point)、高原(plateau) 等。Figure 2 展示了这个问题。

改进 DTW 以解决上述问题

The classic dynamic time warping algorithm

总结之前的改进方法：Constraining the classic DTW algorithm

1) Windowing

Berndt, D. & Clifford, J. (1994) Using dynamic time warping to find patterns in time series. AAAI-94 Workshop on Knowledge Discovery in Databases (KDD-94), Seattle, Washington.

将矩阵中可用的元素限制在一个 warping window 中：| i - ( n / ( m/j )) | < R。R 是一个正整数，表示 warping width。这其实是修剪了矩阵的角，如 Figure 3 中的虚线所示。

有许多研究实验了不同形状的 warping window。

Rabiner, L., Rosenberg, A. & Levinson, S. (1978). Considerations in dynamic time warping algorithms for discrete word recognition. IEEE Trans. Acoustics, Speech, and Signal Proc., Vol. ASSP-26, 575-582.

Tappert, C. & Das, S. (1978). Memory and time improvements in a dynamic programming algorithm for matching speech patterns. IEEE Trans. Acoustics, Speech, and Signal Proc., Vol. ASSP-26, 583-586.

Myers, C., Rabiner, L & Roseneberg, A. (1980). Performance tradeoffs in dynamic time warping algorithms for isolated word recognition. IEEE Trans. Acoustics, Speech, and Signal Proc., Vol. ASSP-28, 623-635.

这种方法限制了 singularity 最大的大小，但并没有防止 singularity 的出现。

2) Slope Weighting

Kruskall, J. B. & Liberman, M. (1983). The symmetric time warping algorithm: From continuous to discrete. In Time Warps, String Edits and Macromolecules: The Theory and Practice of String Comparison. Addison-Wesley.

Sakoe, H. & Chiba, S. (1978) Dynamic programming algorithm optimization for spoken word recognition. IEEE Trans. Acoustics, Speech, and Signal Proc., Vol. ASSP-26, 43- 49.

将等式(5) 替换为。其中，X 是一个正实数。通过改变 X 值来限制 warping。X 越大，warping path 越接近于对角线。

3) Step Patterns (Slope constraints)

Itakura, F. (1975). Minimum prediction residual principle applied to speech recognition. IEEE Trans. Acoustics, Speech, and Signal Proc., Vol. ASSP-23, 52-72.

将等式(5) 可视化为一个可行路径图（a diagram of admissible step-patterns），如 Figure 4.A 所示。箭头表示在每个阶段，warping path 可以行进的路径。

将等式(5) 替换为，其可行路径图就变成 Figure 4.B。此外，还有许多种可行路径，详见下面这篇综述。

Rabiner, L. & Juang, B. (1993). Fundamentals of speech recognition. Englewood Cliffs, N.J, Prentice Hall.

上述 3 类方法是以有可能错过正确的 warping 为代价，减轻 singularity 的问题。此外，参数的选择也是一大难题（R for Windowing、X for Slope Weighting、Step-Pattern）。

Derivative dynamic time warping

DTW 的缺陷 / DDTW 所解决的问题

一些影响整个序列的全局差异比较容易去除。比如：different means (offset translation)、 different scalings (amplitude scaling)、linear trends。

Keogh, E., & Pazzani, M. (1998). An enhanced representation of time series which allows fast and accurate classification, clustering and relevance feedback. Proceedings of the 4rd International Conference of Knowledge Discovery and Data Mining. pp 239-241, AAAI Press.

Agrawal, R., Lin, K. I., Sawhney, H. S., & Shim, K. (1995). Fast similarity search in the presence of noise, scaling, and translation in times-series databases. In VLDB, September.

当两个序列在 Y 轴上有局部差别（在 X 轴上存在局部加速/减速）时，DTW 可能会出错。例如 Figure 5：两个序列相同的序列，DTW 可以很清楚地得到一个一对一的匹配；如果稍微改变一个局部特征（波谷的深度），DTW 就会倾向于通过时间轴来解释这种差异，于是产生两个 singularities。

DTW 的缺点在于其考虑的特征，它只考虑了数据点的 Y 值。比如，两个值相同的数据点和，处于一个上升趋势，而处于一个下降趋势。DTW 认为，由于这两个数据点值相等，这两个点之间的映射是理想的。但在直觉上，我们并不希望将一个上升趋势和一个下降趋势匹配。