Research Paper:
Two Fuzzy Clustering Algorithms Based on ARMA Model
Tomoki Nomura and Yuchi Kanzawa
Shibaura Institute of Technology
3-7-5 Toyosu, Koto-ku, Tokyo 135-8548, Japan
Corresponding author
This study proposes two fuzzy clustering algorithms based on autoregressive moving average (ARMA) model for series data. The first, referred to as Tsallis entropy-regularized fuzzy c-ARMA model (TFCARMA), is created from k-ARMA, a conventional hard clustering algorithm for series data. TFCARMA is motivated by the relationship between the two clustering algorithms for vectorial data: k-means and Tsallis entropy-regularized fuzzy c-means. The second, referred to as q-divergence-based fuzzy c-ARMA model (QFCARMA), is created from ARMA mixtures, a conventional probabilistic clustering algorithm for series data. QFCARMA is motivated by the relationship between the two clustering algorithms for vectorial data: Gaussian mixture model and q-divergence-based fuzzy c-means. Based on numerical experiments using an artificial dataset, we observed the effects of fuzzification parameters in the proposed algorithms and relationship between the proposed and conventional algorithms. Moreover, numerical experiments using seven real datasets compared the clustering accuracy among the proposed and conventional algorithms.
- [1] J. C. Bezdek, “Pattern Recognition with Fuzzy Objective Function Algorithms,” Plenum Press, 1981. https://doi.org/10.1007/978-1-4757-0450-1
- [2] M. Yasuda, “Tsallis entropy based fuzzy c-means clustering with parameter adjustment,” The 6th Int. Conf. on Soft Computing and Intelligent Systems, and the 13th Int. Symp. on Advanced Intelligence Systems, pp. 1534-1539, 2012. https://doi.org/10.1109/SCIS-ISIS.2012.6505118
- [3] Y. Kanzawa, “On fuzzy clustering based on Tsallis entropy-regularization,” Proc. of the 30th Fuzzy System Symp., pp. 452-457, 2014 (in Japanese). https://doi.org/10.14864/fss.30.0_452
- [4] D. O. Hoare, D. S. Matteson, and M. T. Wells, “K-ARMA models for clustering time series data,” arXiv:2207.00039, 2022. https://doi.org/10.48550/arXiv.2207.00039
- [5] Y. Xiong and D.-Y. Yeung, “Mixtures of ARMA models for model-based time series clustering,” 2002 IEEE Int. Conf. on Data Mining, pp. 717-720, 2002. https://doi.org/10.1109/ICDM.2002.1184037
- [6] K. Kalpakis, “Mining of science time-series data.” https://redirect.cs.umbc.edu/kalpakis/TS-mining/ [Accessed January 28, 2024]
- [7] H. A. Dau et al., “UCR time series classification archive.” https://www.cs.ucr.edu/eamonn/time_series_data_2018/ [Accessed December 12, 2021]
- [8] L. Hubert and P. Arabie, “Comparing partitions,” J. of Classification, Vol.2, No.1, pp. 193-218, 1985. https://doi.org/10.1007/BF01908075
- [9] Y. Benjamini and Y. Hochberg, “Controlling the false discovery rate: A practical and powerful approach to multiple testing,” J. of the Royal Statistical Society: Series B (Methodological), Vol.57, No.1, pp. 289-300, 1995. https://doi.org/10.1111/j.2517-6161.1995.tb02031.x
- [10] I. Gath and A. B. Geva, “Unsupervised optimal fuzzy clustering,” IEEE Trans. on Pattern Analysis and Machine Intelligence, Vol.11, No.7, pp. 773-780, 1989. https://doi.org/10.1109/34.192473
- [11] X. L. Xie and G. Beni, “A validity measure for fuzzy clustering,” IEEE Trans. on Pattern Analysis and Machine Intelligence, Vol.13, No.8, pp. 841-847, 1991. https://doi.org/10.1109/34.85677
- [12] Y. Fukuyama and M. Sugeno, “A new method for choosing the number of clusters for the fuzzy c-means method,” Proc. of the 5th Fuzzy System Symp., pp. 247-250, 1989.
- [13] D. L. Davies and D. W. Bouldin, “A cluster separation measure,” IEEE Trans. on Pattern Analysis and Machine Intelligence, Vol.1, No.2, pp. 224-227, 1979. https://doi.org/10.1109/TPAMI.1979.4766909
- [14] G. E. P. Box, G. M. Jenkins, G. C. Reinsel, and G. M. Ljung, “Time Series Analysis: Forecasting and Control,” John Wiley & Sons, Inc., 2016.
This article is published under a Creative Commons Attribution-NoDerivatives 4.0 Internationa License.