Two Fuzzy Clustering Algorithms Based on ARMA Model

Tomoki Nomura; Yuchi Kanzawa

doi:10.20965/jaciii.2024.p1251

single-jc.php

« previous

JACIII Vol.28 No.6 pp. 1251-1262

doi: 10.20965/jaciii.2024.p1251

(2024)

Research Paper:

Views over last 60 days: 388

Two Fuzzy Clustering Algorithms Based on ARMA Model

Tomoki Nomura^† and Yuchi Kanzawa

Shibaura Institute of Technology
3-7-5 Toyosu, Koto-ku, Tokyo 135-8548, Japan

^†Corresponding author

Received:

March 19, 2024

Accepted:

June 21, 2024

Published:

November 20, 2024

Keywords:

fuzzy clustering, series data, autoregressive moving average model

Abstract

This study proposes two fuzzy clustering algorithms based on autoregressive moving average (ARMA) model for series data. The first, referred to as Tsallis entropy-regularized fuzzy c-ARMA model (TFCARMA), is created from k-ARMA, a conventional hard clustering algorithm for series data. TFCARMA is motivated by the relationship between the two clustering algorithms for vectorial data: k-means and Tsallis entropy-regularized fuzzy c-means. The second, referred to as q-divergence-based fuzzy c-ARMA model (QFCARMA), is created from ARMA mixtures, a conventional probabilistic clustering algorithm for series data. QFCARMA is motivated by the relationship between the two clustering algorithms for vectorial data: Gaussian mixture model and q-divergence-based fuzzy c-means. Based on numerical experiments using an artificial dataset, we observed the effects of fuzzification parameters in the proposed algorithms and relationship between the proposed and conventional algorithms. Moreover, numerical experiments using seven real datasets compared the clustering accuracy among the proposed and conventional algorithms.

Fuzzy clustering for series data

Cite this article as:

T. Nomura and Y. Kanzawa, “Two Fuzzy Clustering Algorithms Based on ARMA Model,” J. Adv. Comput. Intell. Intell. Inform., Vol.28 No.6, pp. 1251-1262, 2024.

Data files:

References

[1] J. C. Bezdek, “Pattern Recognition with Fuzzy Objective Function Algorithms,” Plenum Press, 1981. https://doi.org/10.1007/978-1-4757-0450-1
[2] M. Yasuda, “Tsallis entropy based fuzzy c-means clustering with parameter adjustment,” The 6th Int. Conf. on Soft Computing and Intelligent Systems, and the 13th Int. Symp. on Advanced Intelligence Systems, pp. 1534-1539, 2012. https://doi.org/10.1109/SCIS-ISIS.2012.6505118
[3] Y. Kanzawa, “On fuzzy clustering based on Tsallis entropy-regularization,” Proc. of the 30th Fuzzy System Symp., pp. 452-457, 2014 (in Japanese). https://doi.org/10.14864/fss.30.0_452
[4] D. O. Hoare, D. S. Matteson, and M. T. Wells, “K-ARMA models for clustering time series data,” arXiv:2207.00039, 2022. https://doi.org/10.48550/arXiv.2207.00039
[5] Y. Xiong and D.-Y. Yeung, “Mixtures of ARMA models for model-based time series clustering,” 2002 IEEE Int. Conf. on Data Mining, pp. 717-720, 2002. https://doi.org/10.1109/ICDM.2002.1184037
[6] K. Kalpakis, “Mining of science time-series data.” https://redirect.cs.umbc.edu/kalpakis/TS-mining/ [Accessed January 28, 2024]
[7] H. A. Dau et al., “UCR time series classification archive.” https://www.cs.ucr.edu/eamonn/time_series_data_2018/ [Accessed December 12, 2021]
[8] L. Hubert and P. Arabie, “Comparing partitions,” J. of Classification, Vol.2, No.1, pp. 193-218, 1985. https://doi.org/10.1007/BF01908075
[9] Y. Benjamini and Y. Hochberg, “Controlling the false discovery rate: A practical and powerful approach to multiple testing,” J. of the Royal Statistical Society: Series B (Methodological), Vol.57, No.1, pp. 289-300, 1995. https://doi.org/10.1111/j.2517-6161.1995.tb02031.x
[10] I. Gath and A. B. Geva, “Unsupervised optimal fuzzy clustering,” IEEE Trans. on Pattern Analysis and Machine Intelligence, Vol.11, No.7, pp. 773-780, 1989. https://doi.org/10.1109/34.192473
[11] X. L. Xie and G. Beni, “A validity measure for fuzzy clustering,” IEEE Trans. on Pattern Analysis and Machine Intelligence, Vol.13, No.8, pp. 841-847, 1991. https://doi.org/10.1109/34.85677
[12] Y. Fukuyama and M. Sugeno, “A new method for choosing the number of clusters for the fuzzy c-means method,” Proc. of the 5th Fuzzy System Symp., pp. 247-250, 1989.
[13] D. L. Davies and D. W. Bouldin, “A cluster separation measure,” IEEE Trans. on Pattern Analysis and Machine Intelligence, Vol.1, No.2, pp. 224-227, 1979. https://doi.org/10.1109/TPAMI.1979.4766909
[14] G. E. P. Box, G. M. Jenkins, G. C. Reinsel, and G. M. Ljung, “Time Series Analysis: Forecasting and Control,” John Wiley & Sons, Inc., 2016.

This article is published under a Creative Commons Attribution-NoDerivatives 4.0 Internationa License.

[1] [1] J. C. Bezdek, “Pattern Recognition with Fuzzy Objective Function Algorithms,” Plenum Press, 1981. https://doi.org/10.1007/978-1-4757-0450-1

[2] [2] M. Yasuda, “Tsallis entropy based fuzzy c-means clustering with parameter adjustment,” The 6th Int. Conf. on Soft Computing and Intelligent Systems, and the 13th Int. Symp. on Advanced Intelligence Systems, pp. 1534-1539, 2012. https://doi.org/10.1109/SCIS-ISIS.2012.6505118

[3] [3] Y. Kanzawa, “On fuzzy clustering based on Tsallis entropy-regularization,” Proc. of the 30th Fuzzy System Symp., pp. 452-457, 2014 (in Japanese). https://doi.org/10.14864/fss.30.0_452

[4] [4] D. O. Hoare, D. S. Matteson, and M. T. Wells, “K-ARMA models for clustering time series data,” arXiv:2207.00039, 2022. https://doi.org/10.48550/arXiv.2207.00039

[5] [5] Y. Xiong and D.-Y. Yeung, “Mixtures of ARMA models for model-based time series clustering,” 2002 IEEE Int. Conf. on Data Mining, pp. 717-720, 2002. https://doi.org/10.1109/ICDM.2002.1184037

[6] [6] K. Kalpakis, “Mining of science time-series data.” https://redirect.cs.umbc.edu/kalpakis/TS-mining/ [Accessed January 28, 2024]

[7] [7] H. A. Dau et al., “UCR time series classification archive.” https://www.cs.ucr.edu/eamonn/time_series_data_2018/ [Accessed December 12, 2021]

[8] [8] L. Hubert and P. Arabie, “Comparing partitions,” J. of Classification, Vol.2, No.1, pp. 193-218, 1985. https://doi.org/10.1007/BF01908075

[9] [9] Y. Benjamini and Y. Hochberg, “Controlling the false discovery rate: A practical and powerful approach to multiple testing,” J. of the Royal Statistical Society: Series B (Methodological), Vol.57, No.1, pp. 289-300, 1995. https://doi.org/10.1111/j.2517-6161.1995.tb02031.x

[10] [10] I. Gath and A. B. Geva, “Unsupervised optimal fuzzy clustering,” IEEE Trans. on Pattern Analysis and Machine Intelligence, Vol.11, No.7, pp. 773-780, 1989. https://doi.org/10.1109/34.192473

[11] [11] X. L. Xie and G. Beni, “A validity measure for fuzzy clustering,” IEEE Trans. on Pattern Analysis and Machine Intelligence, Vol.13, No.8, pp. 841-847, 1991. https://doi.org/10.1109/34.85677

[12] [12] Y. Fukuyama and M. Sugeno, “A new method for choosing the number of clusters for the fuzzy c-means method,” Proc. of the 5th Fuzzy System Symp., pp. 247-250, 1989.

[13] [13] D. L. Davies and D. W. Bouldin, “A cluster separation measure,” IEEE Trans. on Pattern Analysis and Machine Intelligence, Vol.1, No.2, pp. 224-227, 1979. https://doi.org/10.1109/TPAMI.1979.4766909

[14] [14] G. E. P. Box, G. M. Jenkins, G. C. Reinsel, and G. M. Ljung, “Time Series Analysis: Forecasting and Control,” John Wiley & Sons, Inc., 2016.

Two Fuzzy Clustering Algorithms Based on ARMA Model

Tomoki Nomura† and Yuchi Kanzawa

Tomoki Nomura^† and Yuchi Kanzawa