Health Big Data Classification Based on Collaborative Training Optimization Algorithm

Jianwei Zhang; Haiyan Liu

doi:10.20965/jaciii.2024.p1313

single-jc.php

« previous

JACIII Vol.28 No.6 pp. 1313-1323

doi: 10.20965/jaciii.2024.p1313

(2024)

Research Paper:

Views over last 60 days: 268

Health Big Data Classification Based on Collaborative Training Optimization Algorithm

Jianwei Zhang^*,† and Haiyan Liu^**

^*College of Health, Zhejiang Industry Polytechnic College
No.151 Qutun Road, Yuecheng District, Shaoxing 312000, China

^†Corresponding author

^**College of Huangjiu, Zhejiang Industry Polytechnic College
No.151 Qutun Road, Yuecheng District, Shaoxing 312000, China

Received:

February 3, 2024

Accepted:

September 3, 2024

Published:

November 20, 2024

Keywords:

collaborative training, health big data, ECoRec, machine learning, tri-training

Abstract

In semisupervised learning, particularly in dealing with health big data classification problems, optimizing the performance of classifiers has always been a challenge. Accordingly, this study explores an optimization algorithm based on collaborative training to better handle health big data. First, the tri-training and decision tree classification models were selected for comparison. The average classification accuracy of the tri-training classification model was 4.20% higher than that of the decision tree classification model. Subsequently, the standard tri-training classifier was compared with these two classifiers. The classification accuracy of the standard tri-training classifier increased by 3.88% and 4.33%, respectively, compared with the aforementioned two classifiers. Finally, under the condition of 10% labeled samples, the performance of the collaborative training optimization algorithm was verified under three different basis classifiers. The results of this study demonstrate the effectiveness of optimization algorithms based on collaborative training in dealing with health big data classification problems. By improving the performance of the classifier, health big data can be predicted and analyzed more accurately, thereby improving the accuracy and efficiency of medical decision-making. Meanwhile, the application of this optimization algorithm also provides new research directions for other semisupervised learning problems.

Cite this article as:

J. Zhang and H. Liu, “Health Big Data Classification Based on Collaborative Training Optimization Algorithm,” J. Adv. Comput. Intell. Intell. Inform., Vol.28 No.6, pp. 1313-1323, 2024.

Data files:

References

[1] L. Ouyang et al., “Training language models to follow instructions with human feedback,” Proc. of the 36th Int. Conf. on Neural Information Processing Systems (NIPS’22), pp. 27730-27744, 2022.
[2] I. Letunic and P. Bork, “Interactive tree of life (iTOL) v5: An online tool for phylogenetic tree display and annotation,” Nucleic Acids Research, Vol.49, No.W1, pp. W293-W296, 2021. https://doi.org/10.1093/nar/gkab301
[3] X. Li and H. Zhang, “Research on college English multimedia teaching model driven by wireless communication network environment,” J. of Sensors, Vol.2021, No.1, Article No.7404712, 2021. https://doi.org/10.1155/2021/7404712
[4] X. Tan, W. Chen, J. Yang, and X. Tan, “Application of a data-driven intelligent information system in infrastructure: Underwater tunnel case study,” J. of Performance of Constructed Facilities, Vol.37, No.1, Article No.04022069, 2023. https://doi.org/10.1061/JPCFEV.CFENG-4046
[5] Z. Zhang, “Early warning model of adolescent mental health based on big data and machine learning,” Soft Computing, Vol.28, No.1, pp. 811-828, 2024. https://doi.org/10.1007/s00500-023-09422-z
[6] C. Cosgrave et al., “A comparison of clinical assessment with common diagnostic tools for monitoring concussion recovery in adolescent rugby union players,” Physical Therapy in Sport, Vol.61, pp. 165-171, 2023. https://doi.org/10.1016/j.ptsp.2023.04.003
[7] J. Chen, X. Yin, and J. Ning, “A fine-grained and secure health data sharing scheme based on blockchain,” Trans. on Emerging Telecommunications Technologies, Vol.33, No.9, Article No.e4510, 2022. https://doi.org/10.1002/ett.4510
[8] J. S. Winter and E. Davidson, “Harmonizing regulatory regimes for the governance of patient-generated health data,” Telecommunications Policy, Vol.46, No.5, Article No.102285, 2022. https://doi.org/10.1016/j.telpol.2021.102285
[9] H. T. Neprash et al., “Measuring primary care exam length using electronic health record data,” Medical Care, Vol.59, No.1, pp. 62-66, 2021. https://doi.org/10.1097/mlr.0000000000001450
[10] R. Mallick et al., “Detection of risky situations for frail adults with hybrid neural networks on multimodal health data,” IEEE MultiMedia, Vol.29, No.1, pp. 7-17, 2022. https://doi.org/10.1109/MMUL.2022.3147381
[11] H. Li and S. Shen, “Construction of college students’ physical health data sharing system based on Django framework,” J. of Sensors, Vol.2021, No.1, Article No.3859351, 2021. https://doi.org/10.1155/2021/3859351
[12] X. Yu, J. Gu, X. Zhang, and J. Mao, “GAN-based semi-supervised learning method for identification of the faulty feeder in resonant grounding distribution networks,” Int. J. of Electrical Power & Energy Systems, Vol.144, Article No.108535, 2023. https://doi.org/10.1016/j.ijepes.2022.108535
[13] J. Wei et al., “Abnormal area identification of corn ear based on semi-supervised learning,” IET Image Processing, Vol.16, No.9, pp. 2351-2360, 2022. https://doi.org/10.1049/ipr2.12492
[14] Y. Zhang and J. Bradic, “High-dimensional semi-supervised learning: In search of optimal inference of the mean,” Biometrika, Vol.109, No.2, pp. 387-403, 2022. https://doi.org/10.1093/biomet/asab042
[15] T. Wang and J. Park, “Design and implementation of intelligent sports training system for college students’ mental health education,” Frontiers in Psychology, Vol.12, Article No.634978, 2021. https://doi.org/10.3389/fpsyg.2021.634978
[16] D. Cheng, A. N. Ananthakrishnan, and T. Cai, “Robust and efficient semi-supervised estimation of average treatment effects with application to electronic health records data,” Biometrics, Vol.77, No.2, pp. 413-423, 2021. https://doi.org/10.1111/biom.13298
[17] S. Yang, “Semiparametric estimation of structural nested mean models with irregularly spaced longitudinal observations,” Biometrics, Vol.78, No.3, pp. 937-949, 2022. https://doi.org/10.1111/biom.13471
[18] V. Avagyan and S. Vansteelandt, “High-dimensional inference for the average treatment effect under model misspecification using penalized bias-reduced double-robust estimation,” Biostatistics & Epidemiology, Vol.6, No.2, pp. 221-238, 2022. https://doi.org/10.1080/24709360.2021.1898730
[19] Y. Fang et al., “ST-SIGMA: Spatio-temporal semantics and interaction graph aggregation for multi-agent perception and trajectory forecasting,” CAAI Trans. on Intelligence Technology, Vol.7, No.4, pp. 744-757, 2022. https://doi.org/10.1049/cit2.12145
[20] F. Masood et al., “Novel approach to evaluate classification algorithms and feature selection filter algorithms using medical data,” J. of Computational and Cognitive Engineering, Vol.2, No.1, pp. 57-67, 2022. https://doi.org/10.47852/bonviewJCCE2202238

This article is published under a Creative Commons Attribution-NoDerivatives 4.0 Internationa License.

[1] [1] L. Ouyang et al., “Training language models to follow instructions with human feedback,” Proc. of the 36th Int. Conf. on Neural Information Processing Systems (NIPS’22), pp. 27730-27744, 2022.

[2] [2] I. Letunic and P. Bork, “Interactive tree of life (iTOL) v5: An online tool for phylogenetic tree display and annotation,” Nucleic Acids Research, Vol.49, No.W1, pp. W293-W296, 2021. https://doi.org/10.1093/nar/gkab301

[3] [3] X. Li and H. Zhang, “Research on college English multimedia teaching model driven by wireless communication network environment,” J. of Sensors, Vol.2021, No.1, Article No.7404712, 2021. https://doi.org/10.1155/2021/7404712

[4] [4] X. Tan, W. Chen, J. Yang, and X. Tan, “Application of a data-driven intelligent information system in infrastructure: Underwater tunnel case study,” J. of Performance of Constructed Facilities, Vol.37, No.1, Article No.04022069, 2023. https://doi.org/10.1061/JPCFEV.CFENG-4046

[5] [5] Z. Zhang, “Early warning model of adolescent mental health based on big data and machine learning,” Soft Computing, Vol.28, No.1, pp. 811-828, 2024. https://doi.org/10.1007/s00500-023-09422-z

[6] [6] C. Cosgrave et al., “A comparison of clinical assessment with common diagnostic tools for monitoring concussion recovery in adolescent rugby union players,” Physical Therapy in Sport, Vol.61, pp. 165-171, 2023. https://doi.org/10.1016/j.ptsp.2023.04.003

[7] [7] J. Chen, X. Yin, and J. Ning, “A fine-grained and secure health data sharing scheme based on blockchain,” Trans. on Emerging Telecommunications Technologies, Vol.33, No.9, Article No.e4510, 2022. https://doi.org/10.1002/ett.4510

[8] [8] J. S. Winter and E. Davidson, “Harmonizing regulatory regimes for the governance of patient-generated health data,” Telecommunications Policy, Vol.46, No.5, Article No.102285, 2022. https://doi.org/10.1016/j.telpol.2021.102285

[9] [9] H. T. Neprash et al., “Measuring primary care exam length using electronic health record data,” Medical Care, Vol.59, No.1, pp. 62-66, 2021. https://doi.org/10.1097/mlr.0000000000001450

[10] [10] R. Mallick et al., “Detection of risky situations for frail adults with hybrid neural networks on multimodal health data,” IEEE MultiMedia, Vol.29, No.1, pp. 7-17, 2022. https://doi.org/10.1109/MMUL.2022.3147381

[11] [11] H. Li and S. Shen, “Construction of college students’ physical health data sharing system based on Django framework,” J. of Sensors, Vol.2021, No.1, Article No.3859351, 2021. https://doi.org/10.1155/2021/3859351

[12] [12] X. Yu, J. Gu, X. Zhang, and J. Mao, “GAN-based semi-supervised learning method for identification of the faulty feeder in resonant grounding distribution networks,” Int. J. of Electrical Power & Energy Systems, Vol.144, Article No.108535, 2023. https://doi.org/10.1016/j.ijepes.2022.108535

[13] [13] J. Wei et al., “Abnormal area identification of corn ear based on semi-supervised learning,” IET Image Processing, Vol.16, No.9, pp. 2351-2360, 2022. https://doi.org/10.1049/ipr2.12492

[14] [14] Y. Zhang and J. Bradic, “High-dimensional semi-supervised learning: In search of optimal inference of the mean,” Biometrika, Vol.109, No.2, pp. 387-403, 2022. https://doi.org/10.1093/biomet/asab042

[15] [15] T. Wang and J. Park, “Design and implementation of intelligent sports training system for college students’ mental health education,” Frontiers in Psychology, Vol.12, Article No.634978, 2021. https://doi.org/10.3389/fpsyg.2021.634978

[16] [16] D. Cheng, A. N. Ananthakrishnan, and T. Cai, “Robust and efficient semi-supervised estimation of average treatment effects with application to electronic health records data,” Biometrics, Vol.77, No.2, pp. 413-423, 2021. https://doi.org/10.1111/biom.13298

[17] [17] S. Yang, “Semiparametric estimation of structural nested mean models with irregularly spaced longitudinal observations,” Biometrics, Vol.78, No.3, pp. 937-949, 2022. https://doi.org/10.1111/biom.13471

[18] [18] V. Avagyan and S. Vansteelandt, “High-dimensional inference for the average treatment effect under model misspecification using penalized bias-reduced double-robust estimation,” Biostatistics & Epidemiology, Vol.6, No.2, pp. 221-238, 2022. https://doi.org/10.1080/24709360.2021.1898730

[19] [19] Y. Fang et al., “ST-SIGMA: Spatio-temporal semantics and interaction graph aggregation for multi-agent perception and trajectory forecasting,” CAAI Trans. on Intelligence Technology, Vol.7, No.4, pp. 744-757, 2022. https://doi.org/10.1049/cit2.12145

[20] [20] F. Masood et al., “Novel approach to evaluate classification algorithms and feature selection filter algorithms using medical data,” J. of Computational and Cognitive Engineering, Vol.2, No.1, pp. 57-67, 2022. https://doi.org/10.47852/bonviewJCCE2202238

Health Big Data Classification Based on Collaborative Training Optimization Algorithm

Jianwei Zhang*,† and Haiyan Liu**

Jianwei Zhang^*,† and Haiyan Liu^**