single-jc.php

JACIII Vol.29 No.4 pp. 829-837
doi: 10.20965/jaciii.2025.p0829
(2025)

Research Paper:

Tourism Information Analysis of SNS Data Using Text Mining

Yoshiyuki Matsumoto ORCID Icon

Department of Data Science, Shimonoseki City University
2-1-1 Daigakucho, Shimonoseki, Yamaguchi 751-8510, Japan

Received:
January 31, 2025
Accepted:
March 31, 2025
Published:
July 20, 2025
Keywords:
text mining, tourism information, social network, correspondence analysis, sentiment analysis
Abstract

In modern society, the internet plays a crucial role in collecting tourism-related information. Traditionally, travelers collected tourism information from travel magazines, television, and travel agencies. However, with the widespread use of the internet, these methods of information collection have quickly shifted online. Travelers can easily access real-time, detailed information from official tourist destination websites, travel review sites, and social networking services (SNSs) through the internet. In addition, experiences, photos, and reviews posted by visitors on SNS serve as valuable reference information for other travelers. Such information frequently serves as a key factor in travelers’ decision-making when selecting destinations and planning their trips. Personal experience-based information is conveyed with reliability and familiarity on SNS, making it influential for many people. Therefore, collecting and analyzing tourism-related information from SNS is considered extremely beneficial for promoting tourism. This study uses text mining to analyze data collected from SNS. Furthermore, it is hypothesized that the most critical source of information on SNS is personal experience-based content. Therefore, this research also explores methods for extracting personal review information from the collected data.

Co-occurrence networks

Co-occurrence networks

Cite this article as:
Y. Matsumoto, “Tourism Information Analysis of SNS Data Using Text Mining,” J. Adv. Comput. Intell. Intell. Inform., Vol.29 No.4, pp. 829-837, 2025.
Data files:
References
  1. [1] Y. Matsumoto, “Analysis using text mining of the proximity area information collected from SNS,” J. of Biomedical Fuzzy Systems Association, Vol.18, No.2, pp. 41-48, 2016 (in Japanese). https://doi.org/10.24466/jbfsa.18.2_41
  2. [2] Y. Matsumoto and N. Inoue, “Analysis of tourism information using Twitter in the Shimonoseki Area,” J. of Biomedical Fuzzy Systems Association, Vol.22, No.2, pp. 59-66, 2020 (in Japanese). https://doi.org/10.24466/jbfsa.22.2_59
  3. [3] M. A. Hearst, “Untangling text data mining,” Proc. of the 37th Annual Meeting of the Association for Computational Linguistics on Computational Linguistics, pp. 3-10, 1999. https://doi.org/10.3115/1034678.1034679
  4. [4] Y. Ichimura, T. Hasegawa, I. Watanabe, and M. Sato, “Text mining: Case studies,” J. of the Japanese Society for Artificial Intelligence, Vol.16, No.2, pp. 192-200, 2001 (in Japanese). https://doi.org/10.11517/jjsai.16.2_192
  5. [5] M. Nii, K. Takahama, S. Miyake, A. Uchinuno, and R. Sakashita, “Rule representation for nursing-care process evaluation using decision tree techniques,” J. Adv. Comput. Intell. Intell. Inform., Vol.18, No.6, pp. 918-925, 2014. https://doi.org/10.20965/jaciii.2014.p0918
  6. [6] S. Nakajima, J. Tatemura, Y. Hara, K. Tanaka, and S. Uemura, “A method of blog thread analysis to discover important bloggers,” J. of Japan Society for Fuzzy Theory and Intelligent Informatics, Vol.19, No.2, pp. 156-166, 2007 (in Japanese). https://doi.org/10.3156/jsoft.19.156
  7. [7] T. Murata and K. Saito, “Extraction and visualization of Web users’ interests using site-keyword graphs,” J. of Japan Society for Fuzzy Theory and Intelligent Informatics, Vol.18, No.5, pp. 701-710, 2006 (in Japanese). https://doi.org/10.3156/jsoft.18.701
  8. [8] S. Araki, A. Komura, and H. Hirai, “Relationship between ranking and word-of-mouth on Japanese travel information websites,” J. of Japan Industrial Management Association, Vol.73, No.1, pp. 15-26, 2022 (in Japanese). https://doi.org/10.11221/jima.73.15
  9. [9] Y. Okada, A. Kuroyanagi, and R. Sugahara, “Attractive factors and mental landscapes of islands captured from visitors’ SNS posts,” Papers on Environmental Information Science, Vol.36, pp. 156-160, 2022 (in Japanese). https://doi.org/10.11492/ceispapers.ceis36.0_156
  10. [10] A. Vaswani et al., “Attention is all you need,” Proc. of the 31st Int. Conf. on Neural Information Processing Systems, pp. 6000-6010, 2017.
  11. [11] J. Devlin, M.-W. Chang, K. Lee, and K. Toutanova, “BERT: Pre-training of deep bidirectional transformers for language understanding,” Proc. of the 2019 Conf. of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Vol.1 (Long and Short Papers), pp. 4171-4186, 2019. https://doi.org/10.18653/v1/N19-1423
  12. [12] Y. Baba, M. Fujiu, and Y. Morisaki, “Proposal for a system for extracting points for improvement in tourist attractions using word-of-mouth data posted on travel information websites,” Artificial Intelligence and Data Science, Vol.4, No.3, pp. 942-951, 2023 (in Japanese). https://doi.org/10.11532/jsceiii.4.3_942
  13. [13] J.-P. Benzécri, “L’analyse des données (Tome 2): L’analyse des correspondances,” Dunod, 1973 (in French).
  14. [14] KH Coder. https://khcoder.net/ [Accessed November 1, 2024]
  15. [15] Hugging Face. https://huggingface.co/ [Accessed November 1, 2024]

*This site is desgined based on HTML5 and CSS3 for modern browsers, e.g. Chrome, Firefox, Safari, Edge, Opera.

Last updated on Jul. 19, 2025