Research Paper:
Tourism Information Analysis of SNS Data Using Text Mining
Yoshiyuki Matsumoto

Department of Data Science, Shimonoseki City University
2-1-1 Daigakucho, Shimonoseki, Yamaguchi 751-8510, Japan
In modern society, the internet plays a crucial role in collecting tourism-related information. Traditionally, travelers collected tourism information from travel magazines, television, and travel agencies. However, with the widespread use of the internet, these methods of information collection have quickly shifted online. Travelers can easily access real-time, detailed information from official tourist destination websites, travel review sites, and social networking services (SNSs) through the internet. In addition, experiences, photos, and reviews posted by visitors on SNS serve as valuable reference information for other travelers. Such information frequently serves as a key factor in travelers’ decision-making when selecting destinations and planning their trips. Personal experience-based information is conveyed with reliability and familiarity on SNS, making it influential for many people. Therefore, collecting and analyzing tourism-related information from SNS is considered extremely beneficial for promoting tourism. This study uses text mining to analyze data collected from SNS. Furthermore, it is hypothesized that the most critical source of information on SNS is personal experience-based content. Therefore, this research also explores methods for extracting personal review information from the collected data.

Co-occurrence networks
- [1] Y. Matsumoto, “Analysis using text mining of the proximity area information collected from SNS,” J. of Biomedical Fuzzy Systems Association, Vol.18, No.2, pp. 41-48, 2016 (in Japanese). https://doi.org/10.24466/jbfsa.18.2_41
- [2] Y. Matsumoto and N. Inoue, “Analysis of tourism information using Twitter in the Shimonoseki Area,” J. of Biomedical Fuzzy Systems Association, Vol.22, No.2, pp. 59-66, 2020 (in Japanese). https://doi.org/10.24466/jbfsa.22.2_59
- [3] M. A. Hearst, “Untangling text data mining,” Proc. of the 37th Annual Meeting of the Association for Computational Linguistics on Computational Linguistics, pp. 3-10, 1999. https://doi.org/10.3115/1034678.1034679
- [4] Y. Ichimura, T. Hasegawa, I. Watanabe, and M. Sato, “Text mining: Case studies,” J. of the Japanese Society for Artificial Intelligence, Vol.16, No.2, pp. 192-200, 2001 (in Japanese). https://doi.org/10.11517/jjsai.16.2_192
- [5] M. Nii, K. Takahama, S. Miyake, A. Uchinuno, and R. Sakashita, “Rule representation for nursing-care process evaluation using decision tree techniques,” J. Adv. Comput. Intell. Intell. Inform., Vol.18, No.6, pp. 918-925, 2014. https://doi.org/10.20965/jaciii.2014.p0918
- [6] S. Nakajima, J. Tatemura, Y. Hara, K. Tanaka, and S. Uemura, “A method of blog thread analysis to discover important bloggers,” J. of Japan Society for Fuzzy Theory and Intelligent Informatics, Vol.19, No.2, pp. 156-166, 2007 (in Japanese). https://doi.org/10.3156/jsoft.19.156
- [7] T. Murata and K. Saito, “Extraction and visualization of Web users’ interests using site-keyword graphs,” J. of Japan Society for Fuzzy Theory and Intelligent Informatics, Vol.18, No.5, pp. 701-710, 2006 (in Japanese). https://doi.org/10.3156/jsoft.18.701
- [8] S. Araki, A. Komura, and H. Hirai, “Relationship between ranking and word-of-mouth on Japanese travel information websites,” J. of Japan Industrial Management Association, Vol.73, No.1, pp. 15-26, 2022 (in Japanese). https://doi.org/10.11221/jima.73.15
- [9] Y. Okada, A. Kuroyanagi, and R. Sugahara, “Attractive factors and mental landscapes of islands captured from visitors’ SNS posts,” Papers on Environmental Information Science, Vol.36, pp. 156-160, 2022 (in Japanese). https://doi.org/10.11492/ceispapers.ceis36.0_156
- [10] A. Vaswani et al., “Attention is all you need,” Proc. of the 31st Int. Conf. on Neural Information Processing Systems, pp. 6000-6010, 2017.
- [11] J. Devlin, M.-W. Chang, K. Lee, and K. Toutanova, “BERT: Pre-training of deep bidirectional transformers for language understanding,” Proc. of the 2019 Conf. of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Vol.1 (Long and Short Papers), pp. 4171-4186, 2019. https://doi.org/10.18653/v1/N19-1423
- [12] Y. Baba, M. Fujiu, and Y. Morisaki, “Proposal for a system for extracting points for improvement in tourist attractions using word-of-mouth data posted on travel information websites,” Artificial Intelligence and Data Science, Vol.4, No.3, pp. 942-951, 2023 (in Japanese). https://doi.org/10.11532/jsceiii.4.3_942
- [13] J.-P. Benzécri, “L’analyse des données (Tome 2): L’analyse des correspondances,” Dunod, 1973 (in French).
- [14] KH Coder. https://khcoder.net/ [Accessed November 1, 2024]
- [15] Hugging Face. https://huggingface.co/ [Accessed November 1, 2024]
This article is published under a Creative Commons Attribution-NoDerivatives 4.0 Internationa License.