PREDICTIVE CLASSIFICATION MODELLING OF SUSTAINABLE TOURISM PRACTICES USING ONLINE TRAVEL AGENT PLATFORM: A MALAYSIAN CASE STUDY
DOI:
https://doi.org/10.24191/18pddh71Keywords:
Sustainable Tourism, Machine Learning, Online Travel Agent, Data Scraping, Predictive ModelAbstract
Tourism is a key pillar of Malaysia’s economy, recognized in the Economic Transformation Programme (ETP) as one of the 12 National Key Economic Areas (NKEA) that contribute to high-income status. However, the sector faces challenges, particularly in accommodation, due to environmental concerns and difficulties in measuring sustainable tourism. This study applies supervised machine learning classification techniques to develop a predictive model that classifies future sustainable tourism practices within Malaysia’s accommodation sector. Using data from an Online Travel Agent (OTA) platform, the model addresses data gaps and provides valuable insights for informed decision-making, aligned with the environmental conservation. The research is divided into four phases: literature review, data collection via web scraping, data modelling, and model evaluation. As the outcome, the multi-layer perceptron (MLP) model with 22 sets of features outperformed others, achieving an F1-score of 58%, recall of 83%, and Precision Recall Area Under the Curve (PRAUC) of 65%. These findings enhance both the understanding and forecasting of sustainability trends in tourist accommodation using online data.
References
Ab Rahman, S. A., Bachok, S., & Mahamod, L. H. (2023). Effects Of Travel Characteristics on Tourism Expenditure: A Case Study of Malaysia Young Outbound Tourists. In Planning Malaysia (Vol. 21). Malaysian Institute of Planners. https://doi.org/10.21837/pm.v21i25.1233
Adil, M., Ansari, M. F., Alahmadi, A., Wu, J. Z., & Chakrabortty, R. K. (2021). Solving the problem of class imbalance in the prediction of hotel cancelations: A hybridized machine learning approach. Processes, 9(10). https://doi.org/10.3390/pr9101713
Alija, S., Beqiri, E., Gaafar, A. S., & Hamoud, A. K. (2023). Predicting Students Performance Using Supervised Machine Learning Based on Imbalanced Dataset and Wrapper Feature Selection. Informatica (Slovenia), 47(1), 11–20. https://doi.org/10.31449/INF.V47I1.4519
Ardito, L., Cerchione, R., Del Vecchio, P., & Raguseo, E. (2019). Big data in smart tourism: challenges, issues and opportunities. In Current Issues in Tourism (Vol. 22, Issue 15, pp. 1805–1809). Routledge. https://doi.org/10.1080/13683500.2019.1612860
Asmelash, A. G., & Kumar, S. (2019). Assessing progress of tourism sustainability: Developing and validating sustainability indicators. Tourism Management, 71, 67–83. https://doi.org/10.1016/j.tourman.2018.09.020
Booking Holdings. (2022). Sustainability Report. https://www.bookingholdings.com/wp-content/uploads/2023/04/BKNG-Sustainability-Report-2022.pdf
Booking.com. (2021a). Booking.com Launches First-of-its-Kind Travel Sustainable Badge to Lead Industry in Showcasing a Wider Variety of Sustainable Stays. https://globalnews.booking.com/bookingcom-launches-first-of-its-kind-travel-sustainable-badge/
Booking.com. (2021b). Sustainable Travel Report 2021. https://5bf1d946-e0b5-464d-82d0-
Cahigas, M. M. L., Ong, A. K. S., & Prasetyo, Y. T. (2023). Super Typhoon Rai’s Impacts on Siargao Tourism: Deciphering Tourists’ Revisit Intentions through Machine-Learning Algorithms. Sustainability (Switzerland), 15(11). https://doi.org/10.3390/su15118463
Chang, J. R., Chen, M. Y., Chen, L. S., & Tseng, S. C. (2019). Why Customers Don’t Revisit in Tourism and Hospitality Industry? IEEE Access, 7, 146588–146606. https://doi.org/10.1109/ACCESS.2019.2946168
Čumlievski, N., Brkić Bakarić, M., & Matetić, M. (2022). A Smart Tourism Case Study: Classification of Accommodation Using Machine Learning Models Based on Accommodation Characteristics and Online Guest Reviews. Electronics (Switzerland), 11(6). https://doi.org/10.3390/electronics11060913
Economic and Social Commission for Asia and the Pacific. (2024). Asia and The Pacific SDG Progress Report 2024 : Showcasing Transformative Actions. UNITED NATIONS.
Economic Research Institute for ASEAN and East Asia. (2022). Study to develop a framework on Sustainable Tourism Development in ASEAN in the Post COVID-19 Era. https://asean.org/wp-content/uploads/2023/01/Study-to-Develop-a-Framework-on-Sustainable-Tourism-Development-in-ASEAN.pdf
Expedia Group. (2022). Sustainable Travel Study Consumer Attitudes, Values, and Motivations in Making Conscientious Choices. https://go2.advertising.expedia.com/rs/185-EIA-216/images/April_2022-Sustainable-Travel-Study-PDF-No-URL.pdf
Gregg, A., Blasco-Puyuelo, J., Jordá-Muñoz, R., Martín Martínez, I., Burrieza-Galán, J., & Cantú Ros, O. G. (2024). Airport accessibility surveys and mobile phone records data fusion for the analysis of air travel behaviour. Transportation Research Procedia, 76, 269–282. https://doi.org/10.1016/j.trpro.2023.12.054
Guilarte, Y. P., & Quintáns, D. B. (2019). Using big data to measure tourist sustainability: Myth or reality? In Sustainability (Switzerland) (Vol. 11, Issue 20). MDPI. https://doi.org/10.3390/su11205641
Hoffmann, F. J., Braesemann, F., & Teubner, T. (2022). Measuring sustainable tourism with online platform data. EPJ Data Science 2022 11:1, 11(1), 1–21. https://doi.org/10.1140/EPJDS/S13688-022-00354-6
Lm, L. L., Tilaki, M. J. M., Abdul Rahim, A., & Marzbali, M. H. (2024). Covid-19’s Impact on Food Stall Workers: Night Markets’ Resilience In Penang, Malaysia. In Planning Malaysia (Vol. 22). Malaysian Institute of Planners. https://doi.org/10.21837/pm.v22i34.1604
Khan, M. R., Khan, H. U. R., Lim, C. K., Tan, K. L., & Ahmed, M. F. (2021). Sustainable tourism policy, destination management and sustainable tourism development: A moderated-mediation model. Sustainability (Switzerland), 13(21). https://doi.org/10.3390/su132112156
Lenzen, M., Sun, Y. Y., Faturay, F., Ting, Y. P., Geschke, A., & Malik, A. (2018). The carbon footprint of global tourism. Nature Climate Change, 8(6), 522–528. https://doi.org/10.1038/s41558-018-0141-x
Miller, G., & Torres-Delgado, A. (2023). Measuring sustainable tourism: a state of the art review of sustainable tourism indicators. Journal of Sustainable Tourism. https://doi.org/10.1080/09669582.2023.2213859
Ministry of Tourism, Arts & Culture. (2020). National Tourism Policy 2020 - 2030. In 2020. https://www.tourism.gov.my/files/uploads/Executive_Summary.pdf
Nilashi, M., Samad, S., Ahani, A., Ahmadi, H., Alsolami, E., Mahmoud, M., Majeed, H. D., & Abdulsalam Alarood, A. (2021). Travellers decision making through preferences learning: A case on Malaysian spa hotels in TripAdvisor. Computers and Industrial Engineering, 158. https://doi.org/10.1016/j.cie.2021.107348
Obiedat, R., Qaddoura, R., Al-Zoubi, A. M., Al-Qaisi, L., Harfoushi, O., Alrefai, M., & Faris, H. (2022). Sentiment Analysis of Customers’ Reviews Using a Hybrid Evolutionary SVM-Based Approach in an Imbalanced Data Distribution. IEEE Access, 10, 22260–22273. https://doi.org/10.1109/ACCESS.2022.3149482
Prime Minister’s Office. (2012). Economic Transformation Programme Annual Report 2012. https://www.pmo.gov.my/dokumenattached/Eng_ETP2012_Full.pdf
Qasim, H. M., Ata, O., Ansari, M. A., Alomary, M. N., Alghamdi, S., & Almehmadi, M. (2021). Hybrid feature selection framework for the parkinson imbalanced dataset prediction problem. Medicina https://doi.org/10.3390/medicina57111217 (Lithuania), 57(11).
Rasoolimanesh, S. M., Ramakrishna, S., Hall, C. M., Esfandiar, K., & Seyfi, S. (2020). A systematic scoping review of sustainable tourism indicators in relation to the sustainable development goals. Journal of Sustainable Tourism. https://doi.org/10.1080/09669582.2020.1775621
Saraswathi, N., Sasi Rooba, T., & Chakaravarthi, S. (2023). Improving the accuracy of sentiment analysis using a linguistic rule-based feature selection method in tourism reviews. Measurement: Sensors, 29. https://doi.org/10.1016/j.measen.2023.100888
Streimikiene, D., Svagzdiene, B., Jasinskas, E., & Simanavicius, A. (2021). Sustainable tourism development and competitiveness: The systematic literature review. In Sustainable Development (Vol. 29, Issue 1, pp. 259–271). John Wiley and Sons Ltd. https://doi.org/10.1002/sd.2133
Sullivan, J. H., Warkentin, M., & Wallace, L. (2021). So many ways for assessing outliers: What really works and does it matter? Journal of Business Research, 132, 530–543. https://doi.org/10.1016/j.jbusres.2021.03.066
Tourism Malaysia. (2021). Malaysia Tourism Key Performance Indicators 2021. https://mytourismdata.tourism.gov.my/wp-content/uploads/2023/01/KEY PERFORMANCE-INDICATORS-2021.pdf
United Nation. (2023). The Sustainable Development Goals Report 2023. https://unstats.un.org/sdgs/report/2023/The-Sustainable-Development-Goals-Report-2023.pdf
United Nations Environment Programme. (2016). Consumer Information Tools and Climate Change Facilitating low-carbon choices in Tourism, Buildings and Food Systems Guidance for Policy Makers and Business Leaders. http://www.oneplanetnetwork.org/consumer-information-scp/,orcontact:ciscp@un.org.
Weidenfeld, A. (2018). Tourism diversification and its implications for smart specialisation. Sustainability (Switzerland), 10(2). https://doi.org/10.3390/su10020319
Downloads
Published
Issue
Section
License
Copyright (c) 2026 Fiza Abdul Rahim, Nazmirul Izzad Nassir, Raja Marzyani Raja Mazlan, Ramona Ramli

This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.




