Optimization of K value in the K-NN For Classification Review Pagar Alam City Tourism using Expectation Maximization and Grid Search CV
DOI:
https://doi.org/10.36050/aqj3h053Keywords:
Expectation Maximixation, Grid Search CV, K-NN, pagar alam, tourismAbstract
The development of the tourism sector in Pagar Alam City requires a sentiment analysis system capable of accurately capturing tourists’ perceptions. Sentiment analysis of online reviews can serve as a valuable foundation for decision-making in managing and improving tourism destinations. This study aims to produce an accurate sentiment classification by optimizing the K value in the K-Nearest Neighbor (KNN) algorithm using Expectation Maximization (EM) and Grid Search Cross Validation (GS-CV). The research employs the Cross Industry Standard Process for Data Mining (CRISP-DM) methodology, consisting of six stages: business understanding, data understanding, data preparation, modeling, evaluation, and deployment. Data were collected through web scraping of online tourism reviews, resulting in 4,806 reviews across eight major tourist attractions, including Tugu Rimau, Mount Dempo, and Tujuh Kenangan Waterfall. The results indicate that Tugu Rimau received the highest positive sentiment score (0.678), while Tujuh Kenangan Waterfall showed the lowest (0.006). Model performance evaluation revealed that KNN accuracy improved from 81% to 89% after optimization using EM and GS-CV, achieving 88% precision, 89% recall, and an F1-score of 85%. These findings demonstrate that the integration of EM and GS-CV effectively enhances the classification accuracy of KNN in sentiment analysis for Pagar Alam’s tourism reviews
References
1. Atikah, N. (2019). Application Of Expectation-Maximization (EM) Algorithm In Grouping Popularity Tourism Objects In Malang Raya Based On Indicator Of Many Visitors. Jurnal Matematika “MANTIK,” 5(2), 123–134. Https://Doi.Org/10.15642/Mantik.2019.5.2.123-134
2. Aziza, L. N., Astuti, R. Y., Maulana, B. A., & Hidayati, N. (2024). Penerapan Algoritma K-Nearest Neighbor Untuk Klasifikasi Ketahanan Pangan Di Provinsi Jawa Tengah. MALCOM: Indonesian Journal Of Machine Learning And Computer Science, 4(2), 404–412. Https://Doi.Org/10.57152/Malcom.V4i2.1201
3. Azizah, N., Firdaus, M. R., Suyaningsih, R., Indrayatna, F., & Padjadjaran, U. (2023). Penerapan Algoritma Klasifikasi K-Nearest Neighbor Pada Penyakit Diabetes. Http://Prosiding.Snsa.Statistics.Unpad.Ac.Id
4. Cholil, S. R., Handayani, T., Prathivi, R., & Ardianita, T. (2021). Implementasi Algoritma Klasifikasi K-Nearest Neighbor (KNN) Untuk Klasifikasi Seleksi Penerima Beasiswa. IJCIT (Indonesian Journal On Computer And Information Technology), 6(2), 118–127.
5. F. Putrawansyah, “Penerapan Metode Support Vector Machine Terhadap Klasifikasi Jenis Jambu Biji,” JIKO (Jurnal Informatika dan Komputer), vol. 8, no. 1, p. 193, Feb. 2024, doi: 10.26798/jiko.v8i1.988.
6. Hamied Nababan, A., & Hutagalung, M. Y. (2023). Hyperparameter Tuning Pada Model Stance Detection Menggunakan Gridsearchcv. Jurnal Sains Dan Teknologi, 5(1), 205–209. Https://Doi.Org/10.55338/Saintek.V5i1.1505
7. Jamiluddin, F., Faisal, S., Lestari, S. A. P., & Fauzi, A. (2024). Implementasi Hyperparameter Tuning Grid Search CV Pada Prediksi Produksi Padi Menggunakan Algoritma Linear Regresi. Journal Of Information System Research (JOSH), 6(1), 480–488. Https://Doi.Org/10.47065/Josh.V6i1.5930
8. Marisa Efendi, D., Sartika, D., Isnayah Waspah, A., Afandi, A., Informasi, S., & Dian Cipta Cendikia Kotabumi, S. (2022). Expectation Maximization Algorithm Memprediksi Penjualan Susu Murni Pada Pt. Sewu Primatama Indonesia Lampung Tengah. In Jurnal Teknik Informatika Musirawas) Aik Isnayah Waspah, Asep Afandi (Vol. 7, Issue 1).
9. Miya Juwita, R., Haerani, E., Kurnia Gusti, S., & Siti Ramadhani, Dan. (2022). Klasifikasi Berita Menggunakan Metode K-Nearest Neighbor. Jurnal Nasional Komputasi Dan Teknologi Informasi, 5(2).
10. M. Mustakim, K. Kurnia, N. Noviarni, F. Putrawansyah, A. Kurniati And R. Rimet, "Implementation Of Convolutional Neural Network For Sentiment Analysis On Hotel Customer Reviews," 2024 International Conference On Decision Aid Sciences And Applications (DASA), Manama, Bahrain, 2024, Pp. 1-6, Doi: 10.1109/DASA63652.2024.10836631.
11. Nadiya Citra Dewi, & Edi Surya Negara. (2023). Klasifikasi teks pada Ulasan Objek Wisata Di Kota Pagar Alam Menggunakan Pendekatan Machine Learning. IJCS, 12 No 5, 3027–3042.
12. Ni Ketut Intan Setiawati, & I Gede Arta Wibawa. (2022). Penerapan Algoritma K-Nearest Neighbor Dalam Klasifikasi Penyakit Gagal Jantung. Jnatia, 1 Nomor 1, 347–352.
13. Putrawansyah, F., Rahayu, C., & Dhiniati, F. (2024). Application Of Particle Swarm Optimization Toimprove The Performance Of The K-Nearestneighbor In Stunting Classification In Southsumatra, Indonesia. International Journal Of Education And Management Engineering, 14(6), 32–43. Https://Doi.Org/10.5815/Ijeme.2024.06.03
14. Safira, A., Masyarakat…, A. S., & Hasan, F. N. (2023). Analisis Sentimen Masyarakat Terhadap Paylater Menggunakan Metode Naive Bayes Classifier. Jurnal Sistem Informasi, 5(1).
15. Simarmata, J. E. (2021). Application Of Expectation Maximization Algorithm In Estimating Parameter Values Of Maximum Likelihood Model. Journal Of Research In Mathematics Trends And Technology, 3(1), 34–39. Https://Doi.Org/10.32734/Jormtt.V3i1.8331
16. Thet, T. T., Na, J. C., & Khoo, C. S. G. (2023). Aspect-Based Sentiment Analysis Of Movie Reviews On Discussion Boards. Journal Of Information Science, 36(6), 823–848. Https://Doi.Org/10.1177/0165551510388123
17. Trihardianingsih, L., Santos Lasatira, G., Kunci-Gridsearrchcv, K., & Udara, K. (2024). Optimasi Hyperparameter Gridsearchcv Pada Klasifikasi Kualitas Udara Menggunakan Support Vector Machine. In Jurnal Informasi Dan Teknologi) (Vol. 1, Issue 2). Https://Data.Jakarta.Go.Id/.
18. Ummami, R., & Winarno, B. (2023). Gaussian Mixture Model Dengan Algoritme Expectation Maximization Untuk Pengelompokan Data Distribusi Air Bersih Di Jawa Barat. PRISMA, Prosiding Seminar Nasional Matematika, 6, 745–750. Https://Journal.Unnes.Ac.Id/Sju/Index.Php/Prisma/
Downloads
Published
Issue
Section
License

This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.






