Pendekatan Model TF-IDF dan Cosine Similarity Pada Rekomendasi Dosen Pembimbing Skripsi
DOI:
https://doi.org/10.36050/bzdpf251Keywords:
cosine similarity, text processing, similarity, supervisor recommendation, TF-IDFAbstract
The selection of a thesis supervisor is recommended based on the alignment between the student's research topic and the supervisor's academic expertise. Currently, the supervisor selection process often relies on manual methods based on subjective preferences or supervisor availability, which risks creating a mismatch between the supervisor's competencies and the research topic. This study aims to implement machine learning model Term Frequency–Inverse Document Frequency (TF-IDF) and Cosine similarity approach for recommending thesis supervisors to enhance the objectivity and efficiency of the academic process. The methodology involved collecting a dataset of research titles and abstracts from lecturers in the Health Administration study program at STIKes Budi Mulia Sriwijaya. This was followed by text preprocessing stages, including case folding, tokenization, stopword removal, and stemming. Subsequently, term weighting was calculated using the TF-IDF algorithm, and the semantic similarity between documents was measured using Cosine Similarity. The analysis results indicate a highest TF-IDF score of 95.81, signifying a high degree of topic focus. Meanwhile, the highest Cosine similarity score was 0.77. The visualization of results in a heatmap illustrates the clustering of relationships between titles based on their level of topical similarity
References
[1] T.-C. Yang and Z.-S. Lin, “Enhancing elementary school students’ computational thinking and programming learning with graphic organizers,” Comput. Educ., vol. 209, p. 104962, 2024.
[2] B. Williamson, J. Komljenovic, and K. N. Gulson, “Introduction: Digitalisation of education in the era of algorithms, automation and artificial intelligence,” in World yearbook of education 2024, Routledge, 2023, pp. 1–19.
[3] M. Carr and F. Lesniewska, “Internet of Things, cybersecurity and governing wicked problems: learning from climate change governance,” Int. Relations, vol. 34, no. 3, pp. 391–412, 2020.
[4] I. Citaristi, “United Nations Educational, Scientific and Cultural Organization—UNESCO,” in The Europa Directory of International Organizations 2022, Routledge, 2022, pp. 369–375.
[5] Q. Nabila, A. Luthfiarta, M. Syabilla, A. Ahmad, and R. Riyanto, “Comparative Analysis of Vectorization Methods for Academic Supervisor Recommendations,” J. Teknol. Inf. dan Terap., vol. 11, no. 2, pp. 115–123, 2024.
[6] A. Syafi’i, A. Munir, S. Anam, and S. Suhartono, “Unleashing the power of supervisory feedback in academic writing: Strategies for timely undergraduate thesis completion,” Teflin J., vol. 35, no. 2, pp. 330–351, 2024.
[7] S. Chen, L. Zhang, and M. Li, “Doctoral students’ self-regulated learning: the roles of academic buoyancy and perceived autonomy support,” Educ. Psychol., vol. 45, no. 2, pp. 148–167, 2025.
[8] H. R. Adli, M. Munir, and R. Megasari, “Implementation of Inverse Document Frequency (TF-IDF) and Cosine Similarity Terms in Determining Research Reviewers for Indonesian Education University Lecturers,” J. Comput. Soc., vol. 4, no. 2, pp. 73–82.
[9] M. Rashmi, Introduction to Information Retrieval Systems, vol. 3, no. 4. Cambridge university press, 2015. doi: 10.17762/ijritcc2321-8169.150462.
[10] S. Qaiser and R. Ali, “Text mining: use of TF-IDF to examine the relevance of words to documents,” Int. J. Comput. Appl., vol. 181, no. 1, pp. 25–29, 2018.
[11] K. Popper, “The Logic of Scientific Discovery,” 2012.
[12] F. Ricci, L. Rokach, and B. Shapira, “Recommender systems: Techniques, applications, and challenges,” Recomm. Syst. Handb., pp. 1–35, 2021.
[13] W. Pedrycz, “The benefits and drawbacks of data mining technologies,” Wiley Interdiscip. Rev. Data Min. Knowl. Discov., vol. 10, no. 1, 2020, doi: 10.1002/widm.1344.
[14] N. Azizah and A. F. Rozi, “Sistem Rekomendasi Produk Somethinc Menggunakan Metode Content-based Filtering,” J. Teknol. Dan Sist. Inf. Bisnis, vol. 6, no. 3, pp. 461–468, 2024, doi: 10.47233/jteksis.v6i3.1411.
[15] Dino Akbar Pratondo, “Pengembangan Sistem Rekomendasi Berbasis Content-Based Filtering Pada data Dinamis,” Universitas Islam Negeri SYARIF HIDAYATULLAH JAKARTA. Fakultas Sains dan Teknologi UIN Syarif Hidayatullah Jakarta, pp. 1–89, 2023. [Online]. Available: https://repository.uinjkt.ac.id/dspace/bitstream/123456789/66813/1/DINO AKBAR PRATONDO-FST.pdf
[16] R. Insan Pratama Siagian, N. Khoiriah, S. Audy Priscilia, M. Raffi Akbar Tanjung, and A. Perdana, “Penerapan Machine Learning Untuk Rekomendasi Film Berdasarkan Preferensi Pengguna,” JATI (Jurnal Mhs. Tek. Inform., vol. 9, no. 4, pp. 5658–5662, 2025, doi: 10.36040/jati.v9i4.13884.
Downloads
Published
Issue
Section
License
Copyright (c) 2026 Heki Aprianto (Author)

This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.






