UTILIZING TEXT MINING FOR ENCRYPTION ALGORITHM RECOMMENDATION USING CONTENT-BASED FILTERING
DOI:
https://doi.org/10.33480/jitk.v11i4.7663Keywords:
Content-Based Filtering, Cosine Similarity, Data Security, Encryption Algorithm, Text MiningAbstract
The selection of an appropriate encryption algorithm is crucial in ensuring data security, as each algorithm has distinct advantages and disadvantages in terms of speed, efficiency, and security level. Many users struggle to determine the most suitable algorithm due to limited technical knowledge and the vast amount of literature that must be reviewed. Therefore, this study proposes a recommendation system based on Content-Based Filtering (CBF) integrated with text mining to facilitate faster, more accurate, and data-driven algorithm selection. The objective of this research is to develop a recommendation system capable of analyzing the technical characteristics of encryption algorithms from scientific literature and providing relevant suggestions according to user needs. The methodology includes collecting 300 articles from the Garuda Kemdikbud portal using web scraping, performing data preprocessing such as tokenization, stop word removal, and case folding, representing text with TF-IDF, and calculating similarity using Cosine Similarity. The results indicate that the most frequently discussed algorithms are RSA (52 articles), AES (40 articles), and RC4 (25 articles), reflecting research trends focusing on modern public-key and symmetric cryptography. The evaluation results show that the system achieved Precision@3 of 1.0000 and Average Precision (AP) of 0.0583, indicating that the top recommendations generated are highly relevant to user needs. The developed system successfully generated recommendations tailored to specific needs, such as suggesting AES as the primary choice for “fast encryption of sensitive data.” This study demonstrates that combining text mining and CBF is effective in assisting the selection of encryption algorithms through literature-based analysis.
Downloads
References
[1] M. Qamal, F. Fajriana, and M. Mardhatillah, “Metode Naive Bayes untuk Menentukan Rekomendsi Tempat Wisata Terbaik di Aceh,” TECHSI - J. Tek. Inform., vol. 13, no. 1, pp. 81–91, Apr. 2021, doi: https://doi.org/10.29103/techsi.v13i1.3132.
[2] X. Wei and S. Sun, “Personalized online learning resource recommendation based on artificial intelligence and educational psychology,” Frontiers in Psychology, vol. 12, p. 767837, 2021, doi: 10.3389/fpsyg.2021.767837
[3] S. Sharma, V. Rana, and M. Malhotra, “Automatic recommendation system based on hybrid filtering algorithm,” Educ. Inf. Technol., vol. 27, no. 2, pp. 1523–1538, 2022, doi: httpS://doi.org/10.1007/s10639-021-10643-8.
[4] M. Zahrawi and A. Mohammad, “Implementing Recommender Systems using Machine Learning and Knowledge Discovery Tools,” Knowledge-Based Engineering and Sciences, vol. 2, no. 2, pp. 44–53, 2021, doi: 10.51526/kbes.2021.2.2.44-53
[5] U. Javed, K. Shaukat, I. A. Hameed, F. Iqbal, T. M. Alam, and S. Luo, “A Review of Content-Based and Context-Based Recommendation Systems,” International Journal of Emerging Technologies in Learning (iJET), vol. 16, no. 3, pp. 274–306, 2021, doi: 10.3991/ijet.v16i03.18851
[6] R. Madhumala, B. Vineetha, M. R. Shree, R. Sanjesh, and B. Charanraj, “A semantic driven model for extraction of text using TF ‑ IDF , SFLA and XGBoost,” Int. J. Inf. Technol., vol. 17, no. 6, pp. 3659–3664, 2025, doi: https://doi.org/10.1007/s41870-025-02549-2.
[7] N.-T.-K. Ho, H. Ho-Dac, and T.-A. Le, “Job Recommendation: An Approach to Match Job-Seeker’s Interest with Enterprise’s Requirement,” in Research in Intelligent and Computing in Engineering, 2021, pp. 361–367. doi: https://doi.org/10.1007/978-981-15-7527-3_35.
[8] Y. Afrilia, L. Rosnita, D. Siska, M. Rigayatsyah, and N. Nurqamarina, “Analisis Sentimen Ciutan Twitter Terkait Penerapan Permendikbudristek Nomor 30 Tahun 2021 Menggunakan TextBlob dan Support Vector Machine,” G-Tech J. Teknol. Terap., vol. 6, no. 2, pp. 387–394, 2022, doi: https://doi.org/10.33379/gtech.v6i2.1778.
[9] T. Ridwansyah, B. Subartini, and S. Sylviani, “Penerapan Metode Content-Based Filtering pada Sistem Rekomendasi,” Math. Sci. Appl. J., vol. 4, no. 2, pp. 70–77, 2024, doi: https://doi.org/10.22437/msa.v4i2.32136.
[10] M. Ikhsan and R. Kurniawan, “Penerapan Text Mining pada Sistem Rekomendasi Pembimbing Skripsi Mahasiswa Menggunakan Algoritma Naive Bayes Classifier,” Indones. J. Comput. Sci., vol. 12, no. 1, pp. 4196–4207, 2023, doi: https://doi.org/10.33022/ijcs.v12i6.3500.
[11] W. Lemus Leiva, M.-L. Li, and C.-Y. Tsai, “A Two-Phase Deep Learning-Based Recommender System: Enhanced by a Data Quality Inspector,” Applied Sciences, vol. 11, no. 20, p. 9667, 2021, doi: 10.3390/app11209667
[12] I. M. A. Bhaskara, M. Pasek, M. P. A. Ariawan, I. B. A. Peling, and I. P. A. Prayudha, “Studi Literatur : Analisa Perbandingan Teori Tentang Tingkat Keamanan Antar Algoritma Simetris,” J. Bangkit Indones., vol. 13, no. 01, pp. 40–45, 2024, doi: https://doi.org/10.52771/bangkitindonesia.v13i1.278.
[13] M. A. Khder, “Web Scraping or Web Crawling: State of Art, Techniques, Approaches and Application,” Int. J. Adv. Soft Comput. its Appl., vol. 13, no. 3, pp. 145–68, 2021, doi: https://doi.org/10.15849/IJASCA.211128.11.
[14] V. Krotov and M. Tennyson, “Web Scraping in the R Language: A Tutorial,” Journal of the Midwest Association for Information Systems (JMWAIS), vol. 2021, no. 1, Art. no. 5, 2021, doi: 10.17705/3jmwa.000066
[15] A. Weichselbraun, “Inscriptis - A Python-based HTML to text conversion library optimized for knowledge extraction from the Web,” Journal of Open Source Software, vol. 6, no. 66, p. 3557, 2021, doi: 10.21105/joss.03557
[16] M. U. Albab, Y. Karuniyawati, and M. N. Fawaiq, “Optimization of the Stemming Technique on Text preprocessing President 3 Periods Topic,” J. Transform., vol. 20, no. 2, pp. 1–10, 2023, doi: https://doi.org/10.26623/transformatika.v20i2.5374.
[17] H. Yan, Y. Sun, X. Li, and X. Qiu, “An Embarrassingly Easy but Strong Baseline for Nested Named Entity Recognition,” in Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), Jul. 2023, pp. 1442–1452. doi: https://doi.org/10.18653/v1/2023.acl-short.123.
[18] Z. Abidin, A. Junaidi, and W. Wamiliana, “Text Stemming and Lemmatization of Regional Languages in Indonesia : A Systematic Literature Review,” J. Inf. Syst. Eng. Bus. Intell., vol. 10, no. 2, pp. 217–231, 2024, doi: https://doi.org/10.20473/jisebi.10.2.217-231.
[19] N. Nilawati, H. Husaini, and J. Salat, “Penggunaan Metode Cosine Similarity dan TF-IDF untuk Klasifikasi Judul Seminar Proposal pada Fakultas Teknik Universitas Jabal Ghafur,” Sagita Acad. J., vol. 2, no. 1, pp. 72–79, 2024, doi: https://doi.org/10.61579/sagita.v2i1.60 p-ISSN:
[20] S. Jain, S. K. Jain, and S. Vasal, “An Effective TF-IDF Model to Improve the Text Classification Performance,” in 2024 IEEE 13th International Conference on Communication Systems and Network Technologies (CSNT), 2024, pp. 1–4. doi: https://doi.org/10.1109/CSNT60213.2024.10545818.
[21] R. Oktavian and F. Amin, “Perancangan Sistem Rekomendasi Laptop dengan Model Prototyping dan Penerapan Content-Based Filtering Approach pada ELS Computer Shop Semarang,” G-Tech J. Teknol. Terap., vol. 8, no. 1, pp. 66–80, 2024, doi: https://doi.org/10.33379/gtech.v8i1.3490.
[22] H. Singhdev, S. Gupta, V. Srivastava, and A. Saxena, “Text recognition using improved dual attention based on textual double embedding network with aquila optimization algorithm,” Int. J. Inf. Technol., 2024, doi: https://doi.org/10.1007/s41870-024-01984-x.
Downloads
Published
Issue
Section
License
Copyright (c) 2026 Mukti Qamal, Muhammad Iqbal, Yesy afrillia

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.






-a.jpg)
-b.jpg)











