IMPROVING SENTIMENT ANALYSIS OF WOMEN IN STEM DISCOURSE USING SMOTE-ENHANCED SVM–VADER
DOI:
https://doi.org/10.33480/jitk.v11i3.7353Keywords:
Sentiment Analysis, SVM Kernels, Vader Lexicon, Women in STEMAbstract
The participation of women in Science, Technology, Engineering, and Mathematics (STEM) remains shaped by complex social and structural factors. This study investigates public sentiment regarding the role of technology in supporting women’s participation in STEM through a machine learning–based sentiment analysis. Using 1,533 social media comments, sentiment classification was performed by integrating Support Vector Machine (SVM) and VADER-based automatic labeling, with imbalance handling to improve classification reliability. The results indicate a dominance of positive sentiment (98%), suggesting an optimistic tendency within the analyzed dataset, although this may be influenced by dataset characteristics and methodological bias. Among the evaluated models, a linear-kernel SVM achieved the highest accuracy (98.31%). This study contributes methodologically by demonstrating the effectiveness of integrating lexicon-based labeling with supervised learning for public sentiment analysis on gender equality in STEM, offering empirical insights to inform technology-driven policy interventions.
Downloads
References
[1] A. Suryaningsih And A. H. Sanjaya, “Pemberdayaan Perempuan Dalam Mewujudkan Kesetaraan Gender: Strategi Dan Tantangan Di Era Globalisasi,” Jurnal Pendidikan Sejarah Dan Riset Sosial Humaniora, Vol. 4, No. 2, Pp. 2621–119, 2024.
[2] C. Dwi Anggola, F. Prawita, And D. Putri Lestarika, “Peran Pendidikan Dalam Mengurangi Kesenjangan Gender Di Tempat Kerja,” Vol. 02, No. 1, Pp. 531–537, 2024, [Online]. Available: Https://Jurnal.Kopusindo.Com/Index.Php/Jkhkp
[3] Amelia, R. N., Mafikah, A. D., and Rif’ah, S., “Kesetaraan Gender dalam Manajemen Sumber Daya Insani: Tantangan dan Peluang,” EQUALITY: Journal of Gender, Child, and Humanity Studies, vol. 2, no. 1, pp. 30–40, 2024.
[4] F. Hotman, S. Damanik, O. Sukmana, And W. Winarjo, “Sosiologi Kritis Dan Transformasi Pendidikan: Menggugat Ketidaksetaraan Gender Di Indonesia,” 2025. [Online]. Available: Https://Jurnaldidaktika.Org2031
[5] East.Vc, “Hari Perempuan Sedunia: Menyoroti Kontribusi Perempuan Di Bidang Stem,” East.Vc. Accessed: Mar. 22, 2025. [Online]. Available: Https://East.Vc/Id/Berita/Insights-Id/Hari-Perempuan-Sedunia-Perempuan-Stem/
[6] Word Economic Forum, “Global Gender Gap Report 2023,” Jun. 2023. Accessed: Mar. 22, 2025. [Online]. Available: Https://Www.Weforum.Org/Publications/Global-Gender-Gap-Report-2023/
[7] L. Sonia And K. Sassi, “Menjelajahi Kesenjangan Gender Dalam Pendidikan: Studi Perbandingan Antara Swedia Dan Afghanistan,” Vol. 5, No. 4, Nov. 2024, [Online]. Available: Https://Ejurnals.Com/Ojs/Index.Php/
[8] A. Permata, “Analisis Sentimen Media Sosial: Mengurai Opini Publik Dengan Data,” Teknologipintar.Org, Vol. 4, No. 3, Pp. 2024–2025, 2024.
[9] D. Andini Putri And D. Ayu Muthia, “Implementasi Metode Lexicon Based Dan Support Vector Machine Pada Analisis Sentimen Ulasan Pengguna Chatgpt,” Ijcit (Indonesian Journal On Computer And Information Technology), Vol. 9, No. 2, Pp. 80–86, 2024.
[10] L. Geni, E. Yulianti, And D. I. Sensuse, “Sentiment Analysis Of Tweets Before The 2024 Elections In Indonesia Using Bert Language Models,” Jurnal Ilmiah Teknik Elektro Komputer Dan Informatika, Vol. 9, No. 3, Pp. 746–757, Aug. 2023, Doi: 10.26555/Jiteki.V9i3.26490.
[11] S. Mariam And I. Nurhaida, “Edumatic: Jurnal Pendidikan Informatika Analisis Sentimen Berbasis Deep Learning Terhadap Kesetaraan Gender Di Bidang Stem: Perspektif Dan Implikasinya,” Vol. 9, No. 1, Pp. 69–78, 2025, Doi: 10.29408/Edumatic.V9i1.29071.
[12] A. Saepudin Et Al., “Analisis Sentimen Pemanfaatan Artificial Intelligence Di Dunia Pendidikan Menggunakan Svm Berbasis Particle Swarm Optimization,” 2024. [Online]. Available: Http://Jurnal.Bsi.Ac.Id/Index.Php/Co-Science
[13] S. Ernawati And R. Wati, “Evaluasi Performa Kernel Svm Dalam Analisis Sentimen Review Aplikasi Chatgpt Menggunakan Hyperparameter Dan Vader Lexicon,” 2024.
[14] M. Ibnu Choldun Rachmatullah And S. Armiati, “Menerapkan Smote Pada Klasifikasi Data Penyakit Stroke,” Vol. 17, No. 1, 2025.
[15] F. S. Pratiwi, M. Agung Barata, And A. D. Ardianti, “Implementasi Metode Smote Dan Random Over-Sampling Pada Algoritma Machine Learning Untuk Prediksi Customer Churn Di Sektor Perbankan,” Jurnal Sistem Informasi Dan Informatika (Simika), Vol. 8, No. 1, 2025, [Online]. Available: Https://Www.Kaggle.Com/Datasets/Gauravtopre/Bank-Customer-Churn-Dataset/Data
[16] F. Dewi, N. Cahyo, H. Wibowo, M. R. Handayani, And K. Umam, “Evaluasi Hyperparamter Tuning Pada Support Vector Machine (Svm) Dalam Klasifikasi Ulasan Hotel Di Tripadvisor,” Vol. 10, No. 3, Pp. 2584–2593, 2025, Doi: 10.29100/Jipi.V10i3.7774.
[17] V. Renedominick And S. Barus, “Analisis Sentimen Pada Trailer Deadpool Vs Wolverine Menggunakan Model Machine Learning,” Jurnal Pustaka Ai (Pusat Akses Kajian Teknologi Artificial Intelligence), Vol. 5, No. 1, Pp. 01–06, Apr. 2025, Doi: 10.55382/Jurnalpustakaai.V5i1.892.
[18] Utari, E. L. and Wibowo, S. H., “Analisis Komparatif Algoritma SVM, Naive Bayes, dan LSTM pada Sentimen Komentar Lagu Labour,” Jurnal Informatika Teknologi dan Sains (Jinteks), vol. 7, no. 3, pp. 1276–1286, 2025.
[19] N. Fauziah, “Analisis Sentimen Publik Terhadap Kenaikan Tarif Ppn Di Indonesia Dengan Pendekatan Vader,” Jurnal Akuntansi Dan Keuangan, Vol. 12, No. 2, P. 228, Sep. 2024, Doi: 10.29103/Jak.V12i2.16796.
[20] D. Nasien Et Al., “Perbandingan Implementasi Machine Learning Menggunakan Metode Knn, Naive Bayes, Dan Logistik Regression Untuk Mengklasifikasi Penyakit Diabetes,” 2024.
[21] A. R. Hanum Et Al., “Analisis Kinerja Algoritma Klasifikasi Teks Bert Dalam Mendeteksi Berita Hoaks,” Vol. 11, No. 3, Pp. 537–546, 2024, Doi: 10.25126/Jtiik938093.
[22] Hizbul Izzi, Arief Setyanto, And Anggit Dwi Hartanto, “Optimalisasi Akurasi Algoritma Naïve Bayes Dengan Metode Syntetic Minority Oversampling Technique (Smote) Pada Data Numerik,” Infotek: Jurnal Informatika Dan Teknologi, Vol. 8, No. 1, Pp. 217–227, Jan. 2025, Doi: 10.29408/Jit.V8i1.28340.
[23] I. Maulana And S. Ernawati, “Meningkatkan Klasifikasi Penyakit Diabetes Menggunakan Metode Ensemble Softvoting Dengan Smote-Enn Dan Optimasi Bayesian,” Jurnal Sains Dan Manajemen, Vol. 13, No. 1, 2025.
[24] K. Tri Putra, S. Anggraini, L. Sutriani, A. Impron, And J. Informatika, “Analisis Sentimen Masyarakat Kalimantan Tengah Terhadap Perkebunan Kelapa Sawit Menggunakan Tf-Idf Dan Support Vector Machine,” 2025.
[25] E. Rifut Nur Mustaqim, U. Pagalay, And C. Crysdian, “Prediksi Tingkat Kepercayaan Masyarakat Terhadap Pilpres 2024 Menggunakan Tf-Idf Dan Bow Menggunakan Metode Svm.”
[26] T. Baskoro And S. R. Nuddin, “Analisa Kinerja Chatgpt Dalam Menghasilkan Teks Bahasa Indonesia Menggunakan Metode Support Vector Machines (Svm),” Journal Of Informatics And Computer Science, Vol. 06, 2024.
[27] M. A. R. N. M. Celine Mutiara Putri, “Perbandingan Evaluasi Kernel Support Vector Machine dalam Analisis Sentimen Chatbot AI pada Ulasan Google Play Store,” Jurnal Teknologi Sistem Informasi dan Aplikasi, vol. 7, Jul. 2024.
[28] A. W. Pradana and M. Hayaty, “The Effect of Stemming and Removal of Stopwords on the Accuracy of Sentiment Analysis on Indonesian-language Texts,” Kinetik: Game Technology, Information System, Computer Network, Computing, Electronics, and Control, pp. 375–380, Oct. 2019, doi: 10.22219/kinetik.v4i4.912.
[29] T. Hevianto Saputro and A. Hermawan, “The Accuracy Improvement of Text Mining Classification on Hospital Review through The Alteration in The Preprocessing Stage,” 2021. [Online]. Available: www.ijcit.com140
[30] M. A. Rosulan and R. Rosli, “Key Dimensions and Impact Factors on STEM Identity Among Female Students: A Systematic Literature Review”, doi: 10.47772/IJRISS.
[31] M. Stella, “Text-mining forma mentis networks reconstruct public perception of the STEM gender gap in social media,” Mar. 2020, doi: 10.7717/peerj-cs.295.
Downloads
Published
Issue
Section
License
Copyright (c) 2026 Dwi Andini Putri, Siti Nurwahyuni

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.






-a.jpg)
-b.jpg)











