EVALUATING REGRESSION AND NEURAL NETWORKS FOR FIVE TRAIT TEXT-BASED PERSONALITY PREDICTION
DOI:
https://doi.org/10.33480/jitk.v11i1.6980Keywords:
neural networks, personality trait prediction , random forest , ridge regressionAbstract
The aim of this study is to evaluate the effectiveness of several predictive modeling techniques in mapping the five major personality traits (extraversion, neuroticism, agreeableness, conscientiousness, and openness) from text-based data. The dataset consists of text-based features extracted from publicly available social media posts, providing a realistic basis for personality prediction. Performance was measured using mean absolute error (MAE), mean squared error (MSE), and R² score to evaluate prediction accuracy and generalization quality, along with training time for computational efficiency. The research compares linear regression, ridge regression, random forest, and neural networks implemented in PyTorch. Results indicate that ridge regression and random forest outperform linear regression and neural networks in error metrics and explained variance, with ridge regression offering a favorable balance between accuracy and training time. Random forest achieves slightly better accuracy but with significantly longer training duration, reducing its practicality for real-time use. Despite theoretical advantages in modeling non-linear relationships, neural networks showed suboptimal results, likely due to limited hyperparameter tuning and dataset size. These findings highlight trade-offs among model complexity, accuracy, and efficiency, suggesting ridge regression as a pragmatic choice for current personality prediction from text while encouraging future research on advanced neural architectures and enhanced datasets
Downloads
References
G. Rajalakshmi, A. Kansal, S. Thomas, M. P. Mali, D. Anandhasilambarasan, and D. Sharma, “A Comprehensive Study on the Real-Time Text Categorization of User Generated Content using Artificial Intelligence,” in 2024 Global Conference on Communications and Information Technologies, GCCIT 2024, Institute of Electrical and Electronics Engineers Inc., 2024. doi: 10.1109/GCCIT63234.2024.10862668.
A. D. Hartanto, E. Utami, Kusrini, and A. Setyanto, “A Survey of Semantic Approaches in Personality Traits Profiling Analysis,” in 2024 International Conference on Smart Computing, IoT and Machine Learning, SIML 2024, Institute of Electrical and Electronics Engineers Inc., 2024, pp. 35–42. doi: 10.1109/SIML61815.2024.10578100.
M. Liang and T. Niu, “Research on Text Classification Techniques Based on Improved TF-IDF Algorithm and LSTM Inputs,” in Procedia Computer Science, Elsevier B.V., 2022, pp. 460–470. doi: 10.1016/j.procs.2022.10.064.
E. M. Dharma, F. Lumban Gaol, H. Leslie, H. S. Warnars, and B. Soewito, “THE ACCURACY COMPARISON AMONG WORD2VEC, GLOVE, AND FASTTEXT TOWARDS CONVOLUTION NEURAL NETWORK (CNN) TEXT CLASSIFICATION,” J Theor Appl Inf Technol, vol. 31, no. 2, 2022, [Online]. Available: www.jatit.org
G. Angelini, “Big five model personality traits and job burnout: a systematic literature review,” BMC Psychol, vol. 11, no. 1, Dec. 2023, doi: 10.1186/s40359-023-01056-y.
E. Utami, A. D. Hartanto, S. Adi, I. Oyong, and S. Raharjo, “Profiling analysis of DISC personality traits based on Twitter posts in Bahasa Indonesia,” Journal of King Saud University - Computer and Information Sciences, Oct. 2022, doi: 10.1016/j.jksuci.2019.10.008.
A. A. Azmee, F. Nweke, M. Pederson, M. A. A. H. Khan, and Y. Pei, “Human AI Collaboration Framework for Detecting Mental Illness Causes from Social Media,” Jun. 01, 2025, Elsevier B.V. doi: 10.1016/j.smhl.2025.100566.
Idris, E. Utami, A. D. Hartanto, and S. Raharjo, “Systematic Literature Review of Profiling Analysis Personality from Social Media,” in Journal of Physics: Conference Series, IOP Publishing Ltd, Mar. 2021. doi: 10.1088/1742-6596/1823/1/012115.
G. D. Salsabila and E. B. Setiawan, “Semantic Approach for Big Five Personality Prediction on Twitter,” Jurnal RESTI (Rekayasa Sistem dan Teknologi Informasi), vol. 5, no. 4, pp. 680–687, Aug. 2021, doi: 10.29207/resti.v5i4.3197.
M. A. Parameswara, F. S. P. Mahendra, M. R. C. Mardika, I. Puspasari, and N. P. Utama, “Big Five Personality Prediction Based on Indonesian Tweets and Personality Test,” in 2023 International Conference on Electrical Engineering and Informatics (ICEEI), IEEE, Oct. 2023, pp. 1–6. doi: 10.1109/ICEEI59426.2023.10346812.
W. Maharani and V. Effendy, “Big five personality prediction based in Indonesian tweets using machine learning methods,” International Journal of Electrical and Computer Engineering, vol. 12, no. 2, pp. 1973–1981, Apr. 2022, doi: 10.11591/ijece.v12i2.pp1973-1981.
A. Perera and P. Fernando, “Cyberbullying Detection System on Social Media Using Supervised Machine Learning,” Procedia Comput Sci, vol. 239, pp. 506–516, Jan. 2024, doi: 10.1016/J.PROCS.2024.06.200.
Winarno, K. A. Azzahra, and S. W. Sihwi, “Dark Triad Personality Traits Prediction through Twitter Analysis using Support Vector Machine Method,” in APICS 2022 - 2022 1st International Conference on Smart Technology, Applied Informatics, and Engineering, Proceedings, Institute of Electrical and Electronics Engineers Inc., 2022, pp. 102–107. doi: 10.1109/APICS56469.2022.9918721.
N. A. Utami, W. Maharani, and I. Atastina, “Personality Classification of Facebook Users According to Big Five Personality Using SVM (Support Vector Machine) Method,” in Procedia Computer Science, Elsevier B.V., 2021, pp. 348–355. doi: 10.1016/j.procs.2021.12.023.
R. P. Pratama and W. Maharani, “Predicting Big Five Personality Traits Based on Twitter User U sing Random Forest Method*,” in 2021 International Conference on Data Science and Its Applications, ICoDSA 2021, Institute of Electrical and Electronics Engineers Inc., 2021, pp. 110–117. doi: 10.1109/ICoDSA53588.2021.9617501.
J.-T. Sohns, M. Schmitt, F. Jirasek, H. Hasse, and H. Leitte, “Attribute-based Explanation of Non-Linear Embeddings of High-Dimensional Data,” IEEE Trans Vis Comput Graph, vol. 28, no. 1, pp. 540–550, 2022, doi: 10.1109/TVCG.2021.3114870.
M. A. Kosan, H. Karacan, and B. A. Urgen, “Predicting personality traits with semantic structures and LSTM-based neural networks,” Alexandria Engineering Journal, vol. 61, no. 10, pp. 8007–8025, Oct. 2022, doi: 10.1016/j.aej.2022.01.050.
Z. Wang, T. Peng, and Z. Lu, “Comparative Research on Forest Fire Image Segmentation Algorithms Based on Fully Convolutional Neural Networks,” Forests, vol. 13, no. 7, Jul. 2022, doi: 10.3390/f13071133.
R. P. Putra and E. B. Setiawan, “RoBERTa as Semantic Approach for Big Five Personality Prediction using Artificial Neural Network on Twitter,” in ICACNIS 2022 - 2022 International Conference on Advanced Creative Networks and Intelligent Systems: Blockchain Technology, Intelligent Systems, and the Applications for Human Life, Proceeding, Institute of Electrical and Electronics Engineers Inc., 2022. doi: 10.1109/ICACNIS57039.2022.10054870.
X. Chen, J. He, E. Swanson, Z. Cai, and X. Fan, “Big Five Personality Traits and Second Language Learning: a Meta-analysis of 40 Years’ Research,” Educ Psychol Rev, vol. 34, no. 2, pp. 851–887, Jun. 2022, doi: 10.1007/s10648-021-09641-6.
A. Naz, H. U. Khan, A. Bukhari, B. Alshemaimri, A. Daud, and M. Ramzan, “Machine and deep learning for personality traits detection: a comprehensive survey and open research challenges,” Artif Intell Rev, vol. 58, no. 8, Aug. 2025, doi: 10.1007/s10462-025-11245-3.
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2025 Anggit Dwi Hartanto, Ema Utami, Arief Setyanto, Kusrini Kusrini

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.