ANALISIS SENTIMEN PADA REVIEW RESTORAN DENGAN TEKS BAHASA INDONESIA MENGUNAKAN ALGORITMA NAIVE BAYES

  • Dinda Ayu Muthia Manajemen Informatika AMIK BSI Bekasi
Keywords: Sentiment Analysis, Review, Restaurant, Naïve Bayes, Text Classification.

Abstract

In the era of the web as it is now, some information is now flowing through the network. Because of the variety of web content includes subjective opinion and objective information, it is now common for people to gather information about products and services they want to buy. However, because there is quite a lot of information in text form without any numerical scale, it is difficult to classify the evaluation of information efficiently without reading the complete text. Sentiment analysis aims to address this problem by automatically classifying user review be a positive or negative opinion. Naïve Bayes classifier is a popular machine learning techniques for text classification because it is very simple, efficient and performs well in many domains. However, Naïve Bayes has the disadvantage that is very sensitive to feature too much, resulting in a classification accuracy becomes low. Therefore, this study used the method of selecting features, namely Genetic algorithm in order to improve the accuracy of Naïve Bayes classifier. This research resulted in the classification of the text in the form of a positive or negative review of the restaurant. Measurement is based on the accuracy of Naive Bayes before and after the addition of feature selection methods. The evaluation was done using a 10 fold cross-validation. While the measurement accuracy is measured by confusion matrix and ROC curves. The results showed an increase in the accuracy of Naïve Bayes from 86.50% to 90.50%.

Downloads

Download data is not yet available.

References

Chen, J., Huang, H., Tian, S., & Qu, Y. (2009). Feature selection for text classification with Naïve Bayes. Expert Systems with Applications, 36(3), 5432–5435.

Feldman, R. (2013). Techniques and applications for sentiment analysis. Communications of the ACM, 56(4), 82.

Gorunescu, F. (2011). Data Mining Concept Model Technique.

Gunal, S. (2012). Hybrid feature selection for text classification ¨, 20.

Haddi, E., Liu, X., & Shi, Y. (2013). The Role of Text Pre-processing in Sentiment Analysis. Procedia Computer Science, 17, 26–32.

Han, J., & Kamber, M. (2007). Data Mining Concepts and Techniques.

Lee, M. (2010). M ULTICLASS S ENTIMENT A NALYSIS WITH RESTAURANT REVIEWS.

Markov, Z., & Daniel, T. (2007). Uncovering Patterns in.

Santoso, Budi. 2007. Data Mining Teknik Pemanfaatan Data Untuk Keperluan Bisnis. Yogyakarta: Graha Ilmu.

Uysal, A. K., & Gunal, S. (2012). A novel probabilistic feature selection method for text classification. Knowledge-Based Systems, 36, 226–235.

V, S. R. R., Somayajulu, D. V. L. N., & Dani, A. R. (2010). Classification of Movie Reviews Using Complemented Naive Bayesian Classifier, 1(4), 162–167.

Ye, Q., Zhang, Z., & Law, R. (2009). Expert Systems with Applications Sentiment classification of online reviews to travel destinations by supervised machine learning approaches. Expert Systems With Applications, 36(3), 6527–6535.
Published
2017-02-15
How to Cite
[1]
D. Muthia, “ANALISIS SENTIMEN PADA REVIEW RESTORAN DENGAN TEKS BAHASA INDONESIA MENGUNAKAN ALGORITMA NAIVE BAYES”, jitk, vol. 2, no. 2, pp. 39-45, Feb. 2017.