TICKER SYMBOL IDENTIFICATION WITH CIMA ON NON-STATIONARY STOCK PRICE DATASET
Abstract
Ticker symbol identification based on stock price data in investor decisions has been proven to be pivotal. Though research exists on stock price forecasting, ticker symbol identification is still a research opportunity. Meanwhile, some temporal-sequential classification methods are available, such as classification-integrated moving average (CIMA) and recurrent neural network (RNN)-based deep learning such as long short-term memory (LSTM), and gated recurrent unit (GRU). Our research aim is to prove that CIMA can perform ticker symbol identification on non-stationary stock price datasets. This research collects ten most well-known stock price dataset from Kaggle and performs pre-processing. Then it designs CIMA with non-stationary data and the benchmark deep learning methods. Both methods are optimized with hyperparameter tuning and model selection between adaptive boosting (AdaBoost) and legacy k-nearest neighbors (KNN). The test results show five non-stationary features in the stock price dataset must go through a differentiation process. Then, AdaBoost has an accuracy of 0.9967 ± 0.001, while KNN has an accuracy of 0.9971 ± 0.001, with no significant difference based on t-test. Meanwhile, AdaBoost has a significantly smaller model size and testing and prediction time than KNN. In benchmarking, CIMA+AdaBoost is superior to the three other methods for accuracy, precision, recall, and f1-score, all of which have a value of 0.996. Our research contribution is ticker symbol identification based on stock price using CIMA on multiple-class sequential classification with non-stationary data. For future research, we advice to perform this method on other stock price data.
Downloads
References
A. Singh and V. Devaser, “Analysis For Stock Market Investment Decision Using ML Models,” in 2023 3rd International Conference on Innovative Practices in Technology and Management (ICIPTM), IEEE, 2023, pp. 1–6. Accessed: May 21, 2024, doi: 10.1109/ICIPTM57143.2023.10118262.
X. R. Chen, “Stock price prediction using machine learning strategies,” in 2022 International Conference on Company Management, Accounting and Marketing, pp. 68–73, 2022.
A. Shah, M. Doshi, M. Parekh, N. Deliwala, and P. P. M. Chawan, “Identifying Trades Using Technical Analysis and ML/DL Models,” Apr. 12, 2023. doi: 10.15680/IJIRCCE.2023.1104038.
A. Kumar, S. Hooda, R. Gill, D. Ahlawat, D. Srivastva, and R. Kumar, “Stock Price Prediction Using Machine Learning,” in 2023 International Conference on Computational Intelligence and Sustainable Engineering Solutions (CISES), Greater Noida, India: IEEE, Apr. 2023, pp. 926–932. doi: 10.1109/CISES58720.2023.10183515.
J. W. Luo, “A Study on Stock Graph Recognition Based on Wavelet Denoising and DTW Algorithm,” Math. Probl. Eng., vol. 2021, pp. 1–15, Jan. 2021, doi: 10.1155/2021/6641749.
V. Khandelwal, H. Varshney, and G. Munjal, “Sentiment Analysis based Stock Price Prediction using Machine Learning,” in 2024 2nd International Conference on Advancement in Computation & Computer Technologies (InCACCT), IEEE, 2024, pp. 182–187. Accessed: Jul. 22, 2024, doi: 10.1109/InCACCT61598.2024.10551038.
L. Wood, K. Anderson, P. Gerstoft, R. Bell, R. Subbaraman, and D. Bharadia, “Deep Learning Object Detection Approaches to Signal Identification,” Nov. 01, 2022, arXiv: arXiv:2210.16173. doi: 10.48550/arXiv.2210.16173.
V. Clerico, J. González-López, G. Agam, and J. Grajal, “LSTM Framework for Classification of Radar and Communications Signals,” in 2023 IEEE Radar Conference (RadarConf23), IEEE, 2023, pp. 1–6. Accessed: Dec. 26, 2023, doi: 10.1109/RadarConf2351548.2023.10149618.
A. G. Putrada, M. Abdurohman, D. Perdana, and H. H. Nuha, “Synthetic Data with Nested Markov Chain for CIMA-Based Smart Lighting Control Deployment Simulation,” in 2023 11th International Conference on Information and Communication Technology (ICoICT), IEEE, 2023, pp. 148–153. Accessed: Dec. 06, 2023, doi: 10.1109/ICoICT58202.2023.10262430.
D. Batabyal, D. Bandopadhyay, B. Sadhukhan, N. Das, and S. Mukherjee, “Exploring stationarity and fractality in stock market time-series,” in 2023 international conference on intelligent systems, advanced computing and communication (ISACC), IEEE, 2023, pp. 1–6. Accessed: Jul. 22, 2024, doi: 10.1109/ISACC56298.2023.10084056.
S. F. Pane, A. G. Putrada, N. Alamsyah, and M. N. Fauzan, “The Influence of The COVID-19 Pandemics in Indonesia On Predicting Economic Sectors,” in 2022 Seventh International Conference on Informatics and Computing (ICIC), IEEE, 2022, pp. 1–6. Accessed: Dec. 26, 2023, doi: 10.1109/ICIC56845.2022.10006897.
W. Khan, M. A. Ghazanfar, M. A. Azam, A. Karami, K. H. Alyoubi, and A. S. Alfakeeh, “Stock market prediction using machine learning classifiers and social media, news,” J. Ambient Intell. Humaniz. Comput., pp. 1–24, 2020.
X. Xu, “Research on the Stock Price Forecasting of Netflix Based on Linear Regression, Decision Tree, and Gradient Boosting Models,” in 2022 2nd International Conference on Business Administration and Data Science (BADS 2022), Atlantis Press, pp. 1232–1242, 2022, doi: 10.2991/978-94-6463-102-9_127.
J. M. Omoware, O. J. Abiodun, and A. I. Wreford, “Predicting Stock Series of Amazon and Google using Long Short-Term Memory (LSTM),” Asian Res. J. Curr. Sci., vol. 5, no.1, pp. 205–217, 2023.
I. Chatterjee, J. Gwan, Y. J. Kim, M. S. Lee, and M. Cho, “An NLP and LSTM based stock prediction and recommender system for KOSDAQ and KOSPI,” in Intelligent Human Computer Interaction: 12th International Conference, IHCI 2020, Daegu, South Korea, November 24–26, 2020, Proceedings, Part I 12, Springer, 2021, pp. 403–413. doi: 10.1007/978-3-030-68449-5_40.
N. D. Thuan, N. M. Nhut, N. T. V. Huong, and D. V. P. Uyen, “Forecasting the Opening and Closing Price Trends of Stock Using Hybrid Models and Artificial Intelligence Algorithm,” in Future Data and Security Engineering. Big Data, Security and Privacy, Smart City and Industry 4.0 Applications, vol. 1688, T. K. Dang, J. Küng, and T. M. Chung, Eds., in Communications in Computer and Information Science, vol. 1688. , Singapore: Springer Nature Singapore, pp. 532–546, 2022, doi: 10.1007/978-981-19-8069-5_36.
R. M. Nabi, S. A. Saeed, and A. M. Abdi, “Feature Engineering for Stock Price Prediction,” Int J Adv Sci Technol, vol. 29, no. 12s, pp. 2486–2496, 2020.
H. Wang, “Stock Price Prediction Based on Machine Learning and Deep Learning Methods,” in Proceedings of the 2nd International Academic Conference on Blockchain, Information Technology and Smart Finance (ICBIS 2023), Atlantis Press, pp. 1087–1098. 2023, doi: 10.2991/978-94-6463-198-2_112.
D. Ress, “Stock Market Expansion and the Ticker’s Siren Song,” in The Kansas Blue Sky Act of 1911, Cham: Springer Nature Switzerland, 2023, pp. 125–137. doi: 10.1007/978-3-031-43831-8_9.
P. Goyal and M. Goyal, “Impact of corporate name change on stock prices: evidence from BSE,” Gyan Manag. J., vol. 17, no. 1, pp. 85–94, 2023.
V. K. Mishra, R. Binyala, P. Sharma, and S. Singh, “Predictive Analysis of Stock Prices Through Scikit‐Learn: Machine Learning in Python,” in Mathematics and Computer Science Volume 1, 1st ed., S. Ghosh, M. Niranjanamurthy, K. Deyasi, B. B. Mallik, and S. Das, Eds., Wiley, pp. 397–404, 2023, doi: 10.1002/9781119879831.ch20.
R. Rodríguez-Aguilar and J.-A. Marmolejo-Saucedo, “Identification of Trading Strategies Using Markov Chains and Statistical Learning Tools,” in Trends in Data Engineering Methods for Intelligent Systems, vol. 76, J. Hemanth, T. Yigit, B. Patrut, and A. Angelopoulou, Eds., in Lecture Notes on Data Engineering and Communications Technologies, vol. 76. , Cham: Springer International Publishing, pp. 732–741, 2021, doi: 10.1007/978-3-030-79357-9_67.
J. L. Buchanan and M. D. Piercey, “How simple changes to language and tick marks can curtail the ghost ticking of audit procedures,” Audit. J. Pract. Theory, vol. 40, no. 4, pp. 53–77, 2021.
M. R. Roostaee and A. A. Abin, “Forecasting financial signal for automated trading: an interpretable approach,” Expert Syst. Appl., vol. 211, p. 118570, 2023.
L.-J. Liu, W.-K. Shen, and J.-M. Zhu, “Research on risk identification system based on random forest algorithm-high-order moment model,” Complexity, vol. 2021, pp. 1–10, 2021.
J. Shang, S. Cui, C. Ke, Z. Xia, S. Fu, and D. Liu, “Symbol rate identification using asynchronous delayed sampling,” in CLEO: QELS_Fundamental Science, Optica Publishing Group, 2015, pp. JTh2A-55. Accessed: May 21, 2024, doi: 10.1364/CLEO_AT.2015.JTh2A.55.
N. Sinha, R. K. Tripathy, and A. Das, “ECG beat classification based on discriminative multilevel feature analysis and deep learning approach,” Biomed. Signal Process. Control, vol. 78, p. 103943, 2022.
A. G. Putrada and N. G. Ramadhan, “MDIASE-Autoencoder: A Novel Anomaly Detection Method for Increasing The Performance of Credit Card Fraud Detection Models,” in 2023 29th International Conference on Telecommunications (ICT), IEEE, pp. 1–6, 2023, doi: 10.1109/ICT60153.2023.10374051.
R. Baidya and S.-W. Lee, “Addressing the Non-Stationarity and Complexity of Time Series Data for Long-Term Forecasts,” Appl. Sci., vol. 14, no. 11, p. 4436, 2024.
C. Fowler, X. Cai, J. T. Baker, J.-P. Onnela, and L. Valeri, “Testing unit root non-stationarity in the presence of missing data in univariate time series of mobile health studies,” J. R. Stat. Soc. Ser. C Appl. Stat., vol. 73, no. 3, pp. 755–773, 2024.
A. G. Putrada, N. Alamsyah, S. F. Pane, and M. N. Fauzan, “Feature Importance on Text Analysis for a Novel Indonesian Movie Recommender System,” in 2023 11th International Conference on Information and Communication Technology (ICoICT), IEEE, pp. 34–39, 2023, doi: 10.1109/ICoICT58202.2023.10262504
A. G. Putrada, M. Abdurohman, D. Perdana, and H. H. Nuha, “EdgeSL: Edge-Computing Architecture on Smart Lighting Control With Distilled KNN for Optimum Processing Time,” IEEE Access, vol. 11, pp. 64697–64712, 2023, doi: https://doi.org/10.1109/ACCESS.2023.3288425.
I. D. Oktaviani and A. G. Putrada, “KNN imputation to missing values of regression-based rain duration prediction on BMKG data,” J. Infotel, vol. 14, no. 4, pp. 249–254, 2022.
A. G. Putrada, M. Abdurohman, D. Perdana, and H. H. Nuha, “Q8KNN: A Novel 8-Bit KNN Quantization Method for Edge Computing in Smart Lighting Systems with NodeMCU,” in Intelligent Systems and Applications, vol. 824, K. Arai, Ed., in Lecture Notes in Networks and Systems, vol. 824. , Cham: Springer Nature Switzerland, 2024, pp. 598–615. doi: 10.1007/978-3-031-47715-7_41.
R. I. Fakhruddin, M. Abdurohman, and A. G. Pu-trada, “Improving PIR Sensor Network-Based Activity Recognition with PCA and KNN,” in 2021 International Conference on Intelligent Cybernetics Technology & Applications (ICICyTA), pp. 138–143, 2021
M. N. Fauzan, A. G. Putrada, N. Alamsyah, and S. F. Pane, “PCA-AdaBoost Method for a Low Bias and Low Dimension Toxic Comment Classification.,” in 2022 International Conference on Advanced Creative Networks and Intelligent Systems (ICACNIS), IEEE, pp. 1–6, 2022, doi: https://doi.org/10.1109/ICACNIS57039.2022.10055017.
A. N. Iman, A. G. Putrada, S. Prabowo, and D. Perdana, “Peningkatan Kinerja AMG8833 sebagai Thermocam dengan Metode Regresi AdaBoost untuk Pelaksanaan Protokol COVID-19,” J. Elektro Dan Telekomun. Terap. E-J., vol. 8, no. 1, pp. 978–985, 2021, doi: https://doi.org/10.25124/jett.v8i1.3894.
B. Pavithra, M. Niranjanamurthy, G. V. Smitha, R. Kiran, and M. Chandrika, “Transactional and Sequential model for pre-processing the textual data,” in 2023 4th International conference on intelligent engineering and management (ICIEM), IEEE, pp. 1-6, 2023, doi: 10.1109/ICIEM59379.2023.10167357
M. Lupión, J. Medina-Quero, J. F. Sanjuan, and P. M. Ortigosa, “DOLARS, a Distributed On-Line Activity Recognition System by Means of Heterogeneous Sensors in Real-Life Deployments—A Case Study in the Smart Lab of The University of Almería,” Sensors, vol. 21, no. 2, p. 405, Jan. 2021, doi: 10.3390/s21020405.
L. Zhao, Y. Zuo, and K. Yada, “Sequential classification of customer behavior based on sequence-to-sequence learning with gated-attention neural networks,” Adv. Data Anal. Classif., vol. 17, no. 3, pp. 549–581, 2023.
A. G. Putrada, M. Abdurohman, D. Perdana, and H. H. Nuha, “Recurrent Neural Network Architectures Comparison in Time-Series Binary Classification on IoT-Based Smart Lighting Control,” in 2022 10th International Conference on Information and Communication Technology (ICoICT), IEEE, pp. 391–396, 2022.
C. B. Vennerød, A. Kjærran, and E. S. Bugge, “Long Short-term Memory RNN,” May 14, 2021, arXiv: arXiv:2105.06756. Accessed: Jan. 03, 2024. [Online]. Available: http://arxiv.org/abs/2105.06756
A. G. Putrada, N. Alamsyah, S. F. Pane, and M. N. Fauzan, “GRU-MF: A Novel Appliance Classification Method for Non-Intrusive Load Monitoring Data,” in 2022 IEEE International Conference on Communication, Networks and Satellite (COMNETSAT), Solo, Indonesia: IEEE, Nov. 2022, pp. 200–205. doi: 10.1109/COMNETSAT56033.2022.9994409.
M. Varlı and H. Yılmaz, “Multiple classification of EEG signals and epileptic seizure diagnosis with combined deep learning,” J. Comput. Sci., vol. 67, p. 101943, 2023.
P. Privietha and V. J. Raj, “Hybrid Activation Function in Deep Learning for Gait Analysis,” in 2022 International Virtual Conference on Power Engineering Computing and Control: Developments in Electric Vehicles and Energy Sector for Sustainable Future (PECCON), Chennai, India: IEEE, May 2022, pp. 1–7. doi: 10.1109/PECCON55017.2022.9851128.
A. A. Bataineh and D. Kaur, “Immunocomputing-Based Approach for Optimizing the Topologies of LSTM Networks,” IEEE Access, vol. 9, pp. 78993–79004, 2021, doi: 10.1109/ACCESS.2021.3084131.
M. Sharaf, E. E.-D. Hemdan, A. El-Sayed, and N. A. El-Bahnasawy, “StockPred: a framework for stock Price prediction,” Multimed. Tools Appl., vol. 80, pp. 17923–17954, 2021.
M. Diqi, M. E. Hiswati, and A. S. Nur, “StockGAN: robust stock price prediction using GAN algorithm,” Int. J. Inf. Technol., vol. 14, no. 5, pp. 2309–2315, 2022, doi: 10.1007/s41870-022-00929-6.
A. Mahadik, D. Vaghela, and A. Mhaisgawali, “Stock price prediction using lstm and arima,” in 2021 Second International Conference on Electronics and Sustainable Communication Systems (ICESC), IEEE, pp. 1594–1601, 2021, doi: 10.1109/ICESC51422.2021.9532655
M. R. Hossain, M. T. Ismail, and S. A. B. A. Karim, “Improving stock price prediction using combining forecasts methods,” IEEE Access, vol. 9, pp. 132319–132328, 2021, doi: 10.1109/ACCESS.2021.3114809.
Y. H. Prassetio, D. Setiawan, and M. Mutmainah, “Perubahan Nama Perusahaan Terhadap Tingkat Likuiditas dan Abnormal Return Saham,” J. Akunt. Dan Bisnis, vol. 22, no. 1, pp. 135–153, 2022, doi: 10.20961/jab.v22i1.730.
M. R. Nazar, “Investor Sentiment Stock Price on Indonesia Stock Exchange,” Int. J. Entrep. Bus. Manag., vol. 1, no. 2, pp. 139–145, 2022, doi: 10.54099/ijebm.v1i2.376.
Y. Long, W. Su, and Y. Tan, “Does a Share Name Change Have an Impact on the Pricing Efficiency of the Share?,” SAGE Open, vol. 13, no. 4, p. 21582440231219071, Oct. 2023, doi: 10.1177/21582440231219071.
X. Teng, T. Wang, X. Zhang, L. Lan, and Z. Luo, “Enhancing stock price trend prediction via a time-sensitive data augmentation method,” Complexity, vol. 2020, pp. 1–8, 2020, doi: 10.1155/2020/6737951.
G. Li, G. Tang, and L. Liu, “Ticker symbol and comovement: Evidence from a unique dataset in China”, Accessed: Jan. 07, 2024. [Online]. Available: https://www.researchgate.net/profile/Guangzi-Li/publication/264881512_Ticker_symbol_and_comovement_Evidence_from_a_unique_dataset_in_China/links/57dfe31d08ae484409238fb1/Ticker-symbol-and-comovement-Evidence-from-a-unique-dataset-in-China.pdf
Y. Li, “Study on the relationship between Word Preference and Stock Returns under the background of intelligent information,” in E3S Web of Conferences, EDP Sciences, p. 01048, 2021, doi: 10.1051/e3sconf/202127501048
A. G. Putrada, M. Abdurohman, D. Perdana, and H. H. Nuha, “CIMA: A Novel Classification-Integrated Moving Average Model for Smart Lighting Intelligent Control Based on Human Presence,” Complexity, vol. 2022, no. Article ID 4989344, p. 19, 2022, doi: 10.1155/2022/4989344.
Copyright (c) 2024 Aji Gautama Putrada, Maman Abdurohman, Doan Perdana, Hilal Hudan Nuha
This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.