Product Success Classification for E-Commerce Summer Sales: Evidence from the Wish Dataset
DOI:
https://doi.org/10.31181/msa31202650Keywords:
E-Commerce, Product Classification, Machine Learning Algorithms, Wish Platform, Random Forest, K-Nearest NeighborsAbstract
This study addresses the task of accurately classifying products on the Wish platform using machine learning (ML) algorithms. The Wish platform is an online e-commerce platform where sellers and buyers transact. This study aims to evaluate and compare the performance of different ML algorithms for product classification. The main methods employed in this study involved collecting a dataset from the Wish platform, consisting of various product attributes. Four ML algorithms, namely Logistic Regression (LR), Random Forest (RF), Decision Tree (DT), and K-Nearest Neighbors (KNN), were implemented and trained on the dataset. Performance metrics such as accuracy, precision, recall, F1-score, and area under the ROC curve (AUC) were used to evaluate the algorithms. The results of this study showed that RF achieved the highest accuracy (78.93%) and AUC score among the evaluated ML algorithms. LR and KNN also demonstrated competitive performance, while DT had relatively lower accuracy. Feature importance analysis revealed the most influential feature for all models, providing insights into the key factors contributing to product classification on the Wish platform. These findings have implications for decision-making on e-commerce platforms, enabling more accurate classification and targeted strategies for successful product promotion and inventory management.
Downloads
References
Stranieri, F., Fadda, E., & Stella, F. (2024). Combining deep reinforcement learning and multi-stage stochastic programming to address the supply chain inventory management problem. International Journal of Production Economics, 268, 109099. https://doi.org/10.1016/j.ijpe.2023.109099.
Canyakmaz, C., Özekici, S., & Karaesmen, F. (2024). Risk management through financial hedging in inventory systems with stochastic price processes. International Journal of Production Economics, 270, 109189. https://doi.org/10.1016/j.ijpe.2024.109189.
Helmi, R. A. A., Johar, M. G. M., & Hafiz, M. A. S. B. M. (2023). Online phishing detection using machine learning. In 2023 1st International Conference on Advanced Innovations in Smart Cities (ICAISC) (pp. 1-4). IEEE. https://doi.org/10.1109/ICAISC56366.2023.10085377.
Helmi, R. A. A., Elghanuni, R. H., & Abdullah, M. I. (2021). Effect the graph metric to detect anomalies and non-anomalies on Facebook using machine learning models. In 2021 IEEE 12th Control and System Graduate Research Colloquium (ICSGRC) (pp. 7-12). IEEE. https://doi.org/10.1109/ICSGRC53186.2021.9515227.
Kaur, R., Kumar, R., & Aggarwal, H. (2025). Systematic Review of Artificial Intelligence, Machine Learning, and Deep Learning in Machining Operations: Advancements, Challenges, and Future Directions. Archives of Computational Methods in Engineering, 32(8), 4983-5036. https://doi.org/10.1007/s11831-025-10290-z.
Lin, S. W., & Lu, W. M. (2024). Using inverse DEA and machine learning algorithms to evaluate and predict suppliers’ performance in the apple supply chain. International Journal of Production Economics, 271, 109203. https://doi.org/10.1016/j.ijpe.2024.109203.
Do, N., Tham, J., Azam, S., & Khatibia, A. (2020). Analysis of customer behavioral intentions towards mobile payment: Cambodian consumer’s perspective. Accounting, 6(7), 1391-1402. https://doi.org/10.5267/j.ac.2020.8.010.
Aldousari, A. A., Delafrooz, N., Ab Yajid, M. S., & Ahmed, Z. U. (2016). Determinants of consumers’ attitudes toward online shopping. Journal of Transnational Management, 21(4), 183-199. https://doi.org/10.1080/15475778.2016.1226658.
Mgm, J., Yajid, M. S. A., & Khatibi, A. (2020). A situation analysis of integrated logistics berhad Malaysia. Systematic Reviews in Pharmacy, 11(1), 734-741. https://doi.org/10.5530/srp.2020.1.94.
Pallathadka, H., Mustafa, M., Sanchez, D. T., Sajja, G. S., Gour, S., & Naved, M. (2023). Impact of machine learning on management, healthcare and agriculture. Materials Today: Proceedings, 80, 2803-2806. https://doi.org/10.1016/j.matpr.2021.07.042.
Hammann, D. (2024). Big data and machine learning in cost estimation: An automotive case study. International Journal of Production Economics, 269, 109137. https://doi.org/10.1016/j.ijpe.2023.109137.
Tirkolaee, E. B., Sadeghi, S., Mooseloo, F. M., Vandchali, H. R., & Aeini, S. (2021). Application of machine learning in supply chain management: a comprehensive overview of the main areas. Mathematical Problems in Engineering, 2021(1), 1476043. https://doi.org/10.1155/2021/1476043.
Selukar, M., Jain, P., & Kumar, T. (2022). Inventory control of multiple perishable goods using deep reinforcement learning for sustainable environment. Sustainable Energy Technologies and Assessments, 52, 102038. https://doi.org/10.1016/j.seta.2022.102038.
Zhang, B., Tseng, M. L., Qi, L., Guo, Y., & Wang, C. H. (2023). A comparative online sales forecasting analysis: Data mining techniques. Computers & Industrial Engineering, 176, 108935. https://doi.org/10.1016/j.cie.2022.108935.
Tsai, M. J., & Chen, T. M. (2022). A Non-Bottleneck Residual Approach for QR Code Printed Source Identification. In 2022 IEEE Eighth International Conference on Big Data Computing Service and Applications (BigDataService) (pp. 132-136). IEEE. https://doi.org/10.1109/BigDataService55688.2022.00028.
Kerzel, U. (2023). Demand Models for Supermarket Demand Forecasting. International Journal of Supply & Operations Management, 10(1), 89-104. https://doi.org/10.22034/ijsom.2022.109044.2145.
Leenatham, A., & Khemavuk, P. (2020). Demand forecasting using artificial neural network based on quantitative and qualitative data. In 2020 1st International Conference on Big Data Analytics and Practices (IBDAP) (pp. 1-6). IEEE. https://doi.org/10.1109/IBDAP50342.2020.9245614.
Sudimanto, Lumban Gaol, F., Warnars, H. L. H. S., & Soewito, B. (2022). Inventory Control with Machine Learning Approach: A Bibliometric Analysis. In: Ranganathan, G., Bestak, R., Palanisamy, R., Rocha, Á. (eds) Pervasive Computing and Social Networking. Lecture Notes in Networks and Systems, vol 317. Springer, Singapore. https://doi.org/10.1007/978-981-16-5640-8_21.
Belhadi, A., Mani, V., Kamble, S. S., Khan, S. A. R., & Verma, S. (2024). Artificial intelligence-driven innovation for enhancing supply chain resilience and performance under the effect of supply chain dynamism: an empirical investigation. Annals of Operations Research, 333(2), 627-652. https://doi.org/10.1007/s10479-021-03956-x.
Qu, T., Zhang, J. H., Chan, F. T., Srivastava, R. S., Tiwari, M. K., & Park, W. Y. (2017). Demand prediction and price optimization for semi-luxury supermarket segment. Computers & Industrial Engineering, 113, 91-102. https://doi.org/10.1016/j.cie.2017.09.004.
Patil, H., & Divekar, B. R. (2014). Inventory management challenges for B2C e-commerce retailers. Procedia Economics and Finance, 11, 561-571. https://doi.org/10.1016/S2212-5671(14)00221-4.
Li, J., Wang, T., Chen, Z., & Luo, G. (2019). Machine learning algorithm generated sales prediction for inventory optimization in cross-border E-commerce. International Journal of Frontiers in Engineering Technology, 1(1), 62-74.
Pallathadka, H., Ramirez-Asis, E. H., Loli-Poma, T. P., Kaliyaperumal, K., Ventayen, R. J. M., & Naved, M. (2023). Applications of artificial intelligence in business management, e-commerce and finance. Materials Today: Proceedings, 80, 2610-2613. https://doi.org/10.1016/j.matpr.2021.06.419.
Qi, Y., Li, C., Deng, H., Cai, M., Qi, Y., & Deng, Y. (2019). A deep neural framework for sales forecasting in e-commerce. In Proceedings of the 28th ACM international conference on information and knowledge management (pp. 299-308). https://doi.org/10.1145/3357384.3357883.
Boute, R. N., Gijsbrechts, J., Van Jaarsveld, W., & Vanvuchelen, N. (2022). Deep reinforcement learning for inventory control: A roadmap. European Journal of Operational Research, 298(2), 401-412. https://doi.org/10.1016/j.ejor.2021.07.016.
Qi, M., Shi, Y., Qi, Y., Ma, C., Yuan, R., Wu, D., & Shen, Z. J. (2023). A practical end-to-end inventory management model with deep learning. Management Science, 69(2), 759-773. https://doi.org/10.1287/mnsc.2022.4564.
Jackson, I., Tolujevs, J., & Kegenbekov, Z. (2020). Review of inventory control models: a classification based on methods of obtaining optimal control parameters. Transport and Telecommunication Journal, 21(3), 191-202. https://doi.org/10.2478/ttj-2020-0015.
Çardak, H., Bardak, S., Bardak, T., Capraz, O., Ozcetin, S., & Kızılırmak, S. (2025). Predicting Consumer Preferences for Furniture Products on E-commerce Platforms: An Analysis Using Machine Learning and Favorite Listing Data. BioResources, 20(4), 9768-9784. https://doi.org/10.15376/biores.20.4.9768-9784.
Feng, J., Su, L., & Liu, X. (2025). Research on Inventory Management of Cross-Border e-commerce Through Retail Demand Forecasting. Journal of The Institution of Engineers (India): Series C, 106(6), 1755-1761. https://doi.org/10.1007/s40032-025-01251-3.
Zhang, M., Fu, J., Zhang, Y., Ma, Q., & Chen, J. (2025). Integrated label correction for e-commerce data: Boosting accuracy with baseline attention and enhanced Bayesian updating. Computers in Industry, 173, 104392. https://doi.org/10.1016/j.compind.2025.104392.
Li, Z., & Tham, J. (2023). Network accounting information security based on classification and regression tree algorithm. In International Conference on Algorithms, High Performance Computing, and Artificial Intelligence (AHPCAI 2023) (Vol. 12941, pp. 112-119). SPIE. https://doi.org/10.1117/12.3011569.
Thakur, V., Kumar, R., Kumar, R., Singh, R., & Kumar, V. (2024). Hybrid additive manufacturing of highly sustainable Polylactic acid-Carbon Fiber-Polylactic acid sandwiched composite structures: Optimization and machine learning. Journal of Thermoplastic Composite Materials, 37(2), 466-492. https://doi.org/10.1177/08927057231180186.
Abdullah, M. I., Hao, S. K., Abdullah, I., & Faizah, S. (2023, August). Parkinson’s disease symptom detection using hybrid feature extraction and classification model. In 2023 IEEE 14th Control and System Graduate Research Colloquium (ICSGRC) (pp. 93-98). IEEE. https://doi.org/10.1109/ICSGRC57744.2023.10215477.
Dissanayake, K., & Md Johar, M. G. (2021). Comparative study on heart disease prediction using feature selection techniques on classification algorithms. Applied Computational Intelligence and Soft Computing, 2021(1), 5581806. https://doi.org/10.1155/2021/5581806.
Chhetri, B., Goyal, L. M., & Mittal, M. (2023). How machine learning is used to study addiction in digital healthcare: A systematic review. International Journal of Information Management Data Insights, 3(2), 100175. https://doi.org/10.1016/j.jjimei.2023.100175.
Dissanayake, K., & Johar, M. G. M. (2023). Two-level boosting classifiers ensemble based on feature selection for heart disease prediction. Indonesian Journal of Electrical Engineering and Computer Science, 32(1), 381-391. https://doi.org/10.11591/ijeecs.v32.i1.pp381-391.
Chikkalingaiah, S., Padmanabha Rao Hari Prasad, S. A., & Uggregowda, L. D. (2023). Classification techniques using gray level co-occurrence matrix features for the detection of lung cancer using computed tomography imaging. International Journal of Electrical & Computer Engineering, 13(5). https://doi.org/10.11591/ijece.v13i5.pp5135-5146.
Mohammadi, S., Dehghani, M., Khatibi, A., Sanderman, R., & Hagedoorn, M. (2015). Caregivers' attentional bias to pain: does it affect caregiver accuracy in detecting patient pain behaviors?. Pain, 156(1), 123-130. https://doi.org/10.1016/j.pain.0000000000000015.
Pawanr, S., Garg, G. K., & Routroy, S. (2023). Prediction of energy efficiency, power factor and associated carbon emissions of machine tools using soft computing techniques. International Journal on Interactive Design and Manufacturing (IJIDeM), 17(3), 1165-1183. https://doi.org/10.1007/s12008-022-01089-4.
Garg, A., Lam, J. S. L., & Gao, L. (2016). Power consumption and tool life models for the production process. Journal of Cleaner Production, 131, 754-764. https://doi.org/10.1016/j.jclepro.2016.04.099.
Mondal, S., & Goswami, S. S. (2024). Rise of Intelligent Machines: Influence of Artificial Intelligence on Mechanical Engineering Innovation. Spectrum of Engineering and Management Sciences, 2(1), 46-55. https://doi.org/10.31181/sems1120244h.
Sarkar, A., Goswami, S. S., & Sahoo, S. K. (2026). AI-Powered Threats and Solutions: A Theoretical Analysis of Risks, Governance, and Ethical Safeguards. Applied Research Advances, 2(1), 1-23. https://doi.org/10.65069/ara2120267.
Sitinjak, C., Simic, V., & Simanullang, W. F. (2025). Promoting the Adoption Dynamics of Autonomous and Shared Autonomous Vehicles: A Scientific Mixed-Methods Approach. International Scientific Spectrum, 1(1), 1-29.
Ibrahim, Z., Rahman, N. R. A., & Johar, M. G. M. (2019). A job satisfaction of emotional intelligence, leadership, employee performance with information technology. International Journal of Recent Technology and Engineering, 8(2), 712-718. https://doi.org/10.35940/ijrte.B1148.0982S919.
Downloads
Published
Issue
Section
License
Copyright (c) 2026 Raman Kumar, Vladimir Simic, Rupinder Kaur, Harkirat Kaur, Jasleen Kaur, Kanishka Sharma, Vivek Kumar, Vivek John (Author)

This work is licensed under a Creative Commons Attribution 4.0 International License.









