Penerapan Algoritma C4.5 dan Random Forest pada Tingkat Penjualan Serum Somethinc di Shopee

Authors

  • Rismayanti Department of Statistics, IPB University
  • Muhammad Nur Aidi Department of Statistics, IPB University
  • Hari Wijayanto Department of Statistics, IPB University

DOI:

https://doi.org/10.29244/xplore.v12i3.1150

Keywords:

C4.5, Random Forest, SMOTE, discretization, online shop

Abstract

Online buying and selling activities in Indonesia are increasing. Shopee is an online buying and selling platform with the highest visits in Indonesia in the fourth quarter of 2022. The category with the highest transactions at Shopee is beauty products. Somethinc is a very successful local beauty product at Shopee which have highest sales of serum products in Indonesia. This study applies the classification method C4.5 and Random Forest to see important variables in the sales of Somethinc serum at Shopee. The variables used come from store profiles which include: number of followers, number of products, chat performance, store rating, and length of stay. Continuous sales data is discretized using k-means into ordinal data with low, medium, and high levels. There is an imbalance of data in the sales class so that the SMOTE technique is used. The C4.5 algorithm produces a decision tree that contains rules for classification. Random Forest generates the order of variable importance based on the Mean Decrease Gini (MDG) values in descending order, which are as follows: number of followers, number of products, message performance, joining time, and store rating.

Downloads

Published

2023-12-31

How to Cite

Rismayanti, Aidi, M. N., & Wijayanto, H. (2023). Penerapan Algoritma C4.5 dan Random Forest pada Tingkat Penjualan Serum Somethinc di Shopee. Xplore: Journal of Statistics, 12(3), 263–273. https://doi.org/10.29244/xplore.v12i3.1150

Most read articles by the same author(s)

1 2 > >>