Author: Jahanshahi, Hadi; Ozyegen, Ozan; Cevik, Mucahit; Bulut, Beste; Yigit, Deniz; Gonen, Fahrettin F.; Bacsar, Aycse
Title: Text Classification for Predicting Multi-level Product Categories Cord-id: dmh3nivz Document date: 2021_9_2
ID: dmh3nivz
Snippet: In an online shopping platform, a detailed classification of the products facilitates user navigation. It also helps online retailers keep track of the price fluctuations in a certain industry or special discounts on a specific product category. Moreover, an automated classification system may help to pinpoint incorrect or subjective categories suggested by an operator. In this study, we focus on product title classification of the grocery products. We perform a comprehensive comparison of six d
Document: In an online shopping platform, a detailed classification of the products facilitates user navigation. It also helps online retailers keep track of the price fluctuations in a certain industry or special discounts on a specific product category. Moreover, an automated classification system may help to pinpoint incorrect or subjective categories suggested by an operator. In this study, we focus on product title classification of the grocery products. We perform a comprehensive comparison of six different text classification models to establish a strong baseline for this task, which involves testing both traditional and recent machine learning methods. In our experiments, we investigate the generalizability of the trained models to the products of other online retailers, the dynamic masking of infeasible subcategories for pretrained language models, and the benefits of incorporating product titles in multiple languages. Our numerical results indicate that dynamic masking of subcategories is effective in improving prediction accuracy. In addition, we observe that using bilingual product titles is generally beneficial, and neural network-based models perform significantly better than SVM and XGBoost models. Lastly, we investigate the reasons for the misclassified products and propose future research directions to further enhance the prediction models.
Search related documents:
Co phrase search for related documents- adam optimizer and machine learning: 1, 2
- additional advantage and low accuracy: 1
- additional advantage and machine learning: 1
- additional dataset and machine learning: 1, 2, 3
- additional information and low accuracy: 1, 2
- additional information and machine learning: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20
- long lstm short term memory network and lstm architecture: 1, 2, 3, 4, 5
- long lstm short term memory network and lstm short term memory network: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25
- long lstm short term memory network and machine learning: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18
- low accuracy and lstm architecture: 1
- low accuracy and machine learning: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11
- lstm architecture and machine learning: 1, 2, 3, 4
- lstm short term memory network and machine learning: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18
Co phrase search for related documents, hyperlinks ordered by date