Prasetyo, Vincentius Riandaru and Benarkah, Njoto and Rahmad, Bayu Aji Hamengku (2025) Improving the Performance of Machine Learning Classifiers in Sentiment Analysis of Jenius Application Using Latent Dirichlet Allocation in Text Preprocessing. Jurnal Teknik Informatika (JUTIF), 6 (5). pp. 3033-3050. ISSN 2723-3863; E-ISSN 2723-3871
|
PDF
document.pdf - Published Version Download (849kB) |
Abstract
Sentiment analysis aims to classify a person’s opinion into a specific sentiment, such as positive or negative. The choice of preprocessing used can influence the performance of a sentiment analysis model. The Latent Dirichlet Allocation (LDA) method, commonly used for topic modelling, can be employed as an additional preprocessing step to identify relevant words associated with a particular sentiment label. This study aims to assess whether the LDA method, implemented in the preprocessing stage, can enhance the performance of machine learning models, including Naïve Bayes, Decision Tree, KNN, Logistic Regression, and SVM. This study utilized a dataset comprising 1,800 reviews, with 900 labelled as positive and 900 as negative. Words with an LDA score of at least 0.15 were given additional weight in the TF-IDF stage before model training. After the model was developed, evaluation was carried out by calculating accuracy, precision, recall, and F1-score. The use of LDA in preprocessing improved the performance of all classification models by 1-3% across most evaluation metrics. Specifically, the Logistic Regression model achieved the best performance, followed by SVM and KNN. This performance improvement is aligned with the use of LDA to reduce semantic noise and improve feature representation. Furthermore, this research is also helpful for monitoring customer opinions in the digital banking sector, enabling the rapid and accurate identification of priority issues. Further research could explore the comparison of performance with other topic modelling and feature extraction methods, as well as expanding the dataset and utilizing multiclass models.
| Item Type: | Article |
|---|---|
| Uncontrolled Keywords: | Digital Banking, Jenius Application, Latent Dirichlet Allocation, Machine Learning Classifiers, Sentiment Analysis, Text Preprocessing |
| Subjects: | Q Science > QA Mathematics > QA75 Electronic computers. Computer science Q Science > QA Mathematics > QA76 Computer software |
| Divisions: | Faculty of Engineering > Department of Informatic |
| Depositing User: | Njoto Benarkah 61120 |
| Date Deposited: | 24 Oct 2025 01:39 |
| Last Modified: | 24 Oct 2025 01:39 |
| URI: | http://repository.ubaya.ac.id/id/eprint/49739 |
Actions (login required)
![]() |
View Item |
