Machine Learning Shrewd Approach For An Imbalanced Dataset Conversion Samples
Keywords:
Classification, Machine learning, SMOTE, Spread Subsampling, Class imbalanceAbstract
The imbalance data applies to at least one of the classes, which are typically exceeded by the other ones. The Machine Learning Algorithm (Classifier) trained with an imbalance dataset predicts the majority class (frequently occurring) more than the other minority classes (rarely occurring). Training with an imbalance dataset poses challenges for classifiers; however, applying suitable techniques for reducing class imbalance issues can enhance the classifier’s performance. We take an imbalanced dataset from an educational context. Initially, all shortcomings regarding classification of imbalanced dataset have been examined. After that, we apply data-level algorithms for class balancing and compare the performance of classifiers. The performance of the classifier is measured using the underlying information in their confusion matrices such as accuracy, precision, recall, and f-measure. It shows that classification with an imbalance dataset may produce higher accuracy but low precision and recall for the minority class. The analysis confirms that both undersampling and oversampling are effective for balancing datasets, however, oversampling dominates.
Downloads
Downloads
Published
How to Cite
Issue
Section
License
Thank you for your interest in submitting your manuscript to the Journal of Engineering and Technology (JET).
JET publishes only original works. Manuscripts must not be previously published or under consideration by any other publications. Papers published in JET may not be published again in whole or in part without permission. Please review these guidelines for researching, writing, formatting and submitting your manuscript. Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International (CC BY-NC-ND 4.0).
Those submitting manuscripts should be carefully checked to ensure that all works contributed to the manuscript are acknowledged. The list of authors should include all those who can legitimately claim authorship. Manuscript should only be submitted for consideration once consent is given by all contributing authors using Transfer of Copyright form.