A support vector machine credit card fraud detection model based on high imbalance dataset

Kehinde Musliudeen Odeyale, Oyelakin A Moruff, Salau Ibrahim T Taofeekat, Saka M Kayode

Abstract


Credit card transactions are exposed to fraudulent activities owing to their sensitive nature. The illegal activities of the fraudsters have been reported to cost financial institutions a lot of money globally as reported in many notable research works. In the past, several machine learning-based approaches have been proposed for the detection of credit card fraud. However, little attention has been given to classification of fraud in high imbalance dataset. Generally, if a dataset is imbalanced, a learning algorithm will give a bias result in terms of the accuracy resulting in poor performance of the model. This study focuses on using Synthetic Minority Oversampling Technique (SMOTE) to address the class imbalance in the selected credit card dataset. Then, ANOVA-F statistic was applied for the selection of promising features. Both the class imbalance and attribute selection techniques were targeted at improving the SVM-based credit card fraud classification. With the balanced dataset, the study achieved an accuracy of 93.9%, recall of 97.3%, precision of 90.3%, and f1 score of 93.5% respectively. It was observed that the result of the Support Vector VM based credit card fraud detection model with class imbalance is better than that of the standard SVM. The study concluded that the class imbalance addressing and attribute selection techniques used were very promising for the credit card fraud detection tasks.


Keywords


ANOVA F-Test;Fraud detection;Imbalanced Data Machine learning;SMOTE;Support Vector Machine.

Full Text:

PDF

References


Ahirwar, A., Sharma, N., and Bano, A. (2020). Enhanced SMOTE & fast random forest techniques for credit card fraud detection. Solid State Technology, 63(6), 4721-4733.

Ali, M. A., Azad, M. A., Centeno, M. P., Hao, F., and van Moorsel, A. (2019). Consumer-facing technology fraud: Economics, attack methods and potential solutions. Future Generation Computer Systems, 100, 408-427.

Ata, O., and Hazim, L. (2020). Comparative analysis of different distributions dataset by using data mining techniques on credit card fraud detection. Tehnički vjesnik, 27(2), 618-626.

Budhram, T. (2012). Lost, stolen or skimmed: Overcoming credit card fraud in South Africa. South African Crime Quarterly, 40, 31-37.

Chawla, N. V., Bowyer, K. W., Hall, L. O., & Kegelmeyer, W. P. (2002). SMOTE: Syntheticminority over– sampling technique.Journal of Artificial Intelligent Research,16, 321–357.

Dutta, R., Smith, D., Rawnsley, R., Bishop-Hurley, G., Hills, J., Timms, G., & Henry, D. (2015). Dynamic cattle behavioural classification using supervised ensemble classifiers. Computers and electronics in agriculture, 111, 18-28.

Eseye, A. T., Lehtonen, M., Tukia, T., Uimonen, S., & Millar, R. J. (2019). Machine learning based integrated feature selection approach for improved electricity demand forecasting in decentralized energy systems. IEEE Access, 7, 91463-91475.

Kadam, S., Kumari, S., Trivedi, S., & Shah, V. (2021) Credit Card Fraud Detection Using Machine Learning Algorithms. 9(6), 7496-7499.

Lee, J., and Kwon, K. N. (2002). Consumers’ use of credit cards: store credit card usage as an alternative payment and financing medium. Journal of Consumer Affairs, 36(2), 239-262.

Moradi, S., & Mokhatab Rafiei, F. (2019). A dynamic credit risk assessment model with data mining techniques: evidence from Iranian banks. Financial Innovation, 5(1), 1-27.

Oyelakin, A. M., & Jimoh, R. G. (2020). Towards building an improved botnet detection model in highly imbalance botnet dataset-a methodological framework. Volume, 3(3), 2020.

Sarker, I. H. (2021). Machine learning: Algorithms, real-world applications and research directions. SN computer science, 2(3), 160.

Sharma, P., Banerjee, S., Tiwari, D., & Patni, J. C. (2021). Machine learning model for credit card fraud detection-a comparative analysis. Int. Arab J. Inf. Technol., 18(6), 789-796.

Sivakumar, N., & Balasubramanian, R. (2015). Fraud detection in credit card transactions: classification, risks and prevention techniques. International Journal of Computer Science and Information Technologies, 6(2), 1379-1386.

Trivedi, N. K., Simaiya, S., Lilhore, U. K., & Sharma, S. K. (2020). An efficient credit card fraud detection model based on machine learning methods. International Journal of Advanced Science and Technology, 29(5), 3414-3424.




DOI: https://doi.org/10.17509/jcs.v5i2.70802

Refbacks

  • There are currently no refbacks.