e-space
Manchester Metropolitan University's Research Repository

    Handling class imbalance in online transaction fraud detection

    Kanika, Singla, Jimmy, Bashir, Ali Kashif ORCID logoORCID: https://orcid.org/0000-0001-7595-2522, Nam, Yunyoung, Hasan, Najam UI and Tariq, Usman (2021) Handling class imbalance in online transaction fraud detection. Computers, Materials and Continua, 70 (2). pp. 2861-2877. ISSN 1546-2218

    [img]
    Preview
    Published Version
    Available under License Creative Commons Attribution.

    Download (362kB) | Preview

    Abstract

    With the rise of internet facilities, a greater number of people have started doing online transactions at an exponential rate in recent years as the online transaction system has eliminated the need of going to the bank physically for every transaction. However, the fraud cases have also increased causing the loss of money to the consumers. Hence, an effective fraud detection system is the need of the hour which can detect fraudulent transactions automatically in real-time. Generally, the genuine transactions are large in number than the fraudulent transactions which leads to the class imbalance problem. In this research work, an online transaction fraud detection system using deep learning has been proposed which can handle class imbalance problem by applying algorithm-level methods which modify the learning of the model to focus more on the minority class i.e., fraud transactions. A novel loss function named Weighted Hard- Reduced Focal Loss (WH-RFL) has been proposed which has achieved maximum fraud detection rate i.e., True Positive Rate (TPR) at the cost of misclassification of few genuine transactions as high TPR is preferred over a high True Negative Rate (TNR) in fraud detection system and same has been demonstrated using three publicly available imbalanced transactional datasets. Also, Thresholding has been applied to optimize the decision threshold using cross-validation to detect maximum number of frauds and it has been demonstrated by the experimental results that the selection of the right thresholding method with deep learning yields better results.

    Impact and Reach

    Statistics

    Activity Overview
    6 month trend
    64Downloads
    6 month trend
    80Hits

    Additional statistics for this dataset are available via IRStats2.

    Altmetric

    Actions (login required)

    View Item View Item