Credit Card Fraud Detection System Using Gradient Boosting with Deep Feature Engineering and Ensemble Stacking
Keywords:
Credit Card Fraud Detection, Gradient Boosting, XGBoost, LightGBM, CatBoost, Deep Feature Engineering, Ensemble Stacking, Imbalanced Data Classification, Machine Learning, Financial Transaction SecurityAbstract
Credit card fraud detection remains a critical challenge due to highly imbalanced transaction data and evolving fraudulent behaviors. This paper presents an effective Credit Card Fraud Detection System using gradient boosting models integrated with deep feature engineering and ensemble stacking. The proposed framework begins with data acquisition and preprocessing to handle missing values and class imbalance, followed by advanced feature engineering to extract meaningful transactional patterns. Multiple gradient boosting classifiers, namely XGBoost, LightGBM, and CatBoost, are trained independently to capture complex non-linear relationships in the data. Their predictions are combined using an ensemble stacking strategy to enhance overall detection performance. The proposed system is evaluated on a real-world credit card transaction dataset containing 284,807 transactions, with a fraud rate of 0.173%. Experimental results demonstrate that the model achieves an accuracy of 99.94%, precision of 0.8472, recall of 0.8243, F1-score of 0.8356, and a ROC–AUC score of 0.9657. The confusion matrix further confirms effective fraud detection with 61 true positives, 11 false positives, and 13 false negatives. These results highlight the robustness of the proposed ensemble-based approach in accurately detecting fraudulent transactions while minimizing false alarms, making it suitable for real-time financial fraud detection systems.
Downloads
References
Y., Liu, H., & Zhou, J. (2020). Real-time fraud detection with Apache Spark: Techniques and case studies. International Journal of Big Data, 6(3), 45-57.
Tanvir Rahman, A., Md Sultanul Arefin, S., & Md Shakil, I. (2024). Investigating Innovative Approaches to Identify Financial Fraud in Real-Time. American Journal of Economics and Business Management, 7(11), 1262-1265.
Goodfellow, I., Bengio, Y., & Courville, A. (2019). Deep learning for fraud detection: Applications and techniques. Journal of Machine Learning Applications, 12(2), 89-107.
Gupta, R., & Garg, S. (2021). Sentiment analysis for fraud detection using NLP techniques. Expert Systems with Applications, 167, 114212.
Khan, A., Rahman, T., & Singh, V. (2021). Behavioral biometrics in fraud detection: A comprehensive review. ACM Transactions on Security, 9(4), 23-37.
Lundberg, S. M., & Lee, S. I. (2017). A unified approach to interpreting model predictions. In Advances in Neural Information Processing Systems, 30, 4765-4774.
Nguyen, H. P., Wang, Y., & Zhang, T. (2020). Hybrid learning models for anomaly detection in financial systems. Journal of AI Research, 56(5), 134-152.
Patel, R., Singh, A., & Mehta, P. (2022). Challenges and solutions in real-time financial fraud detection. Big Data Research, 35, 100234.
Raji, I. D., Binns, R., & Gillon, C. (2020). Ethical AI frameworks for financial systems. AI &Society, 35(2), 43-58.
Singh, S., & Patel, A. (2021). Real-time anomaly detection models in financial fraud systems. Computational Intelligence, 38(3), 567-580.
Tian, Y., Zhu, M., & Wei, X. (2022). Blockchain analytics for fraud detection: Advances and challenges. IEEE Transactions on Blockchain, 8(2), 241-254.
Downloads
Published
Issue
Section
License
Copyright (c) 2026 International Journal of Scientific Research in Science and Technology

This work is licensed under a Creative Commons Attribution 4.0 International License.
https://creativecommons.org/licenses/by/4.0