With the rise of digital payments, credit card fraud has become a significant challenge for financial institutions. Traditional rule-based systems fail to detect subtle patterns used by fraudsters. This project aims to develop a machine learning-based fraud detection system that can intelligently flag suspicious transactions and help reduce financial loss. We also integrate Power BI to visualize trends and improve decision-making for fraud analysts.
This project uses a real-world credit card transactions dataset that includes over 280,000 records with anonymized features. The goal is to build a classification model that can distinguish between genuine and fraudulent transactions with high accuracy and recall, especially since fraudulent cases represent a very small percentage of the data (imbalanced dataset problem).
The ML output is then visualized using Power BI, allowing analysts to interact with fraud trends, risk indicators, and transaction behavior.
β βββ data/ β #βββ raw_data.csv # Original dataset β βββ cleaned_data.csv # Cleaned and preprocessed data β βββ prediction_results.csv # Model predictions for Power BI β βββ notebooks/ β βββ eda_visualization.ipynb # EDA using Plotly/Matplotlib β βββ fraud_detection_model.ipynb # Model training and evaluation β βββ powerbi_dashboard/ β βββ fraud_dashboard.pbix # Power BI report file β βββ images/ β βββ fraud_dashboard.png # Screenshot of dashboard β βββ models/ β βββ xgboost_model.pkl # Trained model (pickle format) β βββ README.md # Project overview and instructions βββ requirements.txt # Python libraries required

