Online Payment Fraud Detection

Project Overview

Online Payment Fraud Detection is a data science project aimed at identifying fraudulent transactions in online payment systems. The project leverages machine learning techniques to analyze transaction data and distinguish between legitimate and fraudulent activities. By building and evaluating predictive models, the project aims to enhance security measures and reduce financial losses due to fraud.

Introduction

With the rise of e-commerce and digital transactions, online payment fraud has become a significant threat. This project focuses on building robust machine learning models to detect fraudulent transactions effectively. The models are trained on historical transaction data and are designed to generalize well to unseen data, ensuring reliable fraud detection in real-world scenarios.

Dataset

The dataset used for this project contains records of online transactions, each labeled as fraudulent or legitimate. It includes features such as transaction amount, payment method, location, and time of transaction. The dataset is cleaned, preprocessed, and split into training and testing sets to develop and evaluate the models.

Dataset link: Kaggle link

An example of the data structure:

1,PAYMENT,9839.64,C1231006815,170136.0,160296.36,M1979787155,0.0,0.0,0,0
1,PAYMENT,1864.28,C1666544295,21249.0,19384.72,M2044282225,0.0,0.0,0,0

The data file onlinefraud.csv should be placed in the root directory of the project.

Installation

To run this project locally, you'll need to have Python installed. Follow these steps to set up the environment:

Clone the Repository:

git clone https://github.com/Vaibhav-kesarwani/Online_Payment_Fraud_Detection.git
cd Online_Payment_Fraud_Detection

Create a Virtual Environment (optional but recommended):

python -m venv venv
source venv/bin/activate  # On Windows use `venv\Scripts\activate`

Install Required Packages: Install the dependencies by running

pip install -r requirements.txt

Usage

Running the Project

To run the Text Emotion Classifier, follow these steps:

Prepare the Dataset: Ensure that your onlinefraud.csv file is in the root directory. This file should contain the text data and corresponding labels, separated by a comma (,).
Run the Script: Execute the main script to load the data and perform emotion classification

python main.ipynb

Output: The script will print the first few rows of the dataset to the console, showing the text samples and their associated emotion labels.

Transaction Type Distribution Visualization

To understand the distribution of different transaction types within the dataset, a pie chart was created using Plotly Express. This chart provides a clear visual representation of the proportion of each transaction type, such as 'fraudulent' and 'legitimate,' in the dataset.

The steps involved include:

Counting the occurrences of each transaction type.
Extracting the transaction types and their respective counts.
Creating a donut-style pie chart to visualize the distribution, with a hole in the center to emphasize the relative sizes of each slice.

The resulting chart helps in quickly identifying which transaction type is more prevalent, offering valuable insights into the dataset's composition.

type = data['type'].value_counts()
transactions = type.index
quantity = type.values

import plotly.express as px # type: ignore
figure = px.pie(data, values = quantity, names = transactions, hole = 0.5, title = "Distribution of Transaction Type")
figure.show()

Model Training

The model training is performed within the main.ipynb script, which processes the text data and trains a Sequential model using sklearn. You can modify the model architecture, training parameters, or the data processing steps within this script.

# Training the machine learning model
from sklearn.tree import DecisionTreeClassifier # type: ignore
xtrain, xtest, ytrain, ytest = train_test_split(x, y, test_size = 0.20, random_state = 42)
model = DecisionTreeClassifier()
model.fit(xtrain, ytrain)
model.score(xtest, ytest)

Prediction

After training the model, you can use it to predict emotions from new text inputs. Implement the prediction logic in a separate script or extend main.ipynb to include a prediction function.

# Prediction
# features = [type, amount, oldbalanceOrg, newbalanceOrig]
features = np.array([[4, 9000.60, 9000.60, 0.0]])
print(model.predict(features))

File Structure

Here is an overview of the project directory structure:

Online_Payment_Fraud_Detection/
├── Images
│   └── visualise.png
│
├── main.ipynb                # Jupyter notebooks (Source code)
├── onlinefraud.csv           # Kaggle Dataset
├── .gitattributes            # For handle the large files in gihub
├── .gitignore                # Ignore the unwanted file like .venv
├── requirements.txt          # List of dependencies
├── CODE_OF_CONDUCT.md        # Contributing guidelines
├── README.md                 # Project documentation
└── LICENSE                   # License file

Contributing

Contributions are welcome! If you'd like to contribute to this project, please follow these steps:

Fork the repository & Star the repository
Create a new branch (git checkout -b feature)
Make your changes
Commit your changes (git commit -am 'Add new feature')
Push to the branch (git push origin feature)
Create a new Pull Request

License

This project is licensed under the MIT License - see the LICENSE file for details.

Acknowledgements

Contact

If you have any questions or suggestions, feel free to reach out to me at :

Happy Coding!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Online Payment Fraud Detection

Project Overview

Introduction

Dataset

Installation

Usage

Running the Project

Transaction Type Distribution Visualization

Model Training

Prediction

File Structure

Contributing

License

Acknowledgements

Contact

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 29 Commits
.vscode		.vscode
Images		Images
.gitattributes		.gitattributes
.gitignore		.gitignore
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
LICENSE		LICENSE
README.md		README.md
main.ipynb		main.ipynb
onlinefraud.csv		onlinefraud.csv
requirements.txt		requirements.txt

License

Vaibhav-kesarwani/Online_Payment_Fraud_Detection

Folders and files

Latest commit

History

Repository files navigation

Online Payment Fraud Detection

Project Overview

Introduction

Dataset

Installation

Usage

Running the Project

Transaction Type Distribution Visualization

Model Training

Prediction

File Structure

Contributing

License

Acknowledgements

Contact

About

Topics

Resources

License

Code of conduct

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages