Skip to content
#

imbalance-classification

Here are 37 public repositories matching this topic...

Credit card fraud is a burden for organizations across the globe. Specifically, $24.26 billion were lost due to credit card fraud worldwide in 2018, according to shiftprocessing.com. In this project, our goal was to build an effective and efficient model to predict fraud. We analyzed a real-world dataset that contained a list of government related credit card transactions over the 2010 calendar year. The data presented a supervised problem as it included a column showing the transaction’s fraud label (whether a transaction was fraudulent or not). It also contained identifying information about each transaction such as the credit card number, merchant, merchant state, etc. The dataset had 96,753 records and 10 data fields. We first described and visualized each of the 10 data fields, cleaned the dataset, and filled in missing values. Then we created many variables and performed feature selection. Finally, we created a variety of machine learning models (both linear and nonlinear) and highlighted our results.

  • Updated Jun 21, 2020
  • Jupyter Notebook

This project is about detecting fraudulent credit card transactions. The dataset tends to be highly imbalanced, with less than 0.2% of the observations labelled as fraudulent. To address this issue we have to take into account the bank's objective (maximizing precision or recall) and restrictions. The performance and efficiency of many classification algorithms (Logistic Regression, XGBoost, Random Forests) were tested and compared.

  • Updated Apr 8, 2021
  • Jupyter Notebook

Contained in this repository are the Jupyter notebooks that contain the scripts used in this project. Examples include: exploratory data analysis, creation of training, validation and test data sets, and CNN model development and data extraction.

  • Updated Jul 7, 2021
  • Jupyter Notebook

Improve this page

Add a description, image, and links to the imbalance-classification topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the imbalance-classification topic, visit your repo's landing page and select "manage topics."

Learn more