Real-Time Fraud Detection System 🚨💳

This project implements a Real-Time Fraud Detection System using Apache Kafka, Spark Streaming, scikit-learn, PostgreSQL, and Grafana. The system processes up to 1 million transactions every day and detects fraudulent transactions using Machine Learning techniques. The pipeline includes low-latency processing with Apache Kafka and Spark Streaming for real-time transaction monitoring. Fraudulent transactions are flagged using an Isolation Forest model, achieving high precision and recall.

Key Features 🌟

Real-time processing of up to 1 million transactions/day 🏃‍♂️
Fraud detection using the Isolation Forest algorithm 🤖
Low-latency streaming pipeline integrated with Apache Kafka and Spark Streaming ⏱️
Anomaly detection and alerting on fraudulent transactions 🚨
Grafana Dashboard for real-time monitoring 📊

Technologies Used 🛠️

Python 🐍
Apache Kafka 🛠️
Spark Streaming ⚡
scikit-learn 📚
PostgreSQL 🗄️
Grafana 📊

Prerequisites 📋

Before running the project, make sure you have the following installed:

Python 3.x 🐍
Apache Kafka 🛠️
Apache Spark ⚡
PostgreSQL 🗄️
Grafana 📊

Installation 🚀

Clone the repository:

git clone https://github.com/ujwalakopparthi/fraud-detection-system.git
cd fraud-detection-system

Install the dependencies: pip install -r requirements.txt
Set up PostgreSQL:
- Create a database and user for the system.
- Run the SQL scripts in database/ to create the necessary tables for transaction data.
Set up Apache Kafka:
- Follow the instructions here to install and run Kafka locally.
- Create a Kafka topic for transaction data: kafka-topics.sh --create --topic transaction_topic --bootstrap-server localhost:9092 --partitions 1 --replication-factor 1
Set up Spark Streaming:
- Install Apache Spark and start the Spark streaming context.
Run the project:
- Start the Kafka Producer to send sample transaction data: python producer/kafka_producer.py
- Start the Kafka Consumer to process transactions: python consumer/kafka_consumer.py
- Start the Spark Streaming Processor: python processor/stream_processor.py
- Run the main application: python main.py

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
config		config
consumer		consumer
dashboard		dashboard
database		database
models		models
processor		processor
producer		producer
README.md		README.md
main.py		main.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Real-Time Fraud Detection System 🚨💳

Key Features 🌟

Technologies Used 🛠️

Prerequisites 📋

Installation 🚀

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Real-Time Fraud Detection System 🚨💳

Key Features 🌟

Technologies Used 🛠️

Prerequisites 📋

Installation 🚀

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages