SOC Real-Time Attack Detection Platform

ML-powered Intrusion Detection System with real-time attack simulation, Zeek PCAP analysis, and MITRE ATT&CK mapping.

Features

Random Forest Binary Classifier - Trained on UNSW-NB15 dataset (Normal vs Attack)
Attack Detection: Binary classification with ground truth comparison
MITRE ATT&CK Mapping - Automatic technique/tactic mapping for detected attacks
Zeek PCAP Analysis - Process PCAP files through Zeek for ML-based detection
Explainability (XAI) - Shows why each alert was flagged
Real-time Dashboard - Live monitoring with VM simulation windows
Terminal Commands - Full control via command-line interface

Quick Start

Prerequisites

Python 3.8+
UNSW-NB15 Dataset (place in Dataset/ folder)
Zeek (optional, for PCAP analysis - install in WSL on Windows)

Installation

# Clone the repository
git clone https://github.com/Aamod007/EDA.git
cd EDA

# Create virtual environment
python -m venv .venv
.venv\Scripts\activate  # Windows
source .venv/bin/activate  # Linux/Mac

# Install dependencies
pip install -r backend/requirements.txt

# Train the Random Forest model
python train_rf_binary.py

Running the Platform

# Start the server
python -m backend.server

# Or use the batch file (Windows)
start_soc.bat

Open http://127.0.0.1:8765 in your browser.

Commands

Command	Description
`/attack`	Simulate random attacks from dataset
`/attack <type>`	Simulate specific attack (dos, exploits, recon, etc.)
`/attacks`	List available attack types with MITRE mappings
`/zeek pcap <file>`	Analyze PCAP file with Zeek + ML detection
`/zeek status`	Check Zeek status
`/help`	Show all commands
`/status`	Show system status
`/stats`	Show rolling statistics
`/clear`	Reset counters and alerts
`/export json`	Export alerts to JSON
`/export csv`	Export alerts to CSV

How Predictions Work

Dataset Mode (`/attack`)

Loads samples from Dataset/UNSW-NB15_*.csv files with real IPs
Extracts 41 network features per sample
XGBoost model predicts attack type
Shows ML prediction vs ground truth comparison

PCAP Mode (`/zeek pcap`)

Zeek processes PCAP file → generates conn.log
Pipeline maps Zeek fields to 41 UNSW-NB15 features
Same XGBoost model makes predictions
Results displayed in VM-style windows

39 Features Used

dur, proto, service, state, spkts, dpkts, sbytes, dbytes,
sttl, dttl, sload, dload, sloss, dloss, sinpkt, dinpkt,
sjit, djit, swin, dwin, tcprtt, synack, ackdat, smean, dmean,
trans_depth, response_body_len, ct_srv_src, ct_state_ttl,
ct_dst_ltm, ct_src_dport_ltm, ct_dst_sport_ltm, ct_dst_src_ltm,
is_ftp_login, ct_ftp_cmd, ct_flw_http_mthd, ct_src_ltm,
ct_srv_dst, is_sm_ips_ports

Architecture

├── backend/
│   ├── server.py          # WebSocket server
│   ├── simulator.py       # Attack simulation (dataset)
│   ├── pipeline.py        # Zeek log processing + ML
│   ├── ml_inference.py    # XGBoost model wrapper
│   ├── zeek_controller.py # Zeek process management
│   ├── mitre.py           # MITRE ATT&CK mappings
│   └── explainability.py  # XAI features
├── model/
│   └── attack_classifier.joblib  # Trained XGBoost model
├── Dataset/
│   └── UNSW-NB15_*.csv    # Training data with real IPs
├── PCAP/
│   └── *.pcap             # Sample PCAP files
├── attack-detection-viz.html     # Frontend UI
└── start_soc.bat                 # Windows launcher

Model Details

Algorithm: Random Forest Binary Classifier
Dataset: UNSW-NB15 (82K training samples)
Features: 39 network features
Classes: 2 (Normal, Attack)
Accuracy: 97.15%
Precision (Attack): 98.51%
Recall (Attack): 96.27%
Class Weight: Balanced
n_estimators: 300

Training the Model

python train_rf_binary.py

This will:

Load UNSW-NB15 dataset
Create binary labels (Normal=0, Attack=1)
Train Random Forest with balanced class weights
Save model to model/rf_binary_classifier.joblib

Zeek Setup (Windows/WSL)

# In WSL
sudo apt update
sudo apt install zeek

# Or install from source
# Zeek should be at /opt/zeek/bin/zeek

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
.github		.github
Dataset		Dataset
PCAP		PCAP
backend		backend
docs		docs
model		model
zeek-live		zeek-live
.dockerignore		.dockerignore
.gitattributes		.gitattributes
.gitignore		.gitignore
.renderignore		.renderignore
COMMAND_REFERENCE.md		COMMAND_REFERENCE.md
Dockerfile		Dockerfile
Procfile		Procfile
README.md		README.md
attack-detection-viz.html		attack-detection-viz.html
railway.json		railway.json
render.yaml		render.yaml
requirements.txt		requirements.txt
runtime.txt		runtime.txt
setup_zeek_wsl.sh		setup_zeek_wsl.sh
start_soc.bat		start_soc.bat
train_rf_binary.py		train_rf_binary.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SOC Real-Time Attack Detection Platform

Features

Quick Start

Prerequisites

Installation

Running the Platform

Commands

How Predictions Work

Dataset Mode (`/attack`)

PCAP Mode (`/zeek pcap`)

39 Features Used

Architecture

Model Details

Training the Model

Zeek Setup (Windows/WSL)

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

SOC Real-Time Attack Detection Platform

Features

Quick Start

Prerequisites

Installation

Running the Platform

Commands

How Predictions Work

Dataset Mode (/attack)

PCAP Mode (/zeek pcap)

39 Features Used

Architecture

Model Details

Training the Model

Zeek Setup (Windows/WSL)

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Dataset Mode (`/attack`)

PCAP Mode (`/zeek pcap`)

Packages