RLiG: Hybrid RL for Bayesian Network Structure Learning

This repository implements RLiG, a reinforcement learning agent for Bayesian Network (BN) structure learning.
Unlike traditional greedy methods (Hill-Climbing, GES, PC), RLiG optimizes a hybrid objective that balances:

Structural fitness (BIC)
Generative fidelity (simulating data from the learned BN and comparing against held-out samples)

To keep the generative cost tractable, the agent evaluates generative metrics only at selected steps within fixed-length tiles, while relying on BIC at other steps.

🚀 Features

Tabular Q-learning agent for BN structure learning.
Hybrid reward function combining BIC and generative metrics (held-out log-likelihood, MMD, JS divergence).
Budgeted meta-controller (GateLearn) to decide when to trigger generative evaluations.
Flexible config system via YAML for datasets, environment parameters, and training settings.
Evaluation on the ASIA benchmark dataset with classical baselines (Hill-Climb, GES, PC).
Visualization of learned DAGs and training curves.

📂 Project Structure


.
├── config.py             # Config loader + typed helpers
├── core.py               # Core RL environment (RLiGEnv, scoring, actions, rewards)
├── dag.py                # DAG class with edge ops + visualization
├── gatelearn_run.py      # Meta-controller (GateLearn) training & evaluation
├── qlearn_run.py         # Q-learning training & evaluation
├── data/
│   └── asia_10000.csv    # Example dataset (ASIA benchmark)
├── reports/
│   ├── figures/          # Plots of returns, BIC, DAGs
│   └── tables/           # Training logs, ablation tables
└── README.md             # Project documentation

⚙️ Installation

Clone the repo and install requirements:

git clone https://github.com/<YOUR_USERNAME>/<YOUR_REPO>.git
cd <YOUR_REPO>
pip install -r requirements.txt

▶️ Usage

Run Q-learning agent

python qlearn_run.py --config configs/asia.yaml

Run Gate meta-controller

python gatelearn_run.py --config configs/asia.yaml

Config Example (`configs/asia.yaml`)

data:
  path: data/asia_10000.csv
  train_frac: 0.8
seed: 42

env:
  k: 2
  L: 4
  T: 20
  I_g: [1, 3]

score:
  alpha: 1.0
  beta: 0.05
  Ns: 1000
  gen_metric: "mmd"
  alpha_dirichlet: 1.0
  bic_per_sample: true

qlearn:
  episodes: 3000
  eps_start: 1.0
  eps_end: 0.05
  eps_decay: 0.995
  gamma: 0.99
  alpha_lr: 0.25
  optimistic_q: 0.0
  snapshot_every: 100
  start_mode: "hc"
  warm_start_best: true

📊 Results

On the ASIA dataset:

RL agent learns competitive structures compared to HC, GES, PC.
Gate meta-controller reduces generative cost while preserving accuracy.
Metrics reported: SHD, Precision, Recall, F1, BIC, Generative metrics.

📈 Visualization

DAGs can be visualized using DAG.visualize().
Training curves (returns, BIC) and gate usage are automatically saved under reports/figures/.

📚 References

👤 Author

Developed by Sunain Mushtaq (Computer Science, Deakin University) as part of Advanced Algorithms coursework.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
Q_outputs		Q_outputs
__pycache__		__pycache__
configs		configs
data		data
.gitignore		.gitignore
11Hd___Advanced_Algorithms.pdf		11Hd___Advanced_Algorithms.pdf
Report.docx		Report.docx
code2.ipynb		code2.ipynb
config.py		config.py
core.py		core.py
dag.py		dag.py
gatelearn_run.py		gatelearn_run.py
load.R		load.R
qlearn_run.py		qlearn_run.py
readme.md		readme.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

RLiG: Hybrid RL for Bayesian Network Structure Learning

🚀 Features

📂 Project Structure

⚙️ Installation

▶️ Usage

Run Q-learning agent

Run Gate meta-controller

Config Example (`configs/asia.yaml`)

📊 Results

📈 Visualization

📚 References

👤 Author

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

RLiG: Hybrid RL for Bayesian Network Structure Learning

🚀 Features

📂 Project Structure

⚙️ Installation

▶️ Usage

Run Q-learning agent

Run Gate meta-controller

Config Example (configs/asia.yaml)

📊 Results

📈 Visualization

📚 References

👤 Author

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Config Example (`configs/asia.yaml`)

Packages