Differentially Private Conditional Text Generation with RL-Boosted Control

This repository contains code for our project "Differentially Private Conditional Text Generation with RL-Boosted Control".

Overview

This repository implements

annotation: Extracting features (schemas) from texts according to a given schema, via prompting an LLM (Gemini)
privacy_accounting: Privacy accounting for 2 two-stage DP mechanisms: composition of DP-SGD and DP-SGD, composition of AIM + DPSGD.
AIM: Generating tabular schemas via AIM.
DPSFT: Fine-tuning an LLM (gemma-3-1b-pt) via differentially private supervised fine-tuning, and generating synthetic texts. Supports both baseline SFT and our conditional SFT.
RL: Improving the instruction following capability of the conditional generation module via our proposed anchored RL algorithm.
evaluation: Evaluating the quality of synthetic texts via multiple metrics, including MAUVE, feature divergence, domain classification.

Each module contains a separate README file with detailed instructions.

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
AIM		AIM
DPSFT		DPSFT
RL		RL
TOPIC		TOPIC
annotation		annotation
data		data
evaluation		evaluation
hyperparam/biorxiv		hyperparam/biorxiv
privacy_accounting		privacy_accounting
.gitignore		.gitignore
README.md		README.md
analyze_tokens.py		analyze_tokens.py
test.ipynb		test.ipynb