Generating Data:

network_ops.generate_data

The network_ops.generate_data module generates synthetic voting datasets for training and testing multi-winner voting rules. It creates preference profiles using various probability models and computes optimal/worst committees according to specified axioms.

Usage

Run the script with command-line arguments in key=value format:

python -m network_ops.generate_data n_profiles=1000 prefs_per_profile=50 m=5 num_winners=3 pref_model="IC" axioms="all" out_folder="data"

Required Arguments

n_profiles (int): Number of preference profiles to generate
prefs_per_profile (int): Number of voters per profile (mean if varied_voters=True)
m (int): Number of candidates in each profile
num_winners (int): Size of committees to elect
pref_model (str): Preference generation model (see options below)
axioms (str): Which axiom set to evaluate ("all", "root", "both")
out_folder (str): Directory to save generated datasets

Optional Arguments

varied_voters (bool): If True, vary number of voters per profile. Default: False
voters_std_dev (int): Standard deviation for voter count when varied_voters=True. Default: 0
rwd_folder (str): Path to real-world data folder (required for pref_model="real_world")

Preference Models

Available preference models include:

Standard Models

"IC" - Impartial Culture (uniform random)
"IAC" - Impartial Anonymous Culture
"URN-R" - Urn model
"identity" - All voters have identical preferences
"MALLOWS-RELPHI-R" - Mallows model

Spatial Models

"euclidean__args__dimensions=3_-_space=gaussian_ball"
"euclidean__args__dimensions=10_-_space=gaussian_ball"
"euclidean__args__dimensions=3_-_space=uniform_ball"
"euclidean__args__dimensions=10_-_space=uniform_ball"
"euclidean__args__dimensions=3_-_space=gaussian_cube"
"euclidean__args__dimensions=10_-_space=gaussian_cube"
"euclidean__args__dimensions=3_-_space=uniform_cube"
"euclidean__args__dimensions=10_-_space=uniform_cube"

Structured Models

"single_peaked_conitzer" - Single-peaked preferences (Conitzer)
"single_peaked_walsh" - Single-peaked preferences (Walsh)
"stratification__args__weight=0.5" - Stratified model

Special Models

"mixed" - Mixed distribution
"real_world" - Generated from real-world voting data (requires rwd_folder)

Output

The script generates CSV files with the following naming convention:

n_profiles={n}-num_voters={v}-varied_voters={vv}-voters_std_dev={std}-m={m}-committee_size={c}-pref_dist={model}-axioms={axioms}-{TYPE}.csv

Where {TYPE} is either TRAIN or TEST.

Generated Data Columns

Each row represents one preference profile with columns including:

Profile: The preference profile as rankings
n_voters: Number of voters in this profile
min_violations-committee: Committee that minimizes axiom violations
min_violations: Number of violations for optimal committee
max_violations-committee: Committee that maximizes axiom violations
max_violations: Number of violations for worst committee
Feature columns: Candidate pair matrices, binary pairs, rank matrices (normalized)
Rule winner columns (TEST data only): Winners for each voting rule

Examples

Generate IC data with 1000 profiles:

python -m network_ops.generate_data n_profiles=1000 prefs_per_profile=50 m=5 num_winners=3 pref_model="IC" axioms="all" out_folder="results"

Generate varied voter data:

python -m network_ops.generate_data n_profiles=500 prefs_per_profile=50 varied_voters=True voters_std_dev=10 m=6 num_winners=2 pref_model="URN-R" axioms="both" out_folder="data"

Generate real-world based data:

python -m network_ops.generate_data n_profiles=100 prefs_per_profile=20 m=5 num_winners=2 pref_model="real_world" rwd_folder="data/real_world_data" axioms="all" out_folder="rwd_results"

Notes

The script automatically generates both TRAIN and TEST datasets
Large datasets are saved incrementally to prevent data loss
Real-world preference generation requires existing real-world voting data in the specified folder
Preference models with __args__ use the specified parameters (e.g., dimensions for euclidean models)

Training Networks

network_ops.train_networks

The network_ops.train_networks module trains neural networks to learn multi-winner voting rules from generated data. It trains multiple networks using different feature sets and loss functions for robust comparison.

Usage

python -m network_ops.train_networks n_profiles=10000 n_voters=50 m=5 num_winners=3 data_path="data" varied_voters=True out_folder="trained_networks"

Required Arguments

n_profiles (int): Number of training profiles to use
n_voters (int): Number of voters per profile (must match training data)
m (int): Number of candidates (must match training data)
num_winners (int): Committee size (must match training data)
data_path (str): Directory containing training data (generated by generate_data)
varied_voters (bool): Whether training data has varied voters (must match training data)
out_folder (str): Directory to save trained networks

Optional Arguments

axioms (str): Axiom set used in training data ("all", "root", "both"). Default: "all"
pref_dist (str): Specific preference distribution to train on. If not provided, trains on all available distributions
voters_std_dev (int): Standard deviation for varied voters (must match training data if applicable)

Training Process

The script automatically:

Loads training data from the specified data_path
Trains multiple networks per parameter set using different:
- Feature sets (candidate pairs, binary pairs, rank matrices)
- Loss functions (MSE, etc.)
Saves trained models with descriptive filenames
Records training losses for analysis

Network Architecture

Hidden layers: 5
Hidden nodes: 256 per layer
Epochs: 50
Early stopping: Enabled with patience
Optimizer: Adam

Output

Trained networks are saved with naming convention:

NN-num_voters={n}-m={m}-num_winners={k}-pref_dist={dist}-axioms={axioms}-features={features}-loss={loss}-idx={idx}-.pt

Examples

Train networks for IC distribution:

python -m network_ops.train_networks n_profiles=10000 n_voters=50 m=5 num_winners=3 data_path="data" varied_voters=False pref_dist="IC" out_folder="networks"

Train on all distributions with varied voters:

python -m network_ops.train_networks n_profiles=5000 n_voters=50 m=6 num_winners=2 data_path="data" varied_voters=True voters_std_dev=10 out_folder="trained_networks"

Evaluating Networks and Rules

network_ops.evaluate_networks

The network_ops.evaluate_networks module evaluates trained neural networks against test data and compares their performance with existing multi-winner voting rules.

Usage

python -m network_ops.evaluate_networks n_profiles=1000 n_voters=50 varied_voters=True voters_std_dev=10 m=5 num_winners=3 data_path="data" network_path="trained_networks" out_folder="evaluation_results"

Required Arguments

n_profiles (int): Number of test profiles to evaluate on
n_voters (int): Number of voters per profile (must match test data)
varied_voters (bool): Whether test data has varied voters (must match test data)
voters_std_dev (int): Standard deviation for varied voters (must match test data)
m (int): Number of candidates (must match test data)
num_winners (int): Committee size (must match test data)
data_path (str): Directory containing test data
network_path (str): Directory containing trained networks
out_folder (str): Directory to save evaluation results

Optional Arguments

axioms (str): Axiom set to evaluate ("all", "root", "both"). Default: "all"
pref_dist (str): Specific preference distribution to evaluate. If not provided, evaluates all available distributions

Evaluation Process

The script:

Loads test data and trained networks
Makes predictions using neural networks
Compares against existing rules:
- Borda, STV, PAV, Monroe, etc.
- Random choice baseline
- Min/Max violation committees
Computes distances between all rule pairs
Counts axiom violations for each rule
Saves comprehensive results

Output Files

Generated files include:

Axiom violation results: axiom_violation_results-{params}.csv
Rule distance matrices: rule_distances-{params}.csv
Performance comparisons: Accuracy and violation statistics

Examples

Evaluate networks on IC test data:

python -m network_ops.evaluate_networks n_profiles=1000 n_voters=50 varied_voters=False voters_std_dev=0 m=5 num_winners=3 data_path="data" network_path="trained_networks" pref_dist="IC" out_folder="results"

Comprehensive evaluation across all distributions:

python -m network_ops.evaluate_networks n_profiles=5000 n_voters=50 varied_voters=True voters_std_dev=10 m=6 num_winners=2 data_path="data" network_path="trained_networks" out_folder="evaluation_results"

Notes

Test data parameters must exactly match training data parameters
Networks must exist for the specified parameters
Evaluation compares neural networks against 15+ established voting rules
Results enable analysis of when neural networks perform well vs. poorly

Optimizing Interpretable Rules

The optimize_interpretable_rules.py script uses simulated annealing to optimize positional scoring rules that satisfy specified axioms while minimizing axiom violations on sampled preference profiles.

Usage

python optimize_interpretable_rules.py num_winners=[1,2,3] axioms_to_optimize="reduced" num_profiles_to_sample=5000 n_annealing_steps=10000

Arguments

Required Arguments

num_winners (list): List of committee sizes to optimize for (e.g., [1,2,3])

Optional Arguments

axioms_to_optimize (str or list): Which axioms to optimize for
- "reduced" - Reduced axiom set (default)
- "all" - All available axioms
- Custom list (e.g., ["majority_winner", "condorcet_winner"])
num_profiles_to_sample (int): Number of preference profiles to sample for optimization (default: 5000)
n_annealing_steps (int): Number of simulated annealing steps (default: 0 = no annealing)

Process

Sampling: Generates random preference profiles from mixed distributions
Optimization: Uses simulated annealing to find scoring vectors that minimize axiom violations
Evaluation: Tests optimized rules against sampled profiles for each axiom
Results: Outputs optimized scoring vectors and violation rates

Examples

Optimize for single winner with annealing:

python optimize_interpretable_rules.py num_winners=[1] axioms_to_optimize="reduced" n_annealing_steps=5000

Optimize for multiple committee sizes:

python optimize_interpretable_rules.py num_winners=[1,2,3,4] axioms_to_optimize="all" num_profiles_to_sample=10000

Generating Appendix Materials

The experiment_both_axiom_sets/arXiv/ directory contains scripts to generate comprehensive appendix materials including tables, plots, and LaTeX documentation.

Workflow

1. Generate Distance Tables

Creates pairwise distance matrices between voting rules:

cd experiment_both_axiom_sets/arXiv/
python make_distance_tables.py

Output: LaTeX tables in distance_tex_tables/
Process: Computes Hamming distances between rule outputs across all experimental conditions

2. Generate Summary Tables

Creates performance summary tables for all rules and distributions:

python make_summary_table.py

Output: Formatted tables in summary_tables/
Content: Axiom violation rates, rule rankings, statistical summaries

3. Generate Plots

Creates visualization plots for experimental results:

python plot_experiment_data.py

Output: PNG plots in plots/ subdirectories
Types: Axiom violation heatmaps, distribution comparisons, rule performance charts

4. Compile Appendix

Combines all materials into a single LaTeX appendix:

python make_appendix.py

Output: combined_appendix.tex
Content: Integrated tables, plots, and structured documentation

Generated Materials

Directory Structure

experiment_both_axiom_sets/arXiv/
├── distance_heatmaps/       # Distance visualization plots
├── distance_tex_tables/     # LaTeX distance tables  
├── summary_tables/          # Performance summary tables
├── plots/                   # Experimental result plots
└── combined_appendix.tex    # Final appendix document

Key Files

make_distance_tables.py: Generates rule similarity matrices
make_summary_table.py: Creates performance comparison tables
plot_experiment_data.py: Generates all experimental plots
make_appendix.py: Compiles everything into LaTeX format

Notes

Scripts expect experimental results to be available in the appropriate data directories
LaTeX compilation requires the generated .tex files and associated plot images
The workflow processes data for m=5,6,7 candidates across all preference distributions

Name		Name	Last commit message	Last commit date
Latest commit History 302 Commits
aaai		aaai
annealing		annealing
cc_jobs		cc_jobs
evaluation_results-AAAI_combined		evaluation_results-AAAI_combined
evaluation_results-root_axioms		evaluation_results-root_axioms
evaluation_results_fixed_fm		evaluation_results_fixed_fm
evaluation_results_thesis-RWD		evaluation_results_thesis-RWD
evaluation_results_thesis		evaluation_results_thesis
evaluation_results_thesis_without_min_and_max		evaluation_results_thesis_without_min_and_max
experiment-thesis_data		experiment-thesis_data
experiment_all_axioms		experiment_all_axioms
network_ops		network_ops
plots		plots
pref_voting		pref_voting
preflib		preflib
results		results
slurm_out		slurm_out
standard_deviations		standard_deviations
summary_results		summary_results
summary_standard_deviations		summary_standard_deviations
utils		utils
.gitignore		.gitignore
README.md		README.md
Rule Details.csv		Rule Details.csv
axiom_implementation_testing.py		axiom_implementation_testing.py
borda_consensus_violation_examples.txt		borda_consensus_violation_examples.txt
cc_job_generator.py		cc_job_generator.py
check_and_submit_evals.sh		check_and_submit_evals.sh
complete_networks.txt		complete_networks.txt
compute_canada_guide.txt		compute_canada_guide.txt
count_complete_evaluation_results.sh		count_complete_evaluation_results.sh
csv_trimmer.sh		csv_trimmer.sh
describe_incomplete_generation_jobs.sh		describe_incomplete_generation_jobs.sh
download_log.txt		download_log.txt
failed_downloads.txt		failed_downloads.txt
fixed_majority_example.txt		fixed_majority_example.txt
fixed_majority_violation_examples.txt		fixed_majority_violation_examples.txt
generate_incomplete_jobs_from_descriptions.sh		generate_incomplete_jobs_from_descriptions.sh
generate_rwd.py		generate_rwd.py
get-pip.py		get-pip.py
job_submitter.sh		job_submitter.sh
optimize_interpretable_rules.py		optimize_interpretable_rules.py
overlap_amounts_all_distributions.csv		overlap_amounts_all_distributions.csv
overlap_check.py		overlap_check.py
pip-log.txt		pip-log.txt
requirements.txt		requirements.txt
results.csv		results.csv
run_downloads.sh		run_downloads.sh
run_incomplete_jobs.sh		run_incomplete_jobs.sh
run_mixed_jobs.py		run_mixed_jobs.py
run_rwd_analysis.py		run_rwd_analysis.py
testjob.sh		testjob.sh
testrun.sh		testrun.sh
trained_network_counter.sh		trained_network_counter.sh
unmanylinuxize.sh		unmanylinuxize.sh

Folders and files

Latest commit

History

Repository files navigation

Generating Data:

network_ops.generate_data

Usage

Required Arguments

Optional Arguments

Preference Models

Standard Models

Spatial Models

Structured Models

Special Models

Output

Generated Data Columns

Examples

Notes

Training Networks

network_ops.train_networks

Usage

Required Arguments

Optional Arguments

Training Process

Network Architecture

Output

Examples

Evaluating Networks and Rules

network_ops.evaluate_networks

Usage

Required Arguments

Optional Arguments

Evaluation Process

Output Files

Examples

Notes

Optimizing Interpretable Rules

Usage

Arguments

Required Arguments

Optional Arguments

Process

Examples

Generating Appendix Materials

Workflow

1. Generate Distance Tables

2. Generate Summary Tables

3. Generate Plots

4. Compile Appendix

Generated Materials

Directory Structure

Key Files

Notes

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages