Run the evaluation after collecting your latents with the EvalSampler via:
python MeasureResults.py --eval-dir /Comfy/output/eval_latents --stratify-std
It expects the baseline latents to be in a sub-folder called "BF16". It then compares against every other subfolder. Results will be printed in terminal, and saved as various formats.