Hi,
Thanks for the amazing work!
I wonder if there's any dataset class implementation for ContraStyles dataset.
It seems like LAIONDedup might be the one, but there are some mismatches between the train.parquet file and the code.
(e.g what is dedup_info and joined.cache?)
Thanks,