Hi kvablack,
thanks for your great work! I'm using dlimp to read the OpenX dataset and train a model using huggingface Trainer for muti-GPU acceleration. In multi-GPU training, an issue involving serialization occurs when loading the dataset and starting the training. The following error is encountered:
train_dataset = build_datasets(
data_args,
)
# TEST: dump using pickle
import pickle
filehandler = open("outputs/dataset.obj","wb")
pickle.dump(train_dataset, filehandler)
# [error]
pickle.PicklingError: Can't pickle <class 'dlimp.dataset.DLataset'>: it's not the same object as dlimp.dataset.DLataset
I believe the issue lies in the serialization of the dlimp Dataset or tf Dataset. Could you provide any solution or suggestions?
Many Thanks! 🤗
Hi kvablack,
thanks for your great work! I'm using
dlimpto read the OpenX dataset and train a model usinghuggingface Trainerfor muti-GPU acceleration. In multi-GPU training, an issue involving serialization occurs when loading the dataset and starting the training. The following error is encountered:I believe the issue lies in the serialization of the
dlimp Datasetortf Dataset. Could you provide any solution or suggestions?Many Thanks! 🤗