Skip to content

Add PretokenizedWebDataset to data/__init__.py#101

Open
rand0musername wants to merge 1 commit intobytedance:mainfrom
rand0musername:patch-1
Open

Add PretokenizedWebDataset to data/__init__.py#101
rand0musername wants to merge 1 commit intobytedance:mainfrom
rand0musername:patch-1

Conversation

@rand0musername
Copy link
Copy Markdown

Fix needed to run the code snippet from README_RAR.md, as importing from train_utils.py executes from data import SimpleImageDataset, PretoeknizedDataSetJSONL, PretokenizedWebDataset which crashes as PretokenizedWebDataset is missing from __init__.py of data/.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant