-
Notifications
You must be signed in to change notification settings - Fork 21
Open
Description
Hello,
Tensor assertion error is raised if you try to train the model. It starts with the following:
0%| | 0/10 [00:00<?, ?it/s]../aten/src/ATen/native/cuda/Indexing.cu:1289: indexSelectLargeIndex: block: [56,0,0], thread: [64,0,0] Assertion `srcIndex < srcSelectDimSize` failed.It's possible to avoid this error by resizing embeddings:
model.resize_token_embeddings(len(tokenizer))Unfortunately, it might affect the performance. Could you please review the finetune.py script and fix this problem?
Thank you in advance.
BR,
Mehti
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
No labels