Error(s) in loading state_dict for TransformerModel:

downloaded all the models (mm-all was commented out in `scripts/download-and-setup-models.sh`).
Ran the test translation script as mentioned in the Readme.

```
Error(s) in loading state_dict for TransformerModel:
	size mismatch for encoder.embed_tokens.weight: copying a param with shape torch.Size([28168, 512]) from checkpoint, the shape in current model is torch.Size([26346, 512]).
	size mismatch for decoder.embed_out: copying a param with shape torch.Size([28160, 512]) from checkpoint, the shape in current model is torch.Size([26346, 512]).
	size mismatch for decoder.embed_tokens.weight: copying a param with shape torch.Size([28160, 512]) from checkpoint, the shape in current model is torch.Size([26346, 512]).
```

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Error(s) in loading state_dict for TransformerModel: #3

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Error(s) in loading state_dict for TransformerModel: #3

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions