Overriding the new default of 'sdpa' by ParthaEth · Pull Request #108 · MeiGen-AI/MultiTalk

ParthaEth · 2025-07-23T08:27:07Z

The default value of the attn_implementation argument in Wav2Vec2Model.from_pretrained() depends on the underlying version of Transformers (Hugging Face) you’re using.

As of Transformers v4.38+, the attn_implementation argument can be one of the following:
"eager" — standard PyTorch attention (default prior to v4.37)
"flash_attention_2" — uses FlashAttention v2 if available (faster and more memory-efficient)
"sdpa" — uses PyTorch’s scaled dot-product attention if available

I have not tested with - flash_attention_2. It would be great if that can speed things up.

VRWarg and others added 5 commits July 23, 2025 10:24

Overriding the newdefault of 'sdpa'

34446ab

Building CLI interface so it can be triggered from hubspot

61ed007

real number of steps

705e428

So the checkpoint is not picked from the deve version

915ccbc

minor

d018b00

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Overriding the new default of 'sdpa'#108

Overriding the new default of 'sdpa'#108
ParthaEth wants to merge 5 commits intoMeiGen-AI:mainfrom
ParthaEth:main

ParthaEth commented Jul 23, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

ParthaEth commented Jul 23, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants