Fix compatible issue with transformers 5.0+ #2328

xiaoyu-work · 2026-02-10T02:47:14Z

Describe your changes

This pull request introduces compatibility updates for Hugging Face Transformers 5.0 and improves handling of dynamic cache and input formats in Olive's ONNX conversion and training utilities. It also updates tests and requirements to reflect these changes and ensure robust model export and training workflows.

Transformers 5.0 Compatibility

Added patching and conversion utilities for DynamicLayer.lazy_initialization, past_key_values, and dynamic shapes to support the new DynamicCache format in Transformers >= 5.0. This ensures models using dynamic cache export correctly with torch.export.
Updated _export_pytorch_model logic to apply the new patches and conversions only for Transformers >= 5.0, while maintaining legacy support for older versions.

Training Argument Handling

Improved filtering of training arguments in create_training_args to remove fields not valid for Transformers 5.0 and exclude None values, allowing Transformers to use its own defaults.

Test Suite Updates

Modified model loading and metadata tests to remove trust_remote_code parameter and update expected file counts and tokenizer types for Transformers 5.0. [1] [2] [3] [4]
Updated model output comparison in rotation tests to cast logits to float before comparison, ensuring consistency across dtypes.

Requirements Adjustments

Restricted onnxscript version to <0.6.1 and removed the Transformers version pin, reflecting confidence in test suite compatibility with Transformers 5.0. [1] [2]

Checklist before requesting a review

Add unit tests for this change.
Make sure all tests can pass.
Update documents if necessary.
Lint and apply fixes to your code by running lintrunner -a
Is this a user-facing change? If yes, give a description of this change to be included in the release notes.

(Optional) Issue link

olive/passes/onnx/conversion.py

justinchuby · 2026-02-10T18:51:45Z

olive/passes/onnx/conversion.py

+            # Apply patches for DynamicCache / past_key_values compatibility
+            if version.parse(transformers.__version__) >= version.parse("5.0"):
+                # transformers >= 5.0: DynamicCache refactored to use DynamicLayer
+                from transformers.integrations.executorch import register_dynamic_cache_export_support


register_dynamic_cache_export_support Does not have the right ordering for kvcaches. I would just use the same patch regardless of the transformers version

Do you mean _patch_model_if_necessary? transformers 5.0 updated DynamicCache and i got error "AttributeError: 'DynamicCache' object has no attribute 'to_legacy_cache'"

We can update the patch code (_patch_model_if_necessary) so that it works universally. There is no need to call to_legacy_cache. The executorch integration is not reliable for our usages.

@titaiwangms suggestions on what the patch logic should be?

titaiwangms · 2026-02-10T23:22:08Z

olive/passes/onnx/conversion.py

+                # transformers >= 5.0: DynamicCache refactored to use DynamicLayer
+                from transformers.integrations.executorch import register_dynamic_cache_export_support
+
+                register_dynamic_cache_export_support()


@xadupre Do you have any suggestion to avoid using executorch function? From Justin's reminder, it provides a wrong kv cache order?

xadupre · 2026-02-10T23:35:42Z

olive/passes/onnx/conversion.py

+    logger.debug("Patched DynamicLayer.lazy_initialization for torch.export compatibility.")
+
+
+def _convert_past_key_values_to_dynamic_cache(dummy_kwargs: dict) -> dict:


With transformers 5+, the update mechanism is not defined by the class DynamicCache but by the class of each layer. This code only works for a DynamicCache using DynamicLayer only. That won't work for a DynamicCache mixing DynamicLayer and DynamicSlidingWindowLayer. The code is fine but it is better to keep that in mind for other models using sliding windows.

xadupre · 2026-02-10T23:37:33Z

test/requirements-test.txt

 soundfile
 tabulate
 torchvision
-# Remove version pin when the tests are fixed


Should you add transformers>=5 unless there are some tests checking multiple versions of transformers.

Fix compatible issue with transformers 5.0+

f2bad23

github-advanced-security bot found potential problems Feb 10, 2026

View reviewed changes

olive/passes/onnx/conversion.py Fixed Show fixed Hide fixed

Fix format

e377058

justinchuby reviewed Feb 10, 2026

View reviewed changes

titaiwangms requested a review from xadupre February 10, 2026 22:44

revert onnxscript version pin

524ca2a

titaiwangms reviewed Feb 10, 2026

View reviewed changes

xadupre reviewed Feb 10, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix compatible issue with transformers 5.0+ #2328

Fix compatible issue with transformers 5.0+ #2328

xiaoyu-work commented Feb 10, 2026

Uh oh!

Uh oh!

justinchuby Feb 10, 2026

Uh oh!

xiaoyu-work Feb 10, 2026

Uh oh!

justinchuby Feb 10, 2026 •

edited

Loading

Uh oh!

justinchuby Feb 10, 2026 •

edited

Loading

Uh oh!

titaiwangms Feb 10, 2026

Uh oh!

xadupre Feb 10, 2026

Uh oh!

xadupre Feb 10, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

		logger.debug("Patched DynamicLayer.lazy_initialization for torch.export compatibility.")


		def _convert_past_key_values_to_dynamic_cache(dummy_kwargs: dict) -> dict:

Fix compatible issue with transformers 5.0+ #2328

Are you sure you want to change the base?

Fix compatible issue with transformers 5.0+ #2328

Conversation

xiaoyu-work commented Feb 10, 2026

Describe your changes

Transformers 5.0 Compatibility

Training Argument Handling

Test Suite Updates

Requirements Adjustments

Checklist before requesting a review

(Optional) Issue link

Uh oh!

Uh oh!

justinchuby Feb 10, 2026

Choose a reason for hiding this comment

Uh oh!

xiaoyu-work Feb 10, 2026

Choose a reason for hiding this comment

Uh oh!

justinchuby Feb 10, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

justinchuby Feb 10, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

titaiwangms Feb 10, 2026

Choose a reason for hiding this comment

Uh oh!

xadupre Feb 10, 2026

Choose a reason for hiding this comment

Uh oh!

xadupre Feb 10, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

justinchuby Feb 10, 2026 •

edited

Loading

justinchuby Feb 10, 2026 •

edited

Loading