Conversation
feat(transformers): upgrade modeling_utils/RoPE/optimization/cache_utils etc to v4.50
fix cache_utils
Add beam_search & Add ForCausalLMLoss
fix beam_search
* examples/codellama * test llama * update transformers version to 4.50 * update transformers version 4.51 * download checkpoints readme * correct load name * move files * revert pyproject
fix cache_utils bugs
Align v450
Revert "Align v450"
* feat(transformers): upgrade modeling_utils/RoPE/optimization/cache_utils etc to v4.50 * feat(transformers): upgrade modeling_utils/RoPE/optimization/cache_utils etc to v4.50 * feat(transformers): upgrade modeling_utils/RoPE/optimization/cache_utils etc to v4.50 * feat(transformers): upgrade modeling_utils/RoPE/optimization/cache_utils etc to v4.50 * precommit * fix cache_util * update generation_util, logit_processor, stopping_criteria, candidate_generator * fix cache_utils * Add beam_search&For ForCausalLMLoss loss * update modeling_outputs * update beam_search * pre-commit check * add support for dynamic input * add beam search py * fix tie_weight dtype mismatch in pynative mode for albert * fix beam_search * fix bug of dynamic input * add _supports_dynamic_input to PretrainedModel * fix pynative synchronize bug for albert[temporal] * support multimodal for init_static_cache[hack implementation] * fix cache_utils bugs * fix cumsum does not support int64 --------- Co-authored-by: Chaoran Wei <77485245+wcrzlh@users.noreply.github.com> Co-authored-by: Mark-ZhouWX <zhouwuxing000@gmail.com> Co-authored-by: liuchuting <liuchuting1@huawei.com>
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
What does this PR do?
Fixes # (issue)
Adds # (feature)
Before submitting
What's New. Here are thedocumentation guidelines
Who can review?
Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors who may be interested in your PR.
@xxx