Skip to content

[None][fix] Enable LoRA in PRAD speculative decoding#13007

Draft
Funatiq wants to merge 2 commits intoNVIDIA:mainfrom
Funatiq:dev/fix/prad_lora
Draft

[None][fix] Enable LoRA in PRAD speculative decoding#13007
Funatiq wants to merge 2 commits intoNVIDIA:mainfrom
Funatiq:dev/fix/prad_lora

Conversation

@Funatiq
Copy link
Copy Markdown
Collaborator

@Funatiq Funatiq commented Apr 13, 2026

@coderabbitai summary

Description

Test Coverage

PR Checklist

Please review the following before submitting your PR:

  • PR description clearly explains what and why. If using CodeRabbit's summary, please make sure it makes sense.

  • PR Follows TRT-LLM CODING GUIDELINES to the best of your knowledge.

  • Test cases are provided for new code paths (see test instructions)

  • Any new dependencies have been scanned for license and vulnerabilities

  • CODEOWNERS updated if ownership changes

  • Documentation updated as needed

  • Update tava architecture diagram if there is a significant design change in PR.

  • The reviewers assigned automatically/manually are appropriate for the PR.

  • Please check this after reviewing the above items as appropriate for this PR.

GitHub Bot Help

To see a list of available CI bot commands, please comment /bot help.

- Add integration test for LoRA in PRAD speculative decoding with and without CUDA graph.

Signed-off-by: Robin Kobus <19427718+Funatiq@users.noreply.github.com>
Signed-off-by: Robin Kobus <19427718+Funatiq@users.noreply.github.com>
@Funatiq Funatiq force-pushed the dev/fix/prad_lora branch from 32f5156 to 0a62078 Compare April 14, 2026 14:15
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant