Skip to content

Expose the context parameter during creation of TensorRT engines#1367

Open
jakstein wants to merge 2 commits intomcmonkeyprojects:masterfrom
jakstein:master
Open

Expose the context parameter during creation of TensorRT engines#1367
jakstein wants to merge 2 commits intomcmonkeyprojects:masterfrom
jakstein:master

Conversation

@jakstein
Copy link
Copy Markdown
Contributor

@jakstein jakstein commented May 3, 2026

A simple tweak that exposes previously hardcoded context parameter (was hardcoded to 1) when creating TensorRT engines. Now users will get to actually select how big context support they want in the dialog, along with other params.
This allows the created engines to actually work with prompts exceeding 75 tokens.
If user inputs 75 in dialog, the value is set to 1, if 150 then 2, so on.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant