OLS-2673: Add query mode field to support different response styles#2807
OLS-2673: Add query mode field to support different response styles#2807onmete wants to merge 2 commits intoopenshift:mainfrom
Conversation
|
@onmete: This pull request references OLS-2673 which is a valid jira issue. DetailsIn response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the openshift-eng/jira-lifecycle-plugin repository. |
|
Skipping CI for Draft Pull Request. |
|
[APPROVALNOTIFIER] This PR is NOT APPROVED This pull-request has been approved by: The full list of commands accepted by this bot can be found here. DetailsNeeds approval from an approver in each of these files:Approvers can indicate their approval by writing |
|
Although proposed approach will work, I think we need additional discussion on this.
With this in place OLS will be fully configurable by users for the set of modes/promptes with NO changes to the OLS code. |
Made-with: Cursor
| provider=llm_request.provider, | ||
| model=llm_request.model, | ||
| system_prompt=llm_request.system_prompt, | ||
| mode=llm_request.mode, |
There was a problem hiding this comment.
Why do we need this in response?
There was a problem hiding this comment.
You mean in response or DocsSummarizer (where is this comment attached)?
There was a problem hiding this comment.
It is not in the response. It is in ProcessedRequest, not the one we return (LLMResponse).
| """ | ||
|
|
||
| TROUBLESHOOTING_SYSTEM_INSTRUCTION = "NOT IMPLEMENTED" | ||
|
|
There was a problem hiding this comment.
There is a certain asymmetry here. Default prompt (ask) is overwritable (can be configured through CR), and troubleshooting one?
|
/retest |
|
@onmete: all tests passed! Full PR test history. Your PR dashboard. DetailsInstructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository. I understand the commands that are listed here. |
|
/retest |
Description
Add a
modefield to the OLS query API to support different response styles. Two modes are defined in aQueryModeenum:The mode is accepted in
LLMRequest, threaded throughProcessedRequest, and the only runtime effect is which system promptQueryHelperselects. All other pipeline behavior (RAG, tool calling, caching, streaming, quota) is unchanged.The mode is also recorded in transcript metadata for observability.
Type of change
Related Tickets & Documents
Checklist before requesting a review
Testing
test_models.py): default mode, explicit values, string coercion, invalid valuestest_query_helper.py): ASK mode, TROUBLESHOOTING mode, config override precedence, default behaviortest_ols.py): verifies mode reachesDocsSummarizer.__init__test_store_transcriptupdated to assertmodeappears in transcript metadatauv run pytestMade with Cursor