implementing mcp invocation approvals by blublinsky · Pull Request #2769 · openshift/lightspeed-service

blublinsky · 2026-02-24T14:43:05Z

Description

Implementing MCP approvals for streaming query only

Type of change

Related Tickets & Documents

Related Issue #
OLS-2659
Closes #
OLS-2659

Checklist before requesting a review

I have performed a self-review of my code.
PR has passed all pre-merge test jobs.
If it is a core feature, I have added thorough tests.

Testing

Please provide detailed steps to perform tests related to this code change.
How were the fix/results from this change verified? Please provide relevant screenshots or results.

openshift-ci · 2026-02-24T14:44:09Z

[APPROVALNOTIFIER] This PR is NOT APPROVED

This pull-request has been approved by:
Once this PR has been reviewed and has the lgtm label, please assign blublinsky for approval. For more information see the Code Review Process.

The full list of commands accepted by this bot can be found here.

Details

Needs approval from an approver in each of these files:

OWNERS

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

onmete · 2026-02-24T20:07:17Z

This seems like multiple PRs bundled together
PR 1 docs_summarizer.iterate_with_tools method refactoring
PR 2 Tool error retry improvement (is retriable), timeout and "do not retry" messages
PR 3 Batch-to-streaming rewrite of tool execution
PR 4 Approval feature
PR 5 Adding approval endpoints (changes not in this PR)

Regarding the approval design - is the approval gate in the right layer? Currently, it lives inside the tool execution engine (tools.py), where each worker independently checks policy, emits events, and blocks waiting for a decision. But the orchestrator (_process_tool_calls_for_round) already makes other pre-execution decisions at this level (token budget checks, duplicate tool filtering). Approval is the same kind of gate. Proposed flow:

_process_tool_calls_for_round():
  1. Resolve tool calls → ToolCallDefinitions
  2. For each definition, check need_validation()            <- POLICY
  3. For tools needing approval:
     a. yield StreamedChunk(type="approval_required", ...)     <- EMIT
     b. await get_approval_decision()                          <- WAIT
     c. If rejected, add synthetic ToolMessage, skip execution
  4. Pass only approved tools to execute_tool_calls_stream()  <- EXECUTE
  5. Process results

In other words, just collect all tools needing approval, emit them together, wait for all decisions (or timeout and exit).
The contract is that the user must approve or reject every tool before the round proceeds.

blublinsky · 2026-02-24T20:59:21Z

Regarding the approval design - is the approval gate in the right layer? Currently, it lives inside the tool execution engine (tools.py), where each worker independently checks policy, emits events, and blocks waiting for a decision. But the orchestrator (_process_tool_calls_for_round) already makes other pre-execution decisions at this level (token budget checks, duplicate tool filtering). Approval is the same kind of gate. Proposed flow:
_process_tool_calls_for_round():
  1. Resolve tool calls → ToolCallDefinitions
  2. For each definition, check need_validation()            <- POLICY
  3. For tools needing approval:
     a. yield StreamedChunk(type="approval_required", ...)     <- EMIT
     b. await get_approval_decision()                          <- WAIT
     c. If rejected, add synthetic ToolMessage, skip execution
  4. Pass only approved tools to execute_tool_calls_stream()  <- EXECUTE
  5. Process results
In other words, just collect all tools needing approval, emit them together, wait for all decisions (or timeout and exit). The contract is that the user must approve or reject every tool before the round proceeds.

Why. What is implemented is cleaner and faster.
We execute all tools in parallel and emit an approval request asap or process the tool. Because I am using AsyncGenerator, the whole code is very straightforward.
I do not see a reason for collecting approval requests; it is simpler and faster.

What you are suggesting will require an extra loop to collect approval requests. That will make the overall logic slower and more complex

onmete · 2026-02-25T10:09:10Z

Regarding the approval design - is the approval gate in the right layer? Currently, it lives inside the tool execution engine (tools.py), where each worker independently checks policy, emits events, and blocks waiting for a decision. But the orchestrator (_process_tool_calls_for_round) already makes other pre-execution decisions at this level (token budget checks, duplicate tool filtering). Approval is the same kind of gate. Proposed flow:
_process_tool_calls_for_round():
  1. Resolve tool calls → ToolCallDefinitions
  2. For each definition, check need_validation()            <- POLICY
  3. For tools needing approval:
     a. yield StreamedChunk(type="approval_required", ...)     <- EMIT
     b. await get_approval_decision()                          <- WAIT
     c. If rejected, add synthetic ToolMessage, skip execution
  4. Pass only approved tools to execute_tool_calls_stream()  <- EXECUTE
  5. Process results
In other words, just collect all tools needing approval, emit them together, wait for all decisions (or timeout and exit). The contract is that the user must approve or reject every tool before the round proceeds.
Why. What is implemented is cleaner and faster. We execute all tools in parallel and emit an approval request asap or process the tool. Because I am using AsyncGenerator, the whole code is very straightforward. I do not see a reason for collecting approval requests; it is simpler and faster.

What you are suggesting will require an extra loop to collect approval requests. That will make the overall logic slower and more complex

Faster = this executes tools without approval need right away. That is the only speed difference. The turn is blocked either way on user approvals. It can be done in docsummarizer layer too. It is also using polling and not asyncio.Event.wait(), which is debatable if it is really faster.

Cleaner = approval logic lives in the tool execution engine, which was stateless (tool in, result out). It now reads config.tools_approval and config.pending_approval_store, generates UUIDs, and polls for decisions. Also streaming_ols.py now has to inspect item.data.get("event") != "approval_required" to filter approval events from stored tool calls - that's approval semantics leaking into a layer that shouldn't need to know about it. Can we at least isolate the config dependency?

blublinsky · 2026-02-25T15:32:32Z

Faster = this executes tools without approval need right away. That is the only speed difference. The turn is blocked either way on user approvals. It can be done in docsummarizer layer too. It is also using polling and not asyncio.Event.wait(), which is debatable if it is really faster.

It also delivers approval requests faster and in parallel. Doing this in doc-summarizer seems quite wrong - approval is part of tools invocation, thats why it is done the way it is

blublinsky · 2026-02-25T15:34:57Z

Cleaner = approval logic lives in the tool execution engine, which was stateless (tool in, result out). It now reads config.tools_approval and config.pending_approval_store, generates UUIDs, and polls for decisions. Also streaming_ols.py now has to inspect item.data.get("event") != "approval_required" to filter approval events from stored tool calls - that's approval semantics leaking into a layer that shouldn't need to know about it. Can we at least isolate the config dependency?

This is a different issue. Fixed most of it by introducing an additional type. It's cleaner because approval is done where it belongs - in the tools invocation. Except for yield, the code looks similar to sequential code, which is quite simple. Tools execution is still stateless, its just longer.

blublinsky · 2026-02-26T18:57:54Z

/test "ci/prow/verify"

blublinsky · 2026-02-26T18:58:41Z

/test verify

blublinsky · 2026-02-27T11:22:28Z

/retest

blublinsky · 2026-02-27T20:31:17Z

/retest

blublinsky · 2026-03-02T15:58:28Z

/retest

blublinsky · 2026-03-16T10:05:44Z

/retest

Made-with: Cursor

openshift-ci · 2026-03-17T22:17:43Z

@blublinsky: all tests passed!

Full PR test history. Your PR dashboard.

Details

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository. I understand the commands that are listed here.

openshift-ci bot requested review from raptorsun and tisnik February 24, 2026 14:43

openshift-merge-robot added the needs-rebase Indicates a PR cannot be merged because it has merge conflicts with HEAD. label Feb 25, 2026

blublinsky force-pushed the approval_implementation branch from 11336f4 to 35804a2 Compare February 25, 2026 11:30

openshift-merge-robot removed the needs-rebase Indicates a PR cannot be merged because it has merge conflicts with HEAD. label Feb 25, 2026

blublinsky force-pushed the approval_implementation branch 2 times, most recently from 145ecdc to fe25c38 Compare February 26, 2026 12:21

openshift-merge-robot added the needs-rebase Indicates a PR cannot be merged because it has merge conflicts with HEAD. label Feb 26, 2026

blublinsky force-pushed the approval_implementation branch from fe25c38 to 8a43dac Compare February 26, 2026 12:40

openshift-merge-robot added needs-rebase Indicates a PR cannot be merged because it has merge conflicts with HEAD. and removed needs-rebase Indicates a PR cannot be merged because it has merge conflicts with HEAD. labels Feb 26, 2026

blublinsky force-pushed the approval_implementation branch from 8a43dac to 6464e1f Compare February 26, 2026 17:29

openshift-merge-robot removed the needs-rebase Indicates a PR cannot be merged because it has merge conflicts with HEAD. label Feb 26, 2026

blublinsky force-pushed the approval_implementation branch 2 times, most recently from 2abe6b9 to 32ff351 Compare February 27, 2026 08:02

blublinsky force-pushed the approval_implementation branch from 32ff351 to 02d9b6e Compare February 27, 2026 12:56

openshift-merge-robot added the needs-rebase Indicates a PR cannot be merged because it has merge conflicts with HEAD. label Feb 27, 2026

blublinsky force-pushed the approval_implementation branch from 02d9b6e to 5d04729 Compare February 27, 2026 19:38

openshift-merge-robot removed the needs-rebase Indicates a PR cannot be merged because it has merge conflicts with HEAD. label Feb 27, 2026

openshift-merge-robot added the needs-rebase Indicates a PR cannot be merged because it has merge conflicts with HEAD. label Mar 2, 2026

blublinsky force-pushed the approval_implementation branch from 5d04729 to b792350 Compare March 3, 2026 18:13

openshift-merge-robot removed the needs-rebase Indicates a PR cannot be merged because it has merge conflicts with HEAD. label Mar 3, 2026

blublinsky force-pushed the approval_implementation branch 8 times, most recently from 8b532ae to 769ff88 Compare March 14, 2026 19:42

openshift-merge-robot added the needs-rebase Indicates a PR cannot be merged because it has merge conflicts with HEAD. label Mar 14, 2026

blublinsky force-pushed the approval_implementation branch from 769ff88 to 6d6004c Compare March 14, 2026 20:08

openshift-merge-robot removed the needs-rebase Indicates a PR cannot be merged because it has merge conflicts with HEAD. label Mar 14, 2026

openshift-merge-robot added the needs-rebase Indicates a PR cannot be merged because it has merge conflicts with HEAD. label Mar 16, 2026

blublinsky force-pushed the approval_implementation branch from 6d6004c to f3bc8d9 Compare March 16, 2026 11:14

openshift-merge-robot removed the needs-rebase Indicates a PR cannot be merged because it has merge conflicts with HEAD. label Mar 16, 2026

blublinsky force-pushed the approval_implementation branch from f3bc8d9 to b78d564 Compare March 16, 2026 13:14

implementing mcp invocation approvals

fcb8a62

Made-with: Cursor

blublinsky force-pushed the approval_implementation branch from b78d564 to fcb8a62 Compare March 17, 2026 20:34

Conversation

blublinsky commented Feb 24, 2026

Description

Type of change

Related Tickets & Documents

Checklist before requesting a review

Testing

Uh oh!

openshift-ci bot commented Feb 24, 2026

Uh oh!

onmete commented Feb 24, 2026

Uh oh!

blublinsky commented Feb 24, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

onmete commented Feb 25, 2026

Uh oh!

blublinsky commented Feb 25, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

blublinsky commented Feb 25, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

blublinsky commented Feb 26, 2026

Uh oh!

blublinsky commented Feb 26, 2026

Uh oh!

blublinsky commented Feb 27, 2026

Uh oh!

blublinsky commented Feb 27, 2026

Uh oh!

blublinsky commented Mar 2, 2026

Uh oh!

blublinsky commented Mar 16, 2026

Uh oh!

openshift-ci bot commented Mar 17, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

blublinsky commented Feb 24, 2026 •

edited

Loading

blublinsky commented Feb 25, 2026 •

edited

Loading

blublinsky commented Feb 25, 2026 •

edited

Loading