feat: durable delegated tool approval flow by anubra266 · Pull Request #2966 · inkeep/agents

anubra266 · 2026-04-01T23:50:58Z

Summary

Adds support for tool approvals in delegated sub-agents running in durable execution mode

changeset-bot · 2026-04-01T23:51:03Z

⚠️ No Changeset found

Latest commit: 46a02c3

Merging this PR will not cause a version bump for any packages. If these changes should not result in a new version, you're good to go. If these changes should result in a version bump, you need to add a changeset.

This PR includes no changesets

When changesets are added to this PR, you'll see the packages that this PR includes changesets for and the associated semver types

Click here to learn what changesets are, and how to add one.

Click here if you're a maintainer who wants to add a changeset to this PR

vercel · 2026-04-01T23:51:04Z

The latest updates on your projects. Learn more about Vercel for GitHub.

Project	Deployment	Actions	Updated (UTC)
agents-api	Ready	Preview, Comment	Apr 3, 2026 11:37pm
agents-manage-ui	Ready	Preview, Comment	Apr 3, 2026 11:37pm

1 Skipped Deployment

Project	Deployment	Actions	Updated (UTC)
agents-docs	Skipped		Apr 3, 2026 11:37pm

pullfrog · 2026-04-01T23:52:56Z

TL;DR — Routes tool approval requests from delegated sub-agents through the durable workflow hook system (WDK) instead of the in-memory pub/sub bus, and fixes a re-delegation loop where approving a delegated tool caused the parent to re-send the original user message instead of continuing with the tool result.

Key changes

Parent-side detection of durable-approval-required artifacts in tool-wrapper — wrapToolWithStreaming inspects delegation results for durable-approval-required data artifacts and populates pendingDurableApproval with delegatedApproval context including the sub-agent's actual toolCallId.
Post-approval continuation prompt — New isPostApproval flag in the workflow loop sends a continuation message instead of re-sending the original user input, breaking the re-delegation loop.
Delegated approval forwarding through A2A metadata — relationTools passes durableWorkflowRunId and delegatedToolApproval decision through delegation metadata so the sub-agent receives pre-approved tool calls on re-execution.
CredentialStoreRegistry and baseUrl fix in durable execution path — buildAgentForStep now creates a CredentialStoreRegistry for parity with classic mode and appends /run/agents to the base URL so delegation A2A calls route correctly.
Remove in-memory pub/sub for durable delegated path — tool-approval.ts drops the toolApprovalUiBus.publish branch for delegated agents; replaced by the artifact flow.
Delegated approval SSE streaming — callLlmStep streams toolInputStart, toolInputDelta, toolInputAvailable, and toolApprovalRequest SSE events using the delegated tool's actual toolCallId so the UI displays the correct tool.
Handle AI SDK stopWhen throw for durable approval detection — callLlmStep wraps agent.generate() in a try/catch so that when the AI SDK's stopWhen callback throws (instead of returning cleanly), pending durable approvals are still detected and surfaced. The same pattern is added to generateTaskHandler for the non-durable A2A path via a shared buildDurableApprovalResult helper.

_{Summary ｜ 9 files ｜ 4 commits ｜ base: main ← feat/durable-delegated-tool-approval}

Parent-side detection of delegated approval artifacts

Before: A delegated sub-agent needing tool approval used the in-memory toolApprovalUiBus which doesn't work across durable workflow boundaries, or threw an error caught by a try/catch wrapper.
After: The sub-agent returns a durable-approval-required data artifact that the parent's tool-wrapper detects after normal execution — no error-path handling needed.

wrapToolWithStreaming scans both message-level parts (result.parts[]) and artifact-level parts (result.artifacts[].parts[]) via a findApprovalRequired helper. When found, it populates ctx.pendingDurableApproval with the delegated approval context including subAgentId, toolCallId, toolName, and args. Delegation results are now stored in conversation history for durable workflows (skipHistoryStorage is false for delegate_to_* tools when durableWorkflowRunId is set), ensuring the next callLlmStep sees the delegation outcome.

tool-wrapper.ts · agent-types.ts · Agent.ts

Post-approval continuation prompt to prevent re-delegation loop

Before: After the user approved a delegated tool call, the workflow re-sent the original user message to the LLM — causing it to re-delegate to the sub-agent in an infinite loop.
After: An isPostApproval flag switches the user message to a continuation prompt ("Continue the conversation. The tool results above contain the information needed to respond to the user."), so the LLM processes the tool results rather than re-delegating.

The flag is set after each approval round in agentExecution.ts and reset on transfer. callLlmStep uses it to select the prompt strategy.

agentExecution.ts · agentExecutionSteps.ts

Delegated approval forwarding through A2A delegation metadata

Before: relationTools built delegation metadata without workflow or approval context — the sub-agent had no way to receive pre-approved tool calls on re-execution.
After: durable_workflow_run_id and approved_tool_calls (serialized JSON) are injected into the delegation metadata when the agentRunContext carries them.

Metadata field	Source	Purpose
`durable_workflow_run_id`	`agentRunContext.durableWorkflowRunId`	Links delegation to the parent's durable workflow
`approved_tool_calls`	`agentRunContext.delegatedToolApproval`	Pre-approves the specific tool call in the sub-agent

The agentRunContext is now threaded through relation-tools.ts → relationTools.ts → createDelegateToAgentTool.

relationTools.ts · relation-tools.ts

`CredentialStoreRegistry`, base URL fix, and SSE streaming for delegated approvals

Before: Durable execution path passed credentialStoreRegistry: undefined for MCP tools, used the bare API root as baseUrl (missing /run/agents), and had no SSE streaming for delegated tool approval UI events.
After: buildAgentForStep instantiates a CredentialStoreRegistry with default credential stores, appends /run/agents to the base URL so A2A delegation calls route correctly, and callLlmStep emits the full SSE event sequence (toolInputStart → toolInputDelta → toolInputAvailable → toolApprovalRequest) using the delegated tool's toolCallId.

The hook token in agentExecution.ts now uses delegatedApproval.toolCallId (the sub-agent's actual tool call ID) so the WDK hook matches the approval the UI sends back.

agentExecutionSteps.ts · tool-approval.ts

Handle AI SDK `stopWhen` throw for durable approval detection

Before: callLlmStep called agent.generate() without a try/catch, relying on stopWhen returning cleanly when a pending approval was detected. If the AI SDK threw instead (e.g. during tool result processing), the pending approval was lost and the workflow errored out.
After: callLlmStep wraps generate() in a try/catch that checks for pendingDurableApproval before re-throwing. The same pattern is added to generateTaskHandler's error handler via a shared buildDurableApprovalResult helper.

Why does stopWhen sometimes throw?
The AI SDK's stopWhen callback runs during the generation loop. When it signals a stop, the SDK may throw an internal error rather than returning a clean response — depending on where in the tool-call lifecycle the stop occurs. The try/catch ensures pending approvals are captured regardless of how the SDK surfaces the stop.

agentExecutionSteps.ts · generateTaskHandler.ts

^{｜ View workflow run ｜ Triggered by Pullfrog ｜ Using Claude Opus ｜ 𝕏}

pullfrog

Solid approach — the artifact-based signaling pattern for durable delegated approvals is a clean replacement for the in-memory pub/sub, and the isPostApproval continuation prompt correctly breaks the re-delegation loop. Three issues to flag, one potentially breaking.

^{｜ Fix all ➔ ｜ Fix 👍s ➔ ｜ View workflow run ｜ Using Claude Opus ｜ 𝕏}

agents-api/src/domains/run/workflow/steps/agentExecutionSteps.ts

agents-api/src/domains/run/agents/relationTools.ts

agents-api/src/domains/run/agents/tools/tool-wrapper.ts

agents-api/src/domains/run/workflow/steps/agentExecutionSteps.ts

agents-api/src/domains/run/agents/generateTaskHandler.ts

github-actions · 2026-04-01T23:59:06Z

Preview URLs

Use these stable preview aliases for testing this PR:

UI: https://pr-2966-ui.preview.inkeep.com
API: https://pr-2966-api.preview.inkeep.com
API health: https://pr-2966-api.preview.inkeep.com/health

These point to the same Vercel preview deployment as the bot comment, but they stay stable and easier to find.

Raw Vercel deployment URLs

UI deployment: https://agents-manage-c23e6nge2.preview.inkeep.com
API deployment: https://agents-fnhh63ssx.preview.inkeep.com

claude

PR Review Summary

(7) Total Issues | Risk: Medium

🟠⚠️ Major (3) 🟠⚠️

Inline Comments:

🟠 Major: tool-wrapper.ts:231 Unsafe type coercion of approval artifact fields without validation
🟠 Major: agentExecutionSteps.ts:555-576 SSE streaming for delegated approvals lacks error handling
🟠 Major: agentExecution.ts:82-85 Durable workflow hook awaits indefinitely without timeout

🟡 Minor (1) 🟡

Inline Comments:

🟡 Minor: agentExecution.ts:82-86 Improve logging for delegated approval debugging

🕐 Pending Recommendations (3)

From pullfrog's prior review (still applicable):

🟠 agentExecutionSteps.ts:123 baseUrl now includes /run/agents — deviates from bare API root convention used elsewhere
🟡 relationTools.ts:436 Type casting delegationMeta to inject workflow metadata sidesteps type safety
🟡 agentExecutionSteps.ts:507 Continuation prompt dependency on conversation history needs documentation

🚫 REQUEST CHANGES

Summary: The artifact-based signaling pattern is a solid architectural choice for durable delegated approvals, and the isPostApproval continuation prompt correctly addresses the re-delegation loop. However, there are three major issues that should be addressed before merging:

Artifact validation — The durable-approval-required artifact fields are used without validation, which could cause silent corruption or unhelpful runtime errors if a sub-agent returns malformed data.
SSE error handling — The delegated approval SSE streaming block has no error handling. If writes fail, the workflow suspends awaiting approval but the client never receives the approval request UI — a particularly confusing failure mode.
Indefinite workflow suspension — Unlike the non-durable path (10-minute timeout), the durable workflow hook can suspend indefinitely. Consider either adding a timeout or documenting the expected cleanup strategy.

The pullfrog findings about baseUrl convention and type casting are worth addressing as follow-ups but are less critical.

Discarded (3)

Location	Issue	Reason Discarded
`agentExecutionSteps.ts:155-167`	Empty catch blocks in enhanceRelation	Pre-existing code not introduced by this PR
`tool-wrapper.ts:196`	Nested artifact parsing without bounds	Low likelihood of malicious sub-agent responses in controlled environment
`agentExecutionSteps.ts:503`	isPostApproval may not survive workflow replays	Workflow engine handles step replay correctly; flag is set consistently in workflow scope

Reviewers (3)

Reviewer	Returned	Inline Comments	Discarded
`pr-review-errors`	4	2	2
`pr-review-sre`	5	2	3
`pr-review-standards`	0	0	0
Total	9	4	5

Note: pr-review-standards found no issues — validated that baseUrl change is consistent with existing patterns in executionHandler.ts and data/agents.ts.

agents-api/src/domains/run/agents/tools/tool-wrapper.ts

agents-api/src/domains/run/workflow/steps/agentExecutionSteps.ts

agents-api/src/domains/run/workflow/functions/agentExecution.ts

claude

PR Review Summary

(8) Total Issues | Risk: Medium | Review Scope: Delta (1 commit since last review)

This is a re-review scoped to changes in commit 5aea41ec7 ("fix: handle AI SDK stopWhen throw for durable approval detection").

🟠⚠️ Major (1) 🟠⚠️

Inline Comments:

🟠 Major: agentExecutionSteps.ts:533-537 Swallowing errors silently when pendingDurableApproval is set

🟡 Minor (1) 🟡

Inline Comments:

🟡 Minor: generateTaskHandler.ts:585-588 Catch block suppresses original error without logging

🕐 Pending Recommendations (6)

From prior automated review (claude):

🟠 tool-wrapper.ts:231 Unsafe type coercion of approval artifact fields without validation
🟠 agentExecutionSteps.ts:555-576 SSE streaming for delegated approvals lacks error handling
🟠 agentExecution.ts:82-85 Durable workflow hook awaits indefinitely without timeout

From pullfrog's prior review:

🟠 agentExecutionSteps.ts:123 baseUrl now includes /run/agents — deviates from bare API root convention
🟡 relationTools.ts:436 Type casting delegationMeta to inject workflow metadata sidesteps type safety
🟡 agentExecutionSteps.ts:507 Continuation prompt dependency on conversation history needs documentation

🚫 REQUEST CHANGES

Summary: The delta commit (5aea41ec7) correctly addresses the AI SDK stopWhen throw behavior by catching the error and extracting the pending approval state. However, silently discarding caught errors without logging creates a debugging blind spot. When errors are suppressed because pendingDurableApproval exists, engineers have no visibility into what actually failed. Add logging before suppressing these errors.

The 6 pending recommendations from the prior review remain unaddressed and should be considered alongside this delta feedback.

Discarded (1)

Location	Issue	Reason Discarded
`agentExecutionSteps.ts:603-605`	Error message lacks context	Low severity — the check is effectively dead code given the control flow (if catch re-throws, this line is unreachable)

Reviewers (1)

Reviewer	Returned	Main Findings	Consider	While You're Here	Inline Comments	Pending Recs	Discarded
`pr-review-errors`	3	0	0	0	2	0	1
Total	3	0	0	0	2	0	1

Note: Delta review — only dispatched error handling reviewer since the changes were focused on error/exception handling patterns.

agents-api/src/domains/run/workflow/steps/agentExecutionSteps.ts

agents-api/src/domains/run/agents/generateTaskHandler.ts

claude

PR Review Summary

(0) Total Issues | Risk: Low

This is a re-review following prior feedback from pullfrog and two automated claude reviews. All previously identified issues have been addressed.

✅ Issues Resolved Since Prior Reviews

Prior Issue	Resolution
🟠 Unsafe type coercion of approval artifact fields (`tool-wrapper.ts:232-245`)	Fixed — Added validation with `typeof` checks and error logging before using `delegatedToolCallId` and `delegatedToolName`
🟠 SSE streaming for delegated approvals lacks error handling (`agentExecutionSteps.ts:583-611`)	Fixed — Wrapped SSE writes in try-catch with error logging and re-throw
🟠 Durable workflow hook awaits indefinitely (`agentExecution.ts:98-101`)	Documented — Comment explains the design decision and cleanup strategy via external job
🟡 Improve logging for delegated approval debugging (`agentExecution.ts:88-96`)	Fixed — Added `logger.info` with `hookToolCallId`, `parentToolCallId`, `isDelegated`, and `workflowRunId`
🟡 Swallowing errors silently when pendingDurableApproval is set (`agentExecutionSteps.ts:545-553`)	Fixed — Added `logger.info` capturing error message, error name, requestId, and currentSubAgentId
🟡 Catch block suppresses original error without logging (`generateTaskHandler.ts:591-598`)	Fixed — Added `logger.info` with toolCallId, toolName, and error message
🟡 baseUrl convention deviation (`agentExecutionSteps.ts:117-121`)	Documented — Comment explains the intentional deviation for durable mode
🟡 Type casting delegationMeta (`relationTools.ts:434-452`)	Accepted — The spread pattern is safe here since we're adding fields, not removing them
🟡 Continuation prompt dependency on conversation history (`agentExecutionSteps.ts:507-510`)	Documented — Comment explains the dependency on `skipHistoryStorage` fix

Architecture Assessment

The artifact-based signaling pattern for durable delegated approvals is well-designed:

Artifact flow — Sub-agents return durable-approval-required artifacts that the parent's tool-wrapper detects and propagates through pendingDurableApproval
Re-delegation loop prevention — isPostApproval flag switches to continuation prompt after approval rounds
Delegation metadata forwarding — durableWorkflowRunId and approvedToolCalls flow through A2A metadata
SSE event streaming — Delegated approvals stream the correct toolCallId from the sub-agent

⚠️ Operational Note

The durable workflow hook (toolApprovalHook.create()) suspends indefinitely by design. The comment at lines 98-101 documents that stale suspended workflows should be cleaned up by an external job querying workflow_executions with status='suspended'. Ensure this cleanup mechanism exists or is planned.

Missing Changeset

The changeset-bot flagged that this PR has no changeset. Since this adds support for tool approvals in delegated sub-agents running in durable execution mode, consider adding:

pnpm bump patch --pkg agents-api "Add support for tool approvals in delegated sub-agents during durable execution"

✅ APPROVE

Summary: All prior review feedback has been addressed. The implementation is solid — artifact-based signaling cleanly replaces the in-memory pub/sub for durable delegated approvals, error handling is comprehensive with logging, and the isPostApproval continuation prompt correctly breaks the re-delegation loop. Ship it! 🚀

Reviewers (0)

Reviewer	Returned	Main Findings	Consider	While You're Here	Inline Comments	Pending Recs	Discarded
Total	0	0	0	0	0	0	0

Note: This re-review validated that all prior findings have been addressed. No new reviewers were dispatched since the diff showed only fixes to previously identified issues.

claude

PR Review Summary

(0) Total Issues | Risk: Low

This is a re-review triggered by CI re-run. The code is unchanged from the prior APPROVE review — all previously identified issues remain resolved.

✅ Verified Fixes (from prior reviews)

Prior Issue	Status
🟠 Unsafe type coercion of approval artifact fields	✅ Fixed — `tool-wrapper.ts:232-245` has `typeof` validation
🟠 SSE streaming lacks error handling	✅ Fixed — `agentExecutionSteps.ts:583-611` has try-catch with re-throw
🟠 Durable workflow hook awaits indefinitely	✅ Documented — `agentExecution.ts:92-95` explains cleanup strategy
🟡 Missing logging for delegated approval debugging	✅ Fixed — `agentExecution.ts:85-90` logs hook details
🟡 Silent error swallowing when pendingDurableApproval set	✅ Fixed — `agentExecutionSteps.ts:545-553` logs caught errors
🟡 Catch block suppresses original error	✅ Fixed — `generateTaskHandler.ts:591-598` logs error context
🟡 baseUrl convention deviation	✅ Documented — `agentExecutionSteps.ts:117-121` explains intentional deviation
🟡 Continuation prompt history dependency	✅ Documented — `agentExecutionSteps.ts:507-510` explains reliance on `skipHistoryStorage`

Architecture Assessment

The artifact-based signaling pattern for durable delegated approvals is well-designed:

Artifact flow — Sub-agents return durable-approval-required artifacts that the parent's tool-wrapper detects and propagates via pendingDurableApproval
Re-delegation loop prevention — isPostApproval flag switches to continuation prompt after approval rounds
Delegation metadata forwarding — durableWorkflowRunId and approvedToolCalls flow through A2A metadata
SSE event streaming — Delegated approvals stream the correct toolCallId from the sub-agent

⚠️ CI Note

The "Auto Format" workflow failure is a CI infrastructure issue (pnpm setup step), not a code problem.

Missing Changeset

Consider adding a changeset for this feature:

pnpm bump patch --pkg agents-api "Add support for tool approvals in delegated sub-agents during durable execution"

✅ APPROVE

Summary: All prior review feedback has been addressed. The implementation is solid — artifact-based signaling cleanly replaces the in-memory pub/sub for durable delegated approvals, error handling is comprehensive with logging, and the isPostApproval continuation prompt correctly breaks the re-delegation loop. Ship it! 🚀

Reviewers (0)

Reviewer	Returned	Main Findings	Consider	While You're Here	Inline Comments	Pending Recs	Discarded
Total	0	0	0	0	0	0	0

Note: Re-review — validated prior fixes are still present. No new reviewers dispatched.

When a delegated sub-agent requires tool approval in durable mode, the approval now flows through the workflow hook system instead of the in-memory pub/sub bus. The parent agent detects `durable-approval-required` artifacts from sub-agents, surfaces them as SSE approval events, and forwards the user's decision back via `delegatedToolApproval` context on re-execution. Adds `isPostApproval` flag to prevent the workflow loop from re-sending the original user message after approval. Includes validation of approval artifact fields, SSE error handling for delegated approval streaming, logging for suppressed errors in the durable approval catch path, and type-safe metadata construction for delegation.

claude

PR Review Summary

(0) Total Issues | Risk: Low

This is a re-review following 4 prior automated reviews. This review validates that all previously identified issues have been addressed.

✅ Issues Resolved Since Prior Reviews

Prior Issue	Resolution
🟠 Unsafe type coercion of approval artifact fields	✅ Fixed — `tool-wrapper.ts:232-245` validates `typeof` before using `delegatedToolCallId` and `delegatedToolName`
🟠 SSE streaming for delegated approvals lacks error handling	✅ Fixed — `agentExecutionSteps.ts:597-625` wraps SSE writes in try-catch with error logging and re-throw
🟠 Durable workflow hook awaits indefinitely without timeout	✅ Documented — `agentExecution.ts:94-97` explains cleanup strategy via external job querying `workflow_executions`
🟡 Missing logging for delegated approval debugging	✅ Fixed — `agentExecution.ts:87-92` logs `hookToolCallId`, `parentToolCallId`, `isDelegated`, and `workflowRunId`
🟡 Silent error swallowing when pendingDurableApproval set	✅ Fixed — `agentExecutionSteps.ts:559-567` logs error message, error name, requestId, and currentSubAgentId
🟡 Catch block suppresses original error without logging	✅ Fixed — `generateTaskHandler.ts:620-627` logs toolCallId, toolName, and error message
🟡 baseUrl convention deviation	✅ Documented — `agentExecutionSteps.ts:123-128` explains the intentional deviation for durable mode
🟡 Continuation prompt dependency on conversation history	✅ Documented — `tool-wrapper.ts:110-114` explains the dependency on `skipHistoryStorage` fix

Architecture Assessment

The artifact-based signaling pattern for durable delegated approvals is well-designed:

Artifact flow — Sub-agents return durable-approval-required artifacts that the parent's tool-wrapper detects via findApprovalRequired helper and propagates through pendingDurableApproval
Re-delegation loop prevention — isPostApproval flag in agentExecution.ts switches to continuation prompt after approval rounds, preventing infinite delegation loops
Delegation metadata forwarding — durableWorkflowRunId and approved_tool_calls flow through A2A metadata in relationTools.ts
SSE event streaming — Delegated approvals stream the correct toolCallId from the sub-agent (not the parent's delegate tool call ID)
Type safety — DelegatedApprovalContext type properly captures the sub-agent's tool context

⚠️ Missing Changeset

Per the changeset-bot comment, this PR adds a user-facing feature but has no changeset. Consider adding:

pnpm bump patch --pkg agents-api "Add support for tool approvals in delegated sub-agents during durable execution"

💡 APPROVE WITH SUGGESTIONS

Summary: All prior review feedback has been addressed. The implementation is solid — artifact-based signaling cleanly replaces the in-memory pub/sub for durable delegated approvals, error handling is comprehensive with logging, type validation prevents malformed artifacts from corrupting state, and the isPostApproval continuation prompt correctly breaks the re-delegation loop.

The only outstanding item is the missing changeset, which should be added before merging to ensure proper versioning.

Reviewers (0)

Reviewer	Returned	Main Findings	Consider	While You're Here	Inline Comments	Pending Recs	Discarded
Total	0	0	0	0	0	0	0

Note: This re-review validated that all prior findings have been addressed. No new reviewers were dispatched since the code showed only the final implementation with all fixes applied.

claude

PR Review Summary

(0) Total Issues | Risk: Low | Review Scope: Delta

This is a re-review scoped to the delta since the last automated review (ad195a073 → 6defd54af).

Delta Analysis

The delta consists of:

Merge commit (6defd54af) — merging main into the feature branch
Logging PR (#3007) — adds performance logging for Doltgres database operations

Feature code unchanged: The 9 files implementing the durable delegated tool approval feature (tool-wrapper.ts, agentExecution.ts, agentExecutionSteps.ts, generateTaskHandler.ts, relationTools.ts, relation-tools.ts, tool-approval.ts, Agent.ts, agent-types.ts) have no changes in this delta.

The merged logging additions are in unrelated files:

function-tools.ts — slow withRef timing
TriggerService.ts — execution dispatch logging
branchScopedDb.ts / ref.ts — middleware timing
ref-helpers.ts / ref-scope.ts — connection/checkout timing

These are standard observability improvements that don't interact with the approval flow.

✅ Prior Issues Remain Resolved

All issues from prior reviews (pullfrog + 4 claude reviews) were addressed in earlier commits:

✅ Artifact validation with typeof checks
✅ SSE streaming error handling with try-catch
✅ Durable hook timeout documented (external cleanup job)
✅ Logging for delegated approval debugging
✅ Error logging when pendingDurableApproval suppresses exceptions
✅ baseUrl deviation documented
✅ Continuation prompt history dependency documented

⚠️ Missing Changeset

This PR still needs a changeset for the durable delegated tool approval feature:

pnpm bump patch --pkg agents-api "Add support for tool approvals in delegated sub-agents during durable execution"

💡 APPROVE WITH SUGGESTIONS

Summary: The delta contains only merged-in logging changes from PR #3007 — no modifications to the feature implementation. All prior review feedback remains addressed. The only outstanding item is the missing changeset, which should be added before merging. Ship it! 🚀

Reviewers (0)

Reviewer	Returned	Main Findings	Consider	While You're Here	Inline Comments	Pending Recs	Discarded
Total	0	0	0	0	0	0	0

Note: Delta review — no sub-reviewers dispatched since feature code is unchanged. Only merged-in logging changes from PR #3007.

claude

PR Review Summary

(0) Total Issues | Risk: Low | Review Scope: Delta

This is a re-review scoped to changes since the last automated review (6defd54af → 46a02c31a).

Delta Analysis

Single commit: 46a02c31a fix: use 'as const' for tool_calls type literal to satisfy typecheck

The change is a trivial TypeScript type narrowing fix:

-        type: 'tool_calls',
+        type: 'tool_calls' as const,

This ensures the return type is properly narrowed to the literal 'tool_calls' for correct type inference in the CallLlmResult union type. No functional change.

✅ Prior Issues Remain Resolved

Verified all fixes from prior reviews are still in place:

Prior Issue	Verification
🟠 Artifact validation	✅ `tool-wrapper.ts:232-245` — `typeof` checks before using `delegatedToolCallId`/`delegatedToolName`
🟠 SSE streaming error handling	✅ `agentExecutionSteps.ts:597-628` — try-catch with error logging and re-throw
🟠 Durable hook timeout	✅ `agentExecution.ts:94-97` — documented cleanup strategy via external job
🟡 Delegated approval logging	✅ `agentExecution.ts:87-92` — logs `hookToolCallId`, `parentToolCallId`, `isDelegated`
🟡 Error logging when suppressing	✅ `agentExecutionSteps.ts:559-567` — logs error context before continuing
🟡 generateTaskHandler error logging	✅ `generateTaskHandler.ts:620-627` — logs toolCallId, toolName, error message
🟡 baseUrl convention	✅ `agentExecutionSteps.ts:123-128` — documented intentional deviation
🟡 Continuation prompt dependency	✅ `tool-wrapper.ts:110-114` — documented history storage fix

Architecture Assessment

The artifact-based signaling pattern for durable delegated approvals is well-designed:

Artifact flow — Sub-agents return durable-approval-required artifacts that the parent's tool-wrapper detects and propagates through pendingDurableApproval
Re-delegation loop prevention — isPostApproval flag switches to continuation prompt after approval rounds
Delegation metadata forwarding — durableWorkflowRunId and approved_tool_calls flow through A2A metadata
SSE event streaming — Delegated approvals stream the correct toolCallId from the sub-agent

⚠️ Missing Changeset

This PR still needs a changeset for the durable delegated tool approval feature:

pnpm bump patch --pkg agents-api "Add support for tool approvals in delegated sub-agents during durable execution"

✅ APPROVE

Summary: The delta is a trivial TypeScript type narrowing fix (as const). All prior review feedback has been addressed and verified. The implementation is solid — artifact-based signaling cleanly replaces the in-memory pub/sub for durable delegated approvals, comprehensive error handling with logging, and the isPostApproval continuation prompt correctly breaks the re-delegation loop.

Add the changeset before merging, then ship it! 🚀

Reviewers (0)

Reviewer	Returned	Main Findings	Consider	While You're Here	Inline Comments	Pending Recs	Discarded
Total	0	0	0	0	0	0	0

Note: Delta review — no sub-reviewers dispatched since the change is a trivial type annotation (as const). All prior findings verified as still resolved.

itoqa · 2026-04-04T00:48:29Z

Ito Test Report ❌

11 test cases ran. 1 failed, 10 passed.

Across 11 total test cases, 10 passed and 1 failed, so the unified run is a failure due to a single high-severity production defect despite broad delegated-approval coverage passing. The critical finding is that after delegated approval succeeds, the reconnect endpoint (/run/api/executions/{executionId}/stream) reads only the default namespace and drops post-approval continuation/completion events (introduced by this PR), while all other verified behaviors worked as designed, including durable playground/OpenAI-compatible approval flows, strict delegated toolCallId token binding, /run/api/chat mixed/unknown/out-of-order idempotent handling and validation, mobile approval actionability, and single-resolution approval state.

❌ Failed (1)

Category	Summary	Screenshot
Happy-path	⚠️ Approval succeeds, but reconnect streaming misses post-approval continuation/completion events.

⚠️ Execution stream reconnect drops post-approval continuation namespace

What failed: The approval endpoint returns success, but reconnect clients only receive initial/default-namespace chunks and miss the continuation/completion events expected after approval.
Impact: Clients that reconnect after approval can appear stuck in an incomplete state and fail to show final assistant output. This breaks reliability of durable resume flows for API consumers.
Steps to reproduce:
1. Start a durable run that suspends for delegated tool approval.
2. Approve with POST /run/api/executions/{executionId}/approvals/{toolCallId} using approved=true.
3. Reconnect with GET /run/api/executions/{executionId}/stream and x-stream-start-index: 0.
4. Observe that only initial/default-namespace chunks are streamed and continuation/completion events are missing.
Stub / mock context: The durable run used a deterministic delegated-approval trigger and non-production auth bypass so approval state could be reproduced consistently; this keeps setup stable, but the reconnect namespace mismatch is in production route logic rather than in test scaffolding.
Code analysis: I traced namespace creation in the durable workflow and compared both streaming paths. Post-approval events are emitted to round namespaces (r1, r2, ...), but the reconnect endpoint reads only the default stream, while /run/api/chat correctly reads the continuation namespace from execution metadata.
Why this is likely a bug: The reconnect route ignores the continuation namespace that durable approval rounds require, so it cannot deliver the same resumed event stream that the workflow actually writes.

Relevant code:

agents-api/src/domains/run/workflow/functions/agentExecution.ts (lines 56-57)

const streamNamespace = approvalRound === 0 ? undefined : `r${approvalRound}`;

const llmResult = await callLlmStep({

agents-api/src/domains/run/workflow/functions/agentExecution.ts (lines 76-83)

const continuationNs = `r${approvalRound + 1}`;
await markWorkflowSuspendedStep({
  tenantId: payload.tenantId,
  projectId: payload.projectId,
  workflowRunId,
  continuationStreamNamespace: continuationNs,
});

agents-api/src/domains/run/routes/executions.ts (lines 344-345)

const readable = run.getReadable({ startIndex });
const reader = readable.getReader();

agents-api/src/domains/run/routes/chatDataStream.ts (lines 183-195)

const namespace = (durableExecution.metadata as any)?.continuationStreamNamespace as
  | string
  | undefined;
const run = getRun(durableExecution.id);

return stream(c, async (s) => {
  try {
    const readable = run.getReadable({ namespace });

✅ Passed (10)

Category	Summary	Screenshot
Adversarial	Out-of-order approval responses are handled idempotently without corrupting pending state.
Edge	Mixed approval decisions succeed when approval parts use valid tool-part schema.
Edge	Unknown toolCallId returns idempotent success with alreadyProcessed=true.
Edge	Approval responses without conversationId are correctly rejected after valid part parsing.
Edge	Confirmed as non-bug: prior block was environment/fixture/auth setup. Reconnect flow is implemented to resume SSE by executionId/start index and continue after approval.
Edge	Confirmed as non-bug: approval card action controls are implemented as always-clickable buttons in approval-requested state and not conditionally hidden for mobile widths.
Edge	Confirmed as non-bug: approval state handling is single-resolution by design; once resolved, subsequent rapid conflicting actions are treated as already processed/idempotent.
Logic	Code verification shows approval tokens are bound to delegated `toolCallId`, so parent/delegated mismatches are correctly rejected.
Happy-path	Delegated approval UI appeared in playground and resolved to approved after clicking Approve in durable mode.
Happy-path	Durable OpenAI-compatible stream returned HTTP 200 with `x-workflow-run-id` and approval-required SSE markers.

Commit: 46a02c3

View Full Run

Tell us how we did: Give Ito Feedback

vercel bot deployed to Preview – agents-manage-ui April 1, 2026 23:53 View deployment

github-actions bot deleted a comment from claude bot Apr 1, 2026

vercel bot deployed to Preview – agents-api April 1, 2026 23:54 View deployment

pullfrog bot reviewed Apr 1, 2026

View reviewed changes

vercel bot temporarily deployed to Preview – agents-docs April 2, 2026 00:08 Inactive

vercel bot deployed to Preview – agents-manage-ui April 2, 2026 00:10 View deployment

vercel bot deployed to Preview – agents-api April 2, 2026 00:11 View deployment

claude bot reviewed Apr 2, 2026

View reviewed changes

github-actions bot deleted a comment from claude bot Apr 2, 2026

vercel bot temporarily deployed to Preview – agents-docs April 2, 2026 00:26 Inactive

vercel bot deployed to Preview – agents-manage-ui April 2, 2026 00:28 View deployment

vercel bot deployed to Preview – agents-api April 2, 2026 00:29 View deployment

claude bot reviewed Apr 2, 2026

View reviewed changes

agents-api/src/domains/run/workflow/steps/agentExecutionSteps.ts Outdated Show resolved Hide resolved

agents-api/src/domains/run/agents/generateTaskHandler.ts Show resolved Hide resolved

github-actions bot deleted a comment from claude bot Apr 2, 2026

anubra266 force-pushed the feat/durable-delegated-tool-approval branch from 5aea41e to f6e94ae Compare April 2, 2026 00:39

vercel bot temporarily deployed to Preview – agents-docs April 2, 2026 00:39 Inactive

vercel bot had a problem deploying to Preview – agents-api April 2, 2026 00:40 Failure

vercel bot deployed to Preview – agents-manage-ui April 2, 2026 00:41 View deployment

claude bot reviewed Apr 2, 2026

View reviewed changes

github-actions bot deleted a comment from claude bot Apr 2, 2026

anubra266 force-pushed the feat/durable-delegated-tool-approval branch from f6e94ae to e8954e7 Compare April 2, 2026 00:45

vercel bot temporarily deployed to Preview – agents-docs April 2, 2026 00:45 Inactive

vercel bot deployed to Preview – agents-manage-ui April 2, 2026 00:46 View deployment

claude bot reviewed Apr 2, 2026

View reviewed changes

github-actions bot deleted a comment from claude bot Apr 2, 2026

vercel bot deployed to Preview – agents-api April 2, 2026 00:49 View deployment

anubra266 force-pushed the feat/durable-delegated-tool-approval branch from e8954e7 to edb0a26 Compare April 3, 2026 22:46

vercel bot temporarily deployed to Preview – agents-docs April 3, 2026 22:46 Inactive

style: auto-format with biome

ad195a0

vercel bot temporarily deployed to Preview – agents-docs April 3, 2026 22:47 Inactive

claude bot reviewed Apr 3, 2026

View reviewed changes

github-actions bot deleted a comment from claude bot Apr 3, 2026

vercel bot deployed to Preview – agents-manage-ui April 3, 2026 22:50 View deployment

vercel bot deployed to Preview – agents-api April 3, 2026 22:50 View deployment

anubra266 requested a review from amikofalvy April 3, 2026 23:12

Merge branch 'main' into feat/durable-delegated-tool-approval

6defd54

vercel bot deployed to Preview – agents-manage-ui April 3, 2026 23:28 View deployment

vercel bot deployed to Preview – agents-docs April 3, 2026 23:29 View deployment

claude bot reviewed Apr 3, 2026

View reviewed changes

github-actions bot deleted a comment from claude bot Apr 3, 2026

vercel bot deployed to Preview – agents-api April 3, 2026 23:30 View deployment

fix: use 'as const' for tool_calls type literal to satisfy typecheck

46a02c3

vercel bot temporarily deployed to Preview – agents-docs April 3, 2026 23:34 Inactive

vercel bot deployed to Preview – agents-manage-ui April 3, 2026 23:35 View deployment

claude bot reviewed Apr 3, 2026

View reviewed changes

github-actions bot deleted a comment from claude bot Apr 3, 2026

vercel bot deployed to Preview – agents-api April 3, 2026 23:37 View deployment

Conversation

anubra266 commented Apr 1, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Uh oh!

changeset-bot bot commented Apr 1, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

⚠️ No Changeset found

Uh oh!

vercel bot commented Apr 1, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pullfrog bot commented Apr 1, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Key changes

Parent-side detection of delegated approval artifacts

Post-approval continuation prompt to prevent re-delegation loop

Delegated approval forwarding through A2A delegation metadata

CredentialStoreRegistry, base URL fix, and SSE streaming for delegated approvals

Handle AI SDK stopWhen throw for durable approval detection

Uh oh!

pullfrog bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

github-actions bot commented Apr 1, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Preview URLs

Uh oh!

claude bot left a comment

Choose a reason for hiding this comment

PR Review Summary

🟠⚠️ Major (3) 🟠⚠️

🟡 Minor (1) 🟡

🕐 Pending Recommendations (3)

🚫 REQUEST CHANGES

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

claude bot left a comment

Choose a reason for hiding this comment

PR Review Summary

🟠⚠️ Major (1) 🟠⚠️

🟡 Minor (1) 🟡

🕐 Pending Recommendations (6)

🚫 REQUEST CHANGES

Uh oh!

Uh oh!

Uh oh!

claude bot left a comment

Choose a reason for hiding this comment

PR Review Summary

✅ Issues Resolved Since Prior Reviews

Architecture Assessment

⚠️ Operational Note

Missing Changeset

✅ APPROVE

Uh oh!

claude bot left a comment

Choose a reason for hiding this comment

PR Review Summary

✅ Verified Fixes (from prior reviews)

Architecture Assessment

⚠️ CI Note

Missing Changeset

✅ APPROVE

Uh oh!

claude bot left a comment

Choose a reason for hiding this comment

PR Review Summary

✅ Issues Resolved Since Prior Reviews

Architecture Assessment

⚠️ Missing Changeset

💡 APPROVE WITH SUGGESTIONS

anubra266 commented Apr 1, 2026 •

edited

Loading

changeset-bot bot commented Apr 1, 2026 •

edited

Loading

vercel bot commented Apr 1, 2026 •

edited

Loading

pullfrog bot commented Apr 1, 2026 •

edited

Loading

`CredentialStoreRegistry`, base URL fix, and SSE streaming for delegated approvals

Handle AI SDK `stopWhen` throw for durable approval detection

github-actions bot commented Apr 1, 2026 •

edited

Loading