fix: always close tool executors on conversation close by csmith49 · Pull Request #3157 · OpenHands/software-agent-sdk

csmith49 · 2026-05-08T17:48:18Z

Summary

Tool executor cleanup in LocalConversation.close() was gated behind if self.delete_on_close. When delete_on_close=False — used by the agent-server during conversation forks — tool executors (terminal sessions, browser processes, MCP connections, task managers) were never released, causing resource leaks.

Root cause

local_conversation.py:996 — the executor cleanup loop was inside an if self.delete_on_close: block. The delete_on_close flag controls whether persisted data should be removed, but runtime resources (subprocesses, connections) must always be released.

Fix

Moved the executor cleanup loop outside the if self.delete_on_close: guard. Executors are now always closed when a conversation closes, regardless of whether conversation data is preserved.

Test

Added test_conversation_close_calls_executor_close_without_delete — verifies that executor.close() is called even when delete_on_close=False. All 7 existing cleanup tests continue to pass.

Fixes #3149

This PR was created by an AI agent (OpenHands) on behalf of @csmith49.

Agent Server images for this PR

• GHCR package: https://github.com/OpenHands/agent-sdk/pkgs/container/agent-server

Variants & Base Images

Variant	Architectures	Base Image	Docs / Tags
java	amd64, arm64	`eclipse-temurin:17-jdk`	Link
python	amd64, arm64	`nikolaik/python-nodejs:python3.13-nodejs22-slim`	Link
golang	amd64, arm64	`golang:1.21-bookworm`	Link

Pull (multi-arch manifest)

# Each variant is a multi-arch manifest supporting both amd64 and arm64
docker pull ghcr.io/openhands/agent-server:ac3adfc-python

Run

docker run -it --rm \
  -p 8000:8000 \
  --name agent-server-ac3adfc-python \
  ghcr.io/openhands/agent-server:ac3adfc-python

All tags pushed for this build

ghcr.io/openhands/agent-server:ac3adfc-golang-amd64
ghcr.io/openhands/agent-server:ac3adfc996b081fefee658ac5bccc8d7682f0817-golang-amd64
ghcr.io/openhands/agent-server:fix-3149-tool-executor-close-on-conversation-close-golang-amd64
ghcr.io/openhands/agent-server:ac3adfc-golang_tag_1.21-bookworm-amd64
ghcr.io/openhands/agent-server:ac3adfc-golang-arm64
ghcr.io/openhands/agent-server:ac3adfc996b081fefee658ac5bccc8d7682f0817-golang-arm64
ghcr.io/openhands/agent-server:fix-3149-tool-executor-close-on-conversation-close-golang-arm64
ghcr.io/openhands/agent-server:ac3adfc-golang_tag_1.21-bookworm-arm64
ghcr.io/openhands/agent-server:ac3adfc-java-amd64
ghcr.io/openhands/agent-server:ac3adfc996b081fefee658ac5bccc8d7682f0817-java-amd64
ghcr.io/openhands/agent-server:fix-3149-tool-executor-close-on-conversation-close-java-amd64
ghcr.io/openhands/agent-server:ac3adfc-eclipse-temurin_tag_17-jdk-amd64
ghcr.io/openhands/agent-server:ac3adfc-java-arm64
ghcr.io/openhands/agent-server:ac3adfc996b081fefee658ac5bccc8d7682f0817-java-arm64
ghcr.io/openhands/agent-server:fix-3149-tool-executor-close-on-conversation-close-java-arm64
ghcr.io/openhands/agent-server:ac3adfc-eclipse-temurin_tag_17-jdk-arm64
ghcr.io/openhands/agent-server:ac3adfc-python-amd64
ghcr.io/openhands/agent-server:ac3adfc996b081fefee658ac5bccc8d7682f0817-python-amd64
ghcr.io/openhands/agent-server:fix-3149-tool-executor-close-on-conversation-close-python-amd64
ghcr.io/openhands/agent-server:ac3adfc-nikolaik_s_python-nodejs_tag_python3.13-nodejs22-slim-amd64
ghcr.io/openhands/agent-server:ac3adfc-python-arm64
ghcr.io/openhands/agent-server:ac3adfc996b081fefee658ac5bccc8d7682f0817-python-arm64
ghcr.io/openhands/agent-server:fix-3149-tool-executor-close-on-conversation-close-python-arm64
ghcr.io/openhands/agent-server:ac3adfc-nikolaik_s_python-nodejs_tag_python3.13-nodejs22-slim-arm64
ghcr.io/openhands/agent-server:ac3adfc-golang
ghcr.io/openhands/agent-server:ac3adfc996b081fefee658ac5bccc8d7682f0817-golang
ghcr.io/openhands/agent-server:fix-3149-tool-executor-close-on-conversation-close-golang
ghcr.io/openhands/agent-server:ac3adfc-golang_tag_1.21-bookworm
ghcr.io/openhands/agent-server:ac3adfc-java
ghcr.io/openhands/agent-server:ac3adfc996b081fefee658ac5bccc8d7682f0817-java
ghcr.io/openhands/agent-server:fix-3149-tool-executor-close-on-conversation-close-java
ghcr.io/openhands/agent-server:ac3adfc-eclipse-temurin_tag_17-jdk
ghcr.io/openhands/agent-server:ac3adfc-python
ghcr.io/openhands/agent-server:ac3adfc996b081fefee658ac5bccc8d7682f0817-python
ghcr.io/openhands/agent-server:fix-3149-tool-executor-close-on-conversation-close-python
ghcr.io/openhands/agent-server:ac3adfc-nikolaik_s_python-nodejs_tag_python3.13-nodejs22-slim

About Multi-Architecture Support

Each variant tag (e.g., ac3adfc-python) is a multi-arch manifest supporting both amd64 and arm64
Docker automatically pulls the correct architecture for your platform
Individual architecture tags (e.g., ac3adfc-python-amd64) are also available if needed

Tool executor cleanup was gated behind `if self.delete_on_close`, meaning executors (terminal sessions, browser processes, MCP connections) were never released when delete_on_close=False — e.g. during agent-server conversation forks. Move the cleanup outside the guard so runtime resources are always released regardless of whether conversation data is preserved. Fixes #3149 Co-authored-by: openhands <openhands@all-hands.dev>

all-hands-bot

Taste Rating: 🟢 Good taste - Clean bug fix with clear intent and proper test coverage.

Assessment

This is a well-implemented fix for a real resource leak. The root cause analysis is spot-on: runtime resources (executors) must be released regardless of whether persisted data is preserved.

Code Quality: Excellent

Clear separation of concerns: delete_on_close controls data persistence, not resource cleanup
Helpful explanatory comments
Proper error handling (catches exceptions, logs warnings, continues cleanup)
New test specifically validates the fixed behavior

Logic Correctness: ✅

Executors hold runtime resources (subprocesses, connections) that must always be released
The fix correctly moves cleanup outside the delete_on_close guard
Exception handling ensures one failing executor doesn't prevent others from closing

Eval-Risk Notification

Per repository guidelines: This PR changes tool executor lifecycle behavior (cleanup phase). While the change improves reliability by fixing a resource leak, and I don't anticipate any negative benchmark impact, I'm flagging for maintainer awareness since it touches executor lifecycle.

My assessment: This fix makes cleanup more reliable and should improve system stability, particularly for agent-server conversation forks. The risk of negative benchmark impact is low.

[RISK ASSESSMENT]

[Overall PR] ⚠️ Risk Assessment: 🟡 MEDIUM

Changes core executor cleanup logic in the conversation lifecycle, affecting all tool types (terminal, browser, MCP, task managers). However, the change fixes a clear bug (resource leak) and makes cleanup more reliable. The risk is that if any benchmark workflow inadvertently relied on executors NOT being closed when delete_on_close=False, that behavior will change. Given this is a bug fix for a resource leak used in production (agent-server forks), the change is justified and likely to improve stability.

VERDICT:
✅ Worth merging: Solid bug fix with proper test coverage. Maintainers should verify no unexpected benchmark impact due to executor lifecycle change, but the improvement to resource management is clear.

KEY INSIGHT:
The fix correctly distinguishes between data lifecycle (controlled by delete_on_close) and resource lifecycle (always cleanup). This is the right architectural separation.

all-hands-bot

✅ QA Report: PASS

This PR successfully fixes the resource leak where tool executors were not being closed when delete_on_close=False.

Does this PR achieve its stated goal?

Yes. I verified end-to-end that the PR fixes the bug described in #3149. On the main branch, tool executors are NOT closed when delete_on_close=False, causing resource leaks. On this PR branch, executors ARE properly closed regardless of the delete_on_close setting. The fix correctly separates the concern of "releasing runtime resources" (executors) from "deleting persisted data" (delete_on_close).

Phase	Result
Environment Setup	✅ Build successful
CI Status	⚠️ Some unrelated checks failing (API breakage, PR artifacts); core `sdk-tests` passed
Functional Verification	✅ Bug reproduced on main, fix verified on PR branch

Functional Verification

Test 1: Reproduce the bug on main branch (before the fix)

Step 1 — Reproduce the resource leak (main branch):

Switched to main branch and ran a test script that creates a conversation with delete_on_close=False, initializes a terminal executor, and tracks whether executor.close() is called:

$ git checkout main
$ uv run python /tmp/test_executor_cleanup.py

Output:

================================================================================
Testing executor cleanup with delete_on_close=False
================================================================================
...
Closing conversation...

================================================================================
❌ FAILURE: Executor was NOT closed (resource leak!)

This confirms the bug exists: when delete_on_close=False, the executor cleanup loop was skipped entirely, leaving terminal sessions, browser processes, MCP connections, and other runtime resources unreleased.

Step 2 — Apply the PR's changes:

Switched back to the PR branch:

$ git checkout fix/3149-tool-executor-close-on-conversation-close

Step 3 — Re-run with the fix in place:

Ran the same test script on the PR branch:

$ uv run python /tmp/test_executor_cleanup.py

Output:

================================================================================
Testing executor cleanup with delete_on_close=False
================================================================================
...
Closing conversation...
✓ terminal_executor.close() was called!

================================================================================
✅ SUCCESS: Executor was properly closed even with delete_on_close=False

This shows the fix works: the executor's close() method is now called even when delete_on_close=False.

Test 2: Verify the new test and all existing tests pass

Ran the full test suite for conversation cleanup:

$ uv run pytest tests/tools/terminal/test_conversation_cleanup.py -v

Output:

tests/tools/terminal/test_conversation_cleanup.py::test_conversation_close_calls_executor_close PASSED [ 14%]
tests/tools/terminal/test_conversation_cleanup.py::test_conversation_close_calls_executor_close_without_delete PASSED [ 28%]
tests/tools/terminal/test_conversation_cleanup.py::test_conversation_del_calls_close PASSED [ 42%]
tests/tools/terminal/test_conversation_cleanup.py::test_conversation_close_handles_executor_exceptions PASSED [ 57%]
tests/tools/terminal/test_conversation_cleanup.py::test_conversation_close_skips_none_executors PASSED [ 71%]
tests/tools/terminal/test_conversation_cleanup.py::test_terminal_executor_close_calls_session_close PASSED [ 85%]
tests/tools/terminal/test_conversation_cleanup.py::test_terminal_executor_close_handles_missing_session PASSED [100%]

7 passed, 10 warnings in 7.92s

All 7 tests pass, including the new test_conversation_close_calls_executor_close_without_delete which specifically verifies this fix. No regressions detected.

Issues Found

None.

This QA report was created by an AI agent (OpenHands) on behalf of the user.

github-actions · 2026-05-08T17:55:19Z

Coverage Report •

File	Stmts	Miss	Cover	Missing
openhands-sdk/openhands/sdk/conversation/impl
local_conversation.py	443	23	94%	299, 304, 435, 481, 518, 534, 599, 818–819, 822, 982, 990, 992, 996–997, 1008–1009, 1034, 1229, 1233, 1303, 1310–1311
TOTAL	26832	7932	70%

github-actions · 2026-05-11T16:22:30Z

📁 PR Artifacts Notice

This PR contains a .pr/ directory with PR-specific documents. This directory will be automatically removed when the PR is approved.

For fork PRs: Manual removal is required before merging.

all-hands-bot

Taste Rating: 🟢 Good taste - Clean bug fix with clear intent and proper test coverage.

This is a well-implemented fix for a real resource leak. The root cause analysis is spot-on: runtime resources (executors) must be released regardless of whether persisted data is preserved.

[RISK ASSESSMENT]

[Overall PR] ⚠️ Risk Assessment: 🟢 LOW

Targeted bug fix that moves cleanup logic outside a conditional guard. No architectural changes, no API changes, no impact on agent decision-making or benchmark results. The fix properly separates concerns: delete_on_close controls data persistence, not resource cleanup.

VERDICT:
✅ Worth merging: Core logic is sound, proper test coverage provided.

KEY INSIGHT:
The separation of concerns between data lifecycle (delete_on_close) and resource cleanup (executor closing) eliminates a subtle but significant resource leak in conversation forking scenarios.

all-hands-bot

✅ QA Report: PASS

The PR successfully fixes the resource leak where tool executors were not being closed when delete_on_close=False.

Does this PR achieve its stated goal?

Yes. The PR fixes the resource leak bug by moving the executor cleanup loop outside the if self.delete_on_close: guard. I verified this by:

Reproducing the bug on the parent commit (executor.close() was NOT called when delete_on_close=False)
Confirming the fix on the PR branch (executor.close() IS called even when delete_on_close=False)
Verifying all 7 cleanup tests pass, including the new test specifically for this scenario

The fix ensures that runtime resources (terminal sessions, browser processes, MCP connections, task managers) are always released when conversations close, regardless of whether conversation data is preserved.

Phase	Result
Environment Setup	✅ Build completed successfully with `make build`
CI Status	⚠️ Core tests pass (sdk-tests, tools-tests, workspace-tests, agent-server-tests, pre-commit). Two unrelated checks failing (Python API, REST API OpenAPI)
Functional Verification	✅ Bug reproduced on parent commit, fix verified on PR branch, all tests pass

Functional Verification

Test 1: Reproduce the bug (parent commit)

Step 1 — Reproduce the bug (without the fix):

Checked out parent commit 549e2012 and created a test script that:

Creates a Conversation with delete_on_close=False
Uses a TerminalTool with a mocked executor
Calls conversation.close()
Checks if executor.close() was called

Ran the test:

git checkout HEAD~1
python test_executor_cleanup_bug.py

Output:

❌ FAILURE: executor.close() was NOT called
   This means resources (subprocesses, connections, etc.) are leaking!

This confirms the bug exists: when delete_on_close=False, the executor cleanup code was skipped entirely, causing resource leaks.

Step 2 — Apply the PR's changes:

Switched back to the PR branch:

git checkout fix/3149-tool-executor-close-on-conversation-close

Step 3 — Re-run with the fix in place:

Ran the same test script:

python test_executor_cleanup_bug.py

Output:

✅ SUCCESS: executor.close() was called
   Close was called 1 time(s)

This shows the fix works: executor cleanup now happens unconditionally, regardless of the delete_on_close flag.

Test 2: Verify all cleanup tests pass

Ran the full cleanup test suite:

pytest tests/tools/terminal/test_conversation_cleanup.py -v

Results:

test_conversation_close_calls_executor_close PASSED
test_conversation_close_calls_executor_close_without_delete PASSED (new test)
test_conversation_del_calls_close PASSED
test_conversation_close_handles_executor_exceptions PASSED
test_conversation_close_skips_none_executors PASSED
test_terminal_executor_close_calls_session_close PASSED
test_terminal_executor_close_handles_missing_session PASSED

1 passed, 6 warnings in 1.60s

All 7 tests pass, including the new test that specifically verifies executor cleanup when delete_on_close=False.

Test 3: Verify the fix works with multiple tools

Created a test with multiple tools (terminal + file_editor) to ensure the fix works comprehensively:

python test_multiple_executors.py

Output:

Initialized 4 tools:
  - terminal: has_executor=True
  - file_editor: has_executor=True
  - finish: has_executor=True
  - think: has_executor=True
✅ SUCCESS: Terminal executor.close() was called 1 time(s)
   The fix properly closes executors even when delete_on_close=False

This confirms the fix works for conversations with multiple tools.

Code Review

Examined the actual code change in local_conversation.py line 996-1012:

Before:

if self.delete_on_close:
    try:
        tools_map = self.agent.tools_map
    except (AttributeError, RuntimeError):
        return
    for tool in tools_map.values():
        try:
            executable_tool = tool.as_executable()
            executable_tool.executor.close()
        ...

After:

# Always close tool executors — they hold runtime resources
# (subprocesses, connections, etc.) that must be released regardless
# of whether the conversation data is preserved (delete_on_close).
try:
    tools_map = self.agent.tools_map
except (AttributeError, RuntimeError):
    return
for tool in tools_map.values():
    try:
        executable_tool = tool.as_executable()
        executable_tool.executor.close()
    ...

The fix is clean and surgical: it simply removes the if self.delete_on_close: guard while preserving all error handling logic.

Issues Found

None. The fix is correct, well-tested, and achieves its stated goal without breaking existing functionality.

This review was created by an AI agent (OpenHands) on behalf of the OpenHands team.

github-actions · 2026-05-11T16:48:56Z

Python API breakage checks — ✅ PASSED

Result: ✅ PASSED

Action log

github-actions · 2026-05-11T16:49:11Z

REST API breakage checks (OpenAPI) — ✅ PASSED

Result: ✅ PASSED

Action log

…-close

Replace try/except patterns with contextlib.suppress for cleaner exception suppression in _close_resources(). Co-authored-by: openhands <openhands@all-hands.dev>

…-close

all-hands-bot reviewed May 8, 2026

View reviewed changes

csmith49 requested a review from all-hands-bot May 11, 2026 16:40

all-hands-bot approved these changes May 11, 2026

View reviewed changes

all-hands-bot reviewed May 11, 2026

View reviewed changes

Merge branch 'main' into fix/3149-tool-executor-close-on-conversation…

18db34a

…-close

csmith49 requested a review from VascoSch92 May 11, 2026 22:15

VascoSch92 approved these changes May 12, 2026

View reviewed changes

Comment thread openhands-sdk/openhands/sdk/conversation/impl/local_conversation.py Outdated

Comment thread openhands-sdk/openhands/sdk/conversation/impl/local_conversation.py Outdated

csmith49 and others added 2 commits May 12, 2026 07:18

refactor: use contextlib.suppress for cleanup exception handling

b0b7c75

Replace try/except patterns with contextlib.suppress for cleaner exception suppression in _close_resources(). Co-authored-by: openhands <openhands@all-hands.dev>

Merge branch 'main' into fix/3149-tool-executor-close-on-conversation…

ac3adfc

…-close

csmith49 merged commit e7b4393 into main May 12, 2026
26 checks passed

csmith49 deleted the fix/3149-tool-executor-close-on-conversation-close branch May 12, 2026 13:31

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: always close tool executors on conversation close#3157

fix: always close tool executors on conversation close#3157
csmith49 merged 4 commits into
mainfrom
fix/3149-tool-executor-close-on-conversation-close

csmith49 commented May 8, 2026 •

edited by github-actions Bot

Loading

Uh oh!

all-hands-bot left a comment

Uh oh!

all-hands-bot left a comment

Uh oh!

github-actions Bot commented May 8, 2026 •

edited

Loading

Uh oh!

github-actions Bot commented May 11, 2026

Uh oh!

all-hands-bot left a comment

Uh oh!

all-hands-bot left a comment

Uh oh!

github-actions Bot commented May 11, 2026 •

edited

Loading

Uh oh!

github-actions Bot commented May 11, 2026 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

csmith49 commented May 8, 2026 • edited by github-actions Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Root cause

Fix

Test

Uh oh!

all-hands-bot left a comment

Choose a reason for hiding this comment

Assessment

Eval-Risk Notification

Uh oh!

all-hands-bot left a comment

Choose a reason for hiding this comment

✅ QA Report: PASS

Does this PR achieve its stated goal?

Test 1: Reproduce the bug on main branch (before the fix)

Test 2: Verify the new test and all existing tests pass

Issues Found

Uh oh!

github-actions Bot commented May 8, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions Bot commented May 11, 2026

Uh oh!

all-hands-bot left a comment

Choose a reason for hiding this comment

Uh oh!

all-hands-bot left a comment

Choose a reason for hiding this comment

✅ QA Report: PASS

Does this PR achieve its stated goal?

Test 1: Reproduce the bug (parent commit)

Test 2: Verify all cleanup tests pass

Test 3: Verify the fix works with multiple tools

Code Review

Issues Found

Uh oh!

github-actions Bot commented May 11, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Python API breakage checks — ✅ PASSED

Uh oh!

github-actions Bot commented May 11, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

REST API breakage checks (OpenAPI) — ✅ PASSED

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

csmith49 commented May 8, 2026 •

edited by github-actions Bot

Loading

github-actions Bot commented May 8, 2026 •

edited

Loading

github-actions Bot commented May 11, 2026 •

edited

Loading

github-actions Bot commented May 11, 2026 •

edited

Loading