Preserve meaningful last_error in LibevConnection close paths by dkropachev · Pull Request #700 · scylladb/python-driver

dkropachev · 2026-02-14T12:01:44Z

Summary

Improves error reporting in LibevConnection so that users see a clear ConnectionShutdown message instead of a confusing [Errno 9] Bad file descriptor when connections close during node restarts.

The libev reactor was already safe from the EBADF race — defunct() returns early when is_closed is true, and the single-threaded event loop + _loop_will_run() design prevents watchers from firing on closed fds. The actual problem was that last_error wasn't set to a meaningful value in two close paths, so the stale EBADF string leaked up to factory() and confused users.

Changes

Set last_error in close() when connected_event is not yet set, so factory() reports ConnectionShutdown instead of whatever stale error was on the connection
Set last_error on server-initiated close (EOF) in handle_read() before calling close()

Test plan

tests/unit/io/test_libevreactor.py passes

Refs #614

LibevConnection.close() closes the socket immediately while watchers are stopped asynchronously in the next event loop iteration via _loop_will_run(). This creates a race window where handle_read() or handle_write() can operate on a closed socket fd, producing EBADF errors that surface as ConnectionShutdown. - Add is_closed/is_defunct guards in handle_read() and handle_write() error paths to silently exit during shutdown instead of calling defunct() with EBADF - Set last_error in close() when connected_event is not yet set to prevent factory() from returning a dead connection - Set last_error on server-initiated close (EOF) in handle_read() before calling close()

Lorak-mmk · 2026-02-16T15:34:57Z

cassandra/io/libevreactor.py

+            self.last_error = ConnectionShutdown(
+                "Connection to %s was closed by server" % self.endpoint)
            self.close()


Why do you set it unconditionally here, but guard it with if not self.connected_event.is_set(): in the other place?

in close, it guards against overwriting last_error when connection was closer before it was properly initialized, in handle_read it is just channeling error reason back to close, connection is already initialized at this point.

Sorry, but I don't get it :(
connected_event can be set in 4 places (for libevreactor at least, I didn't loo at others):

defunct in connection.py, which calls close before setting the event.

_handle_startup_response in case of ReadyMessage - successfull connection creation.

_handle_auth_response in case of AuthSuccessMessage - successfull connection creation.

close in libevreactor.py.

You said that you want to guard against overwriting error when connection was closed before properly initialized, but it looks to me like you are doing the opposite.
You are overwriting error only if event was not yet set. If it was not yet set then we didn't receive ReadyMessage or AuthSuccessMessage. We also can't be in defunct, because we are in if not self.is_defunct:, and also defunct calls close before setting event.

@sylwiaszunejko since you approved the PR, maybe you understand this?

Maybe I am wrong, but I understand it that way that if it was set by defunct or close we will not call handle_read anymore, so there is no other error that we would possible want to have here. In close if it was not yet set then we didn't receive ReadyMessage or AuthSuccessMessage, so the conn was not properly initialized as Dmitry said, or no?
I am now confused tbh

Ok I think I get it. If the event is set in this check, it was set by either ReadyMessage (_handle_startup_response), or by AuthSuccessMessage (_handle_auth_response). In both of those cases the connection is properly initialized.
So the check looks correct - it will only set error if connection was not initialized yet.
I would love to better understand the paths that can trigger this, to verify that they don't set last_error already, but I won't hold this PR over that.

dkropachev marked this pull request as draft February 14, 2026 16:40

dkropachev force-pushed the fix/libev-close-race-614 branch from 8543c1f to 6aabcee Compare February 16, 2026 13:02

dkropachev force-pushed the fix/libev-close-race-614 branch from 6aabcee to 96add52 Compare February 16, 2026 13:09

dkropachev changed the title ~~Fix LibevConnection close() race causing EBADF errors~~ Preserve meaningful last_error in LibevConnection close paths Feb 16, 2026

dkropachev self-assigned this Feb 16, 2026

dkropachev marked this pull request as ready for review February 16, 2026 13:26

dkropachev requested review from Lorak-mmk and sylwiaszunejko February 16, 2026 13:26

Lorak-mmk reviewed Feb 16, 2026

View reviewed changes

sylwiaszunejko approved these changes Mar 4, 2026

View reviewed changes

Lorak-mmk mentioned this pull request Mar 4, 2026

geventreactor: fix close() race causing EBADF errors #701

Open

Lorak-mmk approved these changes Mar 4, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Preserve meaningful last_error in LibevConnection close paths#700

Preserve meaningful last_error in LibevConnection close paths#700
dkropachev wants to merge 1 commit intomasterfrom
fix/libev-close-race-614

dkropachev commented Feb 14, 2026 •

edited

Loading

Uh oh!

Lorak-mmk Feb 16, 2026

Uh oh!

dkropachev Feb 17, 2026

Uh oh!

Lorak-mmk Mar 4, 2026

Uh oh!

Lorak-mmk Mar 4, 2026

Uh oh!

sylwiaszunejko Mar 4, 2026

Uh oh!

Lorak-mmk Mar 4, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

dkropachev commented Feb 14, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Changes

Test plan

Uh oh!

Lorak-mmk Feb 16, 2026

Choose a reason for hiding this comment

Uh oh!

dkropachev Feb 17, 2026

Choose a reason for hiding this comment

Uh oh!

Lorak-mmk Mar 4, 2026

Choose a reason for hiding this comment

Uh oh!

Lorak-mmk Mar 4, 2026

Choose a reason for hiding this comment

Uh oh!

sylwiaszunejko Mar 4, 2026

Choose a reason for hiding this comment

Uh oh!

Lorak-mmk Mar 4, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

dkropachev commented Feb 14, 2026 •

edited

Loading