Switch to the next host on connection error #2330

arnaudmorin · 2025-07-08T15:53:36Z

When having a connection error, like a PRECONDITION_FAILED because of missing replica of a stream queue in RabbitMQ, then we can safely cycle through the list of hosts given in the connection string instead of keep retrying on the same one.

When having a connection error, like a PRECONDITION_FAILED because of missing replica of a stream queue in RabbitMQ, then we can safely cycle through the list of hosts given in the connection string instead of keep retrying on the same one. Signed-off-by: Arnaud Morin <[email protected]>

auvipy

same, can you add test?

Copilot

Pull request overview

This PR updates Kombu’s connection retry behavior so that when an operation wrapped by Connection.ensure() encounters a recoverable connection error, the connection will rotate to the next configured host (failover URL) instead of repeatedly retrying the same host.

Changes:

Switch to the next failover host when ensure() catches a recoverable connection error during the wrapped operation.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2026-02-12T09:39:42Z

kombu/connection.py

                    except conn_errors as exc:
+                        self.maybe_switch_next()  # select next host
                        if got_connection and not has_modern_errors:
                            # transport can not distinguish between


maybe_switch_next() is called before the early-exit raise paths (got_connection and not has_modern_errors and max_retries exceeded). That means callers can see the Connection mutated to the next host even though the operation ultimately fails and raises, which is a surprising side effect and can make diagnostics inconsistent (errback/logging will refer to the previous exception but self now points at a different host). Consider only switching hosts when you’re actually going to retry (e.g., move the switch after the raise checks and right before _ensure_connection).

Copilot · 2026-02-12T09:39:42Z

kombu/connection.py

                        self._debug('ensure retry policy error: %r',
                                    exc, exc_info=1)
                    except conn_errors as exc:
+                        self.maybe_switch_next()  # select next host


This change alters retry/failover behavior in ensure() by switching to the next host when fun() raises a recoverable connection error, but there doesn’t appear to be a unit test covering that ensure() actually advances the host list in this scenario (as opposed to only _ensure_connection doing it). Adding a test that sets up a multi-host connection, forces a recoverable_connection_errors exception from fun(), and asserts the connection params switched would help prevent regressions.

auvipy requested changes Jul 8, 2025

View reviewed changes

auvipy added this to the 5.7.0 milestone Jul 19, 2025

auvipy requested a review from Copilot February 12, 2026 09:35

Copilot started reviewing on behalf of auvipy February 12, 2026 09:36 View session

Copilot AI reviewed Feb 12, 2026

View reviewed changes

Merge branch 'main' into dev/am/switch1

cee2157

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Switch to the next host on connection error #2330

Switch to the next host on connection error #2330

Uh oh!

arnaudmorin commented Jul 8, 2025

Uh oh!

auvipy left a comment

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Feb 12, 2026

Uh oh!

Copilot AI Feb 12, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Switch to the next host on connection error #2330

Are you sure you want to change the base?

Switch to the next host on connection error #2330

Uh oh!

Conversation

arnaudmorin commented Jul 8, 2025

Uh oh!

auvipy left a comment

Choose a reason for hiding this comment

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Copilot AI Feb 12, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Feb 12, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants