Skip to content

High Load and "Node has not completed discovery" after Controller Re-initialization #931

@FranzSchi

Description

@FranzSchi

Description

Hello! I recently migrated my coordinator from an EM21 to an EM24. The initial migration went smoothly, but I encountered significant performance issues after deciding to "re-initialize" the controller to ensure a clean setup.

The Problem

After triggering the re-initialization, I observed the following:

  • System Load: Average jumped to 2.X, compared to the usual 0.2–0.3.

  • Discovery Issues: A large number of devices (primarily battery-powered) remained in the status: "node has not completed discovery".

  • Unresponsiveness: The entire system became unresponsive for approximately 2 hours.

  • eventually resolved this by restoring the userdata directory from a backup, which returned everything to a stable state.

Questions & Feedback

I have a few questions regarding the "Node has not completed discovery" state:

Technical Meaning: What exactly does this state imply for battery-powered devices during a re-initialization?

Next Steps: In this situation, is the best practice to manually trigger a "scan," or is it better to wait for the devices to wake up naturally?

Suggested Improvement

To improve the user experience, would it be possible to update the error message?

If the system knows a device is waiting for a wake-up interval, perhaps the message could provide a hint on how to resolve it (e.g., "Node has not completed discovery. Please wake the device manually or wait for the next check-in."). This would help users understand if the system is hanging or simply waiting for battery-operated nodes to check in.

Thank you for your hard work on this project!

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions