Draft
Conversation
e9fb8e1 to
3368f61
Compare
Contributor
|
Could you please explain the challenge and the approach? |
Contributor
Author
|
The challenge is that the workers are trying to restart and they can't because the port is already in use.
The approach here is that the workers will First, check the ports to see if they're available and if they are then to proceed but otherwise back off and wait until the time out of 30 seconds and if it doesn't hit 30 seconds without them being available then they will just exit.
Upon further diagnosis, I'm not 100% sure this is the entire problem, but it does seem at least 90% effective.
The remaining may be due to the IPC server restarting on exit. I need to confirm this on a develop branch.
If this is the case, then a more simple solution is to add a second check, which is more of a band-aid, and then if the check fails, that the IPC server is still running, then to just stop it. But the better approach would be to figure out why it's restarting and fix that.
Again, I need to verify that this is the case if it wasn't just something that happens on my server after my these changes in this PR.
If this hat is the case then it would be interesting to know if this is the node worker or the master process initiating the IPC server restart on shutdown.
If you want to verify this, it would be adding logs to the IPC server and worker start/stop and adding a log for checking when an interrupt for the server happens.
The other thing you'll need to look for is the IPC server saying that it's listening. I know that on mine I saw that several of my cores would restart but not necessarily start listening before the shutdown was complete.
May be easier to diagnose this on or replicate this on a larger server with more cores, but I know that my 9950X was seeing this issue, and if this was the case, then it's not isolated to larger servers.
…-------- Original Message --------
On Thursday, 12/18/25 at 23:21 Black Swan ***@***.***> wrote:
blacks1ne left a comment [(QuilibriumNetwork/monorepo#498)](#498 (comment))
Could you please explain the challenge and the approach?
—
Reply to this email directly, [view it on GitHub](#498 (comment)), or [unsubscribe](https://github.com/notifications/unsubscribe-auth/ANGGZJ5LK5F42EXIBUHYGAL4COYQLAVCNFSM6AAAAACPQKFJLGVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZTMNZUGA2TQMZVGU).
You are receiving this because you authored the thread.Message ID: ***@***.***>
|
Contributor
|
Thanks for the clarifications.
|
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
No description provided.