Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add a timeout argument to progressWorkerEvent #30

Merged

Conversation

pentschev
Copy link
Member

Instead of blocking indefinitely until an evens occurs, allows the user to specify a timeout the epoll wait operation, causing it to return.

In some cases using blocking mode in the progress thread may cause the event to never wake, allowing it to break periodically fixes hangs in those cases. It's not clear whether this is an abuse of blocking progress mode in UCX, but for now this resolves that issue.

Instead of blocking indefinitely until an evens occurs, allows the user
to specify a timeout the epoll wait operation, causing it to return.

In some cases using blocking mode in the progress thread may cause the
event to never wake, allowing it to break periodically fixes hangs in
those cases. It's not clear whether this is an abuse of blocking
progress mode in UCX, but for now this resolves that issue.
@pentschev pentschev added bug Something isn't working non-breaking Introduces a non-breaking change labels Apr 14, 2023
@pentschev pentschev requested review from a team as code owners April 14, 2023 15:46
Copy link
Contributor

@wence- wence- left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

One style suggestions, otherwise LGTM.

cpp/src/worker.cpp Outdated Show resolved Hide resolved
@pentschev pentschev changed the base branch from branch-0.31 to branch-0.32 April 26, 2023 21:16
@pentschev
Copy link
Member Author

Thanks @wence- !

@pentschev
Copy link
Member Author

And btw, the Python benchmark timed out several times here, I now opened #37 to increase the timeout for now.

@pentschev
Copy link
Member Author

/merge

@rapids-bot rapids-bot bot merged commit 4b8d835 into rapidsai:branch-0.32 Apr 27, 2023
@pentschev pentschev mentioned this pull request Apr 27, 2023
4 tasks
@pentschev pentschev deleted the progress_worker_event-timeout branch May 2, 2023 08:45
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working non-breaking Introduces a non-breaking change
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants