-
Notifications
You must be signed in to change notification settings - Fork 25
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add workaround for MPI/UCX environment #196
Conversation
In today's community meeting you agreed to do some more investigation into this. Let me know when that's done; I do have some ideas for improvements to this documentation change, but I want to first make sure we know what we want to recommend. |
@panda1100 Reminder, this is waiting on you. |
Thank you, @DrDaveD -san! I've got access to MPI/UCX environment yesterday. I'll keep update here. |
@DrDaveD @gmkurtzer The issue users (including me) faced is related to posix transport (shared memory-based transport). I replicated the issue with the following two command, and I got the same error. The difference is explicitly use posix transport
and
Rootless container environment, somehow,
There is another transport called "sysv transport". This works as well.
The I will perform quick performance test with the following cases:
to be continue... |
@DrDaveD -san, I have updated workaround regarding investigation results. I keep this as simple as possible for now and I wrote a details on apptainer/apptainer#769 (comment). |
Signed-off-by: Dave Dykstra <[email protected]>
Description of the Pull Request (PR):
Add workaround for MPI/UCX environment.
See
This fixes or addresses the following GitHub issues: