Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Graphbolt] [NFC] Remove default backend torch.distributed.init_process_group calls #7870

Open
wants to merge 2 commits into
base: master
Choose a base branch
from

Conversation

tpopp
Copy link

@tpopp tpopp commented Feb 22, 2025

With newer pytorch versions, the behavior in these locations was changed from initializing nccl+gloo to only initializing nccl.

Now, all locations explicitly specify their desired backends. These additions maintain expected torch<=2.4 behavior on torch>=2.6. The changed test relies on both backends existing and would fail on newer torch versions.

See pytorch/pytorch#147631

tpopp and others added 2 commits February 22, 2025 17:36
See pytorch/pytorch#147631

With newer torch versions, the behavior in these locations was changed
from initializing nccl+gloo to only initializing gloo.

The changed test relies on both backends existing and would fail on newer
torch versions.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant