Why does NCCL not support one GPU multiple NICs in all2all collective communication? #804

zhangmenghao · 2023-03-17T08:49:30Z

I was testing NCCL between two node, and each node has one GPU and two NICs in the same NUMA, connecting directing with one switch. Each NIC is assigned with a different IP address.
I found that in all_reduce, NCCL already support one GPU two NICs, and the bandwidth of these two NICs can be fully utilized.
However, in all2all, only one NIC is used, and the other NIC is left used.

Therefore, my question is, why does NCCL not support one GPU two NICs in alltoall? Is there some difficulty that prevent you from supporting such a feature?

sjeaugey · 2023-03-17T08:59:56Z

What version are you using? We had that issue on old versions but it was supposed to be fixed now.

zhangmenghao · 2023-03-17T09:10:24Z

We used NCCL 2.14.3. Which version are you supposed to fix that problem? @sjeaugey

zhangmenghao · 2023-03-22T02:28:41Z

Hi, Sylvain, could you please answer this question for us? @sjeaugey

sjeaugey · 2023-03-22T08:27:18Z

Sorry for the delay. Turns out I was wrong and it wasn't fixed. We're working on a fix right now, we'll post updates here when ready.

zhangmenghao · 2023-03-22T10:28:20Z

Really great news! We are looking forward to your new NCCL version.

In NCCL 2.14.3, we have hacked the function ncclTopoGetLocalNet() in topo.cc, and make alltoall support one GPU two NICs finally. Especially, we assigned netDev based on not only rr, but also on channelId. However, such a change only uses half of these channels, i.e., half of channels are assigned to these two NICs equally. And we are confused by such a phenomenon. Could you please explain a few words about how the setting-up channels are used in the subsequent all2all communication?

Add support for IB SHARP to NVLS (NVLink SHARP algorithm). Add NVLS+Tree algorithm. Add support for memory management using cuMem* functions. Use all NICs for Send/Receive operations on systems with more than one NIC per GPU (#804). Add ncclCommSplit primitive, with resource sharing option in config. Fix alltoallv hang (#788) Increase number of channels on H100 when we're not limited by NVLink. Improve error reporting in case of IB failure, printing local and remote ID (#779). Add build option to allow compilation against RDMA includes instead of dynamically loading IB verbs symbols (#802). Fix context creation for progress thread (#803). NET/IB: add option to use multiple QPs in round-robin mode. Fix tree performance issue when NVB is disabled on HCM topologies.

zhangmenghao closed this as completed Jun 30, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Why does NCCL not support one GPU multiple NICs in all2all collective communication? #804

Why does NCCL not support one GPU multiple NICs in all2all collective communication? #804

zhangmenghao commented Mar 17, 2023

sjeaugey commented Mar 17, 2023

zhangmenghao commented Mar 17, 2023 •

edited

Loading

zhangmenghao commented Mar 22, 2023

sjeaugey commented Mar 22, 2023

zhangmenghao commented Mar 22, 2023 •

edited

Loading

Why does NCCL not support one GPU multiple NICs in all2all collective communication? #804

Why does NCCL not support one GPU multiple NICs in all2all collective communication? #804

Comments

zhangmenghao commented Mar 17, 2023

sjeaugey commented Mar 17, 2023

zhangmenghao commented Mar 17, 2023 • edited Loading

zhangmenghao commented Mar 22, 2023

sjeaugey commented Mar 22, 2023

zhangmenghao commented Mar 22, 2023 • edited Loading

zhangmenghao commented Mar 17, 2023 •

edited

Loading

zhangmenghao commented Mar 22, 2023 •

edited

Loading