You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hi, I've been trying out nccl tests on g6e nodes and it maxes out at 3GBps. I see that gpudirect rdma is not enabled on g6e nodes when trying to use aws nccl plugin, with efa enabled on 4 network interfaces. Nvidia states that those gpus do support gpudirect rdma.
Is it a matter of having a topology file or something more needed?
Would love to contribute support if given some pointers.
The text was updated successfully, but these errors were encountered:
Abhishek8394
changed the title
RDMA supoort for g6e nodes
RDMA support for g6e nodes
Aug 28, 2024
Hi, I've been trying out nccl tests on g6e nodes and it maxes out at 3GBps. I see that gpudirect rdma is not enabled on g6e nodes when trying to use aws nccl plugin, with efa enabled on 4 network interfaces. Nvidia states that those gpus do support gpudirect rdma.
Is it a matter of having a topology file or something more needed?
Would love to contribute support if given some pointers.
The text was updated successfully, but these errors were encountered: