Hi there,
sorry for my huge delay i had some issues with the production cluster so i was not able to test some stuff again.
i tried to get rdma working with the a new cluster (4 Node Supermicro X11 , Mellanox ConnectX-5 100G).
RDMA is still not working in my environment as soon as i installed server 2022.
With Server 2019 everything works fine.
So here are my settings:
- Verify if RDMA is enabled, the first one check if it's enabled on the server; the second one checks if it's enabled on the network adapters.
Get-NetOffloadGlobalSetting
ReceiveSideScaling : Enabled
ReceiveSegmentCoalescing : Enabled
Chimney : Disabled
TaskOffload : Enabled
NetworkDirect : Enabled
NetworkDirectAcrossIPSubnets : Blocked
PacketCoalescingFilter : Disabled
PS C:\Windows\system32> Get-NetAdapterRdma
Name InterfaceDescription Enabled Operational PFC ETS
Ethernet 4 Intel(R) Ethernet Connection X722 for... True False False False
Ethernet 2 Intel(R) Ethernet Connection X722 for... True False False False
Mellanox_anksinc151 Mellanox ConnectX-5 Adapter #2 True True True True
vEthernet (Verwaltung) 2 Hyper-V Virtual Ethernet Adapter #4 False False NA NA
vEthernet (Storage1) Hyper-V Virtual Ethernet Adapter #5 True False NA NA
Mellanox_anksinc150 Mellanox ConnectX-5 Adapter True True True True
vEthernet (Storage2) Hyper-V Virtual Ethernet Adapter #6 True False NA NA
- If the network adapter supports RoCE, we also need to configure the Switches to manage bandwidth(DCB/PFC);
Get-NetAdapterAdvancedProperty
Mellanox_anksinc150 NetworkDirect Technology RoCE *NetworkDire... {3}
Mellanox_anksinc150 DcbxMode Host in Charge DcbxMode {0}
PFC Settings
Set-NetQosDcbxSetting -Willing 0 -Confirm:$false
Enable-NetQosFlowControl -Priority 3
Disable-NetQosFlowControl 0,1,2,4,5,6,7
New-NetQosPolicy -Name "SMB" -NetDirectPortMatchCondition 445 -PriorityValue8021Action 3
New-NetQosPolicy -Name "Cluster" -Cluster -PriorityValue8021Action 7
New-NetQosPolicy -Name "Default" -Default -PriorityValue8021Action 0
New-NetQosTrafficClass "SMB" -Priority 3 -BandwidthPercentage 50 -Algorithm ETS
New-NetQosTrafficClass „Cluster" -Priority 7 -BandwidthPercentage 1 -Algorithm ETS
- For the OS, we need to use Server 2012 or higher with SMB3.0 along with SMB multichannel enabled;
The OS is Server 2022 Datacenter (GUI)
Get-SmbClientConfiguration
EnableMultiChannel : True
- For the failover cluster, please make sure the RDMA NICs are used for client access;
the storage network is configured as client access
- RDMA doesn't work with NIC teaming or Virtual Switch.
as i remember it is available for Switch embedded Teaming (SET) and all of my Server 2019 Cluster are configured with SET and RDMA is working for the S2D.
- Please also ensure you install the latest NIC drivers and latest firmware.
I installed the last Firmware and Drivers form Mellanox.
Driverversion: 2.70.24728.0
FW: 16.31.1014
In addition, please also run Cluster Validation Wizard to check if the cluster passed all tests.
passed all tests except the driver signature test because of the Intel VROC driver is not signed.