An Azure service that is used to provision Windows and Linux virtual machines.
Accelerated networking does not work in kernel 4.4.277 where hyperv sriov related functions have been migrated.
mlx5's sriov related irq is not registered. So the callback call for async does not work.
The code I migrated added code to use mlx5vf, including pci-hyperv, netvsc, hv-vmbus, and so on.
The dmesg output is below:
Jun 24 09:12:01 kernel: [ 526.016040] mlx5_core 0001:00:02.0: 0001:00:02.0:wait_func:792:(pid 2824): CREATE_EQ(0x301) timeout. Will cause a leak of a command resource
Jun 24 09:12:01 kernel: [ 526.016050] mlx5_core 0001:00:02.0: 0001:00:02.0:mlx5_start_eqs:489:(pid 2824): failed to create async EQ -110
Jun 24 09:12:01 kernel: [ 526.024106] mlx5_core 0001:00:02.0: Failed to start pages and async EQs
Jun 24 09:13:02 kernel: [ 587.124035] mlx5_core 0001:00:02.0: 0001:00:02.0:wait_func:792:(pid 2824): MANAGE_PAGES(0x108) timeout. Will cause a leak of a command resource
Jun 24 09:13:02 kernel: [ 587.124045] mlx5_core 0001:00:02.0: 0001:00:02.0:reclaim_pages:413:(pid 2824): failed reclaiming pages
Jun 24 09:13:02 kernel: [ 587.124047] mlx5_core 0001:00:02.0: 0001:00:02.0:mlx5_reclaim_startup_pages:537:(pid 2824): failed reclaiming pages (-110)
bash-3.1# Jun 24 09:13:02 kernel: [ 587.309118] mlx5_core 0001:00:02.0: mlx5_load_one failed with error code -110
Jun 24 09:13:02 kernel: [ 587.310278] mlx5_core: probe of 0001:00:02.0 failed with error -110