How to fix the error "failed to create shim task" after submitting a job to compute cluster?
As I submitted a job, it failed and gave me the following messages:
Service Error:
Failed to execute command group with error Docker responded with status code 500: {"message":"failed to create shim task: OCI runtime create failed: runc create failed: unable to start container process: error during container init: error running hook #0: error running hook: exit status 1, stdout: , stderr: Auto-detected mode as 'legacy'\nnvidia-container-cli: initialization error: nvml error: driver not loaded: unknown"}
Warning:
AzureMLCompute job failed. OrchestrateJobError: Failed to execute command group with error Docker responded with status code 500: {"message":"failed to create shim task: OCI runtime create failed: runc create failed: unable to start container process: err