How to fix the error "failed to create shim task" after submitting a job to compute cluster?

Lu 66 Reputation points
2023-06-22T13:13:24.69+00:00

As I submitted a job, it failed and gave me the following messages:

Service Error:

Failed to execute command group with error Docker responded with status code 500: {"message":"failed to create shim task: OCI runtime create failed: runc create failed: unable to start container process: error during container init: error running hook #0: error running hook: exit status 1, stdout: , stderr: Auto-detected mode as 'legacy'\nnvidia-container-cli: initialization error: nvml error: driver not loaded: unknown"}

Warning:

AzureMLCompute job failed. OrchestrateJobError: Failed to execute command group with error Docker responded with status code 500: {"message":"failed to create shim task: OCI runtime create failed: runc create failed: unable to start container process: err

Azure Machine Learning
Azure Machine Learning
An Azure machine learning service for building and deploying models.
2,729 questions
{count} votes