Error while depolying Whisper Model in batch pipeline
Arie Youlus
21
Reputation points
I'm trying to deploy the OpenAI Whisper model with a batch pipeline, following the example notebook: https://github.com/Azure/MachineLearningNotebooks/blob/master/how-to-use-azureml/machine-learning-pipelines/parallel-run/file-dataset-image-inference-mnist.ipynb
I'm using the STANDARD_NC6S_V3 Machine.
I keep getting the following error:
- Error '/azureml-envs/azureml_2b0a8ce0115582fe46e2aa65a9665d55/lib/python3.9/site-packages/torch/lib/../../nvidia/cublas/lib/libcublas.so.11: undefined symbol: cublasLtGetStatusString, version libcublasLt.so.11 File "/mnt/azureml/cr/j/3f034c6e7a1b4166b24b196339e7b655/exe/wd/whisper_transcribe.py", line 3, in <module>
import whisper
File "/azureml-envs/azureml_2b0a8ce0115582fe46e2aa65a9665d55/lib/python3.9/site-packages/whisper/init.py", line 8, in <module>
import torch
File "/azureml-envs/azureml_2b0a8ce0115582fe46e2aa65a9665d55/lib/python3.9/site-packages/torch/init.py", line 191, in <module>
_load_global_deps()
File "/azureml-envs/azureml_2b0a8ce0115582fe46e2aa65a9665d55/lib/python3.9/site-packages/torch/init.py", line 153, in _load_global_deps
ctypes.CDLL(lib_path, mode=ctypes.RTLD_GLOBAL)
File "/azureml-envs/azureml_2b0a8ce0115582fe46e2aa65a9665d55/lib/python3.9/ctypes/init.py", line 382, in init
self._handle = _dlopen(self._name, mode)'
I can't find what causes the error, but I think it has to do with the machine I'm deploying on, because with a different machine, the error did not appear.
help will be much appreciated.
Sign in to answer