Error creating endpoint from mlflow model (tensorflow job)

Question

Error creating endpoint from mlflow model (tensorflow job)

Juna Salviati 11 MVP

Hello everybody,
I am trying to deploy a realtime endpoint from a registered mlflow model obtained from a tensorflow training job.
In this repository, you will find the training scripts:

https://github.com/antigones/py-hands-ml-tf/tree/main/azure_ml/job_script

The job outputs a MLFlow model with its conda environment yml file.

When I try to deploy the model to a realtime endpoint, I get the following error:

257528-azure-ml-deploy-error.txt

It seems to be an error related to protobuf, when loading the model:

 File "/opt/miniconda/envs/userenv/lib/python3.8/site-packages/google/protobuf/descriptor.py", line 560, in __new__  
    _message.Message._CheckCalledFromGeneratedFile()  
TypeError: Descriptors cannot not be created directly.  
If this call came from a _pb2.py file, your generated code is out of date and must be regenerated with protoc >= 3.19.0.  
If you cannot immediately regenerate your protos, some other possible workarounds are:  
 1. Downgrade the protobuf package to 3.20.x or lower.  
 2. Set PROTOCOL_BUFFERS_PYTHON_IMPLEMENTATION=python (but this will use pure-Python parsing and will be much slower).  
  
More information: https://developers.google.com/protocol-buffers/docs/news/2022-05-06#python-updates

The environment is deployed automatically (the scoring script is also generated).
I have also tried different images, with different python versions (3.7) and Tensorflow versions (2.4) with no luck.

How can I solve this issue?

Thank you in advance for your support.

2 answers

Your answer

Answer 1

Juna Salviati 11 MVP

Seems like the problem is in azureml-inference-server-http package, where there is a mismatch with protobuf version.

As a workaround, I created a custom managed online deployment via CLI, specifing the following environment variable:

environment_variables:  
  "PROTOCOL_BUFFERS_PYTHON_IMPLEMENTATION": "python"

and then I was able to publish the endpoint.

Alberto Gallo 16 Reputation points Microsoft Employee

2022-11-09T13:55:17.8+00:00
I am trying to deploy a wide And Deep Reccomender from the Azure studio by creating the online endpoint from the job details but i get the same error:

TypeError: Descriptors cannot not be created directly.
If this call came from a _pb2.py file, your generated code is out of date and must be regenerated with protoc >= 3.19.0.
If you cannot immediately regenerate your protos, some other possible workarounds are:

Downgrade the protobuf package to 3.20.x or lower.

Set PROTOCOL_BUFFERS_PYTHON_IMPLEMENTATION=python (but this will use pure-Python parsing and will be much slower).

Both using the Azure Container both if I create a Kubernetes cluster inference instance.

Is there a way to set the env variable in such scenario to overcome the problem? @Juna Salviati
Alberto Gallo 16 Reputation points Microsoft Employee

2022-11-09T17:10:31.387+00:00

I actually overcome the problem by creating my custom Environment and adding the override pipi install:

However the next error is that from the score.py the "azureml" module cannot be loaded:

Someone from Microsoft in other threads suggested to include this but doens't make sense as the other core modules etc form azure are already there.
pip install azureml

sooo...what next??

Answer 2

Juna Salviati 11 MVP

I have a Keras model and I had to develop and upload my own score.py to override the init() function in order to load the model using load_model() for Keras models, instead of using joblib.load(model_path) as it was by default.
You probably also have to override the run() function to customize the inference.

Share via

Error creating endpoint from mlflow model (tensorflow job)

2 answers

Your answer