Synapse Analytics Auto ML Predict No module named 'azureml.automl'

Question

Synapse Analytics Auto ML Predict No module named 'azureml.automl'

Barth, Thilo 32

ref: https://learn.microsoft.com/en-us/answers/questions/788637/azure-synapse-ml-predict-errno-20-not-a-directory.html

I get the following error with Apache Spark version 3.1 : ModuleNotFoundError: No module named 'azureml.automl'

with version 2.4

2 answers

Your answer

Answer 1

Barth, Thilo 32

I solved it. In my case it works best like this:

PRADEEPCHEEKATLA 90,641 Reputation points Moderator

2022-04-04T04:23:07.613+00:00

Hello @Anonymous ,

Glad to know that your issue has resolved. And thanks for sharing the solution, which might be beneficial to other community members reading this thread.

Answer 2

Hello @Anonymous ,

Thanks for the question and using MS Q&A platform.

If you are using import azureml.automl in Apache spark 3.1 runtime, you will experience the error message stating No module named 'azureml.automl'.

As mentioned in the official document could you please try using from notebookutils.mssparkutils import azureML and it will work as excepted.

Here is the sample notebook for Score machine learning models with PREDICT in serverless Apache Spark pools

#!/usr/bin/env python  
# coding: utf-8  
  
# ## Azure_Synapse_ML_predict  
  
  
# In[Cell-1]:  
  
  
from notebookutils.mssparkutils import azureML  
  
  
# In[Cell-2]:    
  
ws = azureML.getWorkspace("AzureMLService")  
      
# In[Cell-3]:      
  
from azureml.core import Workspace, Model  
  
model = Model(ws, id="linear_regression:1")  
  
model.download('./')  
      
# In[Cell-4]:      
  
from pyspark.sql.functions import col, pandas_udf,udf,lit  
  
from notebookutils.mssparkutils import azureML  
  
from azureml.core import Workspace, Model  
  
from azureml.core.authentication import ServicePrincipalAuthentication  
  
import azure.synapse.ml.predict as pcontext  
  
import azure.synapse.ml.predict.utils._logger as synapse_predict_logger  
  
spark.conf.set("spark.synapse.ml.predict.enabled","true")  
  
# In[Cell-5]:      
  
AML_MODEL_URI_SKLEARN= "aml://linear_regression:1"  
  
# In[Cell-6]:      
  
model = pcontext.bind_model(  
  
return_types="Array<float>",  
  
runtime="mlflow",  
  
model_alias="linear_regression:1",  
  
model_uri=AML_MODEL_URI_SKLEARN,  
  
aml_workspace=ws  
  
).register()      
  
# In[Cell-7]:      
  
DATA_FILE = "abfss://******@cheprasynapse.dfs.core.windows.net/AML/LengthOfStay_cooked_small.csv"  
df = spark.read     .format("csv")     .option("header", "true")     .csv(DATA_FILE,  
        inferSchema=True)  
df.createOrReplaceTempView('data')  
df.show(10)      
  
# In[Cell-8]:      
  
#Call PREDICT using Spark SQL API  
  
predictions = spark.sql(  
              """  
            SELECT PREDICT('linear_regression:1',  
            hematocrit,neutrophils,sodium,glucose,bloodureanitro,creatinine,bmi,pulse,respiration)  
            AS predict FROM data  
             """  
        ).show()

Hope this will help. Please let us know if any further queries.

------------------------------

Please don't forget to click on or upvote button whenever the information provided helps you. Original posters help the community find answers faster by identifying the correct answer. Here is how
Want a reminder to come back and check responses? Here is how to subscribe to a notification
If you are interested in joining the VM program and help shape the future of Q&A: Here is how you can be part of Q&A Volunteer Moderators

Barth, Thilo 32 Reputation points

2022-03-29T09:38:26.303+00:00

Same error
PRADEEPCHEEKATLA 90,641 Reputation points Moderator

2022-03-29T09:44:11.123+00:00

Hello @Anonymous ,

Could you please share the Apache Spark pool runtime details?
Barth, Thilo 32 Reputation points

2022-03-29T10:00:01.507+00:00

Where can I find them?
PRADEEPCHEEKATLA 90,641 Reputation points Moderator

2022-03-30T05:27:09.207+00:00

Hello @Anonymous ,

Yes, you are using the Apache Spark 3.1 runtime.

As per the repro from my end, the above code which you have shared works as excepted and I don't see any error message which you are experiencing.

I had even tested the same code on the newly created Apache spark 3.1 runtime and it works as expected.

I would request you to create a new cluster and see if you are able to run the above code.

Hope this will help. Please let us know if any further queries.
Barth, Thilo 32 Reputation points

2022-03-30T06:30:55.907+00:00

New Spark pool same settings as you. Still this error.

Maybe my Model is corrupt ?

Conda.yaml - MLmodel - requirements.txt
https://pastebin.com/vf8pzE3C

This is how I register the model after training:

Share via

Synapse Analytics Auto ML Predict No module named 'azureml.automl'

2 answers

Your answer