Azure OpenAI service doesn't show up for Azure AI Search import and vectorize data wizard

Question

Azure OpenAI service doesn't show up for Azure AI Search import and vectorize data wizard

Nihit Mody 55

Hi,

I am trying to use the Import and Vectorize Data Wizard for AI Search Service and I cannot select my OpenAI service that contains the model text-embedding-ada-002 for embedding. The data I have is a CSV in Blob Storage. I have ensured the Search Service has Cognitive Services OpenAI User access yet it still doesn't show up. You can see the issue here:

User's image

How can I resolve this?

Thanks

Laxman Reddy Revuri 5,475 Reputation points Microsoft External Staff Moderator

2025-03-24T09:35:12.4533333+00:00

Hi @Nihit Mody
Please enable system assigned identity on AI search.

Please add Please go to OpenAI resource IAM and add the AI search resource (Managed identity) as Cognitive Services OpenAI contributor role assigned correctly.

Make sure that the model you are trying to use (text-embedding-ada-002) is deployed and available in your Azure OpenAI resource.
Nihit Mody 55 Reputation points

2025-03-24T10:11:08.31+00:00

Hi Laxman,

Thanks for your response, but as my original post states I have already done the steps you have written.. Could you please let me know next steps to resolve this?

I followed these instructions:

https://learn.microsoft.com/en-us/azure/search/search-get-started-portal-import-vectors?tabs=sample-data-storage%2Cmodel-aoai%2Cconnect-data-storage#connect-to-azure-openai

Best
Anonymous

2025-03-25T00:39:25.7933333+00:00
Hi Nihit Mody,

Thanks for confirming that you have verified all the steps above, and its till a better practise to cross check each point.

Ensure that your Azure OpenAI service was created in the Azure portal and not in the Azure AI Foundry portal, as only the former is compatible with the Azure OpenAI Embedding skill integration

If Azure AI Search can't connect, it might be due to Virtual Network/Firewall rules. In such cases, consider scripted or programmatic approaches instead

Ensure that the embedding permission step is correctly followed. If you are using a virtual network scenario, you may need to modify the firewall settings and do additional whitelisting

Sometimes, services may not show up immediately due to caching or synchronization issues. Try refreshing the services or restarting the Azure AI Search Service to see if the OpenAI service appears

https://learn.microsoft.com/en-us/azure/search/search-get-started-portal-import-vectors?tabs=sample-data-storage%2Cmodel-aoai%2Cconnect-data-storage#connect-to-azure-openai
Laxman Reddy Revuri 5,475 Reputation points Microsoft External Staff Moderator

2025-03-25T04:37:41.28+00:00

Hi @Nihit Mody

Could you kindly share a screenshot of where you are encountered an issue?
Sampath 3,875 Reputation points Microsoft External Staff Moderator

2025-03-25T08:35:44.0166667+00:00

Hi @Nihit Mody We still have not heard back from you. Could you kindly share a screenshot of where you are encountered an issue?
Nihit Mody 55 Reputation points

2025-03-26T04:21:06.47+00:00

Hi Sampath,

Here is the screenshot, nothing appears in the dropdown despite ada-002 embedding model existing in this subscription with IAM enabled.

Thanks
Sampath 3,875 Reputation points Microsoft External Staff Moderator

2025-03-26T06:22:30.9466667+00:00

Hello @Nihit Mody,

Have you assigned the Storage Blob Data Reader role to storage? In the OpenAI resource, have you added the Cognitive Services OpenAI User role? Have you included the sample "gpt-4" module before adding "text-embedding-ada-002" using this DOC

Refer to this doc for using azure-search-documents and the Azure SDK for Python, applying data chunking and vectorization in an indexer pipeline.
Nihit Mody 55 Reputation points

2025-03-27T07:18:03.8533333+00:00

Hi Sampath,

Yes I have Storage Blob Data Reader for my storage applied for the Search Service. I also have Cognitive Services OpenAI User role applied to the OpenAI Resource for the Search Service. I followed this tutorial verbatim from start to finish:

https://learn.microsoft.com/en-us/azure/search/tutorial-rag-build-solution-models

I have these models in my Azure Foundry under my Azure Subscription 1 but none of them appear in the Import data and vectorize wizard as shown in my original post:

Please let me know how to proceed.

Thanks
Sampath 3,875 Reputation points Microsoft External Staff Moderator

2025-03-27T07:55:35.8566667+00:00

Hello @Nihit Mody , Could you please check your private message and respond with the details there? Thank you.
Vinayak Gupta 0 Reputation points

2025-03-27T10:28:36.9233333+00:00
I also faced a similar problem. In my case Open AI instance was getting deployed with a regional endpoint that comes with a suffix of cognitive.microsoft.com but the vectorizer supports only Open AI with the suffix openai.azure.com.

There are two methods to get this suffix

Via Bicep: Add customSubDomainName property to the cognitive service bicep and set its value to the name of the Open AI instance. Make sure that you will have to create a new instance, as custom sub domain name cannot be changed for an existing Open AI instance.

Via Portal: I first tried without Bicep, and when I created Open AI using portal by default it came with openai.azure.com suffix.

Once you have an OpenAI deployed with this suffix, you should be able to see your instance in the list.

Hope this helps!

Answer accepted by question author

0 additional answers

Your answer

Laxman Reddy Revuri 5,475 Reputation points Microsoft External Staff Moderator

2025-03-24T09:35:12.4533333+00:00

Hi @Nihit Mody
Please enable system assigned identity on AI search.

Please add Please go to OpenAI resource IAM and add the AI search resource (Managed identity) as Cognitive Services OpenAI contributor role assigned correctly.

Make sure that the model you are trying to use (text-embedding-ada-002) is deployed and available in your Azure OpenAI resource.
Nihit Mody 55 Reputation points

2025-03-24T10:11:08.31+00:00

Hi Laxman,

Thanks for your response, but as my original post states I have already done the steps you have written.. Could you please let me know next steps to resolve this?

I followed these instructions:

https://learn.microsoft.com/en-us/azure/search/search-get-started-portal-import-vectors?tabs=sample-data-storage%2Cmodel-aoai%2Cconnect-data-storage#connect-to-azure-openai

Best
Anonymous

2025-03-25T00:39:25.7933333+00:00

Hi Nihit Mody,

Thanks for confirming that you have verified all the steps above, and its till a better practise to cross check each point.

Ensure that your Azure OpenAI service was created in the Azure portal and not in the Azure AI Foundry portal, as only the former is compatible with the Azure OpenAI Embedding skill integration

If Azure AI Search can't connect, it might be due to Virtual Network/Firewall rules. In such cases, consider scripted or programmatic approaches instead

Ensure that the embedding permission step is correctly followed. If you are using a virtual network scenario, you may need to modify the firewall settings and do additional whitelisting

Sometimes, services may not show up immediately due to caching or synchronization issues. Try refreshing the services or restarting the Azure AI Search Service to see if the OpenAI service appears

https://learn.microsoft.com/en-us/azure/search/search-get-started-portal-import-vectors?tabs=sample-data-storage%2Cmodel-aoai%2Cconnect-data-storage#connect-to-azure-openai
Laxman Reddy Revuri 5,475 Reputation points Microsoft External Staff Moderator

2025-03-25T04:37:41.28+00:00

Hi @Nihit Mody

Could you kindly share a screenshot of where you are encountered an issue?
Sampath 3,875 Reputation points Microsoft External Staff Moderator

2025-03-25T08:35:44.0166667+00:00

Hi @Nihit Mody We still have not heard back from you. Could you kindly share a screenshot of where you are encountered an issue?
Nihit Mody 55 Reputation points

2025-03-26T04:21:06.47+00:00

Hi Sampath,

Here is the screenshot, nothing appears in the dropdown despite ada-002 embedding model existing in this subscription with IAM enabled.

Thanks
Sampath 3,875 Reputation points Microsoft External Staff Moderator

2025-03-26T06:22:30.9466667+00:00

Hello @Nihit Mody,

Have you assigned the Storage Blob Data Reader role to storage? In the OpenAI resource, have you added the Cognitive Services OpenAI User role? Have you included the sample "gpt-4" module before adding "text-embedding-ada-002" using this DOC

Refer to this doc for using azure-search-documents and the Azure SDK for Python, applying data chunking and vectorization in an indexer pipeline.
Nihit Mody 55 Reputation points

2025-03-27T07:18:03.8533333+00:00

Hi Sampath,

Yes I have Storage Blob Data Reader for my storage applied for the Search Service. I also have Cognitive Services OpenAI User role applied to the OpenAI Resource for the Search Service. I followed this tutorial verbatim from start to finish:

https://learn.microsoft.com/en-us/azure/search/tutorial-rag-build-solution-models

I have these models in my Azure Foundry under my Azure Subscription 1 but none of them appear in the Import data and vectorize wizard as shown in my original post:

Please let me know how to proceed.

Thanks
Sampath 3,875 Reputation points Microsoft External Staff Moderator

2025-03-27T07:55:35.8566667+00:00

Hello @Nihit Mody , Could you please check your private message and respond with the details there? Thank you.
Vinayak Gupta 0 Reputation points

2025-03-27T10:28:36.9233333+00:00

I also faced a similar problem. In my case Open AI instance was getting deployed with a regional endpoint that comes with a suffix of cognitive.microsoft.com but the vectorizer supports only Open AI with the suffix openai.azure.com.

There are two methods to get this suffix

Via Bicep: Add customSubDomainName property to the cognitive service bicep and set its value to the name of the Open AI instance. Make sure that you will have to create a new instance, as custom sub domain name cannot be changed for an existing Open AI instance.

Via Portal: I first tried without Bicep, and when I created Open AI using portal by default it came with openai.azure.com suffix.

Once you have an OpenAI deployed with this suffix, you should be able to see your instance in the list.

Hope this helps!

Answer 1

Hello @Nihit Mody ,

We need to have the OpenAI resources and supported models.

Check the supported provider and models here.

Since you already having the models and embedding endpoints with in ai services you can use either sdk or rest api,

below is the sample to create indexer on json array data via python sdk

from azure.core.credentials import AzureKeyCredential
from azure.search.documents.indexes import SearchIndexClient, SearchIndexerClient
from azure.search.documents.indexes.models import (  
SearchIndexer,  
IndexingParameters,  
IndexingParametersConfiguration,  
SearchIndexerDataSourceConnection,  
SearchIndexerDataContainer,  
SearchIndexerSkillset,  
FieldMapping,  
InputFieldMappingEntry,  
OutputFieldMappingEntry,  
AzureOpenAIEmbeddingSkill,  
)
 
# Azure Search Service Configuration
service_name = "ai_search_name"
index_name = "index_name"
api_key = "your-azure-search-api-key"
endpoint = f"https://{service_name}.search.windows.net/"
 
# Azure Blob Storage Configuration
blob_connection_string = "your-blob-connection-string"
container_name = "aicsvdata"
 
# Azure OpenAI Configuration
azure_openai_service = "your-openai-service-name"
azure_openai_api_key = "your-openai-api-key"
azure_openai_embedding_deployment = "text-embedding-ada-002"
 
def create_indexer_with_skillset(index_client, indexer_client):
    """Creates an indexer with a skillset to vectorize the 'plot' field."""
    data_source_name = "blob-datasource"
    indexer_name = "blob-indexer-with-vector"
    skillset_name = "plot-vectorization-skillset"
 
    # 1. Data Source
    container = SearchIndexerDataContainer(name=container_name)
    data_source = SearchIndexerDataSourceConnection(
        name=data_source_name,
        connection_string=blob_connection_string,
        container=container,
        type="azureblob",
    )
 
    # 2. Skillset - Embedding Generation
    embedding_skill = AzureOpenAIEmbeddingSkill(
        name="plot-embedding",
        description="Generates vector embeddings for plot field",
        context="/document",
        resource_url=f"https://{azure_openai_service}.openai.azure.com/",
        api_key=azure_openai_api_key,
        deployment_name=azure_openai_embedding_deployment,
        model_name="text-embedding-ada-002",
        inputs=[InputFieldMappingEntry(name="text", source="/document/plot")],
        outputs=[OutputFieldMappingEntry(name="embedding", target_name="PlotVector")],
    )
 
    skillset = SearchIndexerSkillset(
        name=skillset_name,
        description="Skillset to vectorize plot",
        skills=[embedding_skill],
    )
 
    # 3. Field Mappings
    field_mappings = [
        FieldMapping(source_field_name="/document/PlotVector", target_field_name="PlotVector"),
    ]

	indexing_parameters = IndexingParameters(
        query_timeout=None,
        configuration=IndexingParametersConfiguration(
            parsing_mode="jsonArray",  # Options: 'default', 'delimitedText', 'json', 'jsonArray', 'jsonLines', 'text', 'markdown'
            data_to_extract="contentAndMetadata"  # Options: 'allMetadata', 'contentAndMetadata', 'storageMetadata'
        )
    )
 
    # 4. Indexer
    indexer = SearchIndexer(
        name=indexer_name,
        data_source_name=data_source_name,
        target_index_name=index_name,
        skillset_name=skillset_name,
        parameters = indexing_parameters,
        output_field_mappings = field_mappings
    )
 
    # Create skillset and data source
    indexer_client.create_or_update_skillset(skillset)
    indexer_client.create_or_update_data_source_connection(data_source)
 
    # Create and run the indexer
    result = indexer_client.create_or_update_indexer(indexer)
    print(f"Indexer '{result.name}' created.")
    indexer_client.run_indexer(indexer_name)
 
def main():
    try:
        credential = AzureKeyCredential(api_key)
        index_client = SearchIndexClient(endpoint, credential)
        indexer_client = SearchIndexerClient(endpoint, credential)
 
        create_indexer_with_skillset(index_client, indexer_client)
 
    except Exception as ex:
        print(f"An error occurred: {ex}")
 
if __name__ == "__main__":
    main()

Please do accept the solution and give feedback by clicking on yes.

Thank you

Share via

Azure OpenAI service doesn't show up for Azure AI Search import and vectorize data wizard

0 additional answers

Your answer