"We Couldn't Connect Your Data"

The AI Consulting Firm 25 Reputation points
2024-01-14T16:04:12.12+00:00

I'm unable to get the Bring Your Own Data to preprocess a Blob container for a vector/text embedding search. The Blob container has successfully deployed several times for the non-vector BYOD search, but is has been unsuccessful in my attempts to deploy in two regions.

West US - Attempted to connect the Blob container. The preprocessing status bar shows 0 of 15 documents preprocessed before coming back with an error that states "We couldn't connect your data. One or more preprocessing jobs failed." What's strange here is that that status bar shows 0 of 15 documents preprocessed, but the preprocessing appears to be completed in the Blob as there is no a separate container with all of the documents chunked.

East US - After several failed attempts in West US. I noticed there is no pricing posted for the text-embedding model in West US (though a post from Microsoft this month states it is available). As a result, I moved to the East US region where pricing is posted. The preprocessing is still failing in the East. This time, the status bar shows progress in the processing, but it is consistently stopping at 7 out 15 documents and then providing an error message that there is a Gateway Timeout Upstream.

Preferably, we want to see this tool up in the West US region. Can you please help me troubleshoot? s

Azure OpenAI Service
Azure OpenAI Service
An Azure service that provides access to OpenAI’s GPT-3 models with enterprise capabilities.
4,106 questions
{count} vote

1 answer

Sort by: Most helpful
  1. navba-MSFT 27,550 Reputation points Microsoft Employee Moderator
    2024-01-15T03:32:44.96+00:00

    @The AI Consulting Firm Thanks for using Microsoft Q&A Platform.

    Could you please confirm if you are getting an error like below ?User's image

    If Yes, we had provided a fix for the error We couldn't connect your data. One or more preprocessing jobs failed. last month ( Dec 2023). See this thread.
    .
    Are you using the Azure Blob Storage as the data source or Upload files option ?
    User's image

    Plan 1:
    Please check if you have provided Storage Blob Data Reader for data read access in Blob Storage to your Azure Search service.
    .

    Plan 2:
    Also, please refer to this documentation to check if you are missing anything like turning on CORS.
    User's image

    .User's image

    . Plan 3:
    Generally, the Azure OpenAI on your data supports the following filetypes: .txt, .md, .html, Microsoft Word files, Microsoft PowerPoint files, and PDF: https://learn.microsoft.com/en-us/azure/ai-services/openai/concepts/use-your-data#data-formats-and-file-types Ensure you are using right file types. .
    Plan 4:

    • Try to login again to the studio or use a different browser to check if the issue occurs while upload.
    • Check the network connectivity to ensure that there are no issues with the network or firewall that could be blocking the connection.
    • Try Add your data from a completely different network to avoid any firewall / proxy in your network.
    • Instead of uploading all 15 documents at once, Try uploading smaller documents ( one by one and check which document is failing) or breaking down the input documents into smaller documents and try again.

    .**
    On a side note:**
    If none of the above plans work, please please collect a network trace while reproducing the error using How to collect a network trace | Microsoft Learn to check if errors are getting logged in the logs and share these traces with me.
    .
    Also is it feasible to share those documents with me, so that I can test it at my end ?
    .
    Awaiting your reply.

    0 comments No comments

Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.