can't view custom speech model data

Victoria 11 Reputation points
2022-10-06T16:12:28.1+00:00

I am trying to create and test a custom speech model. I'm able to go through all of the steps to upload data, train the model, and test the model. However, I can't view the contents of files after I upload them (for example, a plain text file has a large gray box where it should display the text the file contains) or the transcripts produced by the model as part of a text (again, a gray box, though I can listen to the audio file it should have transcribed). Any attempt to download the file or test results to view the contents will fail. The upload of the file and the running of the test are listed as being successful, it's just my ability to view the results that isn't working.

I'm guessing this is coming from a permissions error somewhere. I am a Cognitive Services Speech Contributor for this resource, which should give me full access for reading data for custom speech models. Is the training data stored somewhere else, so I need to be a Cognitive Services Speech Contributor for that location as well? If so, where can I find the storage location? Or does the role Cognitive Services Speech Contributor not contain the permissions I need to read the data I upload? If so, what role would allow me to do this?

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,948 questions
{count} votes

2 answers

Sort by: Most helpful
  1. Victoria 11 Reputation points
    2023-02-03T16:21:57.9866667+00:00

    I already had the Cognitive Services Contributor role.

    The solution ended up being to enable Resource Sharing (CORS) on the BYOS storage account connected to the project.

    It required an origin of https://speech.microsoft.com , GET and OPTIONS for allowed methods, * for allowed and exposed headers, and 1000 for the max age.

    1 person found this answer helpful.
    0 comments No comments

  2. Victoria 11 Reputation points
    2022-10-07T15:01:26.003+00:00

    I'm talking about two screens. When I say I can't view the contents of an uploaded file, I mean the Speech Datasets page, when I click to view more of a successfully uploaded plain text file:
    248525-image.png

    When I say I can't view the results of a test, I do mean the Test Models page, but specifically when I click on one test to view the custom-model-created transcripts for the audio files used in that test.
    248429-image.png

    0 comments No comments

Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.