Error with uploaded JSONL dataset: The specified file reference must point to a completed file import.

Teerasej Jiraphatchandej 25 Reputation points
2023-02-20T11:08:22.42+00:00

I found an error with creating a customized model process: The specified file reference must point to a completed file import.

Noted:

  • Both validation.jsonl and training.jsonl can be downloaded as a zip file here.
  • both JSONL file, validation.jsonl, and training.jsonl, uploaded via Azure Data Studio. But I discovered that using it via Azure Blob Storage gave the same error.
  • This error also happened with ada and babbage model

Step to reproduce:

  1. Open Azure OpenAI Studio > Management > Models
  2. Click Create Customized Model
    2023-02-20_17-30-06
  3. Select the base model type and provide the model's suffix
    2023-02-20_17-30-58
  4. In the training data section, click Local file tab
    2023-02-20_17-31-09
      1. Click browse a file, and I choose the training.jsonl file in my MacOS, then click Upload file. The windows showed the selected training file as green, and I clicked the Next button to proceed to the next step.
        2023-02-20_17-31-29
        1. I did the same process for validation.jsonl in the validation data section

2023-02-20_17-31-55

    1. I leave the advanced options as the default, Click next to proceed

2023-02-20_17-32-04

    1. The summary section showed, then I clicked save a close.

2023-02-20_17-32-11

  1. In 3 minutes, I got the error from Azure OpenAI Studio within 5 seconds

2023-02-20_17-32-59

and if I tried to upload both JSONL files from File management menu, it will show Failed for each:

2023-02-20_17-33-22

Azure OpenAI Service
Azure OpenAI Service
An Azure service that provides access to OpenAI’s GPT-3 models with enterprise capabilities.
3,349 questions
{count} vote

Accepted answer
  1. romungi-MSFT 47,236 Reputation points Microsoft Employee
    2023-02-22T06:06:44.08+00:00

    teerasej Have you tried to use the OpenAI CLI data preperation tool to check if there is any reformatting required before uploading the file to the studio and trying again?

    If this answers your query, do click Accept Answer and Yes for was this answer helpful. And, if you have any further query do let us know.

    1 person found this answer helpful.
    0 comments No comments

1 additional answer

Sort by: Most helpful
  1. TOMOIAGA Ciprian 1 Reputation point
    2023-11-23T15:06:03.6066667+00:00

    I can no longer find a CLI data prep tool, but in my case, the problem was that the JSONL file did not have each instance on a row, rather the JSONs were formatted and indented. Once I removed the indenting, it was accepted by the tool

    0 comments No comments

Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.