Fine Tuning - Json File

Question

Fine Tuning - Json File

Irina Sopas 80

Hello. I am trying to Fine Tuning my OpenAi Model.

I would like to know how many json files I can add. I need to put all the information in one json file or I can have it divided by themes in various Json files.

Best regards,

IS

Saideep Anchuri 9,425 Reputation points Microsoft External Staff Moderator

2025-04-13T09:45:31.4633333+00:00

Hi Irina Sopa

You can use multiple JSON files for fine-tuning your OpenAI model. It is not necessary to combine all your information into a single JSON file. You can divide your data by themes across various JSON files. Ensure that each file is formatted correctly as a JSON Lines (JSONL) document and adheres to the required specifications for training and validation data.

Kindly refer below link: training-and-validation-data

Thank You.
Irina Sopas 80 Reputation points

2025-04-14T09:02:55.3366667+00:00
Hello. I ran a test and used exactly the same code in the website you gave me, it gave me the errors, said json wasn't the extension.

it says json files and the extension is jsonl, where I create jsonl?

I really don't know what to do and I need to fine tune my IA. I have another question, I tell my bot to writer me a text, but it keeps me sending a short one. I tell him X characters, words, lines, it does not get me... what azure openia service understands? I am a writer I need specific size text's.

Best regards, IS
Deleted

This comment has been deleted due to a violation of our Code of Conduct. The comment was manually reported or identified through automated detection before action was taken. Please refer to our Code of Conduct for more information.
Saideep Anchuri 9,425 Reputation points Microsoft External Staff Moderator

2025-04-14T17:13:45.96+00:00

Hi Irina Sopa

Just checking in to see if the below answer provided by @Suhas M helped.

Thank You.

Accepted answer

1 additional answer

Your answer

Saideep Anchuri 9,425 Reputation points Microsoft External Staff Moderator

2025-04-13T09:45:31.4633333+00:00

Hi Irina Sopa

You can use multiple JSON files for fine-tuning your OpenAI model. It is not necessary to combine all your information into a single JSON file. You can divide your data by themes across various JSON files. Ensure that each file is formatted correctly as a JSON Lines (JSONL) document and adheres to the required specifications for training and validation data.

Kindly refer below link: training-and-validation-data

Thank You.
Irina Sopas 80 Reputation points

2025-04-14T09:02:55.3366667+00:00

Hello. I ran a test and used exactly the same code in the website you gave me, it gave me the errors, said json wasn't the extension.

it says json files and the extension is jsonl, where I create jsonl?

I really don't know what to do and I need to fine tune my IA. I have another question, I tell my bot to writer me a text, but it keeps me sending a short one. I tell him X characters, words, lines, it does not get me... what azure openia service understands? I am a writer I need specific size text's.

Best regards, IS
Deleted

This comment has been deleted due to a violation of our Code of Conduct. The comment was manually reported or identified through automated detection before action was taken. Please refer to our Code of Conduct for more information.
Saideep Anchuri 9,425 Reputation points Microsoft External Staff Moderator

2025-04-14T17:13:45.96+00:00

Hi Irina Sopa

Just checking in to see if the below answer provided by @Suhas M helped.

Thank You.

Answer 1

Hello IS! Great to hear you’re fine-tuning your model with Azure OpenAI Service.

You can split your data into multiple JSONL files.

Each file must follow the required format, and during the fine-tuning upload and training process, Azure OpenAI supports multiple files as long as they are properly formatted.

Format Reminder:

Each file must be in .jsonl (JSON Lines) format, meaning:

{"prompt": "input text", "completion": "desired response"}

Multiple Files – Best Practice:

You can and often should organize your training data by theme into different .jsonl files (e.g., customer_service.jsonl, technical_docs.jsonl, sales_pitch.jsonl). This helps you:

Maintain your data more easily

Debug or update specific topics later

Ensure better control over how different data segments influence the model

Then, when creating the fine-tuned model, you can upload and use them together.

Notes:

You can combine files during upload or before fine-tuning depending on the method you're using (e.g., CLI, API).

Azure has some size and token limits (e.g., each file max 100 MB and total dataset should remain within token limits).

Be mindful of balance and duplication across datasets to avoid model bias.

Would you like help with how to format or combine your files, or a command example for uploading in Azure?Hello IS! Great to hear you’re fine-tuning your model with Azure OpenAI Service.

You can split your data into multiple JSONL files.

Each file must follow the required format, and during the fine-tuning upload and training process, Azure OpenAI supports multiple files as long as they are properly formatted.

Format Reminder:

Each file must be in .jsonl (JSON Lines) format, meaning:

{"prompt": "input text", "completion": "desired response"}

🗂 Multiple Files – Best Practice:

You can and often should organize your training data by theme into different .jsonl files (e.g., customer_service.jsonl, technical_docs.jsonl, sales_pitch.jsonl). This helps you:

Maintain your data more easily

Debug or update specific topics later

Ensure better control over how different data segments influence the model

Then, when creating the fine-tuned model, you can upload and use them together.

Notes:

You can combine files during upload or before fine-tuning depending on the method you're using (e.g., CLI, API).
Azure has some size and token limits (e.g., each file max 100 MB and total dataset should remain within token limits).
Be mindful of balance and duplication across datasets to avoid model bias.

Irina Sopas 80 Reputation points

2025-04-14T18:23:59.77+00:00

Hello, thanks for the formating help, but in which program/software, I create my jsonl files?

And can you give some explicit examples of:

{"prompt": "input text", "completion": "desired response"}

Best regards, IS

Answer 2

Irina Sopas 80

Hello. Thanks for the answer. Can you tell me which software I can use to create a jsonl file? And a specific sample for

{"prompt": "input text", "completion": "desired response"}

Share via

Fine Tuning - Json File

1 additional answer

Your answer