Hello @James,
I understand that your fine-tuning job is stuck in "Training" status for over 24 hours, there could be several reasons for this issue. Here are some potential causes and steps you can take:
Reviewing the job status in the Fine-tuning section of the Azure AI Studio portal. Fine-tuning jobs are sometimes queued due to high demand or limited resources. Try refreshing the portal to check for any recent updates.
Although you mentioned that the dataset is small, it's important to ensure that it meets the minimum requirements for fine-tuning. If the dataset is too small or not well-structured, it might lead to unexpected behavior.
If the job continues to hang with no progress, consider canceling and resubmitting it. Since you've already canceled a previous long-running job, resubmitting with validated data and parameters might help rule out edge issues.
Azure’s compute resources for fine-tuning are shared across tenants and may not be immediately available. Check the Azure Status Page to rule out regional delays or service incidents that might be impacting availability.
Also you can refer Check the status of your custom model,
Troubleshooting for Azure OpenAI fine-tuning.
I Hope this helps. Do let me know if you have any further queries.
Thank you!