I would like to know if there is a way to train my own pdf invoices using OpenAI and use that model for further predictions?
To answer your query, As per my understanding the models will be able to provide you the information available in the document and I'm not sure if it will be able to provide the predictions.
There is an upload limit, and there are some caveats about document structure and how it might affect the quality of responses from the model:
- The model provides the best citation titles from markdown (
.md
) files. - If a document is a PDF file, the text contents are extracted as a preprocessing step (unless you're connecting your own Azure Cognitive Search index). If your document contains images, graphs, or other visual content, the model's response quality depends on the quality of the text that can be extracted from them.
- If you're converting data from an unsupported format into a supported format, make sure the conversion:
- Doesn't lead to significant data loss.
- Doesn't add unexpected noise to your data.
- Doesn't lead to significant data loss.
Please see Azure OpenAI on your data for more details.
Hope this helps.,