How to use PDF documents for creating a custom model and deploying the same in Azure Open AI studio ?

Katriya Bhavik Babubhai 10 Reputation points
2023-08-28T09:13:16.67+00:00

Hi,

I have set of pdf documents which I need to use for fine tuning the base model type (curie) in Azure Open AI studio. But currently it takes documents of type .jsonl only.

How do I bridge the gap between .pdf and .jsonl?User's image

Thanks,

Bhavik

Azure OpenAI Service
Azure OpenAI Service
An Azure service that provides access to OpenAI’s GPT-3 models with enterprise capabilities.
4,081 questions
Azure AI services
Azure AI services
A group of Azure services, SDKs, and APIs designed to make apps more intelligent, engaging, and discoverable.
3,602 questions
{count} votes

1 answer

Sort by: Most helpful
  1. Ramr-msft 17,826 Reputation points
    2023-08-30T05:14:15.3266667+00:00

    @Katriya Bhavik Babubhai yes, You can use the Azure OpenAI Service on your data empowers you to unlock the full potential of your data by running OpenAI models directly on it, eliminating the need for training or fine-tuning.

    • Retrieval Augmented Generation or “RAG” is one of the most popular architectural patterns for building data-infusedLLM applications. Azure OpenAI Service on your data automates many of the components of this architecture (ingestion, chunking, deployment), allowing customers to rapidly build use cases involving enterprise search or knowledge retrieval.

    How do I get started?

    To begin utilizing Azure OpenAI Service on your data, you need to have an approved Azure OpenAI Service application and an Azure OpenAI Service resource with either the gpt-35-turbo or the gpt-4 models deployed. Once you meet the prerequisites, follow these steps:

    • Connect your data source: Use Azure AI Studio to connect your desired data source, whether it's an Azure Cognitive Search index, Blob storage container, or by uploading files locally.
    • Ask questions and chat on your data: Once your data source is connected, you can start asking questions and conversing with the OpenAI models through Azure AI Studio. This enables you to gain valuable insights and make informed business decisions.

    You can use the following Sample code for a simple web chat experience targeting chatGPT through AOAI.

    Azure OpenAI on your data supports the following filetypes:

    • .txt
    • .md
    • .html
    • Microsoft Word files
    • Microsoft PowerPoint files
    • PDF
    0 comments No comments

Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.