Upload pdf and word files to vector database via API

Hessel Wellema 256 Reputation points
2024-03-15T16:53:38.9933333+00:00

I need a simple way to have a customer upload documents to a vector database. A chatbot will use this database to ground the prompts and answer questions based on reliable and actual data.
I see a lot of development in this area by Microsoft but am a bit lost about what is available and what not.

What I need is to be able to do the same thing you can do in Azure OpenAI studio (Bring you Own Data):
,1 upload a pdf or some other supported document,
,2 chunk it,
.3 create embeddings
.4 upload it to a vector database

I know there is an API in preview that does .2 .3 and.4 (https://learn.microsoft.com/en-us/azure/search/vector-search-integrated-vectorization)
But is there also an API that does .1 .2 and .3. That would save me a lot of time

Thank you

Azure AI Search
Azure AI Search
An Azure search service with built-in artificial intelligence capabilities that enrich information to help identify and explore relevant content at scale.
1,339 questions
Azure AI services
Azure AI services
A group of Azure services, SDKs, and APIs designed to make apps more intelligent, engaging, and discoverable.
3,601 questions
{count} votes

1 answer

Sort by: Most helpful
  1. VenkateshDodda-MSFT 24,951 Reputation points Microsoft Employee Moderator
    2024-03-18T11:17:56.23+00:00

    @Hessel Wellema Thanks for reaching out to Microsoft Q&A, apologize for any inconvenience caused on this.

    Based on the shared information, I understand that you want to create an index in azure search service and to upload an PDF document in it using the Rest API.

    If my understanding is correct, then you can go through this documentation on how to create the index using Rest API.

    Hope this helps, let me know if you have any further questions on this.

    0 comments No comments

Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.