How to best create a dataset for FAQs

ryota.matsuda 140 Reputation points
2023-10-06T02:10:29.5766667+00:00

We are considering creating an index of FAQs.

The dataset is going to have one-to-one answers to questions, but we are still researching if this is the best way to do it.

My concern is that in this case, we would only get one hit answer per question.

Would it be better to not put it in a one-to-one format so that we can get relevant background and other similar answers to a single question?

Note that a vector search is used.

Azure AI Search
Azure AI Search
An Azure search service with built-in artificial intelligence capabilities that enrich information to help identify and explore relevant content at scale.
741 questions
Azure OpenAI Service
Azure OpenAI Service
An Azure service that provides access to OpenAI’s GPT-3 models with enterprise capabilities.
2,250 questions
{count} votes

Accepted answer
  1. ajkuma 22,766 Reputation points Microsoft Employee
    2023-10-10T03:08:37.9366667+00:00

    @ryota.matsuda , Apologies for the delayed response.

    Based on your requirement, you may enable multiple options with citations that will give the user the option to choose what perhaps is a second or third choice to what they are looking for.

    Also, I would suggest you to enable semantic ranking that will add an extra layer of ranking for semantic related content for better results.

    0 comments No comments

0 additional answers

Sort by: Most helpful