Importing from data sources
A knowledge base consists of question and answer pairs brought in by public URLs and files.
Note
The QnA Maker service is being retired on the 31st of March, 2025. A newer version of the question and answering capability is now available as part of Azure AI Language. For question answering capabilities within the Language Service, see question answering. Starting 1st October, 2022 you won’t be able to create new QnA Maker resources. For information on migrating existing QnA Maker knowledge bases to question answering, consult the migration guide.
Data source locations
Content is brought into a knowledge base from a data source. Data source locations are public URLs or files, which do not require authentication.
SharePoint files, secured with authentication, are the exception. SharePoint resources must be files, not web pages.
QnA Maker supports public URLs ending with a .ASPX web extension which are not secured with authentication.
Chit-chat content
The chit-chat content set is offered as a complete content data source in several languages and conversational styles. This can be a starting point for your bot's personality, and it will save you the time and cost of writing them from scratch. Learn how to add chit-chat content to your knowledge base.
Structured data format through import
Importing a knowledge base replaces the content of the existing knowledge base. Import requires a structured .tsv
file that contains questions and answers. This information helps QnA Maker group the question-answer pairs and attribute them to a particular data source.
Question | Answer | Source | Metadata (1 key: 1 value) |
---|---|---|---|
Question1 | Answer1 | Url1 | Key1:Value1 | Key2:Value2 |
Question2 | Answer2 | Editorial | Key:Value |
Structured multi-turn format through import
You can create the multi-turn conversations in a .tsv
file format. The format provides you with the ability to create the multi-turn conversations by analyzing previous chat logs (with other processes, not using QnA Maker), then create the .tsv
file through automation. Import the file to replace the existing knowledge base.
The column for a multi-turn .tsv
, specific to multi-turn is Prompts. An example .tsv
, shown in Excel, show the information to include to define the multi-turn children:
[
{"displayOrder":0,"qnaId":2,"displayText":"Level 2 Question A"},
{"displayOrder":0,"qnaId":3,"displayText":"Level 2 - Question B"}
]
The displayOrder is numeric and the displayText is text that shouldn't include markdown.
Export as example
If you are unsure how to represent your QnA pair in the .tsv
file:
- Use this downloadable example from GitHub
- Or create the pair in the QnA Maker portal, save, then export the knowledge base for an example of how to represent the pair.
Unstructured data format
You can also create a knowledge base based on unstructured content imported via a file. Currently this functionality is available only via document upload for documents that are in any of the supported file formats.
Important
The support for unstructured content via file upload is available only in question answering.
Content types of documents you can add to a knowledge base
Content types include many standard structured documents such as PDF, DOC, and TXT.
File and URL data types
The table below summarizes the types of content and file formats that are supported by QnA Maker.
Source Type | Content Type | Examples |
---|---|---|
URL | FAQs (Flat, with sections or with a topics homepage) Support pages (Single page how-to articles, troubleshooting articles etc.) |
Plain FAQ, FAQ with links, FAQ with topics homepage Support article |
PDF / DOC | FAQs, Product Manual, Brochures, Paper, Flyer Policy, Support guide, Structured QnA, etc. |
Without Multi-turn Structured QnA.docx, Sample Product Manual.pdf, Sample semi-structured.docx, Sample white paper.pdf, Unstructured blog.pdf, Unstructured white paper.pdf Multi-turn: Surface Pro (docx) Contoso Benefits (docx) Contoso Benefits (pdf) |
*Excel | Structured QnA file (including RTF, HTML support) |
Without Multi-turn: Sample QnA FAQ.xls Multi-turn: Structured simple FAQ.xls Surface laptop FAQ.xls |
*TXT/TSV | Structured QnA file | Sample chit-chat.tsv |
If you need authentication for your data source, consider the following methods to get that content into QnA Maker:
- Download the file manually and import into QnA Maker
- Add the file from authenticated SharePoint location
URL content
Two types of documents can be imported via URL in QnA Maker:
- FAQ URLs
- Support URLs
Each type indicates an expected format.
File-based content
You can add files to a knowledge base from a public source, or your local file system, in the QnA Maker portal.
Content format guidelines
Learn more about the format guidelines for the different files.
Next steps
Learn how to edit QnAs.