Semantic Kernel Vector Store code samples (Preview)

Άρθρο
11/11/2024

Προειδοποίηση

The Semantic Kernel Vector Store functionality is in preview, and improvements that require breaking changes may still occur in limited circumstances before release.

End to end RAG sample with Vector Stores

This example is a standalone console application that demonstrates RAG using Semantic Kernel. The sample has the following characteristics:

Allows a choice of chat and embedding services
Allows a choice of vector databases
Reads the contents of one or more PDF files and creates a chunks for each section
Generates embeddings for each text chunk and upserts it to the chosen vector database
Registers the Vector Store as a Text Search plugin with the kernel
Invokes the plugin to augment the prompt provided to the AI model with more context

End to end RAG demo

Simple Data Ingestion and Vector Search

For two very simple examples of how to do data ingestion into a vector store and do vector search, check out these two examples, which use Qdrant and InMemory vector stores to demonstrate their usage.

Common code with multiple stores

Vector stores may different in certain aspects, e.g. with regards to the types of their keys or the types of fields each support. Even so, it is possible to write code that is agnostic to these differences.

For a data ingestion sample that demonstrates this, see:

MultiStore Data Ingestion

For a vector search sample demonstrating the same concept see the following samples. Each of these samples are referencing the same common code, and just differ on the type of vector store they create to use with the common code.

Supporting multiple vectors in the same record

The Vector Store abstractions support multiple vectors in the same record, for vector databases that support this. The following sample shows how to create some records with multiple vectors, and pick the desired target vector when doing a vector search.

Choosing a vector for search on a record with multiple vectors

Vector search with paging

When doing vector search with the Vector Store abstractions it's possible to use Top and Skip parameters to support paging, where e.g. you need to build a service that reponds with a small set of results per request.

Vector search with paging

Προειδοποίηση

Not all vector databases support Skip functionality natively for vector searches, so some connectors may have to fetch Skip + Top records and skip on the client side to simulate this behavior.

Using the generic data model vs using a custom data model

It's possible to use the Vector Store abstractions without defining a data model and defining your schema via a record definition instead. This example shows how you can create a vector store using a custom model and read using the generic data model or vice versa.

Generic data model interop

Συμβουλή

For more information about using the generic data model, refer to using Vector Store abstractions without defining your own data model.

Using collections that were created and ingested using Langchain

It's possible to use the Vector Store abstractions to access collections that were created and ingested using a different sytem, e.g. Langchain. There are various approaches that can be followed to make the interop work correctly. E.g.

Creating a data model that matches the storage schema that the Langchain implemenation used.
Using a custom mapper to map between the storage schema and data model.
Using a record definition with special storage property names for fields.

In the following sample, we show how to use these approaches to construct Langchain compatible Vector Store implementations.

VectorStore Langchain Interop

For each vector store, there is a factory class that shows how to contruct the Langchain compatible Vector Store. See e.g.

In this sample, we also demonstrate a technique for having a single unified data model across different Vector Stores, where each Vector Store supports different key types and may require different storage schemas.

We use a decorator class MappingVectorStoreRecordCollection that allows converting data models and key types. E.g. Qdrant only supports Guid and ulong key types, and Langchain uses the Guid key type when creating a collection. Azure AI Search, Pinecone and Redis all support string keys. In the sample, we use the MappingVectorStoreRecordCollection to expose the Qdrant Vector Store with a string key containing a guid instead of the key being a Guid type. This allows us to easily use all databases with one data model. Note that supplying string keys that do not contain guids to the decorated Qdrant Vector Store will not work, since the underlying database still requires Guid keys.

End to end RAG sample with Azure AI Search Vector Store

This example is a set of two scripts, the first showing the basics of setting up the Azure AI Search Vector Store and the second showing how to create a plugin from it and use that to perform RAG.

Simple Data Ingestion and Vector Search

We also have a sample that shows the basics from creating the collection, to adding records, to finally doing search, this can be started with different vector stores.

Simple Vector Search

Simple Data Ingestion and Vector Search

For simple examples of how to do data ingestion into a vector store and do vector search, check out these examples, which make use of Azure AI Search, JDBC with PostgreSQL, Redis and In Memory vector stores.

Πρόσθετοι πόροι

Τεκμηρίωση

Legacy Semantic Kernel Memory Stores

Describes the legacy Semantic Kernel Memory Stores and the benefits of moving to Vector Stores
How to ingest data into a Semantic Kernel Vector Store (Preview)

Step by step instructions on how to ingest data into a Vector Store using Semantic Kernel
Generating embeddings for Semantic Kernel Vector Store connectors

Describes how you can generate embeddings to use with Semantic Kernel vector store connectors.
What are Semantic Kernel Vector Store connectors? (Preview)

Describes what a Semantic Kernal Vector Store is, an provides a basic example of how to use one and how to get started.
Out-of-the-box Vector Store connectors (Preview)

Out-of-the-box Vector Store connectors
Using the Semantic Kernel In-Memory Vector Store connector (Preview)

Contains information on how to use a Semantic Kernel Vector store connector to access and manipulate data in an in-memory Semantic Kernel supplied vector store.
Vector search using Semantic Kernel Vector Store connectors (Preview)

Describes the different options you can use when doing a vector search using Semantic Kernel vector store connectors.
Using the Semantic Kernel Qdrant Vector Store connector (Preview)

Contains information on how to use a Semantic Kernel Vector store connector to access and manipulate data in Qdrant.

Εκπαίδευση

Λειτουργική μονάδα

Enable semantic search in Azure Database for PostgreSQL - Training

Learn to enable semantic search in Azure Database for PostgreSQL.

Κοινή χρήση μέσω

Semantic Kernel Vector Store code samples (Preview)

End to end RAG sample with Vector Stores

Simple Data Ingestion and Vector Search

Common code with multiple stores

Supporting multiple vectors in the same record

Vector search with paging

Using the generic data model vs using a custom data model

Using collections that were created and ingested using Langchain

End to end RAG sample with Azure AI Search Vector Store

Simple Data Ingestion and Vector Search

Simple Data Ingestion and Vector Search

Πρόσθετοι πόροι