Microsoft.Extensions.DataIngestion.Chunkers Namespace

Classes

Name Description
DocumentTokenChunker

Processes a document by tokenizing its content and dividing it into overlapping chunks of tokens.

SectionChunker

Treats each IngestionDocumentSection in a Sections as a separate entity.

SemanticSimilarityChunker

Splits a IngestionDocument into chunks based on semantic similarity between its elements based on cosine distance of their embeddings.