Azure.AI.Inference Namespace
Important
Some information relates to prerelease product that may be substantially modified before it’s released. Microsoft makes no warranties, express or implied, with respect to the information provided here.
Classes
AIInferenceModelFactory |
Model factory for models. |
AzureAIInferenceClientOptions |
Client options for Azure.AI.Inference library clients. |
ChatChoice |
The representation of a single prompt completion as part of an overall chat completions request.
Generally, |
ChatCompletions |
Representation of the response data from a chat completions request. Completions support a wide variety of tasks and generate text that continues from or "completes" provided prompt data. |
ChatCompletionsClient |
The ChatCompletions service client. |
ChatCompletionsNamedToolChoice |
A tool selection of a specific, named function tool that will limit chat completions to using the named function. |
ChatCompletionsNamedToolChoiceFunction |
A tool selection of a specific, named function tool that will limit chat completions to using the named function. |
ChatCompletionsOptions |
The configuration information for a chat completions request. Completions support a wide variety of tasks and generate text that continues from or "completes" provided prompt data. |
ChatCompletionsResponseFormat |
Represents the format that the model must output. Use this to enable JSON mode instead of the default text mode. Note that to enable JSON mode, some AI models may also require you to instruct the model to produce JSON via a system or user message. Please note ChatCompletionsResponseFormat is the base class. According to the scenario, a derived class of the base class might need to be assigned here, or this property needs to be casted to one of the possible derived classes. The available derived classes include ChatCompletionsResponseFormatJSON and ChatCompletionsResponseFormatText. |
ChatCompletionsResponseFormatJSON |
A response format for Chat Completions that restricts responses to emitting valid JSON objects. Note that to enable JSON mode, some AI models may also require you to instruct the model to produce JSON via a system or user message. |
ChatCompletionsResponseFormatText |
A response format for Chat Completions that emits text responses. This is the default response format. |
ChatCompletionsToolCall |
A function tool call requested by the AI model. |
ChatCompletionsToolChoice |
Represents an optional control that specifies which, if any, tools may be called by the model while processing a chat completions request. |
ChatCompletionsToolDefinition |
The definition of a chat completions tool that can call a function. |
ChatMessageContentItem |
An abstract representation of a structured content item within a chat message. Please note ChatMessageContentItem is the base class. According to the scenario, a derived class of the base class might need to be assigned here, or this property needs to be casted to one of the possible derived classes. The available derived classes include ChatMessageImageContentItem and ChatMessageTextContentItem. |
ChatMessageImageContentItem |
A structured chat content item containing an image reference. |
ChatMessageTextContentItem |
A structured chat content item containing plain text. |
ChatRequestAssistantMessage |
A request chat message representing response or action from the assistant. |
ChatRequestMessage |
An abstract representation of a chat message as provided in a request. Please note ChatRequestMessage is the base class. According to the scenario, a derived class of the base class might need to be assigned here, or this property needs to be casted to one of the possible derived classes. The available derived classes include ChatRequestAssistantMessage, ChatRequestSystemMessage, ChatRequestToolMessage and ChatRequestUserMessage. |
ChatRequestSystemMessage |
A request chat message containing system instructions that influence how the model will generate a chat completions response. |
ChatRequestToolMessage |
A request chat message representing requested output from a configured tool. |
ChatRequestUserMessage |
A request chat message representing user input to the assistant. |
ChatResponseMessage |
A representation of a chat message as received in a response. |
CompletionsUsage |
Representation of the token counts processed for a completions request. Counts consider all tokens across prompts, choices, choice alternates, best_of generations, and other consumers. |
EmbeddingItem |
Representation of a single embeddings relatedness comparison. |
EmbeddingsClient |
The Embeddings service client. |
EmbeddingsOptions |
The configuration information for an embeddings request. |
EmbeddingsResult |
Representation of the response data from an embeddings request. Embeddings measure the relatedness of text strings and are commonly used for search, clustering, recommendations, and other similar scenarios. |
EmbeddingsUsage |
Measurement of the amount of tokens used in this request and response. |
FunctionCall |
The name and arguments of a function that should be called, as generated by the model. |
FunctionDefinition |
The definition of a caller-specified function that chat completions may invoke in response to matching user input. |
ModelInfo |
Represents some basic information about the AI model. |
StreamingChatChoiceUpdate |
Represents an update to a single prompt completion when the service is streaming updates
using Server Sent Events (SSE).
Generally, |
StreamingChatCompletionsUpdate |
Represents an incremental update to a streamed Chat Completions response. |
StreamingChatResponseMessageUpdate |
A representation of a chat message update as received in a streaming response. |
StreamingChatResponseToolCallUpdate |
An update to the function tool call information requested by the AI model. |
StreamingFunctionToolCallUpdate |
Represents an incremental update to a streaming function tool call that is part of a streaming chat completions choice. |
StreamingResponse<T> |
Represents an operation response with streaming content that can be deserialized and enumerated while the response is still being received. |
StreamingToolCallUpdate |
Represents an incremental update to a streaming tool call that is part of a streaming chat completions choice. |
Structs
ChatCompletionsNamedToolChoiceType |
The ChatCompletionsNamedToolChoice_type. |
ChatCompletionsToolCallType |
The ChatCompletionsToolCall_type. |
ChatCompletionsToolChoicePreset |
Represents a generic policy for how a chat completions tool may be selected. |
ChatCompletionsToolDefinitionType |
The ChatCompletionsToolDefinition_type. |
ChatMessageImageDetailLevel |
A representation of the possible image detail levels for image-based chat completions message content. |
ChatRole |
A description of the intended purpose of a message within a chat completions interaction. |
CompletionsFinishReason |
Representation of the manner in which a completions response concluded. |
EmbeddingEncodingFormat |
The format of the embeddings result. Returns a 422 error if the model doesn't support the value or parameter. |
EmbeddingInputType |
Represents the input types used for embedding search. |
ExtraParameters |
Controls what happens if extra parameters, undefined by the REST API, are passed in the JSON request payload. |
ModelType |
The type of AI model. |
Enums
AzureAIInferenceClientOptions.ServiceVersion |
The version of the service to use. |