エージェントミドルウェア

Agent Framework のミドルウェアは、実行のさまざまな段階でエージェントの対話をインターセプト、変更、および強化する強力な方法を提供します。ミドルウェアを使用すると、コアエージェントや関数ロジックを変更することなく、ログ記録、セキュリティ検証、エラー処理、結果変換などの横断的な問題を実装できます。

Agent Framework は、次の 3 種類のミドルウェアを使用してカスタマイズできます。

エージェント実行ミドルウェア: 必要に応じて入力と出力を検査したり変更したりできるように、すべてのエージェント実行のインターセプトを許可します。
関数呼び出しミドルウェア: エージェントによって実行されるすべての関数呼び出しのインターセプトを許可します。これにより、入力と出力を必要に応じて検査および変更できます。
IChatClient ミドルウェア: IChatClient 実装への呼び出しのインターセプトを許可します。この場合、エージェントは推論呼び出しに IChatClient を使用します (たとえば、 ChatClientAgentを使用する場合)。

すべての種類のミドルウェアは関数コールバックを介して実装され、同じ型の複数のミドルウェアインスタンスが登録されるとチェーンを形成し、各ミドルウェアインスタンスは、提供された nextFuncを介してチェーン内の次のインスタンスを呼び出す必要があります。

エージェントの実行と関数呼び出しのミドルウェアの種類は、エージェントビルダーと既存のエージェントオブジェクトを使用して、エージェントに登録できます。

var middlewareEnabledAgent = originalAgent
    .AsBuilder()
        .Use(runFunc: CustomAgentRunMiddleware, runStreamingFunc: CustomAgentRunStreamingMiddleware)
        .Use(CustomFunctionCallingMiddleware)
    .Build();

Important

理想的には、 runFunc と runStreamingFunc の両方を提供する必要があります。非ストリーミングミドルウェアのみを提供する場合、エージェントは、ストリーミングと非ストリーミングの両方の呼び出しに使用します。ストリーミングは、ミドルウェアの期待に応えるために、非ストリーミングモードでのみ実行されます。

注

Use(sharedFunc: ...)追加のオーバーロードがあり、ストリーミングをブロックすることなく、非ストリーミングとストリーミングに同じミドルウェアを提供できます。ただし、共有ミドルウェアは出力をインターセプトまたはオーバーライドできません。このオーバーロードは、エージェントに到達する前に入力を検査または変更するだけで済むシナリオに使用する必要があります。

IChatClientミドルウェアは、チャットクライアントビルダーパターンを使用して、IChatClientで使用する前に、ChatClientAgentに登録できます。

var chatClient = new AzureOpenAIClient(new Uri("https://<myresource>.openai.azure.com"), new AzureCliCredential())
    .GetChatClient(deploymentName)
    .AsIChatClient();

var middlewareEnabledChatClient = chatClient
    .AsBuilder()
        .Use(getResponseFunc: CustomChatClientMiddleware, getStreamingResponseFunc: null)
    .Build();

var agent = new ChatClientAgent(middlewareEnabledChatClient, instructions: "You are a helpful assistant.");

IChatClient SDK クライアントのヘルパーメソッドの 1 つを使用してエージェントを構築するときに、ファクトリメソッドを使用してミドルウェアを登録することもできます。

var agent = new AzureOpenAIClient(new Uri(endpoint), new AzureCliCredential())
    .GetChatClient(deploymentName)
    .CreateAIAgent("You are a helpful assistant.", clientFactory: (chatClient) => chatClient
        .AsBuilder()
            .Use(getResponseFunc: CustomChatClientMiddleware, getStreamingResponseFunc: null)
        .Build());

エージェント実行ミドルウェア

エージェント実行のミドルウェアの例を次に示します。このミドルウェアは、エージェント実行からの入力と出力を検査または変更できます。

async Task<AgentRunResponse> CustomAgentRunMiddleware(
    IEnumerable<ChatMessage> messages,
    AgentThread? thread,
    AgentRunOptions? options,
    AIAgent innerAgent,
    CancellationToken cancellationToken)
{
    Console.WriteLine(messages.Count());
    var response = await innerAgent.RunAsync(messages, thread, options, cancellationToken).ConfigureAwait(false);
    Console.WriteLine(response.Messages.Count);
    return response;
}

エージェント実行ストリーミングミドルウェア

エージェントのストリーミング実行からの入力と出力を検査または変更できるエージェント実行ストリーミングミドルウェアの例を次に示します。

async IAsyncEnumerable<AgentRunResponseUpdate> CustomAgentRunStreamingMiddleware(
    IEnumerable<ChatMessage> messages,
    AgentThread? thread,
    AgentRunOptions? options,
    AIAgent innerAgent,
    [EnumeratorCancellation] CancellationToken cancellationToken)
{
    Console.WriteLine(messages.Count());
    List<AgentRunResponseUpdate> updates = [];
    await foreach (var update in innerAgent.RunStreamingAsync(messages, thread, options, cancellationToken))
    {
        updates.Add(update);
        yield return update;
    }

    Console.WriteLine(updates.ToAgentRunResponse().Messages.Count);
}

関数呼び出しミドルウェア

注

現在、関数呼び出しミドルウェアは、AIAgentなどのFunctionInvokingChatClientを使用するChatClientAgentでのみサポートされています。

呼び出される関数を検査したり変更したりできる関数呼び出しミドルウェアの例と、関数呼び出しの結果を次に示します。

async ValueTask<object?> CustomFunctionCallingMiddleware(
    AIAgent agent,
    FunctionInvocationContext context,
    Func<FunctionInvocationContext, CancellationToken, ValueTask<object?>> next,
    CancellationToken cancellationToken)
{
    Console.WriteLine($"Function Name: {context!.Function.Name}");
    var result = await next(context, cancellationToken);
    Console.WriteLine($"Function Call Result: {result}");

    return result;
}

指定された FunctionInvocationContext.Terminate を true に設定することで、関数呼び出しミドルウェアを使用して関数呼び出しループを終了できます。これにより、関数呼び出しループは、関数呼び出し後に関数呼び出し結果を含む推論サービスに要求を発行できなくなります。このイテレーション中に呼び出しに使用できる関数が複数ある場合は、残りの関数が実行されない可能性もあります。

Warnung

関数呼び出しループを終了すると、たとえば、関数の結果コンテンツのない関数呼び出しコンテンツが含まれるなど、スレッドが不整合な状態のままになる可能性があります。これにより、スレッドがそれ以降の実行で使用できなくなる可能性があります。

IChatClient ミドルウェア

チャットクライアントが提供する推論サービスへの要求の入力と出力を検査または変更できるチャットクライアントミドルウェアの例を次に示します。

async Task<ChatResponse> CustomChatClientMiddleware(
    IEnumerable<ChatMessage> messages,
    ChatOptions? options,
    IChatClient innerChatClient,
    CancellationToken cancellationToken)
{
    Console.WriteLine(messages.Count());
    var response = await innerChatClient.GetResponseAsync(messages, options, cancellationToken);
    Console.WriteLine(response.Messages.Count);

    return response;
}

注

IChatClient ミドルウェアの詳細については、「Custom IChatClient ミドルウェア」を参照してください。

Function-Based ミドルウェア

関数ベースのミドルウェアは、非同期関数を使用してミドルウェアを実装する最も簡単な方法です。このアプローチはステートレス操作に最適であり、一般的なミドルウェアシナリオに適した軽量ソリューションを提供します。

エージェントミドルウェア

エージェントミドルウェアは、エージェントの実行をインターセプトして変更します。次を含む AgentRunContext を使用します。

agent: 呼び出されるエージェント
messages: 会話内のチャットメッセージの一覧
is_streaming: 応答がストリーミングされているかどうかを示すブール値
metadata: ミドルウェア間で追加のデータを格納するためのディクショナリ
result: エージェントの応答 (変更可能)
terminate: それ以降の処理を停止するフラグ
kwargs: エージェント実行メソッドに渡される追加のキーワード引数

呼び出し可能な next は、ミドルウェアチェーンを続行するか、最後のミドルウェアである場合はエージェントを実行します。

呼び出し可能なロジックの前後 next 単純なログの例を次に示します。

async def logging_agent_middleware(
    context: AgentRunContext,
    next: Callable[[AgentRunContext], Awaitable[None]],
) -> None:
    """Agent middleware that logs execution timing."""
    # Pre-processing: Log before agent execution
    print("[Agent] Starting execution")

    # Continue to next middleware or agent execution
    await next(context)

    # Post-processing: Log after agent execution
    print("[Agent] Execution completed")

関数ミドルウェア

関数ミドルウェアは、エージェント内の関数呼び出しをインターセプトします。次を含む FunctionInvocationContext を使用します。

function: 呼び出される関数
arguments: 関数の検証済み引数
metadata: ミドルウェア間で追加のデータを格納するためのディクショナリ
result: 関数の戻り値 (変更可能)
terminate: それ以降の処理を停止するフラグ
kwargs: この関数を呼び出したチャットメソッドに渡される追加のキーワード引数

呼び出し可能な next は、次のミドルウェアに進むか、実際の関数を実行します。

呼び出し可能なロジックの前後 next 単純なログの例を次に示します。

async def logging_function_middleware(
    context: FunctionInvocationContext,
    next: Callable[[FunctionInvocationContext], Awaitable[None]],
) -> None:
    """Function middleware that logs function execution."""
    # Pre-processing: Log before function execution
    print(f"[Function] Calling {context.function.name}")

    # Continue to next middleware or function execution
    await next(context)

    # Post-processing: Log after function execution
    print(f"[Function] {context.function.name} completed")

チャットミドルウェア

チャットミドルウェアは、AI モデルに送信されたチャット要求をインターセプトします。次を含む ChatContext を使用します。

chat_client: 呼び出されるチャットクライアント
messages: AI サービスに送信されるメッセージの一覧
chat_options: チャット要求のオプション
is_streaming: これがストリーミング呼び出しであるかどうかを示すブール値
metadata: ミドルウェア間で追加のデータを格納するためのディクショナリ
result: AI からのチャット応答 (変更可能)
terminate: それ以降の処理を停止するフラグ
kwargs: チャットクライアントに渡される追加のキーワード引数

呼び出し可能な next は、次のミドルウェアに続くか、AI サービスに要求を送信します。

呼び出し可能なロジックの前後 next 単純なログの例を次に示します。

async def logging_chat_middleware(
    context: ChatContext,
    next: Callable[[ChatContext], Awaitable[None]],
) -> None:
    """Chat middleware that logs AI interactions."""
    # Pre-processing: Log before AI call
    print(f"[Chat] Sending {len(context.messages)} messages to AI")

    # Continue to next middleware or AI service
    await next(context)

    # Post-processing: Log after AI response
    print("[Chat] AI response received")

関数ミドルウェアデコレーター

デコレーターは、型注釈を必要とせずに、明示的なミドルウェア型宣言を提供します。次の場合に役立ちます。

型注釈を使用しない
明示的なミドルウェア型宣言が必要です
型の不一致を防ぐ

from agent_framework import agent_middleware, function_middleware, chat_middleware

@agent_middleware  # Explicitly marks as agent middleware
async def simple_agent_middleware(context, next):
    """Agent middleware with decorator - types are inferred."""
    print("Before agent execution")
    await next(context)
    print("After agent execution")

@function_middleware  # Explicitly marks as function middleware
async def simple_function_middleware(context, next):
    """Function middleware with decorator - types are inferred."""
    print(f"Calling function: {context.function.name}")
    await next(context)
    print("Function call completed")

@chat_middleware  # Explicitly marks as chat middleware
async def simple_chat_middleware(context, next):
    """Chat middleware with decorator - types are inferred."""
    print(f"Processing {len(context.messages)} chat messages")
    await next(context)
    print("Chat processing completed")

Class-Based ミドルウェア

クラスベースのミドルウェアは、ステートフルな操作や、オブジェクト指向の設計パターンからメリットを得る複雑なロジックに役立ちます。

エージェントミドルウェアクラス

クラスベースのエージェントミドルウェアは、関数ベースのミドルウェアと同じシグネチャと動作を持つ process メソッドを使用します。 process メソッドは、同じcontextパラメーターとnext パラメーターを受け取り、まったく同じ方法で呼び出されます。

from agent_framework import AgentMiddleware, AgentRunContext

class LoggingAgentMiddleware(AgentMiddleware):
    """Agent middleware that logs execution."""

    async def process(
        self,
        context: AgentRunContext,
        next: Callable[[AgentRunContext], Awaitable[None]],
    ) -> None:
        # Pre-processing: Log before agent execution
        print("[Agent Class] Starting execution")

        # Continue to next middleware or agent execution
        await next(context)

        # Post-processing: Log after agent execution
        print("[Agent Class] Execution completed")

関数ミドルウェアクラス

クラスベースの関数ミドルウェアでは、関数ベースのミドルウェアと同じシグネチャと動作を持つ process メソッドも使用されます。メソッドは、同じ context および next パラメーターを受け取ります。

from agent_framework import FunctionMiddleware, FunctionInvocationContext

class LoggingFunctionMiddleware(FunctionMiddleware):
    """Function middleware that logs function execution."""

    async def process(
        self,
        context: FunctionInvocationContext,
        next: Callable[[FunctionInvocationContext], Awaitable[None]],
    ) -> None:
        # Pre-processing: Log before function execution
        print(f"[Function Class] Calling {context.function.name}")

        # Continue to next middleware or function execution
        await next(context)

        # Post-processing: Log after function execution
        print(f"[Function Class] {context.function.name} completed")

チャットミドルウェアクラス

クラスベースのチャットミドルウェアは、関数ベースのチャットミドルウェアと同じシグネチャと動作を持つ process メソッドと同じパターンに従います。

from agent_framework import ChatMiddleware, ChatContext

class LoggingChatMiddleware(ChatMiddleware):
    """Chat middleware that logs AI interactions."""

    async def process(
        self,
        context: ChatContext,
        next: Callable[[ChatContext], Awaitable[None]],
    ) -> None:
        # Pre-processing: Log before AI call
        print(f"[Chat Class] Sending {len(context.messages)} messages to AI")

        # Continue to next middleware or AI service
        await next(context)

        # Post-processing: Log after AI response
        print("[Chat Class] AI response received")

ミドルウェアの登録

ミドルウェアは、スコープと動作が異なる 2 つのレベルで登録できます。

Agent-Level ミドルウェアと Run-Level ミドルウェア

from agent_framework.azure import AzureAIAgentClient
from azure.identity.aio import AzureCliCredential

# Agent-level middleware: Applied to ALL runs of the agent
async with AzureAIAgentClient(async_credential=credential).create_agent(
    name="WeatherAgent",
    instructions="You are a helpful weather assistant.",
    tools=get_weather,
    middleware=[
        SecurityAgentMiddleware(),  # Applies to all runs
        TimingFunctionMiddleware(),  # Applies to all runs
    ],
) as agent:

    # This run uses agent-level middleware only
    result1 = await agent.run("What's the weather in Seattle?")

    # This run uses agent-level + run-level middleware
    result2 = await agent.run(
        "What's the weather in Portland?",
        middleware=[  # Run-level middleware (this run only)
            logging_chat_middleware,
        ]
    )

    # This run uses agent-level middleware only (no run-level)
    result3 = await agent.run("What's the weather in Vancouver?")

主な違い:

エージェントレベル: エージェントの作成時に 1 回構成されたすべての実行で永続的
実行レベル: 特定の実行にのみ適用され、要求ごとのカスタマイズが可能
実行順序: エージェントミドルウェア (最も外側) → 実行ミドルウェア (最も内側) → エージェントの実行

ミドルウェアの終了

ミドルウェアは、 context.terminateを使用して早期に実行を終了できます。これは、セキュリティチェック、レート制限、または検証エラーに役立ちます。

async def blocking_middleware(
    context: AgentRunContext,
    next: Callable[[AgentRunContext], Awaitable[None]],
) -> None:
    """Middleware that blocks execution based on conditions."""
    # Check for blocked content
    last_message = context.messages[-1] if context.messages else None
    if last_message and last_message.text:
        if "blocked" in last_message.text.lower():
            print("Request blocked by middleware")
            context.terminate = True
            return

    # If no issues, continue normally
    await next(context)

終了とは次のことを意味します。

処理 context.terminate = True 停止するシグナルを設定する
終了する前にカスタム結果を提供して、ユーザーにフィードバックを提供できます
ミドルウェアの終了時にエージェントの実行が完全にスキップされる

ミドルウェアの結果のオーバーライド

ミドルウェアは、非ストリーミングシナリオとストリーミングシナリオの両方で結果をオーバーライドできるため、エージェントの応答を変更または完全に置き換えることができます。

context.resultの結果の種類は、エージェントの呼び出しがストリーミングか非ストリーミングかによって異なります。

非ストリーミング: context.result には、完全な応答を含む AgentRunResponse が含まれています
ストリーミング: context.result には、 AgentRunResponseUpdate チャンクを生成する非同期ジェネレーターが含まれています

context.is_streamingを使用して、これらのシナリオを区別し、結果のオーバーライドを適切に処理できます。

async def weather_override_middleware(
    context: AgentRunContext,
    next: Callable[[AgentRunContext], Awaitable[None]]
) -> None:
    """Middleware that overrides weather results for both streaming and non-streaming."""

    # Execute the original agent logic
    await next(context)

    # Override results if present
    if context.result is not None:
        custom_message_parts = [
            "Weather Override: ",
            "Perfect weather everywhere today! ",
            "22°C with gentle breezes. ",
            "Great day for outdoor activities!"
        ]

        if context.is_streaming:
            # Streaming override
            async def override_stream() -> AsyncIterable[AgentRunResponseUpdate]:
                for chunk in custom_message_parts:
                    yield AgentRunResponseUpdate(contents=[TextContent(text=chunk)])

            context.result = override_stream()
        else:
            # Non-streaming override
            custom_message = "".join(custom_message_parts)
            context.result = AgentRunResponse(
                messages=[ChatMessage(role=Role.ASSISTANT, text=custom_message)]
            )

このミドルウェアアプローチを使用すると、高度な応答変換、コンテンツフィルター処理、結果の強化、ストリーミングのカスタマイズを実装しながら、エージェントロジックをクリーンで集中させ続けます。

次のステップ

エージェントのバックグラウンド応答

フィードバック

このページはお役に立ちましたか?

Last updated on 2025-12-08

次の方法で共有

エージェント ミドルウェア

エージェント実行ミドルウェア

エージェント実行ストリーミング ミドルウェア

関数呼び出しミドルウェア

IChatClient ミドルウェア

Function-Based ミドルウェア

エージェント ミドルウェア

関数ミドルウェア

チャット ミドルウェア

関数ミドルウェア デコレーター

Class-Based ミドルウェア

エージェント ミドルウェア クラス

関数ミドルウェア クラス

チャット ミドルウェア クラス

ミドルウェアの登録

Agent-Level ミドルウェアと Run-Level ミドルウェア

ミドルウェアの終了

ミドルウェアの結果のオーバーライド

次のステップ

フィードバック

その他のリソース

エージェントミドルウェア

エージェント実行ストリーミングミドルウェア

エージェントミドルウェア

チャットミドルウェア

関数ミドルウェアデコレーター

エージェントミドルウェアクラス

関数ミドルウェアクラス

チャットミドルウェアクラス