Ollama

Ollama vous permet d’exécuter des modèles open source localement et de les utiliser avec Agent Framework. Cela est idéal pour le développement, les tests et les scénarios dans lesquels vous devez conserver des données locales.

L’exemple suivant montre comment créer un agent à l’aide de Ollama :

using System;
using Microsoft.Agents.AI;
using Microsoft.Extensions.AI;

// Create an Ollama agent using Microsoft.Extensions.AI.Ollama
// Requires: dotnet add package Microsoft.Extensions.AI.Ollama --prerelease
var chatClient = new OllamaChatClient(
    new Uri("http://localhost:11434"),
    modelId: "llama3.2");

AIAgent agent = chatClient.AsAIAgent(
    instructions: "You are a helpful assistant running locally via Ollama.");

Console.WriteLine(await agent.RunAsync("What is the largest city in France?"));

Prerequisites

Vérifiez que Ollama est installé et exécuté localement avec un modèle téléchargé avant d’exécuter des exemples :

ollama pull llama3.2

Note

Tous les modèles ne prennent pas en charge l’appel de fonction. Pour l’utilisation de l’outil, essayez llama3.2 ou qwen3:4b.

pip install agent-framework-ollama --pre

pip install agent-framework

Paramétrage

Native Ollama
OpenAI Compatible

OLLAMA_MODEL="llama3.2"

Le client natif se connecte http://localhost:11434 par défaut. Vous pouvez le remplacer en passant host au client.

OLLAMA_ENDPOINT="http://localhost:11434/v1/"
OLLAMA_MODEL="llama3.2"

Créer des agents Ollama

Native Ollama
OpenAI Compatible

OllamaChatClient fournit une intégration native de Ollama avec une prise en charge complète des outils de fonction et du streaming.

import asyncio
from agent_framework.ollama import OllamaChatClient

async def main():
    agent = OllamaChatClient().as_agent(
        name="HelpfulAssistant",
        instructions="You are a helpful assistant running locally via Ollama.",
    )
    result = await agent.run("What is the largest city in France?")
    print(result)

asyncio.run(main())

Vous pouvez également utiliser OpenAIChatClient avec une URL de base personnalisée pointant vers votre instance Ollama.

import asyncio
import os
from agent_framework.openai import OpenAIChatClient

async def main():
    agent = OpenAIChatClient(
        api_key="ollama",  # Placeholder, Ollama doesn't require an API key
        base_url=os.environ["OLLAMA_ENDPOINT"],
        model=os.environ["OLLAMA_MODEL"],
    ).as_agent(
        name="HelpfulAssistant",
        instructions="You are a helpful assistant running locally via Ollama.",
    )
    result = await agent.run("What is the largest city in France?")
    print(result)

asyncio.run(main())

Tools

Les clients Ollama Python (OllamaChatClient et OpenAIChatClient point de terminaison compatible avec Ollama) prennent en charge les outils appelés localement. Les types d’outils hébergés n’existent pas, car Ollama est un runtime de modèle local.

Tool	État	Remarques
Outils de fonction	✅	Appels standard Python ou `@ai_function`. Si le modèle sélectionné peut réellement les appeler dépend du modèle lui-même.
Approbation de l’outil	✅	Fourni par le client de conversation de fonction de l’infrastructure ; fonctionne avec n’importe quel appel d’outil de fonction.
Interpréteur de code	❌	Aucun interpréteur de code hébergé.
Recherche de fichiers	❌	Aucune recherche de fichier hébergée.
Recherche web	❌	Aucune recherche web hébergée.
Outils MCP hébergés	❌	Ollama n’expose pas mcP hébergé.
Outils MCP locaux	✅	S’exécute dans votre processus et fonctionne avec n’importe quel client de conversation.

Outils de fonction

Native Ollama
OpenAI Compatible

import asyncio
from datetime import datetime
from agent_framework.ollama import OllamaChatClient

def get_time(location: str) -> str:
    """Get the current time."""
    return f"The current time in {location} is {datetime.now().strftime('%I:%M %p')}."

async def main():
    agent = OllamaChatClient().as_agent(
        name="TimeAgent",
        instructions="You are a helpful time agent.",
        tools=get_time,
    )
    result = await agent.run("What time is it in Seattle?")
    print(result)

asyncio.run(main())

import asyncio
import os
from datetime import datetime
from agent_framework.openai import OpenAIChatClient

def get_time(location: str) -> str:
    """Get the current time."""
    return f"The current time in {location} is {datetime.now().strftime('%I:%M %p')}."

async def main():
    agent = OpenAIChatClient(
        api_key="ollama",
        base_url=os.environ["OLLAMA_ENDPOINT"],
        model=os.environ["OLLAMA_MODEL"],
    ).as_agent(
        name="TimeAgent",
        instructions="You are a helpful time agent.",
        tools=get_time,
    )
    result = await agent.run("What time is it in Seattle?")
    print(result)

asyncio.run(main())

Diffusion en continu

async def streaming_example():
    agent = OllamaChatClient().as_agent(
        instructions="You are a helpful assistant.",
    )
    print("Agent: ", end="", flush=True)
    async for chunk in agent.run("Tell me about Python.", stream=True):
        if chunk.text:
            print(chunk.text, end="", flush=True)
    print()

Prochaines étapes

GitHub Copilot

Commentaires

Cette page a-t-elle été utile ?

Last updated on 2026-05-26