Microsoft Foundry 모델의 엔드포인트

Microsoft Foundry 모델을 사용하면 단일 엔드포인트 및 자격 증명 집합을 통해 주요 모델 공급자의 가장 강력한 모델을 access 수 있습니다. 이 기능을 사용하면 코드를 변경하지 않고도 모델 간에 전환하고 애플리케이션에서 모델을 사용할 수 있습니다.

이 문서에서는 Foundry 서비스가 모델을 구성하는 방법과 유추 엔드포인트를 사용하여 모델에 접근하는 방법을 설명합니다.

중요합니다

Azure AI 유추 베타 SDK는 더 이상 사용되지 않으며 2026년 5월 30일에 사용 중지됩니다. 안정적인 OpenAI SDK를 사용하여 일반적으로 사용 가능한 OpenAI/v1 API 로 전환합니다. 기본 프로그래밍 언어로 SDK를 사용하여 마이그레이션 가이드 에 따라 OpenAI/v1로 전환합니다.

배포

Foundry는 배포를 사용하여 모델을 사용 가능하게 만듭니다. 배포는 모델에 이름을 지정하고 특정 구성을 설정합니다. 모델에 접근하려면 요청에 배포 이름을 사용하십시오.

배포에는 다음이 포함됩니다.

모델 이름
모델 버전
프로비전 또는 용량 형식¹
콘텐츠 필터링 구성¹
속도 제한 구성¹

¹ 이러한 구성은 선택한 모델에 따라 변경될 수 있습니다.

Foundry 리소스에는 많은 모델 배포가 있을 수 있습니다. 모델 배포 시 수행된 유추에 대해서만 비용을 지불합니다. 배포는 Azure 리소스이므로 Azure 정책의 적용을 받습니다.

배포 만들기에 대한 자세한 내용은 모델 배포 추가 및 구성을 참조하세요.

Azure OpenAI 추론 엔드포인트

Azure OpenAI API는 OpenAI 모델의 전체 기능을 노출하고 도우미, 스레드, 파일 및 일괄 처리 유추와 같은 더 많은 기능을 지원합니다. 이 경로를 통해 비 OpenAI 모델을 access 수도 있습니다.

일반적으로 https://<resource-name>.openai.azure.com 형식의 OpenAI 엔드포인트를 Azure 배포 수준에서 작동하며 각 배포에는 고유한 연결된 URL이 있습니다. 하지만 동일한 인증 메커니즘을 사용하여 배포를 사용할 수 있습니다. 자세한 내용은 Azure OpenAI API 참조 페이지를 참조하세요.

각 배포에는 Azure OpenAI 기본 URL 및 경로 /deployments/<model-deployment-name> 연결하여 구성되는 URL이 있습니다.

pip와 같은 package manager 사용하여 패키지 openai 설치합니다.

pip install openai --upgrade

그런 다음 패키지를 사용하여 모델을 활용할 수 있습니다. 다음 예는 채팅 완성을 활용하는 클라이언트를 생성하는 방법을 보여 줍니다.

import os
from openai import AzureOpenAI
    
client = AzureOpenAI(
    azure_endpoint = "https://<resource>.services.ai.azure.com"
    api_key=os.getenv("AZURE_INFERENCE_CREDENTIAL"),  
    api_version="2024-10-21",
)

npm을 사용하여 openai 패키지를 설치합니다.

npm install openai

그런 다음 패키지를 사용하여 모델을 활용할 수 있습니다. 다음 예는 채팅 완성을 활용하는 클라이언트를 생성하는 방법을 보여 줍니다.

import { AzureKeyCredential } from "@azure/openai";

const endpoint = "https://<resource>.services.ai.azure.com";
const apiKey = new AzureKeyCredential(process.env.AZURE_INFERENCE_CREDENTIAL);
const apiVersion = "2024-10-21"

const client = new AzureOpenAI({ 
    endpoint, 
    apiKey, 
    apiVersion, 
    "deepseek-v3-0324"
});

deepseek-v3-0324 다음은 Microsoft Foundry 리소스의 모델 배포 이름입니다.

다음 명령으로 OpenAI 라이브러리를 설치합니다.

dotnet add package Azure.AI.OpenAI --prerelease

패키지를 사용하여 모델을 사용할 수 있습니다. 다음 예는 채팅 완성을 활용하는 클라이언트를 생성하는 방법을 보여 줍니다.

AzureOpenAIClient client = new(
    new Uri("https://<resource>.services.ai.azure.com"),
    new ApiKeyCredential(Environment.GetEnvironmentVariable("AZURE_INFERENCE_CREDENTIAL"))
);

프로젝트에 패키지를 추가하십시오.

<dependency>
    <groupId>com.azure</groupId>
    <artifactId>azure-ai-openai</artifactId>
    <version>1.0.0-beta.16</version>
</dependency>

그런 다음 패키지를 사용하여 모델을 활용할 수 있습니다. 다음 예는 채팅 완성을 활용하는 클라이언트를 생성하는 방법을 보여 줍니다.

OpenAIClient client = new OpenAIClientBuilder()
    .credential(new AzureKeyCredential("{key}"))
    .endpoint("https://<resource>.services.ai.azure.com")
    .buildClient();

참조 섹션을 사용하여 API 디자인 및 사용할 수 있는 매개 변수를 살펴봅니다. 예를 들어 채팅 완성에 대한 참조 섹션에서는 /chat/completions 경로를 사용하여 채팅 형식 지침에 따라 예측을 생성하는 방법을 자세히 설명합니다.

요청

POST https://<resource>.services.ai.azure.com/openai/deployments/deepseek-v3-0324/chat/completions?api-version=2024-10-21
api-key: <api-key>
Content-Type: application/json

deepseek-v3-0324 다음은 Foundry 리소스의 모델 배포 이름입니다.

response = client.chat.completions.create(
    model="deepseek-v3-0324", # Replace with your model deployment name.
    messages=[
        {"role": "system", "content": "You are a helpful assistant."},
        {"role": "user", "content": "Explain Riemann's conjecture in 1 paragraph"}
    ]
)

print(response.model_dump_json(indent=2)

var messages = [
    { role: "system", content: "You are a helpful assistant" },
    { role: "user", content: "Explain Riemann's conjecture in 1 paragraph" },
];

const response = await client.chat.completions.create({ messages, model: "deepseek-v3-0324" });

console.log(response.choices[0].message.content)

ChatCompletion response = chatClient.CompleteChat(
    [
        new SystemChatMessage("You are a helpful assistant."),
        new UserChatMessage("Explain Riemann's conjecture in 1 paragraph"),
    ]);

Console.WriteLine($"{response.Role}: {response.Content[0].Text}");

List<ChatRequestMessage> chatMessages = new ArrayList<>();
chatMessages.add(new ChatRequestSystemMessage("You are a helpful assistant"));
chatMessages.add(new ChatRequestUserMessage("Explain Riemann's conjecture in 1 paragraph"));

ChatCompletions chatCompletions = client.getChatCompletions("deepseek-v3-0324",
    new ChatCompletionsOptions(chatMessages));

System.out.printf("Model ID=%s is created at %s.%n", chatCompletions.getId(), chatCompletions.getCreatedAt());
for (ChatChoice choice : chatCompletions.getChoices()) {
    ChatResponseMessage message = choice.getMessage();
    System.out.printf("Index: %d, Chat Role: %s.%n", choice.getIndex(), message.getRole());
    System.out.println("Message:");
    System.out.println(message.getContent());
}

deepseek-v3-0324 다음은 Microsoft Foundry 리소스의 모델 배포 이름입니다.

요청

POST https://<resource>.services.ai.azure.com/openai/deployments/deepseek-v3-0324/chat/completions?api-version=2024-10-21
api-key: <api-key>
Content-Type: application/json

{
    "messages": [
        {
            "role": "system",
            "content": "You are a helpful assistant"
        },
        {
            "role": "user",
            "content": "Explain Riemann's conjecture in 1 paragraph"
        }
    ]
}

deepseek-v3-0324 다음은 Foundry 리소스의 모델 배포 이름입니다.

Azure OpenAI 엔드포인트를 사용하는 방법에 대한 자세한 내용은 foundry Models 설명서의 Azure OpenAI 설명서 참조하세요.

키 없는 인증

Foundry 도구의 Foundry 모델에 배포된 모델은 Microsoft Entra ID 사용하여 키 없는 권한 부여를 지원합니다. 키 없는 권한 부여는 보안을 강화하고, 사용자 환경을 간소화하며, 운영상의 복잡성을 줄이고, 최신 개발에 대한 강력한 준수 지원을 제공합니다. 이를 통해 키 없는 권한 부여는 안전하고 확장성 있는 ID 관리 솔루션을 도입하는 조직에 강력한 선택이 됩니다.

키 없는 인증을 사용하려면, 리소스를 구성하고 사용자가 유추를 수행할 수 있도록 접근 권한을 부여합니다. 리소스를 구성하고 access 부여한 후 다음과 같이 인증합니다.

pip와 같은 package manager 사용하여 OpenAI SDK를 설치합니다.

pip install openai

Microsoft Entra ID 인증의 경우 다음을 설치합니다.

pip install azure-identity

패키지를 활용하여 모델을 사용합니다. 다음 예제에서는 Microsoft Entra ID를 사용하여 채팅 완료 서비스를 활용하는 클라이언트를 생성하고, 모델 배포를 통해 채팅 완료 엔드포인트에 테스트 호출을 수행하는 방법을 보여줍니다.

<resource>를 당신의 Foundry 리소스 이름으로 대체합니다. Azure 포털에서 찾거나 az cognitiveservices account list을 실행하여 찾습니다. DeepSeek-V3.1를 실제 배포 이름으로 대체합니다.

from openai import OpenAI
from azure.identity import DefaultAzureCredential, get_bearer_token_provider

token_provider = get_bearer_token_provider(
    DefaultAzureCredential(), 
    "https://ai.azure.com/.default"
)

client = OpenAI(
    base_url="https://<resource>.openai.azure.com/openai/v1/",
    api_key=token_provider,
)

completion = client.chat.completions.create(
    model="DeepSeek-V3.1",  # Required: your deployment name
    messages=[
        {"role": "system", "content": "You are a helpful assistant."},
        {"role": "user", "content": "What is Azure AI?"}
    ]
)

print(completion.choices[0].message.content)

예상 출력

Azure AI is a comprehensive suite of artificial intelligence services and tools from Microsoft that enables developers to build intelligent applications. It includes services for natural language processing, computer vision, speech recognition, and machine learning capabilities.

참조: OpenAI Python SDK 및 DefaultAzureCredential 클래스.

OpenAI SDK를 설치합니다.

dotnet add package OpenAI

Microsoft Entra ID 인증의 경우 Azure.Identity 패키지도 설치합니다.

dotnet add package Azure.Identity

다음 네임스페이스를 가져옵니다.

using Azure.Identity;
using OpenAI;
using OpenAI.Chat;
using System.ClientModel.Primitives;

그런 다음 패키지를 사용하여 모델을 사용합니다. 다음 예제에서는 Microsoft Entra ID 사용하여 채팅 완료를 사용하는 클라이언트를 만든 다음 모델 배포를 사용하여 채팅 완료 엔드포인트에 대한 테스트 호출을 만드는 방법을 보여 줍니다.

<resource>을(를) Azure 포털에서 찾을 수 있는 Foundry 리소스 이름으로 교체하십시오. gpt-4o-mini를 실제 배포 이름으로 대체합니다.

#pragma warning disable OPENAI001

BearerTokenPolicy tokenPolicy = new(
    new DefaultAzureCredential(),
    "https://ai.azure.com/.default"
);

ChatClient client = new(
    model: "gpt-4o-mini", // Your deployment name
    authenticationPolicy: tokenPolicy,
    options: new OpenAIClientOptions() {
        Endpoint = new Uri("https://<resource>.openai.azure.com/openai/v1/")
    }
);

ChatCompletion completion = client.CompleteChat(
    new SystemChatMessage("You are a helpful assistant."),
    new UserChatMessage("What is Azure AI?")
);

Console.WriteLine(completion.Content[0].Text);

예상 출력:

Azure AI is a comprehensive suite of artificial intelligence services and tools from Microsoft that enables developers to build intelligent applications. It includes services for natural language processing, computer vision, speech recognition, and machine learning capabilities.

참조: OpenAI .NET SDK 및 DefaultAzureCredential 클래스.

npm을 사용하여 OpenAI SDK를 설치합니다.

npm install openai

Microsoft Entra ID 인증의 경우 다음을 설치합니다.

npm install @azure/identity

리소스 이름을 <resource>에서 지정한 Foundry 리소스로 교체합니다. Azure 포털 또는 az cognitiveservices account list을 실행하여 찾을 수 있습니다. DeepSeek-V3.1를 실제 배포 이름으로 대체합니다.

import { DefaultAzureCredential, getBearerTokenProvider } from "@azure/identity";
import { OpenAI } from "openai";

const tokenProvider = getBearerTokenProvider(
    new DefaultAzureCredential(),
    'https://ai.azure.com/.default'
);

const client = new OpenAI({
    baseURL: "https://<resource>.openai.azure.com/openai/v1/",
    apiKey: tokenProvider
});

const completion = await client.chat.completions.create({
    model: "DeepSeek-V3.1", // Required: your deployment name
    messages: [
        { role: "system", content: "You are a helpful assistant." },
        { role: "user", content: "What is Azure AI?" }
    ]
});

console.log(completion.choices[0].message.content);

예상 출력:

Azure AI is a comprehensive suite of artificial intelligence services and tools from Microsoft that enables developers to build intelligent applications. It includes services for natural language processing, computer vision, speech recognition, and machine learning capabilities.

참조: OpenAI Node.js SDK 및 DefaultAzureCredential 클래스.

프로젝트에 OpenAI SDK를 추가합니다. 최신 버전 및 설치 지침은 OpenAI Java GitHub 리포지토리를 확인합니다.

Microsoft Entra ID 인증의 경우 다음을 추가합니다.

<dependency>
    <groupId>com.azure</groupId>
    <artifactId>azure-identity</artifactId>
    <version>1.18.0</version>
</dependency>

<resource>을(를) Azure 포털에서 찾을 수 있는 Foundry 리소스 이름으로 교체하십시오. DeepSeek-V3.1를 실제 배포 이름으로 대체합니다.

import com.openai.client.OpenAIClient;
import com.openai.client.okhttp.OpenAIOkHttpClient;
import com.azure.identity.DefaultAzureCredential;
import com.azure.identity.DefaultAzureCredentialBuilder;
import com.openai.models.chat.completions.*;

DefaultAzureCredential tokenCredential = new DefaultAzureCredentialBuilder().build();

OpenAIClient client = OpenAIOkHttpClient.builder()
    .baseUrl("https://<resource>.openai.azure.com/openai/v1/")
    .credential(BearerTokenCredential.create(
        AuthenticationUtil.getBearerTokenSupplier(
            tokenCredential, 
            "https://ai.azure.com/.default"
        )
    ))
    .build();

ChatCompletionCreateParams params = ChatCompletionCreateParams.builder()
    .addSystemMessage("You are a helpful assistant.")
    .addUserMessage("What is Azure AI?")
    .model("DeepSeek-V3.1") // Required: your deployment name
    .build();

ChatCompletion completion = client.chat().completions().create(params);
System.out.println(completion.choices().get(0).message().content());

예상 출력:

Azure AI is a comprehensive suite of artificial intelligence services and tools from Microsoft that enables developers to build intelligent applications. It includes services for natural language processing, computer vision, speech recognition, and machine learning capabilities.

참조: OpenAI Java SDK 및 DefaultAzureCredential 클래스.

참조 섹션에서 API 디자인을 탐색하여 사용할 수 있는 매개 변수를 확인합니다. 헤더 Authorization에 인증 토큰을 나타냅니다. 예를 들어 채팅 완료 참조 섹션에서는 경로를 사용하여 /chat/completions 채팅 형식 지침에 따라 예측을 생성하는 방법을 자세히 설명합니다. 경로 /models 는 URL의 루트에 포함됩니다.

요청

리소스 이름을 <resource>에서 지정한 Foundry 리소스로 교체합니다. Azure 포털 또는 az cognitiveservices account list을 실행하여 찾을 수 있습니다. MAI-DS-R1를 실제 배포 이름으로 대체합니다.

base_url https://<resource>.openai.azure.com/openai/v1/ 및 https://<resource>.services.ai.azure.com/openai/v1/ 형식을 모두 허용합니다.

curl -X POST https://<resource>.openai.azure.com/openai/v1/chat/completions \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer $AZURE_OPENAI_AUTH_TOKEN" \
  -d '{
      "model": "MAI-DS-R1",
      "messages": [
      {
        "role": "system",
        "content": "You are a helpful assistant."
      },
      {
        "role": "user",
        "content": "Explain what the bitter lesson is?"
      }
    ]
  }'

응답

인증에 성공하면 응답 본문에서 200 OK 채팅 완료 결과가 포함된 응답을 받게 됩니다.

{
  "id": "chatcmpl-...",
  "object": "chat.completion",
  "created": 1738368234,
  "model": "MAI-DS-R1",
  "choices": [
    {
      "index": 0,
      "message": {
        "role": "assistant",
        "content": "The bitter lesson refers to a key insight in AI research that emphasizes the importance of general-purpose learning methods that leverage computation, rather than human-designed domain-specific approaches. It suggests that methods which scale with increased computation tend to be more effective in the long run."
      },
      "finish_reason": "stop"
    }
  ],
  "usage": {
    "prompt_tokens": 28,
    "completion_tokens": 52,
    "total_tokens": 80
  }
}

토큰은 범위 https://ai.azure.com/.default와 함께 발급되어야 합니다.

테스트를 위해 사용자 계정에 유효한 토큰을 가져오는 가장 쉬운 방법은 Azure CLI 사용하는 것입니다. 콘솔에서 다음 Azure CLI 명령을 실행합니다.

az account get-access-token --resource https://cognitiveservices.azure.com --query "accessToken" --output tsv

이 명령은 $AZURE_OPENAI_AUTH_TOKEN 환경 변수에 저장할 수 있는 access 토큰을 출력합니다.

참조: 채팅 완료 API

피드백

이 페이지가 도움이 되었나요?

Last updated on 2026-03-11

다음을 통해 공유

Microsoft Foundry 모델의 엔드포인트

배포

Azure OpenAI 추론 엔드포인트

키 없는 인증

관련 콘텐츠

피드백

추가 리소스