An Azure service that provides access to OpenAI’s GPT-3 models with enterprise capabilities.
Hi Axel Vázquez,
What you are seeing usually means the request is reaching the service, but the backend is routing it to the wrong internal embedding's handler. The key signal is that you get HTTP 200 with an empty body and a grpc message saying an unrecognized internal embeddings path, even though you called the /models/embeddings route. That pattern points to a service side routing regression or an endpoint format mismatch rather than a payload change on your side. [clemenssiebler.com]
How to verify you are using the correct embeddings endpoint format:
Azure currently has more than one valid way to call embeddings, and mixing endpoint styles can lead to confusing results. The safest way to verify is to align your call with the official embedding's patterns documented for Azure OpenAI in Foundry Models. [microsoftl....github.io]
Option 1 is the OpenAI v1 style embeddings route under your Azure OpenAI endpoint. The documented pattern is POST https://YOUR RESOURCE NAME.openai.azure.com/openai/v1/embeddings?api-version=preview and the body must include model and input.
Option 2 is the deployment style embeddings route POST https://YOUR RESOURCE NAME.openai.azure.com/openai/deployments/YOUR DEPLOYMENT NAME/embeddings?api-version=2025-04-01-preview and the body must include input.
If you are calling the newer services.ai.azure.com models embeddings endpoint and you see grpc message Unrecognized endpoint, that is a strong hint the backend is not mapping your call to the expected embeddings handler correctly.
Key troubleshooting steps for this embeddings issue:
Step 1 Use the documented embeddings endpoint as a baseline test Try the OpenAI v1 embeddings endpoint shown in the official embeddings documentation and confirm you get a non empty response. This is the fastest way to separate a service routing issue from a request formatting issue.
Step 2 Confirm authentication matches what the embeddings API supports The Azure OpenAI embeddings v1 API does not currently support Microsoft Entra ID authentication. The documentation recommends using API key authentication for embeddings calls. If you are using Entra ID with the new inference endpoint, testing with API key on the v1 embeddings endpoint helps confirm whether auth style is part of the issue.
Step 3 Double check the request body fields For embeddings, input must be present and non empty. For the v1 embeddings route, model is also required. If model or input are missing or mismatched, you can get unexpected behavior depending on the gateway.
Step 4 Validate limits to avoid silent failures The latest embedding models have input length limits and request size limits. Keeping the test input small and simple for the first verification helps rule out limit related behavior. The documentation calls out maximum input length guidance and other best practices for embedding requests.
Step 5 Compare results across endpoints using the exact same input Run the same input text through the services.ai.azure.com models embeddings endpoint that is returning 200 with empty body and the openai v1 embeddings endpoint If the openai v1 endpoint returns a normal embedding vector while the services.ai.azure.com endpoint returns empty output with grpc unrecognized endpoint, that strongly supports a backend routing issue specific to that endpoint.
In short: First confirm embeddings work using the documented openai v1 embeddings endpoint and API key. Then compare with the services.ai.azure.com models embeddings endpoint. If only the services.ai.azure.com route produces 200 with an empty body and grpc unrecognized endpoint, it is most likely a service side routing regression on that new endpoint rather than a change you made.
I Hope this helps. Do let me know if you have any further queries.
Thankyou!