Missing x-ratelimit-remaining-requests in response for embeddings generation

Roman Egel 0 Reputation points
2025-05-06T19:55:49.3133333+00:00

when calling https://<endpoint>.openai.azure.com/openai/deployments/<deployment>/embeddings?api-version=2023-05-15

there's no x-ratelimit-remaining-requests in response headers, only x-ratelimit-remaining-tokens whereas before we received both rpm and tpm tokens.

Azure AI services
Azure AI services
A group of Azure services, SDKs, and APIs designed to make apps more intelligent, engaging, and discoverable.
3,409 questions
{count} votes

Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.