Hello Teodor Constantin,
Thank you for posting your question in the Microsoft Q&A forum.
The 403 "temporarily blocked" error with Azure's GPT-4.5-preview model typically stems from automated traffic patterns (even unintentional ones) triggering Azure’s abuse prevention systems, especially if your requests include rapid retries, high-frequency polling, or repetitive prompts. While your prompts may comply with content policies, Azure’s rate-limiting algorithms might flag sudden spikes in usage or unusual deployment churn (e.g., frequent model re-deployments) as suspicious behavior.
To resolve this, first audit your request patterns: Space out API calls, implement exponential backoff for retries, and avoid redundant deployments. If your prompts include NSFW fallbacks, ensure they don’t inadvertently trigger content filters test with simplified prompts to isolate the issue. For persistent blocks, contact Azure Support with the Apim-request-id to request a manual review or quota adjustment.
Long-term, migrate to a dedicated Azure OpenAI Service resource (not shared tiers) for higher rate limits and stability. Monitor usage via Azure Metrics to preemptively identify throttling risks. If the issue persists, Microsoft may need to whitelist your workload provide concrete use-case details to expedite resolution.
If the above answer helped, please do not forget to "Accept Answer" as this may help other community members to refer the info if facing a similar issue. Your contribution to the Microsoft Q&A community is highly appreciated.