Azure AI use tokens for prompts even AI don't respond, how to fix it?

Vishav Singh | CHECKER 0

Hi,

I am using Auzre OpenAI API services for chat completion. However, when I send a prompt to AI then sometimes it does not respond. In this case, token counts are being reduce even though Azure Open AI didn't respond. Respond error occurs whenever I send around 4000 token prompts to process. The response is not more then 800 tokens.

How can I:

1- Avoid token loss when Azure AI don't respond?

2- Another way to handle this matter?

Vishav Deep Singh

1 answer

Amira Bedhiafi 26,261 Reputation points

2024-07-14T15:04:17.8366667+00:00
You can implement a retry mechanism and error handling in your code. This way, if a request fails, it can be retried without immediately counting it as a token loss.

Or, when dealing with large prompts, you can:

If possible, break down the prompt into smaller chunks and process them individually.

Use the stream parameter to get responses as they are generated, reducing the risk of hitting token limits.

Simplify or condense your prompts to stay within token limits while conveying the necessary information.
Please sign in to rate this answer.
Vishav Singh | CHECKER 0 Reputation points

2024-07-15T01:35:23.5366667+00:00

Hi Amira,

Thanks for reply. But the issue is at the time of response. When sending a prompt there is no such issue. For example: Azure Open AI respond when sending a prompt (token counted and reduced from my token credits). But after waiting 30-40 sec there is an empty response and Azure Open AI didn't return anything.

Can we handle this any how?

Regards

AshokPeddakotla-MSFT 34,611 Reputation points

2024-08-07T16:39:43.32+00:00

Vishav Singh | CHECKER If you are still blocked, please check the below response.

Could you confirm if it happens with a particular prompt or all? What is the model you are using?

Can you try with the API Version 2024-05-01-preview and see if that solves the issue?

Also, check if there are any errors or warnings in the deployment logs. See Monitoring Azure OpenAI Service for more details on how to collect logs.

It is possible that the issue is related to the input text. If the input text is too long or contains too many complex sentences, the model may have difficulty generating a complete response.

To troubleshoot this issue, you can try the following:

Check the input text to ensure that it is not too long or complex.

Try using a different model to see if that resolves the issue.

Also, see Prompt engineering techniques for more information on improving prompting.

Do let me know if you have any further queries.

AshokPeddakotla-MSFT 34,611 Reputation points

2024-08-12T10:30:55.1766667+00:00

Vishav Singh | CHECKER Just checking if we are still connected on this discussion? Please let us know if you need further help.

Vishav Singh | CHECKER 0 Reputation points

2024-08-12T10:52:49.94+00:00

Hi Ashok,

I am using gpt-4-32k model and API version api-version=2023-07-01-preview
I also tried with gpt-4o model and api-version=2024-02-15-preview.

I also applied another logic in my code to reduce the token loss like send large the prompt in chunks.

But still have the main issue, for example the prompt_tokens of a report = 3000
before getting the response from Azure AI, its stopped due some reason e.g heavy traffic or AI response didn't complete in specific allowed time e:g 40 sec.

In this case, I lost my 3000 tokens. You can imagine this is for one user, when sending a single report of 3000 tokens. if user sends the 10 reports, then loss will increase 3000x10 so on...

Any idea how to tackle this?

Why AI reduced my token if I didn't get response due to AI response errors or any other limitations?

let me know if you need more detail.

Regards

AshokPeddakotla-MSFT 34,611 Reputation points

2024-08-13T16:32:37.43+00:00

Vishav Singh | CHECKER Thanks for sharing the additional information. I understand that you have implemented a logic and performed basic troubleshooting to identify the issue.

It is not possible to further debug the issue here. For a deeper investigation and immediate assistance on this issue, please file a support request @ https://aka.ms/azsupt?

Thanks for your understanding.
Sign in to comment

Use comments to ask for clarification, additional information, or improvements to the question.

Share via

Azure AI use tokens for prompts even AI don't respond, how to fix it?

1 answer

Your answer