Word count limit for system message in gpt-4o-realtime API? Can gpt-4o-realtime API handle video?

GenixPRO 116 Reputation points
2024-12-12T10:36:14.5366667+00:00

Hi Team,

  1. Is there a word count limit for system message in gpt-4o-realtime API? When we pass a system message with about 2500 words -> API responds and user can have a conversation. However, when system message word count exceeds 6500 words -> API does not initiate a conversation. We don't receive any error message so don't know what may be the issue and how to troubleshoot.
  2. Can gpt-4o-realtime API handle video?
  3. We've tried audio and it works intermittently. How can we troubleshoot to find error message and the AI does not converse?

Thanks & appreciate your help.

Azure OpenAI Service
Azure OpenAI Service
An Azure service that provides access to OpenAI’s GPT-3 models with enterprise capabilities.
4,098 questions
0 comments No comments
{count} votes

1 answer

Sort by: Most helpful
  1. Pavankumar Purilla 8,570 Reputation points Microsoft External Staff Moderator
    2024-12-13T00:01:45.9833333+00:00

    Hi GenixPRO,
    Greetings & Welcome to Microsoft Q&A forum! Thanks for posting your query!

    I understand that you are facing an issue with the gpt-4o-realtime API where system messages exceeding a certain word count (~6500 words) fail to initiate a conversation, and you're also exploring if the API can handle video.

    The gpt-4o-realtime API has a token limit of 8,192 or 32,768 tokens, which includes the system message, user inputs, and model responses. If your system message exceeds this limit, the API may silently fail. For a system message exceeding ~6500 words (likely over 32,000 tokens), consider reducing its length or splitting the conversation into smaller parts. Use tools like OpenAI's Tokenizer to calculate token usage and ensure you're within limits.

    Regarding video, the API does not support direct video processing. However, you can use Azure services like Video Indexer to extract metadata and transcription or Speech Services for audio-to-text conversion, then pass the processed text to the API. For intermittent audio issues, ensure you log errors, verify rate limits, and test in different Azure regions to troubleshoot latency or service degradation. Let us know if you need further assistance!

    Hope this helps. Do let us know if you have any further queries.


    If this answers your query, do click Accept Answer and Yes for was this answer helpful.


Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.