Need help optimizing GPT-3.5 Turbo Chatbot Responses for Multilingual Document Retrieval

Question

Need help optimizing GPT-3.5 Turbo Chatbot Responses for Multilingual Document Retrieval

Dana 5

Our company is planning to use GPT-3.5 Turbo to create our own custom chatbot that is supposed to help the users with their meditation journey. Currently our server is hosted by Microsoft, so we are using their OpenAI Studio to get things going faster. We have a document in Korean with basic information on meditation, like how to start, how often to practice meditation, etc. and we are using hybrid semantic search with text-embedding-ada-002 for vectorizing. When testing out the AI search, the information being retrieved is relevant enough. However the problem occurs when the data is being uploaded to the chat playground and the chat keeps responding with “This question is out of the scope of the retrieved documents. Therefore, I cannot provide an answer based on the documents.” message in English to a prompt written in Korean (same language as the uploaded data). I tried reflecting this in a system message, asking GPT to respond according to its own knowledge to questions for which no information was found in the uploaded document, however that did not work. At certain values of temperature and top P it gives out a relatively good answer, now considering results of the semantic search, but later with the same configurations for both temperature and top P and the exact same prompt, it again responds with “no relevant information was found”. I am not sure what to do with this issue, what exactly it is related to whether its language related or I have set some configurations wrong, so if you could please give any advice I would most certainly appreciate it. In the screenshots provided as you can see same parameters are being used, however one time it responded perfectly while in another case the chatbot apologizes for lack of information. Screenshot 2024-03-18 132852 Screenshot 2024-03-18 133126

navba-MSFT 27,540 Reputation points Microsoft Employee Moderator

2024-03-21T06:06:00.8066667+00:00

@Dana Welcome to Microsoft Q&A Forum, Thank you for posting your query here!

Since your requirement is to answer the questions outside your input data, Please follow the below action plan:

Plan 1:

After you add your own data and the ingestion completes, Expand the Advanced Settings and then please uncheck the Limit responses to your data content option shown below:

Then test the functionality in the chat playground before you deploy your code.

Plan 2:

Please update / append your System message with the below:

Please answer using retrieved documents and combine with your internal knowledge to answer the question. Please feel free to add facts on top of information extracted from retrieved documents. Please do not forget to cite the retrieved documents for information you extract from documents. Think step by step about which information needs to be added or is missing from retrieved documents

Then click on Apply Changes as shown below:

Hope this helps. If you have any follow-up questions, please let me know. I would be happy to help.
Dana 5 Reputation points

2024-03-22T01:22:00.5666667+00:00

@navba-MSFT Thanks for the suggestion! I have tried it out and I also had to change the temperature setting gradually in increments of 0.1 , until the model responded something except "Sorry, I can't help with that." after the value was pretty much high at 0.7. However after answering the first question (very poorly I have to say, repeating same sentence twice), the model was replying with the same prompt again to the following three questions. I had to restart the chat, but again at the same value of temperature I was getting a response saying 죄송합니다. 이 요청은 저의 업무 범위를 벗어납니다. (I'm sorry, this request is beyond my scope of work.)

I was thinking that maybe data retrieval was not working properly, so I checked the AI Search resource and the index that I was using (semantic+hybrid), I went to search explorer tab and tried to do a search using the example prompts, and the chunks were being retrieved just fine. Even with the instructions stating clearly to answer outside the scope of the document, the bot was still not answering properly again.

If you need any additional information about the settings or the model being deployed please let me know!
Harsh Sharma 5 Reputation points

2024-03-23T17:26:53.07+00:00

Hi @navba-MSFT I am also facing the similar issue. Few days back the chat playground was able to show perfect answers on the basis of the indexed data in Azure Cognitive Search. But from last 5-6 days it stopped giving correct responses and on asking any question it is showing this output: "The requested information is not found in the retrieved data. Please try another query or topic.". I also tried all the solutions which you mentioned in this message thread but none of them worked.

I searched about this error more then I got to know that few days back some other person also posted the similar issue on OpenAI Query Form: https://community.openai.com/t/azure-openai-issue-with-chat-response-generation-using-data-retrieval/688
I am not sure but it seems there is some problem happening with gpt-35-turbo from last 5-6 days.
navba-MSFT 27,540 Reputation points Microsoft Employee Moderator

2024-03-24T05:01:07.5766667+00:00

@Dana Thanks for getting back. I am unable to reproduce this issue at my end with gpt 3.5 turbo. It worked fine at my end.

I believe the behavior is related to your chat playground and not the API itself. Please try to invoke the Azure Open AI chat completion REST API as shown here from postman and check the behavior:

https://learn.microsoft.com/en-us/azure/ai-services/openai/reference#azure-ai-search

If you want more details about the headers used in the API refer this thread.

Please remember to set the "inScope": false in your request.

Hope this helps.

Dana 5

@navba-MSFT I tried using the REST API as follows:

curl -i -X POST {}/openai/deployments/{}/extensions/chat/completions?api-version=2023-06-01-preview
-H "Content-Type: application/json" \
-H "api-key: key" \
-d \
'
{
    "temperature": 0,
    "max_tokens": 1000,
    "top_p": 1.0,
    "dataSources": [
        {
            "type": "AzureCognitiveSearch",
            "parameters": {
                "endpoint": "endpoint",
                "key": "key",
                "indexName": "index",
                "inScope": false
            }
        }
    ],
    "messages": [
        {
            "role": "system",
            "content":"Please answer using retrieved documents and combine with your internal knowledge to answer the question. Please feel free to add facts on top of information extracted from retrieved documents. Please do not forget to cite the retrieved documents for information you extract from documents. Think step by step about which information needs to be added or is missing from retrieved documents"
        },
        {
            "role": "user",
            "content": "명상은 얼마나 자주 하는게 좋은가요?"
        }
    ]
}
'

This is the response that I have received

{
    "id": "id",
    "model": "gpt-35-turbo",
    "created": 1711333604,
    "object": "chat.completion",
    "choices": [
        {
            "index": 0,
            "messages": [
                {
                    "index": 0,
                    "role": "tool",
                    "content": "{\"citations\": [], \"intent\": \"[\\\"How often should I meditate?\\\", \\\"What is the ideal frequency for meditation?\\\"]\"}",
                    "end_turn": false
                },
                {
                    "index": 1,
                    "role": "assistant",
                    "content": "I'm sorry, I can't provide a response to that question as it is out of the scope of the provided documents. If you have any other questions related to the content of the documents, feel free to ask!",
                    "end_turn": true
                }
            ]
        }
    ],
    "usage": {
        "prompt_tokens": 2587,
        "completion_tokens": 62,
        "total_tokens": 2649
    },
    "system_fingerprint": "fp"
}

The intent is being recognized perfectly, however for some reason there's no information being retrieved, I wonder if I should add something for language configuration as well in my request?

Dana 5

@navba-MSFT

I have tried using the REST API as follows:

curl -i -X POST {}/openai/deployments/{}/extensions/chat/completions?api-version=2023-06-01-preview
-H "Content-Type: application/json" \
-H "api-key: key" \
-d \
'
{
    "temperature": 0,
    "max_tokens": 1000,
    "top_p": 1.0,
    "dataSources": [
        {
            "type": "AzureCognitiveSearch",
            "parameters": {
                "endpoint": "endpoint",
                "key": "key",
                "indexName": "index",
                "inScope": false
            }
        }
    ],
    "messages": [
        {
            "role": "system",
            "content":"Please answer using retrieved documents and combine with your internal knowledge to answer the question. Please feel free to add facts on top of information extracted from retrieved documents. Please do not forget to cite the retrieved documents for information you extract from documents. Think step by step about which information needs to be added or is missing from retrieved documents"
        },
        {
            "role": "user",
            "content": "명상은 얼마나 자주 하는게 좋은가요?"
        }
    ]
}
'

I have received the following response:

{
    "id": "",
    "model": "gpt-35-turbo",
    "created": ,
    "object": "chat.completion",
    "choices": [
        {
            "index": 0,
            "messages": [
                {
                    "index": 0,
                    "role": "tool",
                    "content": "{\"citations\": [], \"intent\": \"[\\\"How often should I meditate?\\\", \\\"What is the ideal frequency for meditation?\\\"]\"}",
                    "end_turn": false
                },
                {
                    "index": 1,
                    "role": "assistant",
                    "content": "I'm sorry, I can't provide a response to that question as it is out of the scope of the provided documents. If you have any other questions related to the content of the documents, feel free to ask!",
                    "end_turn": true
                }
            ]
        }
    ],
    "usage": {
        "prompt_tokens": 2587,
        "completion_tokens": 62,
        "total_tokens": 2649
    },
    "system_fingerprint": "fp"
}

The intent is being recognized perfectly, however I don't seem to receive any information chunks related to the question. I'm wondering if I should specify the language configuration in the request? I have added the system message as per your instruction as well.

navba-MSFT 27,540 Reputation points Microsoft Employee Moderator

2024-03-26T05:05:49.5466667+00:00

@Dana Thanks for getting back. Could you please try deploying it to your existing webapp as shown below and then test the behavior from there to isolate any issues with the chat playground ?

Awaiting your reply.
Dana 5 Reputation points

2024-03-26T06:32:23.0833333+00:00

@navba-MSFT I don't have an existing webapp at the moment, and I'll have to create a new service to try this out, is there a reason why this should be any different from using the API directly? I am looking to use the OpenAI service in my app through API calls.
Dana 5 Reputation points

2024-03-26T06:33:02.9233333+00:00

@navba-MSFT thanks for getting back so fast!
navba-MSFT 27,540 Reputation points Microsoft Employee Moderator

2024-03-26T06:36:46.7566667+00:00

@Dana Thanks for your reply. We are just isolating this issue if this is API related or chat playground related in your case. Because the same is working fine at my end without any issues. So by deploying it to the new webapp we can check if that helps.

Please don't forget to uncheck the below option before deploying:

Please let me know how it goes.
Dana 5 Reputation points

2024-03-26T06:49:32.75+00:00

@navba-MSFT I have tried it out considering every suggestion you have made, however the response is still the same
Dana 5 Reputation points

2024-03-26T06:51:46.89+00:00

When I ask the same question in English though, I was able to get an answer this time. The document is fully in Korean however
navba-MSFT 27,540 Reputation points Microsoft Employee Moderator

2024-03-26T06:55:41.7066667+00:00

@Dana Thanks for confirmation. Can you add the relevant system prompt or mention in your question saying if you want the response to be in English or Korean and check if that helps ? Awaiting your reply.
Dana 5 Reputation points

2024-03-26T07:02:59.61+00:00

@navba-MSFT I have updated the system prompt asking to answer in Korean. Now its answering the same thing as in the first case but in Korean instead
Dana 5 Reputation points

2024-03-26T07:08:08.9366667+00:00

The answer is again generated (even though with no relation to the citations found) when the prompt is in English. However when I added a request to answer in Korean to my Korean prompt it worked as well
Dana 5 Reputation points

2024-03-26T07:09:10.77+00:00

Please let me know if I should test something else as well!
navba-MSFT 27,540 Reputation points Microsoft Employee Moderator

2024-03-26T07:10:38.25+00:00

@Dana I have sent you a private message in this thread. Please reply once you get a chance.

2 answers

Your answer

navba-MSFT 27,540 Reputation points Microsoft Employee Moderator

2024-03-21T06:06:00.8066667+00:00

@Dana Welcome to Microsoft Q&A Forum, Thank you for posting your query here!

Since your requirement is to answer the questions outside your input data, Please follow the below action plan:

Plan 1:

After you add your own data and the ingestion completes, Expand the Advanced Settings and then please uncheck the Limit responses to your data content option shown below:

Then test the functionality in the chat playground before you deploy your code.

Plan 2:

Please update / append your System message with the below:

Please answer using retrieved documents and combine with your internal knowledge to answer the question. Please feel free to add facts on top of information extracted from retrieved documents. Please do not forget to cite the retrieved documents for information you extract from documents. Think step by step about which information needs to be added or is missing from retrieved documents

Then click on Apply Changes as shown below:

Hope this helps. If you have any follow-up questions, please let me know. I would be happy to help.
Dana 5 Reputation points

2024-03-22T01:22:00.5666667+00:00

@navba-MSFT Thanks for the suggestion! I have tried it out and I also had to change the temperature setting gradually in increments of 0.1 , until the model responded something except "Sorry, I can't help with that." after the value was pretty much high at 0.7. However after answering the first question (very poorly I have to say, repeating same sentence twice), the model was replying with the same prompt again to the following three questions. I had to restart the chat, but again at the same value of temperature I was getting a response saying 죄송합니다. 이 요청은 저의 업무 범위를 벗어납니다. (I'm sorry, this request is beyond my scope of work.)

I was thinking that maybe data retrieval was not working properly, so I checked the AI Search resource and the index that I was using (semantic+hybrid), I went to search explorer tab and tried to do a search using the example prompts, and the chunks were being retrieved just fine. Even with the instructions stating clearly to answer outside the scope of the document, the bot was still not answering properly again.

If you need any additional information about the settings or the model being deployed please let me know!
Harsh Sharma 5 Reputation points

2024-03-23T17:26:53.07+00:00

Hi @navba-MSFT I am also facing the similar issue. Few days back the chat playground was able to show perfect answers on the basis of the indexed data in Azure Cognitive Search. But from last 5-6 days it stopped giving correct responses and on asking any question it is showing this output: "The requested information is not found in the retrieved data. Please try another query or topic.". I also tried all the solutions which you mentioned in this message thread but none of them worked.

I searched about this error more then I got to know that few days back some other person also posted the similar issue on OpenAI Query Form: https://community.openai.com/t/azure-openai-issue-with-chat-response-generation-using-data-retrieval/688
I am not sure but it seems there is some problem happening with gpt-35-turbo from last 5-6 days.
navba-MSFT 27,540 Reputation points Microsoft Employee Moderator

2024-03-24T05:01:07.5766667+00:00

@Dana Thanks for getting back. I am unable to reproduce this issue at my end with gpt 3.5 turbo. It worked fine at my end.

I believe the behavior is related to your chat playground and not the API itself. Please try to invoke the Azure Open AI chat completion REST API as shown here from postman and check the behavior:

https://learn.microsoft.com/en-us/azure/ai-services/openai/reference#azure-ai-search

If you want more details about the headers used in the API refer this thread.

Please remember to set the "inScope": false in your request.

Hope this helps.
Dana 5 Reputation points

2024-03-25T02:35:19.8133333+00:00

@navba-MSFT I tried using the REST API as follows:

curl -i -X POST {}/openai/deployments/{}/extensions/chat/completions?api-version=2023-06-01-preview -H "Content-Type: application/json" \ -H "api-key: key" \ -d \ ' { "temperature": 0, "max_tokens": 1000, "top_p": 1.0, "dataSources": [ { "type": "AzureCognitiveSearch", "parameters": { "endpoint": "endpoint", "key": "key", "indexName": "index", "inScope": false } } ], "messages": [ { "role": "system", "content":"Please answer using retrieved documents and combine with your internal knowledge to answer the question. Please feel free to add facts on top of information extracted from retrieved documents. Please do not forget to cite the retrieved documents for information you extract from documents. Think step by step about which information needs to be added or is missing from retrieved documents" }, { "role": "user", "content": "명상은 얼마나 자주 하는게 좋은가요?" } ] } '

This is the response that I have received

{ "id": "id", "model": "gpt-35-turbo", "created": 1711333604, "object": "chat.completion", "choices": [ { "index": 0, "messages": [ { "index": 0, "role": "tool", "content": "{\"citations\": [], \"intent\": \"[\\\"How often should I meditate?\\\", \\\"What is the ideal frequency for meditation?\\\"]\"}", "end_turn": false }, { "index": 1, "role": "assistant", "content": "I'm sorry, I can't provide a response to that question as it is out of the scope of the provided documents. If you have any other questions related to the content of the documents, feel free to ask!", "end_turn": true } ] } ], "usage": { "prompt_tokens": 2587, "completion_tokens": 62, "total_tokens": 2649 }, "system_fingerprint": "fp" }

The intent is being recognized perfectly, however for some reason there's no information being retrieved, I wonder if I should add something for language configuration as well in my request?
navba-MSFT 27,540 Reputation points Microsoft Employee Moderator

2024-03-26T05:05:49.5466667+00:00

@Dana Thanks for getting back. Could you please try deploying it to your existing webapp as shown below and then test the behavior from there to isolate any issues with the chat playground ?

Awaiting your reply.
Dana 5 Reputation points

2024-03-26T06:32:23.0833333+00:00

@navba-MSFT I don't have an existing webapp at the moment, and I'll have to create a new service to try this out, is there a reason why this should be any different from using the API directly? I am looking to use the OpenAI service in my app through API calls.
Dana 5 Reputation points

2024-03-26T06:33:02.9233333+00:00

@navba-MSFT thanks for getting back so fast!
navba-MSFT 27,540 Reputation points Microsoft Employee Moderator

2024-03-26T06:36:46.7566667+00:00

@Dana Thanks for your reply. We are just isolating this issue if this is API related or chat playground related in your case. Because the same is working fine at my end without any issues. So by deploying it to the new webapp we can check if that helps.

Please don't forget to uncheck the below option before deploying:

Please let me know how it goes.
Dana 5 Reputation points

2024-03-26T06:49:32.75+00:00

@navba-MSFT I have tried it out considering every suggestion you have made, however the response is still the same
Dana 5 Reputation points

2024-03-26T06:51:46.89+00:00

When I ask the same question in English though, I was able to get an answer this time. The document is fully in Korean however
navba-MSFT 27,540 Reputation points Microsoft Employee Moderator

2024-03-26T06:55:41.7066667+00:00

@Dana Thanks for confirmation. Can you add the relevant system prompt or mention in your question saying if you want the response to be in English or Korean and check if that helps ? Awaiting your reply.
Dana 5 Reputation points

2024-03-26T07:02:59.61+00:00

@navba-MSFT I have updated the system prompt asking to answer in Korean. Now its answering the same thing as in the first case but in Korean instead
Dana 5 Reputation points

2024-03-26T07:08:08.9366667+00:00

The answer is again generated (even though with no relation to the citations found) when the prompt is in English. However when I added a request to answer in Korean to my Korean prompt it worked as well
Dana 5 Reputation points

2024-03-26T07:09:10.77+00:00

Please let me know if I should test something else as well!
navba-MSFT 27,540 Reputation points Microsoft Employee Moderator

2024-03-26T07:10:38.25+00:00

@Dana I have sent you a private message in this thread. Please reply once you get a chance.

Answer 1

Charlie Wei 3,335

Hello Dana,

You may refer to this Microsoft Learn document to adjust the "Strictness" setting. Based on your situation, it is advisable to start by setting it to the lowest level and then gradually increase it to a level you find acceptable.

Strictness determines the system's aggressiveness in filtering search documents based on their similarity scores. Setting strictness to 5 indicates that the system will aggressively filter out documents, applying a very high similarity threshold. Semantic search can be helpful in this scenario because the ranking models do a better job of inferring the intent of the query. Lower levels of strictness produce more verbose answers, but might also include information that isn't in your index. This is set to 3 by default.

Best regards,
Charlie

If you find my response helpful, please consider accepting this answer and voting 'yes' to support the community. Thank you!

Charlie Wei 3,335 Reputation points

2024-03-20T15:09:35.1533333+00:00

@Dana, just checking in to see if above information was helpful. Please let us know if you would like further assistance.
Dana 5 Reputation points

2024-03-21T00:47:39.0933333+00:00

I have tried adjusting the strictness, but that doesn't seem to have any effect, as the results stay the same, should I try changing some other parameters in addition as a combination with strictness?
Dirk Broenink 125 Reputation points

2024-05-03T10:25:17.2033333+00:00

I have a similar issue and also strictness does not seem to have any effect.

Answer 2

@Dana As discussed in the private chat, I have enabled a one-time, courtesy free support flag on your subscription ‘a1bXXX-XXX-XXX-XXXX-XXXXXcb84d’ for a quick and immediate assistance. Using this you can work with the Microsoft Support professional to activate your subscription. I have also emailed you the steps to create a support ticket.

Once you have a resolution for your issue, please post the fix in this thread, so that it benefits the community audience who encounter the same issue.

**

Please do not forget to "Accept the answer” and “up-vote” if this helps.

Share via

Need help optimizing GPT-3.5 Turbo Chatbot Responses for Multilingual Document Retrieval

2 answers

Your answer