Understanding Internal ChatGPT Deployment

Question

Understanding Internal ChatGPT Deployment

Jai-6363 245

The ChatGPT web application behind the AI assistant does not have a view of the model it is running on? The completion from the chatbot is "As an AI assistant, I'm based on OpenAI GPT-3". The team has deployed an internal ChatGPT for employees and utilized the GPT-4 model after deploying the web application from the wizard on the AOAI studio.

Ramr-msft 17,826 Reputation points

2023-10-17T07:00:01.7366667+00:00

@Jai-6363 Thanks for the question, Can you please add more details about the steps that you performed?.

Accepted answer

1 additional answer

Your answer

Ramr-msft 17,826 Reputation points

2023-10-17T07:00:01.7366667+00:00

@Jai-6363 Thanks for the question, Can you please add more details about the steps that you performed?.

Answer 1

Thanks, Ultimately, the model is performing next token prediction in response to your question. The model doesn't have any native ability to query what model version is currently being run to answer your question. To answer this question, you can always go to Azure OpenAI Studio > Management > Deployments > and consult the model name column to confirm what model is currently associated with a given deployment name.

The questions, "What model are you running?" or "What is the latest model from OpenAI?" produce similar quality results to asking the model what the weather will be today. It might return the correct result, but purely by chance. On its own, the model has no real-world information other than what was part of its training/training data. In the case of GPT-4, as of August 2023 the underlying training data goes only up to September 2021. GPT-4 was not released until March 2023, so barring OpenAI releasing a new version with updated training data, or a new version that is fine-tuned to answer those specific questions, it's expected behavior for GPT-4 to respond that GPT-3 is the latest model release from OpenAI.

If you wanted to help a GPT based model to accurately respond to the question "what model are you running?", you would need to provide that information to the model through techniques like prompt engineering of the model's system message, Retrieval Augmented Generation (RAG) which is the technique used by Azure OpenAI on your data where up-to-date information is injected to the system message at query time, or via fine-tuning where you could fine-tune specific versions of the model to answer that question in a certain way based on model version.

Answer 2

Ramr-msft 17,826

@Jai-6363 Thanks for the question, You can ground your models with vector databases, web apis, and function calling as well as establishing metaprompts that mitigate this.

Here is the blog for the same.

Ramr-msft 17,826 Reputation points

2023-10-25T04:36:27.96+00:00

Just checking in to see if the above answer helped. If this answers your query, do click “Accept Answer” and Up-Vote for the same, which might be beneficial to other community members reading this thread. And, if you have any further query do let us know.

Share via

Understanding Internal ChatGPT Deployment

1 additional answer

Your answer