Is it possible to include Reinforce Learning with Human Feedback into Azure prompt flow?

Kecheng 10 Reputation points
2024-07-23T08:14:03.8666667+00:00

As the title suggests, can RLHF be included in the prompt flow to improve the quality of outputs?

If not, what other ways can I improve on the quality of outputs? (other than modifying the prompts)

Azure OpenAI Service
Azure OpenAI Service
An Azure service that provides access to OpenAI’s GPT-3 models with enterprise capabilities.
3,022 questions
Azure App Service
Azure App Service
Azure App Service is a service used to create and deploy scalable, mission-critical web apps.
7,742 questions
Azure AI services
Azure AI services
A group of Azure services, SDKs, and APIs designed to make apps more intelligent, engaging, and discoverable.
2,829 questions
{count} votes

1 answer

Sort by: Most helpful
  1. AshokPeddakotla-MSFT 33,746 Reputation points
    2024-07-23T12:07:24.02+00:00

    Kecheng Greetings!

    As the title suggests, can RLHF be included in the prompt flow to improve the quality of outputs? If not, what other ways can I improve on the quality of outputs? (other than modifying the prompts)

    I haven't found any information that specifically addresses your question about including Reinforcement Learning with Human Feedback (RLHF) into Azure prompt flow. However, please check below information and let me know if that helps.

    One way to improve the quality of outputs is to use Few-shot, one-shot and zero-shot approaches for in-context learning. These approaches vary based on the amount of task-specific data that is given to the model. Few-shot approach is where a user includes several examples in the call prompt that demonstrate the expected answer format and content. One-shot approach is where a user includes a single example in the call prompt that demonstrates the expected answer format and content. Zero-shot approach is where a user provides no examples in the call prompt, and the model generates a response based on its training data.

    Please see Prompt engineering techniques, Get started with prompt flow and Tune prompts using variants for more details.

    0 comments No comments

Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.