Share via


Allow image input from users, and image analysis (preview)

[This article is prerelease documentation and is subject to change.]

You can allow users of your agent to upload images, which your agent can then analyze and use to provide responses.

Important

This article contains Microsoft Copilot Studio preview documentation and is subject to change.

Preview features aren't meant for production use and may have restricted functionality. These features are available before an official release so that you can get early access and provide feedback.

If you're building a production-ready agent, see Microsoft Copilot Studio Overview.

When to allow image input

The following are the two primary scenarios for allowing users to upload images to elicit agent responses:

  • Questions that require information from your configured knowledge sources
  • Questions that can be answered from an image

When you allow users to upload images and your agent to generate responses based on those images, it's important to follow best practices for prompt engineering.

To get the most out of this feature, review the following tips:

  • Upload images that are clear and high-quality. Agents might struggle to interpret grainy, poorly lit, or pixelated images.

  • Only upload supported image file types, including JPG, PNG, WebP, or non-animated GIF files.

  • If an image triggers content moderation filtering, restart the conversation. The agent uses the current conversation history to generate answers and continues to return content moderation filtering errors if objectionable content is part of the conversation history.

  • Don't exceed a total of 15 MB for all uploaded images in a single conversation. Any images uploaded after the 15 MB limit can't be used to generate answers.

If you're having issues generating high-quality responses from images, turn on Allow the AI to use its own general knowledge on the Knowledge page. This setting is required if there are no relevant knowledge sources configured. For more information, see Allow the agent to use general knowledge.

Turn on image input

Makers can allow users of their agent to upload images during chat interactions with their agent.

  1. Go to the Settings page for your agent, and select Generative AI.

  2. Select Generative.

  3. (Optional) Select your preferred level of strictness for content moderation.

  4. Turn on Image input.

    Screenshot of the Generative AI settings highlighting the Image input setting being turned on.

  5. Select Save.

Test your agent's image analysis

After you turn on Image input, you can test your agent's capability by uploading images and verifying the agent's responses. To upload an image, select the paper clip icon in the Test your agent chat box. Thoroughly test your agent's accuracy by using various images and queries to validate the agent's responses before you publish.

Screenshot of the Test your agent pane, highlighting the paper clip icon used to upload images.

Responsible AI

Microsoft is committed to strong responsible AI practices. Please review the full responsible AI guidelines for Copilot Studio before using any AI features. Like all of generative answers, content is checked twice: first during user input and again before the agent generates an output response. If the system finds harmful, offensive, or malicious content at either the input or output stage, the agent won't respond.