Use Content Safety in Azure AI Foundry portal

Article
29/01/2025

Azure AI Foundry includes a Content Safety try it out page that lets you use the core detection models and other content safety features.

Prerequisites

An Azure account. If you don't have one, you can create one for free.
An Azure AI resource.

Setup

Follow these steps to use the Content Safety try it out page:

Go to Azure AI Foundry and navigate to your project/hub. Then select the Safety+ Security tab on the left nav and select the Try it out tab.
On the Try it out page, you can experiment with various content safety features such as text and image content, using adjustable thresholds to filter for inappropriate or harmful content.

Screenshot of the try it out page for content safety.

Analyze text

Select the Moderate text content panel.
Add text to the input field, or select sample text from the panels on the page.
Select Run test. The service returns all the categories that were detected, with the severity level for each: 0-Safe, 2-Low, 4-Medium, 6-High. It also returns a binary Accepted/Rejected result, based on the filters you configure. Use the matrix in the Configure filters tab to set your allowed/prohibited severity levels for each category. Then you can run the text again to see how the filter works.

Use a blocklist

The Use blocklist tab lets you create, edit, and add a blocklist to the moderation workflow. If you have a blocklist enabled when you run the test, you get a Blocklist detection panel under Results. It reports any matches with the blocklist.

Screenshot of the Use blocklist panel.

Analyze images

The Moderate image page provides capability for you to quickly try out image moderation.

Select the Moderate image content panel.
Select a sample image from the panels on the page, or upload your own image.
Select Run test. The service returns all the categories that were detected, with the severity level for each: 0-Safe, 2-Low, 4-Medium, 6-High. It also returns a binary Accepted/Rejected result, based on the filters you configure. Use the matrix in the Configure filters tab on the right to set your allowed/prohibited severity levels for each category. Then you can run the text again to see how the filter works.

View and export code

You can use the View Code feature in either the Analyze text content or Analyze image content pages to view and copy the sample code, which includes configuration for severity filtering, blocklists, and moderation functions. You can then deploy the code on your end.

Screenshot of the View code button.

Use Prompt Shields

The Prompt Shields panel lets you try out user input risk detection. Detect User Prompts designed to provoke the Generative AI model into exhibiting behaviors it was trained to avoid or break the rules set in the System Message. These attacks can vary from intricate role-play to subtle subversion of the safety objective.

Select the Prompt Shields panel.
Select a sample text on the page, or input your own content for testing.
Select Run test. The service returns the risk flag and type for each sample.

For more information, see the Prompt Shields conceptual guide.

Use Groundedness detection

The Groundedness detection panel lets you detect whether the text responses of large language models (LLMs) are grounded in the source materials provided by the users.

Select the Groundedness detection panel.
Select a sample content set on the page, or input your own for testing.
Optionally, enable the reasoning feature and select your Azure OpenAI resource from the dropdown.
Select Run test. The service returns the groundedness detection result.

For more information, see the Groundedness detection conceptual guide.

Use Protected material detection

This feature scans AI-generated text for known text content (for example, song lyrics, articles, recipes, selected web content).

Select the Protected material detection for text or Protected material detection for code panel.
Select a sample text on the page, or input your own for testing.
Select Run test. The service returns the protected content result.

For more information, see the Protected material conceptual guide.

Use custom categories

This feature lets you create and train your own custom content categories and scan text for matches.

Select the Custom categories panel.
Select Add a new category to open a dialog box. Enter your category name and a text description, and connect a blob storage container with text training data. Select Create and train.
Select a category and enter your sample input text, and select Run test. The service returns the custom category result.

For more information, see the Custom categories conceptual guide.

Next step

To use Azure AI Content Safety features with your Generative AI models, see the Content filtering guide.

Additional resources

Documentation

Use blocklists in Azure AI Foundry portal - Azure AI Foundry

Learn how to create custom blocklists in Azure AI Foundry portal as part of your content filtering configurations.
Content Safety in Azure AI Foundry portal overview - Azure AI Foundry

Learn how to use Azure AI Content Safety in Azure AI Foundry portal to detect harmful user-generated and AI-generated content in applications and services.
Migrate from Azure AI Content Safety public preview to GA - Azure AI services

Learn how to upgrade your app from the public preview version of Azure AI Content Safety to the GA version.
What is Azure AI Content Safety? - Azure AI services

Learn how to use Content Safety to track, flag, assess, and filter inappropriate material in user-generated content.
Embedded content safety - Azure AI Content Safety - Azure AI services

Embedded content safety is designed for on-device scenarios where cloud connectivity is intermittent or unavailable.
Use blocklists for text moderation - Azure AI services

Learn how to customize text moderation in Azure AI Content Safety by using your own list of blocklistItems.
Content Safety error codes - Azure AI services

See the possible error codes and their corresponding suggestions for the Azure AI Content Safety APIs.
Azure AI Content Safety documentation - Quickstarts, Tutorials, API Reference - Azure AI services

The cloud-based Azure AI Content Safety API provides developers with access to advanced algorithms for processing images and text and flagging content that is potentially offensive, risky, or otherwise undesirable.

Training

Module

Use AI responsibly with Azure AI Content Safety - Training

As the amount of user-generated online content increases, so does the need to ensure harmful material is moderated effectively. Azure AI Content Safety resource includes features to help organizations moderate and manage both user-generated and AI-generated content.

Certification

Microsoft Certified: Azure AI Fundamentals - Certifications

Demonstrate fundamental AI concepts related to the development of software and services of Microsoft Azure to create AI solutions.

Events

Build Intelligent Apps

17 Mar, 21 - 21 Mar, 10

Join the meetup series to build scalable AI solutions based on real-world use cases with fellow developers and experts.

Share via