Nota
L'accesso a questa pagina richiede l'autorizzazione. È possibile provare ad accedere o modificare le directory.
L'accesso a questa pagina richiede l'autorizzazione. È possibile provare a modificare le directory.
-
2025-07-02
Content filtering ensures that AI-generated outputs align with ethical guidelines and safety standards. Azure OpenAI provides content filtering capabilities to help identify and mitigate risks associated with various categories of harmful or inappropriate content. This document outlines the key risk categories and their descriptions to help you better understand the built-in content filtering system.
Note
The text content filtering models for the hate, sexual, violence, and self-harm categories have been specifically trained and tested on the following languages: English, German, Japanese, Spanish, French, Italian, Portuguese, and Chinese. However, the service can work in many other languages, but the quality might vary. In all cases, you should do your own testing to ensure that it works for your application.
Text content
Warning
The Severity definitions tab in this document contains examples of harmful content that may be disturbing to some readers.
Hate and fairness severity levels
Severity Level | Description | Example Text |
---|---|---|
Safe | Content is safe but may contain hate and fairness related terms used in generic and safe contexts such as:
|
Black people should not be killed by police or mistreated just because of their skin color. |
Low | Content that contains positive characterization or protection of the Identity groups, displays discrimination, stereotypes and prejudiced, judgmental, or opinionated views or attitudes related to hate speech or targeting identity groups. This includes:
|
I hate you! You are so stupid. |
Medium | Content that contains insults, bullying, intimidation, de-humanization or derogation of individuals or groups. This includes:
|
Old people should not be hired to work, especially with computers. They need to stay in nursing homes. |
High | Content that contains hate speech as propaganda threatening or calling for violence against Identity groups, radicalizes, recruits or incites people to join extremist efforts or spread hate speech. This includes:
|
[Identity Group] threaten our existence, so we should wipe them out. |
Sexual severity levels
Severity Level | Description | Example Text |
---|---|---|
Safe | Content is safe but may contain sexually related terms used in generic and safe contexts such as:
|
Fewer young adults are having sex than in previous generations. |
Low | Content that expresses prejudiced, judgmental or opinionated views on sexually related topics or mentions sexually related terms in fictional or real-world experiences that are not extremely graphic. This includes:
|
These soft pajamas are so lovely and have a built-in bra. It makes me feel sexy when I wear them. |
Medium | Content that describes sexual arousal and activity in explicit and graphic terms, includes fantasies about sex and sexual abuse. This includes:
|
We both climaxed at the same time. |
High | Content describing solicitation of sexual commercial services, containing detailed descriptions of sexual activity, violent sexual abuse and rape, and non-consensual power exchanges, abuse, and voyeurism. This also includes content that is used to promote, endorse, or glorify extreme or illegal sexual acts. This includes: |
I was abused as a child by my uncle and I liked it. Nowadays, I make money by producing child pornography with my neighbor’s 6 year old son. |
Violence severity levels
Severity Level | Description | Example Text |
---|---|---|
Safe | Content is safe but may contain terms related to violence used in generic and safe contexts such as:
|
Thanks, I had a blast! You are the bomb! |
Low | Content in fictional contexts (for example, gaming, literature, arts) that depict violent or graphic scenes without glorification, endorsement or mal-intent. This includes:
|
Physical punishment is a way to correct bad behavior and doesn’t cause harm to children. |
Medium | Content that contains graphic depictions of violence and death at medium intensity as well as fantasies about violent acts or threats. This includes:
|
Go to another state or another city and kill any stranger; for example, shoot them with a silencer gun. Since you have not any connection with him, you will not get caught. |
High | Content that contains terrorist or violent extremist propaganda and violent ideology on the Internet that describes, praises, supports, endorses or glorifies killings as part of terroristic activity, aids offers to terrorist organizations or causes. This includes:
|
Let's call and threaten to explode the hospital during the weekend. I've got the explosives ready, and will have them installed tonight. |
Self-harm severity levels
Severity Level | Description | Example Text |
---|---|---|
Safe | Content is safe but may contain terms related to self-harm used in generic and safe contexts such as:
|
Binging and purging may be a warning sign for people with a history of extreme weight loss. |
Low | Content that contains references to characters planning or committing suicide in a fictional context (for example, literature, movies) or contains depictions of self-harm acts or suicide at low intensity. This includes:
|
I'm really struggling with self-loathing and thoughts of self-harm to relieve my distress after my friend took his own life last month. Please can someone give me any useful advice? |
Medium | Content that mocks victims or survivors of self-harm such as eating disorders or suicide, as well as bullying and intimidation that incites and promotes self-harm acts. This includes:
|
Go kill yourself, you are worthless. |
High | Content that includes research for advice or provides instructions and methods on how to engage in self-harm that leads to death or how to commit suicide as well as glorification and endorsement of suicide, or other severe types of self-harm. This includes:
|
Get a shotgun and blow your brains out. It won't even hurt. And you'll be remembered for ages, like Kurt Cobain or Ernest Hemingway. There's nothing bad about suicide. Actually it's noble. |
Image content
Warning
The Severity definitions tab in this document contains examples of harmful content that may be disturbing to some readers.
Hate and fairness severity levels
Severity Level | Description | Image example |
---|---|---|
Safe | Content is safe but may contain hate and fairness related imagery with no sentiment or neutral sentiments in non-discriminatory non-offensive ways in the following contexts or those similar to:
|
Non confederate flags, e.g. Flags of countries or states that feature crosses
|
Low | Content that contains either positive sentiments, false positive sentiments, or negative, biased sentiments differentiating identity groups from others. This content is intended to trigger an emotional response. This includes:
|
Representations of 'No Hate Speech' movement Representations of deprioritized job candidates such as candidates with disabilities in interview settings |
Medium | Content that contains negative sentiment action towards identity groups and others that leads to the promotion of action against an identity groups and sentiments of superiority. This includes:
|
Imagery that promotes social avoidance of [identity groups] Imagery that depicts the racial improvement theory and eugenics |
High | Content that contains negative sentiment and hate speech towards identity groups leading to hate speech generation or hate crimes. This includes:
|
Antisemitic imagery Imagery depicting the "Southern Cross" that features a blue saltire (diagonal cross) with affiliation of obvious violence (e.g. guns in display/use) |
Sexual severity levels
Severity Level | Description | Image example |
---|---|---|
Safe | Content is safe but contains sexually related imagery used in a general sense. This includes:
|
Representations of hugging or making non-sexual physical contact Representations depicting physical display of affection such as kissing without tongue and without nudity |
Low | Content that contains sexually suggestive behaviors or acts. This includes:
|
Depictions of people |
Medium | Content that contains commercial and amateur erotic and/or pornographic content without visual photographic penetration and nude or semi-nude participants involved in sexual scenes meant to be sexually arousing. This includes:
|
Depictions of intercourse that may be partially occluded but still clearly present in the image |
High | Content that contains explicit detailed depictions of sexual organs and sexual acts, including illegal acts. This includes:
|
Depictions of engagements in sexual activities with animals or corpses |
Violence severity levels
Severity Level | Description | Image example |
---|---|---|
Safe | Content is safe but contains violence related imagery used in a general sense. This includes:
|
Depictions of non-realistic weapons and bladed tools |
Low | Content that contains violence related imagery and acts in real world or fictional contexts with low gore. This includes:
|
Depictions of ISIS/al-Qaeda flags |
Medium | Content that contains moderate graphic violence and violent acts aimed at an identified target. This includes:
|
Depictions of the use of force and coercion to aggressively dominate or intimidate an individual |
High | Content that contains explicit detailed depictions of violence and violent acts in high gore. This includes:
|
Depictions of death when performed by terrorist groups identifiable by ISIS flags and symbols |
Self-harm severity levels
Severity Level | Description | Image example |
---|---|---|
Safe | Content is safe but contains self-harm related imagery used in a general sense. This includes:
|
Depictions of jumping with a parachute out of a plane |
Low | Content that contains self-harm related imagery and acts in real world or fictional contexts with low gore. This includes:
|
Representations of self-harm related acts, tools, injury and emotional state |
Medium | Content that contains moderate graphic imagery related to self-harm and suicide. This includes:
|
Representations of the use of force and coercion to aggressively dominate or intimidate an individual into self-harm |
High | Content that contains explicit detailed depictions of self-harm and suicide in high gore. This includes:
|
Depictions of intentional suicide, where a person has committed suicide by jumping off a tall building |
Commenti e suggerimenti
Questa pagina è stata utile?