Quite disappointed on text filtering capabilities of 'Content Moderator'

Question

Hi,

(Sorry for my English, I am French and not very fluent in English).

I am currently testing Content Moderator via the API for SMS texts. Quite simple to implement and test.
I am using it in the "West Europe" area. And in English and in French.

I tested rough and sexual expressions in English and French. Here are my results :

in English : these expressions/words are partially detected and isolated ("Terms"=...), but are well categorized,
in French : NO expression detected/isolated, no category (OK, it is said in the documentation, but without any "Terms" found, it is quite not usefull).

Does anybody have any idea to improve the capability to filter such texts ?

BR

Sylvain

Answer

@Sylvain Donnet Thanks for the question. Can you please add more details about the sample input text and that you want to detect to check further on this.
Please follow the Content Moderator Review tool: https://contentmoderator.cognitive.microsoft.com/

CM with their custom developed APIs for custom detection: you should be able combine since CM workflow provides APIs.
Content Moderator Review tool: https://contentmoderator.cognitive.microsoft.com/

There are only limited connectors available so would suggest programmatically accessing other services via SDK/API would be ideal. This way you’ve access to all cognitive services and extend their functionalities.

Answer

Hi,

Thanks for your reply.

I perform several CURL on my endpoint :

in English :

curl -X POST "https://XXXXXXXXXXX.cognitiveservices.azure.com//contentmoderator/moderate/v1.0/ProcessText/Screen?autocorrect=true&PII=True&classify=true&language=eng

with --data-ascii "I sc**** you and I f**** you. You are an idiot and an ass*****."

F word, and ass word have been detected, and Category3 is about 98%

So > OK

in French,
curl -X POST "https://XXXXXXXXXXX.cognitiveservices.azure.com//contentmoderator/moderate/v1.0/ProcessText/Screen?autocorrect=true&PII=True&classify=true&language=fra

same data, with french translations, for F word, ass* word, and so on :
On 4 rough words, none has been detected (in "Terms" results), and, as language=fra (French), I cannot have the categorization.

Quite disappointed on text filtering capabilities of 'Content Moderator'

2 answers