An Azure service that provides access to OpenAI’s GPT-3 models with enterprise capabilities.
Language Detection Issue
The Azure Cognitive Services Language Detection may misclassify words that exist in multiple languages, such as "Toppen" being detected as Danish instead of Swedish. This is a known limitation where the service tends to return the language with the largest representation in the input text, which can lead to incorrect language detection, especially for short samples or ambiguous words. To improve accuracy, consider providing longer inputs or using the countryHint parameter if you have a specific region in mind.
PII Entity Detection Issue
For PII detection, misclassification can occur if the model does not recognize the context or if the input text does not contain recognizable patterns. For example, a Social Security Number being detected as a phone number indicates a potential issue with the model's ability to accurately categorize entities. To mitigate this, ensure that the input text is clear and contains recognizable PII patterns. Additionally, specifying the piiCategories parameter can help in accurately identifying the types of PII you expect to be detected.
Both issues can significantly impact downstream processes such as translation and compliance, so it is crucial to test and validate the outputs from the API thoroughly.
References: