Choose an Azure Cognitive Services technology

Azure Cognitive Services is a set of cloud-based APIs that you can use in AI applications and data flows. It provides pretrained models that are ready to use in your applications, requiring no data and no model training on your part. The services are developed by the Microsoft AI and Research team and expose the latest deep learning algorithms. They're consumed over HTTP REST interfaces. In addition, SDKs are available for many common application development frameworks.

Key benefits:

  • Minimal development effort for state-of-the-art AI services. Use predefined algorithms or create custom algorithms on top of pre-built libraries.
  • Easy integration into apps via HTTP REST interfaces.
  • Developers and data scientists of all skill levels can easily add AI capabilities to apps.

Considerations:

  • These services are only available over the web. Internet connectivity is generally required. An exception is the Custom Vision service, whose trained model you can export for prediction on devices and at the IoT edge.
  • Although considerable customization is supported, the available services might not suit all predictive analytics requirements.

Categories of Azure cognitive services

Dozens of cognitive services are available in Azure. Here's a list, categorized by the functional area they support:

Service Link to decision guide Description
Language Choose a language service Language cognitive services are services that provide Natural Language Processing (NLP) features for understanding and analyzing text.
Speech Choose a speech service Speech cognitive services are services that provide speech capabilities like speech-to-text, text-to-speech, speech translation, and speaker recognition.
Vision Choose a vision service Vision cognitive services are services that provide image and video recognition capabilities.
Decision services

Applied AI Services
Choose a decision API or applied AI service Decision cognitive services are services that provide NLP features to produce recommendations for informed and efficient decision-making.
Azure OpenAI Service N/A Azure OpenAI Service provides REST API access to powerful OpenAI language models.

Common use cases

The following are some common use cases for Azure Cognitive Services.

Use case Category
Transcribe audible speech into readable, searchable text. Speech
Convert text to lifelike speech for more natural interfaces. Speech
Integrate real-time speech translation into your apps. Speech
Identify and verify the person speaking by using voice characteristics. Speech
Identify commonly used and domain-specific terms. Language
Automatically detect sentiments and opinions in text. Language
Distill information into easy-to-navigate questions and answers. Language
Enable your apps to interact with users through natural language. Language
Translate more than 100 languages and dialects. Language
Identify and analyze content in images and video. Vision
Customize image recognition to fit your business needs. Vision
Identify potential problems early. Decision services / Anomaly Detector
Detect potentially offensive or unwanted content. Decision services / Content Moderator
Create rich, personalized experiences for every user. Decision services / Personalizer
Apply advanced coding and language models to various use cases. Azure OpenAI

Key selection criteria

To narrow down the choices, start by answering these questions:

  • Are you processing something related to spoken language, or are you processing text, images, or documents?

  • Do you have the data to train a model? If yes, consider using the custom services that enable you to train their underlying models with data that you provide. Doing so can improve accuracy and performance.

This flow chart can help you choose the best API service for your use case.

Diagram that shows how to select a Cognitive Services API.

  • If your use case requires speech-to-text, text-to-speech, or speech-to-speech, use a speech API.
  • If your use case requires language analysis, text assessment, or text-to-text, use a language API.
  • If you need to analyze images, video, or text, use a vision API.
  • If you need to make a decision, use a decision API or Applied AI Services.

Deploying services

When you deploy Cognitive Services, you can either deploy services independently or use the Cognitive Services multi-service resource. The multi-service resource deploys decision, language, speech, vision, and applied AI services.

  • Deploy an individual service if you don't need other services or if you want to manage access and billing on a per-service basis.
  • Deploy the multi-service resource if you're using multiple services and want to manage access and billing for all services together.

Note

The resource categories in these API services change frequently. Be sure to check the latest documentation for new categories.

Contributors

This article is maintained by Microsoft. It was originally written by the following contributors.

Principal authors:

Other contributors:

To see non-public LinkedIn profiles, sign in to LinkedIn.

Next steps