Migrate from Media Indexer and Media Indexer 2 to Video Analyzer for Media
Update your Azure Media Services REST API and SDKs to v3 by 29 February 2024. Version 3 of Azure Media Services REST API and client SDKs for .NET and Java offers more capabilities than version 2. We’re retiring version 2 of the Azure Media Services REST API and client SDKs for .NET and Java.
Action Required: To minimize disruption to your workloads, review the migration guide to transition your code from the version 2 API and SDKs to version 3 API and SDK before February 29th, 2024. After February 29th, 2024, Azure Media Services will no longer accept traffic on the version 2 REST API, the ARM account management API version 2015-10-01, or from the version 2 .NET client SDKs. This includes any 3rd party open-source client SDKS that may call the version 2 API. Learn about the latest version, starting with the Media Services v3 Overview.
You should migrate from Indexer v1 and Indexer v2 to using the Media Services v3 AudioAnalyzerPreset Basic mode. The Azure Media Indexer media processor and Azure Media Indexer 2 Preview media processors are being retired. For the retirement dates, see this legacy components topic.
Azure Video Analyzer for Media is built on Azure Media Analytics, Azure Cognitive Search, Cognitive Services (such as the Face API, Microsoft Translator, the Computer Vision API, and Custom Speech Service). It enables you to extract the insights from your videos using Video Analyzer for Media video and audio models. To see what scenarios Video Analyzer for Media can be used in, and what features it offers, and how to get started, see Video Analyzer for Media video and audio models.
You can extract insights from your video and audio files by using the Azure Media Services v3 analyzer presets or directly by using the Video Analyzer for Media APIs. Currently, there is an overlap between features offered by the Video Analyzer for Media APIs and the Media Services v3 APIs.
To understand the differences between the Video Analyzer for Media vs. Media Services analyzer presets, check out the comparison document.
This article discusses the steps for migrating from the Azure Media Indexer and Azure Media Indexer 2 to Video Analyzer for Media.
|If you require||then|
|a solution that provides a speech-to-text transcription for any media file format in a closed caption file formats: VTT, SRT, or TTML
as well as additional audio insights such as: keywords, topic inferencing, acoustic events, speaker diarization, entities extraction and translation
|update your applications to use the Video Analyzer for Media capabilities through the Video Analyzer for Media v2 REST API or the Azure Media Services v3 Audio Analyzer preset.|
|speech-to-text capabilities||use the Cognitive Services Speech API directly.|
Getting started with Video Analyzer for Media
The following section points you to relevant links: How can I get started with Video Analyzer for Media?
Getting started with Media Services v3 APIs
Azure Media Services v3 API enables you to extract insights from your video and audio files through the Azure Media Services v3 analyzer presets.
AudioAnalyzerPreset enables you to extract multiple audio insights from an audio or video file. The output includes a VTT or TTML file for the audio transcript and a JSON file (with all the additional audio insights). The audio insights include keywords, speaker indexing, and speech sentiment analysis. AudioAnalyzerPreset also supports language detection for specific languages. For detailed information, see Transforms.
To get started see:
Getting started with Cognitive Services Speech Services
Azure Cognitive Services provides a speech-to-text service that transcribes audio streams to text in real time that your applications, tools, or devices can consume or display. You can use speech-to-text to customize your own acoustic model, language model, or pronunciation model. For more information, see Cognitive Services speech-to-text.
The speech-to-text service does not take video file formats and only takes certain audio formats.
For more information about the text-to-speech service and how to get started, see What is speech-to-text?
Known differences from deprecated services
You will find that Video Analyzer for Media, Azure Media Services v3 AudioAnalyzerPreset, and Cognitive Services Speech Services services are more reliable and produces better quality output than the retired Azure Media Indexer 1 and Azure Media Indexer 2 processors.
Some known differences include:
- Cognitive Services Speech Services does not support keyword extraction. However, Video Analyzer for Media and Media Services v3 AudioAnalyzerPreset both offer a more robust set of keywords in JSON file format.
You can open a support ticket by navigating to New support request