@larissa kelmer Is your scenario similar to a call center conversation? There is an API called batch transcription which offers something similar but it is not available under the free tier of a speech resource. You would need to move to S0 tier and setup your audio recordings and call the API. There are some samples available to configure your speech resource and the storage locations which can simplify your setup to test the service and check if the required result is available.
If you plan to recognize your speakers in the conversation we would recommend registering the voice profile and use speaker recognition instead.