Confused as to what counts as 1 transaction for computer vision API

Rishi Khemka 60 Reputation points
2024-04-16T09:08:35.3933333+00:00

Hi, ive recently started using computer vision api via python SDK, what is Im doing is processing images and read the text out of the images for analytical purposes. Can you elaborate what is one transaction here? will it be 1 for processing 1 whole image or more than 1 for a single image?

Thank you

Azure Computer Vision
Azure Computer Vision
An Azure artificial intelligence service that analyzes content in images and video.
313 questions
{count} votes

Accepted answer
  1. navba-MSFT 17,280 Reputation points Microsoft Employee
    2024-04-16T09:21:42.3833333+00:00

    @Rishi Khemka Welcome to Microsoft Q&A Forum, Thank you for posting your query here!

    What constitutes a transaction for Computer Vision API ?

    Each feature you select is counted as a transaction. There are a few special cases to note:

    1. Analyse allows you to select multiple features at once. For instance, an Analyse call specifying the Tag, Face and Adult features would count as three transactions.
    2. Read allows you to upload multipage PDF documents. Each page is counted as a feature. For instance, a 200 page document would count as 200 transactions.
    3. All GET calls to see the results of the async Read and Recognize Text features are counted as transactions but are free of charge.

    A transaction for computer vision refers to an individual instance of an image or data being processed by the algorithm. So, if you’re processing a single image and reading the text out of it, that would count as one transaction.

    Here is the pricing page for computer vision:

    https://azure.microsoft.com/en-in/pricing/details/cognitive-services/computer-vision/

    Hope this helps.

    **
    Please do not forget to "Accept the answer” and “up-vote” wherever the information provided helps you, this can be beneficial to other community members.


0 additional answers

Sort by: Most helpful