OCR Demo on Vision Studio performs better than API call

Question

OCR Demo on Vision Studio performs better than API call

Danny Zhang 20

I've been testing out Optical Character Recognition using Azure. However, I've noticed that the online demo version performs far better than the API version. In addition, the API model version is stuck on 2023-10-01 instead of the 2024 version shown in the documentation. Can someone explain the discrepancy?

navba-MSFT 27,550 Reputation points Microsoft Employee Moderator

2024-09-02T05:48:38.87+00:00

@Danny Zhang Welcome to Microsoft Q&A Forum, Thank you for posting your query here!

.

Firstly the new Azure AI Vision Image Analysis 4.0 REST API offers the ability to extract printed or handwritten text from images in a unified performance-enhanced synchronous API that makes it easy to get all image insights including OCR results in a single API operation.

.

The Vision Studio and the V4 analysis API both makes use of the api-version 2023-10-01.

.

I also tested with the below sample image and the response was fast and efficient.

https://learn.microsoft.com/azure/ai-services/computer-vision/media/quickstarts/presentation.png

.

I used the Image Analysis 4.0 Python SDK sample code given here:

https://learn.microsoft.com/en-us/azure/ai-services/computer-vision/quickstarts-sdk/image-analysis-client-library-40?tabs=visual-studio%2Cwindows&pivots=programming-language-python

This Python SDK sample code is also using the 2023-10-01 api-version and it is having better performance.

Please test the same and let me know if you are encountering the same issue. Awaiting your reply.
navba-MSFT 27,550 Reputation points Microsoft Employee Moderator

2024-09-04T03:49:00.79+00:00

@Danny Zhang Just following up to check if my suggestion helped. Please let me know if you have any further queries. I would be happy to help.
Danny Zhang 20 Reputation points

2024-09-04T12:28:23.5466667+00:00

Thank you for your response,

I was looking at this page:

https://learn.microsoft.com/en-us/azure/ai-services/computer-vision/quickstarts-sdk/image-analysis-client-library-40?tabs=visual-studio%2Clinux&pivots=programming-language-rest-api

On here the JSON response has the model version on 2024-02-01. I wanted to know how I can utilize that version as well.
navba-MSFT 27,550 Reputation points Microsoft Employee Moderator

2024-09-06T05:19:46.33+00:00

@Danny Zhang Just following up to check if the below answer helped. If that answers your query, do click "Accept the answer” for the same, which might be beneficial to other community members reading this thread. And, if you have any further query do let me know. I would be happy to help.

4 answers

Your answer

navba-MSFT 27,550 Reputation points Microsoft Employee Moderator

2024-09-02T05:48:38.87+00:00

@Danny Zhang Welcome to Microsoft Q&A Forum, Thank you for posting your query here!

.

Firstly the new Azure AI Vision Image Analysis 4.0 REST API offers the ability to extract printed or handwritten text from images in a unified performance-enhanced synchronous API that makes it easy to get all image insights including OCR results in a single API operation.

.

The Vision Studio and the V4 analysis API both makes use of the api-version 2023-10-01.

.

I also tested with the below sample image and the response was fast and efficient.

https://learn.microsoft.com/azure/ai-services/computer-vision/media/quickstarts/presentation.png

.

I used the Image Analysis 4.0 Python SDK sample code given here:

https://learn.microsoft.com/en-us/azure/ai-services/computer-vision/quickstarts-sdk/image-analysis-client-library-40?tabs=visual-studio%2Cwindows&pivots=programming-language-python

This Python SDK sample code is also using the 2023-10-01 api-version and it is having better performance.

Please test the same and let me know if you are encountering the same issue. Awaiting your reply.
navba-MSFT 27,550 Reputation points Microsoft Employee Moderator

2024-09-04T03:49:00.79+00:00

@Danny Zhang Just following up to check if my suggestion helped. Please let me know if you have any further queries. I would be happy to help.
Danny Zhang 20 Reputation points

2024-09-04T12:28:23.5466667+00:00

Thank you for your response,

I was looking at this page:

https://learn.microsoft.com/en-us/azure/ai-services/computer-vision/quickstarts-sdk/image-analysis-client-library-40?tabs=visual-studio%2Clinux&pivots=programming-language-rest-api

On here the JSON response has the model version on 2024-02-01. I wanted to know how I can utilize that version as well.
navba-MSFT 27,550 Reputation points Microsoft Employee Moderator

2024-09-06T05:19:46.33+00:00

@Danny Zhang Just following up to check if the below answer helped. If that answers your query, do click "Accept the answer” for the same, which might be beneficial to other community members reading this thread. And, if you have any further query do let me know. I would be happy to help.

Answer 1

Deleted

This answer has been deleted due to a violation of our Code of Conduct. The answer was manually reported or identified through automated detection before action was taken. Please refer to our Code of Conduct for more information.

Comments have been turned off. Learn more

Answer 2

Deleted

This answer has been deleted due to a violation of our Code of Conduct. The answer was manually reported or identified through automated detection before action was taken. Please refer to our Code of Conduct for more information.

Comments have been turned off. Learn more

Answer 3

Deleted

This answer has been deleted due to a violation of our Code of Conduct. The answer was manually reported or identified through automated detection before action was taken. Please refer to our Code of Conduct for more information.

Comments have been turned off. Learn more

Answer 4

@Danny Zhang Thanks for getting back nd clarifying your ask. This api-version is supported 2024-02-01. However the SDK is not yet updated to use this new version and it is still using older version 2023-10-01.

.

So, if you want to use this 2024-02-01 api-version, try directly invoking it from the REST API as shown below:

POST https://XXXX.cognitiveservices.azure.com/computervision/imageanalysis:analyze?features=caption%2Cread&api-version=2024-02-01&gender-neutral-caption=true 
Content-Type: application/json 
Ocp-Apim-Subscription-Key: 337e4XXXXXXX4633cd7f 
Request Body:

{"url":"https://learn.microsoft.com/azure/ai-services/computer-vision/media/quickstarts/presentation.png"}

.

So, if you want to use this 2024-02-01 api-version, try directly invoking it from the Python code via REST API as shown below:

import requests

# Define the endpoint and parameters

url = "https://XXXX.cognitiveservices.azure.com/computervision/imageanalysis:analyze"

params = {

    'features': 'caption,read',

    'api-version': '2024-02-01',

    'gender-neutral-caption': 'true'

}

# Define the headers

headers = {

    'Content-Type': 'application/json',

    'Ocp-Apim-Subscription-Key': '337e4XXXXXXX4633cd7f'

}

# Define the request body

data = {

    'url': 'https://learn.microsoft.com/azure/ai-services/computer-vision/media/quickstarts/presentation.png'

}

# Make the POST request

response = requests.post(url, headers=headers, params=params, json=data)

# Print the response

print(response.status_code)

print(response.json())

Hope this helps. If you have any follow-up questions, please let me know. I would be happy to help.

** Please do not forget to "Accept the answer” and “up-vote” wherever the information provided helps you, this can be beneficial to other community members.

Share via

OCR Demo on Vision Studio performs better than API call

4 answers

Your answer