ღონისძიებები
Microsoft 365 საზოგადოების კონფერენცია
May 6, 2 PM - May 9, 12 AM
AI-ის ეპოქის უნარი საბოლოო საზოგადოების ხელმძღვანელობით Microsoft 365 ღონისძიებაზე, 6-8 მაისს ლას ვეგასში.
შეიტყვეთ მეტიეს ბრაუზერი აღარ არის მხარდაჭერილი.
გადადით Microsoft Edge-ზე, რათა ისარგებლოთ უახლესი ფუნქციებით, უსაფრთხოების განახლებებითა და ტექნიკური მხარდაჭერით.
Note
Through June 2025, you can try out a limited amount of optical character recognition and other selected Syntex services at no cost if you have pay-as-you-go billing set up. For information and limitations, see Try out Microsoft Syntex and explore its services.
The optical character recognition (OCR) service in Microsoft Syntex lets you extract printed or handwritten text from images and documents. Examples of images include posters, drawings, and product labels. Examples of documents include articles, reports, forms, and invoices.
The text is typically extracted as words, text lines, and paragraphs or text blocks, enabling access to digital version of the scanned text. The extracted information is indexed in search and can be made available for compliance features like data loss prevention (DLP).
For example, you enable the OCR service and then add image files to your document library. Microsoft Syntex automatically scans the image files, extracts the relevant text, and makes the text from the images available for search and indexing. This feature lets you quickly and accurately find the keywords and phrases you're looking for.
Endpoint | Supported file types |
---|---|
SharePoint and OneDrive | .bmp, .png, .jpeg, .jpg, .jfif, .arw, .cr2, .crw, .erf, .gif, .mef, .mrw, .nef, .nrw, .orf, .pef, .raw, .rw2, .rw1, .sr2, .tif, .tiff, .heic, .heif, .ari, .bay, .cap, .cr3, .dcs, .dcr, .drf, .eip, .fff, .iiq, .k25, .kdc, .mef, .mos, .ptx, .pxn, .raf, .rwl, .sr2, .srf, .srw, .x3f, .dng, .tiff, and .pdf |
Teams, Exchange, and Windows devices | .bmp, .png, .jpeg, .jpg, .tiff, and .pdf |
In addition to image-based PDF, Syntex OCR will support hybrid PDF (text plus image PDF) beginning November 2024. After that time, newly uploaded hybrid PDFs will be processed by the OCR service.
Note
When you apply OCR to an image file, the text is stored in the Extracted text metadata column. When you apply OCR to a PDF or TIFF file, the extracted text is indexed in search but not available in the metadata column.
The OCR service supports more than 150 languages.
The OCR service supports multiple solutions, as shown in the following table. For details about compliance solutions, see Supported locations and solutions in Microsoft Purview.
Location | Supported solution |
---|---|
Exchange | Text is available for end-user search and search-driven solutions. Text is available for compliance solutions. |
SharePoint sites | Text is available for end-user search and search-driven solutions. Text is available for compliance solutions. |
OneDrive accounts | Text is available for end-user search and search-driven solutions. Text is available for compliance solutions. |
Teams chat and channel message | Text is available for compliance solutions. |
Devices | Text is available for compliance solutions. |
Images must be less than 50 MB.
Images must be at least 50 x 50 pixels and not larger than 16,000 x 16,000 pixels.
Images uploaded after OCR has been enabled are the only images that are scanned.
Images that are embedded in Office documents aren't supported.
ღონისძიებები
Microsoft 365 საზოგადოების კონფერენცია
May 6, 2 PM - May 9, 12 AM
AI-ის ეპოქის უნარი საბოლოო საზოგადოების ხელმძღვანელობით Microsoft 365 ღონისძიებაზე, 6-8 მაისს ლას ვეგასში.
შეიტყვეთ მეტიტრენინგი
მოდული
Fundamentals of optical character recognition - Training
Use optical character recognition (OCR) to read text with Azure AI Vision.
სერტიფიკაცია
Microsoft Certified: Azure AI Engineer Associate - Certifications
Design and implement an Azure AI solution using Azure AI services, Azure AI Search, and Azure Open AI.