Share via

Azure Document Intelligence crashing when trying to read unknown characters

PrivaC 0 Reputation points
2024-05-08T03:52:55.18+00:00

Hi, I'm trying to parse data from a image that's in Bengali the native language of Bangladesh and while using document intelligence to parse the information I'm getting the following error:

UnicodeEncodeError: 'charmap' codec can't encode characters in position 76-84: character maps to <undefined>

which made me believe that it's not finding the bengali characters, so I tried out with a full english image and it worked fine. The document I want to read has both bengali as well as latin characters. I was wondering if there was a way to ignore unknown bengali characters while parsing. Thank you in advance

Azure Document Intelligence in Foundry Tools

Your answer

Answers can be marked as 'Accepted' by the question author and 'Recommended' by moderators, which helps users know the answer solved the author's problem.