Is there any good nuget package for PDF text scanner in asp.net core ?

Question

Is there any good nuget package for PDF text scanner in asp.net core ?

chandra dev 1

Hi All,

Currently I am using pdfclown nuget package for scanning the text from pdf file in asp.net core project.
https://pdfclown.org/

My requirement is there to read the pdf text and dump in excel file. pdfclown is doing almost everything's but blank space is not reading from pdf file.

could you please suggest any other alternate nuget package to fulfill this requirement ?

2 answers

Your answer

Answer 1

PDF is programing language that draws text and images. the language is a simple stack machine. to help in parsing PDF, tags support was added to help define the document. in postscript the % is the comment character, %% is used to identify a structure tag

sample hello world:

%!PS
/Palatino-Roman 20 selectfont
300 400 moveto
(Hello, World!) show
showpage

how well a PDF file can be parsed depends on how well the ps program was written, did it follow tag conventions used by the parser. most likely in your sample, the table is a text array, and only has 2 rows of data.

note: postscript supports arrays of arrays, so a text table should follow this structure. the data and the code to draw the borders are seperate.

Answer 2

Deleted

This answer has been deleted due to a violation of our Code of Conduct. The answer was manually reported or identified through automated detection before action was taken. Please refer to our Code of Conduct for more information.

Comments have been turned off. Learn more

Share via

Is there any good nuget package for PDF text scanner in asp.net core ?

2 answers

Your answer