I am working on OS X 10.9 and have access to Acrobat XI Pro and Adobe CS6, as well as basic programming familiarity, if you have any novel solutions that achieve this same goal. Other methods like simple copy/paste or the Python PDF Miner module also misread the spacing. The PDFs use a strange font, so when I use the Acrobat XI Pro "Export to Excel" feature, the text is read with awkward spacing, but when I change the font to simple Arial, it reads correctly. (To give some context, I'm ultimately trying to parse/serialize the data in these PDF tables into JSON for use in a Python program by converting from PDF->XLSX->CSV->JSON. As far as I can find, the native "Actions" macros in Acrobat don't have this functionality, and Mac Automator struggles with the task. I understand that Acrobat is not a word processor, but I have to imagine that there is a better way to do this. I can only "Select All" of the text on a single page, which means that I would have to repeat this process for each page thousands of times to convert the entire document. I have several 100+ page PDFs that consist only of tables of text, and I am trying to change the font of all of the text in the entire document however, this is proving to be more difficult than I anticipated. Do you need to work with documents on the go Get the Adobe Acrobat Reader mobile app, the worlds most trusted PDF viewer, with more than 635 million. I'll preface by saying that I am not a graphic designer, so this question might be staggeringly obvious.
0 Comments
Leave a Reply. |
AuthorWrite something about yourself. No need to be fancy, just an overview. ArchivesCategories |