Hi -
I'm trying to copy and paste text from a PDF so I can edit and analyze the contents. The file was created in Hebrew. It is set of Israel's election results and available on their government website: http://www.moin.gov.il/Apps/PubWebSite/mainmenu.nsf/4DF815EA4AC4E503C2 256BA6002EE732/8E408A044EE1D3EDC2257520002817B8/$FILE/News.pdf.
Under document properties, the fonts listed are Helvetica (standard) and two unknown, embedded subsets (TTE1C42600t00 and TTE1DA2290t00).
I have tried:
- Copy and pasting text from Reader 9 --> opening in Word and Excel, changing around fonts
- Copy and pasting text from Acrobat 8 Professional --> opening in Word and Excel, changing around fonts
- Right-click, open table as spreadsheet
- Exporting as .doc, .TIFF, PostScript, .txt, .html
- Export as image, running OCR (trialware Hebrew OCR program I used did not pick up all characters correctly)
- Adobe website mentions an Adobe Reader Middle Eastern Edition 7, but when I go to download it, it takes me to the regular Reader v9 page
Can anyone think of a way to extract the data from this document so that it is editable?
Any help would be appreciated!