Exporting an Pdf file to Excell while retaining the structure

Hi
I have a bunch of archival inventories on which I ran OCR. When trying to export them as excell documents the structure was lost.
The structure is always the same and I am wondering wether there is a script I could write and run before the export in order to retain the structure.
Or for instance could I create a structures excell document and make the export match it? I did try the auto export with Acrobat Pro X1 and it is a disaster with such documents I am pretty sure it works well in other instances.
As I am looking for specific type of archival materials (inventories, wills, sales..) I need to be able to count them and retrieve the info attached to them in the inventory.
Typically an inventory line would be as follow
Calendro,            Francisco                                June 8,1780               P-400   being the page number  in the actual notarial book          
                              by Pedro Portal
                              Sale of Property
Thanks

I would suggest that there may be something in JavaScript that would help. However, for an individual file you can use the alt key during selection to select a table. There are also some options to try to recognize a table, but it works better if it is tagged -- not something you get from a scan.
I just did OCR to an image of an excel sheet. I was not able to save as tables in Excel with a message that no table could be found. This is where the tags would come in. I tried adding tags (in AA9 this is under Advanced>Accessibility), but it did not help.
I was able to select each column with the Select Text and Images tool. To select a column, hold down the alt key as you use the cursor for selection. You can copy the column to a spread sheet and repeat it for each column. This is cumbersome, but does the job. It may be that you can you JavaScript to select text in a window and copy it to an Excel file. I am going beyond my skill set, but suggestion what might be possible. In the meantime, you can work with the individual files.
Good luck.

Similar Messages

  • How to convert a pdf file to html, while retaining the exact formatting?

    I have a pdf file, need to convert it into html. while I do so, all the formatting is lost. Is there any other way?

    Save the pages of the document as images and create a HTML file with references to the images.

  • Using my PC Desktop I am trying to convert a PDF file to excel.  When the data is imported to Excel the majority of the data is vertical in the far left colum.

    I am trying to import a PDF file from my desktop PC in to Excel using XI Pro.  When the data imports in to excel, the majority of the data is vertical along the far left comumn instead of horizonal along the colums such as name, address, etc.   I need to be able to import the data in to the horizonal categories so I can categorize the data after it is in excel.  Thanks.

    Hi johnc,
    I'm sorry to hear that your file didn't convert well for you. Can you please tell me more about the PDF that you're trying to convert? Do you know how it was created? The quality of a conversion depends on the quality of the PDF, and it sounds like the file you have may not include the tagging information that is required to convert that properly into an Excel file. Would you be willing to share that file with me? If so, let me know and I'll send you a private message with my contact details.
    Best,
    Sara

  • Export a PDF file to Excel with adobe Distiller 9

    How i can export an file with several pages to an excel file?
    The right click have 3 options ( Copy as Table, Save as table ans open table in a spreedsheet) and but they are limited for just one page.

    See https://forums.adobe.com/docs/DOC-2412
    If you need more help, please ask in https://forums.adobe.com/community/acrobatdotcom/

  • How to export a fireworks file to flash while retaining image quality?

    I created a banner for our website using fireworks cs3. Part of the banner has aimation (basically a slide show of 6 photos). I've tried to export the files by saving the frams to files, but when I do this, the image quality really gets bad. When I try to change the export settings and change the dither, everything but the last frame looks better and the last frame looks like a yellow color is splattered all over it....I've tried just saving the file as an .swf file but when I do so, the file doesn't work when I insert it into my web page, it doesn't work.  Is there a better way to do this?  Can I export as something else then import into flash and still retain my image quality?  Thanks for any input and help!

    How are you inserting the file?
    Flash objects need to be placed in an object tag...as described here: http://kb2.adobe.com/cps/415/tn_4150.html

  • I am trying to export a PDF file into excel and it shows an error which says,

    "An error occured while trying to access the service"

    There is absolutely no way to stop any file you put on the internet from being downloaded. That's how the internet works.
    Digital Rights Management can control who can OPEN a copy of a file and when. If you are a big company with 5-6 figure budget, this can be interesting.

  • Getting "Cannot Insert Object" message while attaching .pdf file to excel spreadsheet.

    While I am trying to attach an adobe (.pdf) file in excel spreadsheet I am getting message as “Cannot Insert Object”.
    I am following the below mentioned steps and getting message as “Cannot Insert Object”.
    Open the adobe (.pdf) file from IE browser.
    While saving the adobe file on local machine it gives warning as “This document does not allow you to save any changes you have made to it unless you are using Adobe Acrobat 9, Pro 9 or Pro Extended 9.  You will only be saving a copy of the original document.  Do you want to continue?” On pressing "OK" it successfully saves the file on my local machine.
    While I Tried to attach the saved adobe file in a spreadsheet of excel it gives message as “Cannot Insert Object”.
    Does any one have any thoughts at all as to how to solve this?

    Deepika,
    The alert dialog your screen shot depicts will only display if there is some kind of form annotation present in a PDF that is not "Reader Enabled".
    Look closer at your 'final' PDF.
    A PDF, not "Reader Enabled", that contains any form annotations will, when opened by Adobe Reader,
    result in the alert dialog that you mention. A "hard wired" default.
    Note that the forms document message bar can be "off" by a selection in Adobe Reader / Acrobat Preferences.
    Select the 'Forms' category. Select "Always hide forms document message bar".
    Be well...

  • How do I convert  a pdf file to excel?

    Will you help me convert a PDF File to Excel eith my new PDF Export Software?

    Hi John,
    Check out our FAQ: Getting Started with ExportPDF
    Let us know if we can be of further assistance!
    -David

  • How to convert pdf files to Excel file

    Convert pdf file to Excel file

    Hi Farhat,
    You can easily convert a pdf to excel via Acrobat XI.
    Go to File > Save as other... > Spreadsheet > Microsoft Excel Workbook
    You can even make a selection of any table in the pdf document and right click and select 'export as ' > Excel workbook.
    Or,
        Open a file in Acrobat XI.
        Choose Tools > Content Editing > Export File to Microsoft Excel Workbook.
        Name the Excel file and save it in a desired location.
    Please refer : http://www.adobe.com/products/acrobat/pdf-to-excel-xlsx-converter.html

  • FM to Convert PDF file to Excel or Word Doc

    Hi All,
    I am Looking for  FM which converts PDF file to EXCEL or Word Doc.
    Thanks in Advance
    Santu

    I would suggest asking in the Export PDF forum, giving full details about your problem and your setup: Adobe ExportPDF (read only)

  • How do i export a pdf file

    how do I export a pdf file

    Hi assiniboine,
    Are you trying to export a PDF file to another format, such as Word or Excel? Or, are you wanting to create a PDF file? Adobe PDF Pack would allow you to do either. For more information, see Reliably Create PDFs, Convert PDFs, & Merge PDFs Online | Adobe PDF Pack.
    If you already have a subscription, but just don't know how to use it please let me know what you're using, and what you hope to accomplish. We'll get you going...
    Best,
    Sara

  • I am having a problem converting a scanned pdf file into Excel.

    I am having a problem converting a scanned pdf file into Excel. I do not get the columns and rows to align, just a single column of everything. Any suggestions?

    Export makes use of what is "in" a PDF.
    Good export is the "silk purse" and, ya know, you canna make a silk purse from a sow's ear (which is what any scanned image in PDF is with regards to export).
    The quality of export is dictated by the quality of the PDF. We are taking about the "inner essences" of the PDF (e.g., degree of compliance with the PDF Standard - ISO 32000-1).
    So, what goes in goes out or "GIGO".
    This has nothing to do with Acrobat or Acrobat's export process.
    A well-formed Tagged PDF (compliant to ISO 32000-1 & ISO 14289-1, PDF/UA-1) provides a PDF that proactively
    supports content export by Acrobat.
    To get the good stuff from export you start with a well-formed Tagged PDF.
    Goodstuff In — Goodstuff Out
    or
    Garbage In — Garbage Out
    "GIGO"
    Be well...
    Message was edited by: CtDave

  • How to embed and open PDF files within excel

    I constantly need to embed PDF files onto Excel documents as well as extract/open/view PDF files from Excel documents. I am unable to do so with a macbook, i know that there is a workaround but it's such a tedious process. Is there a software i can buy or setting that i can do so that i can embed and open PDF files easily as i need to view these files multiple times a day for work.
    please advise

    Download and install Adobe Reader: Adobe Reader Install for all versions
    Open the PDF file using it. If security restrictions don't prevent it you could also print it.

  • HT1338 My mac is becoming too slow. It takes long to open word documents, pdf files or excel documents or even safari. Can anybody suggest something? I have tried to reduce the number of open applications, but does not seem to work.

    My mac is becoming too slow. It takes long to open word documents, pdf files or excel documents or even safari. Can anybody suggest something? I have tried to reduce the number of open applications, but does not seem to work.

    Hi ...
    Checked to see how much free space there is on the startup disk lately?
    Right or control click the MacintoshHD icon. Click Get Info. In the Get Info window you will see Capacity and Available. Make sure there's a minimum of 15% free disk space.
    Freeing Up Hard Disk Space - Mac GuidesFreeing Up Hard Disk Space - Mac Guides
    If disk space is not the issue, booting in Safe Mode deletes system caches that may help.
    A Safe Mode boot takes longer then a normal boot so be patient.
    Once you see the Desktop, click the Apple menu icon top left corner of the screen.
    From the drop down menu click Restart.
    See if that makes a difference ...

  • SSRS - Export multiple pdf file

    SSRS report background :
    Report generating 50 student details with 50 page (each student one page- group by student).
    now data will export one pdf file with 50 page where each page having one student details. 
    Requirement :
    instead of one single pdf with 50 page i want 50 pdf file so i can send each report to respective student

    generating one pdf file with 50 page where each page having one student details.
    Hi ,
    Please follow below steps;
    Step1. Create one Row Group based on Student.
    Step2.Row Group has been created. We can see the Group1 in the Row Group Pane.
    Right Click on the Group1 column Delete Columns only.
    Delete Column window comes, Choose Delete column only and click on
    Step3.Now you can see the Group1 column is deleted but Group1 is still available in the report that groups data Student wise.
    Step4.Go to Group1 property by right click on Group1 in Row grouping pane.
    In the Group property, go to Page Break Page Break Option Check the box for “Between each instance of a group” and “at the end of group”.
    Right click on the Tablix go to Tablix property. Tablix property windows comes: Check “Add Page break after” and in column header, check “Repeat header columns on each page.”
    Step5.After implementing Page Break and Grouping, run the report and export it to PDF.
    Thanks.

Maybe you are looking for

  • Cant update my ipod touch 4g

    everytime i try to download the software it makes me wait like 2 hours then at the last second it says connection timed out...idk what to do heelllp

  • Can't get Acrobat to install, no matter what I try...

    Hi there, I just downloaded the trial version of Acrobat X Pro for my Vista 32-bit laptop. Immediately after setting the language to English, I get the following message: "This application cannot be installed on this operating system. Setup will now

  • File Vault decryption problem

    Hi guys, I'm running 2013 2.6GHz i5 13' MBP on 10.9 I have trouble unlocking my 2TB WD external drive. I have encrypted it with disk utilities when i formated it 4 months ago. Everything worked fine up to this moment. I cannot unlock it, but my paswo

  • Oil and Gas Client Discussion on PRA Processes

    Hello My customer, a major company in the oil and gas industry, would like to obtain some key functional/application contacts, for a Client to Client call.  The outline of the call(s) will be to discuss certain: process overview, (ii) process pain po

  • HT4972 update iphone 3G to 4.3

    What's up doesn't work any longer and I cannot update the version. What I suppose to do???