OCR for numbers in a PDF

Hi guys
I've got a situation where I need to take numbers from a PDF file which is a previsouly scanned-in copy of an invoice. The PDF will be opened alongside a data entry form for the user to complete the form with information from the invoice. However, what I need to do is offer the user suggestions for some of the form fields (such as Net Total, VAT (Tax), Grand Total etc). I know that for specific suggestions for each field I would need some kind of 'zone' OCR, so if I could only pull out all numbers in the scanned image of the invoice from the PDF, I could offer all numbers as a suggestion in a drop-down.
I am using CFMX7, so I'm looking for a way to do this or some kind of component which will allow me to do this.
All the best
Wes

I am not sure it if will meet your needs, but you might look into jPedal. IIRC, it has some text extraction capabilities.

Similar Messages

  • Searching for Numbers in Pdf

    Hi, Is there a shortcut to search for numbers in a pdf file

    Not sure what you mean.
    You have the find feature (ctrl + f) or the advanced search feature (ctrl+shift+f) to search for digits, words, strings etc.

  • Can I use OCR for just a single image in a text document?

    Hi All
    I have a 37page pdf document that is mostly recognised text. i think this document was created in MS word then converted to PDF. I did not make this document.
    There are images inserted on some pages that are scans from another document. The document has footnotes, page numbers, title, text paragraphs on the pages with the images i want to ocr. I have already used the highlight an sticky note functions for some of the recognised text and don't want them lost.
    I have tried using OCR for the whole document but it doesn't work (renderable error).
    Can I use the OCR function just on a selected image within a document that has renderable text for the most part?
    thanks.

    Read what it says on the ATT page
    http://www.att.com/ipad/?fbid=s18K8c1ujw3
     Cheers, Tom

  • OCR and hidden text in PDF scans of historic documents

    I need to edit the hidden text behind a scanned PDF image of a document.  The image must remain as an “exact” copy of the original scanned document.
    I used Acrobat Pro (versions 7 and 9) to make PDF images of old typed documents from the 1940’s.  When I open those images and run OCR in version 9, then examine the hidden (invisible) text layer behind the image, there are errors.  For example, the word “book” has been picked-up by the OCR as the word “look.”  I need to change the “l” to a “b” in order to make the PDF accurate when it is searched at a later date. 
    I have checked many user forums.  Most people imply that hidden text can be viewed, but NOT edited in Acrobat Pro 7 and 9.  (Hidden text can be viewed in Version 9 by selecting “Document” “Examine Document” and then clicking on the “+” symbol next to “Hidden Text,” then clicking “Show preview.”)  Some say to use Adobe Capture 3.0 to edit hidden text.  Others say to use Photoshop or Illustrator to edit hidden text (I think these folks may have been confused, because Photoshop and Illustrator would be used, logically, to edit the image ON TOP OF the hidden text).  Yet another person seemed to say that a hidden text editor was added to Acrobat 8, but was taken away in Acrobat 9.  (I can’t verify that because I don’t have version 8.)
    The closest answer I was able to find involved using the Text Touch Up Tool on top of the image to edit hidden text behind it, but when you do that you are typing “blind.”  In other words, you highlight a spot on the image (top layer) where you THINK the error MIGHT be, and you type the correction without being able to see what you are typing over.  Then, you go back to the “Examine Document” procedure (described above) to see if you “hit” your mark, and if not, you redo it until you do “hit” your mark.  With the number of documents and corrections that we have, that procedure would be too labor intensive and thus a budget breaker.
    If we have to buy more software, my preference would be to buy a genuine Adobe product because I have experienced problems in the past switching back and forth between Adobe products and other PDF manipulation software.
    Can anyone answer any of these questions: 
    (1) Is there a way in Acrobat versions 7, 8 or 9 to edit hidden text, and if so, how? 
    (2) What Adobe software (other than Acrobat) will edit hidden text behind a PDF image? 
    (3) Assuming no Adobe product will edit hidden text behind a PDF image, is there any non-Adobe products that will do that?
    Thank you!

    Hi,
    Unless you use Acrobat 8 Pro's Formatted Text & Graphics" or Acrobat 9 Pro's ClearScan you will find that there is no
    practicable means of editing the OCR "hidden text" in a PDF.
    The TouchUp text tool (Advanced Editing toolbar) is reliant upon the selected text having an available system font to use during touchup. However, both Searchable Image and Searchable Image (Exact)  OCR output is of text rendering mode 3 (invisible text) that is provided from within Acrobat and not any installed system or other application installed font.
    With Searchable Image (Exact) you have the untouched image augmented by the invisible text which is provided as a user aid for search or find with Adobe Reader or Acrobat. The invisible text is not intended to support word processor like editing.
    To your questions:
    #1. There is no practicable way to edit invisible text (text rendering mode 3) with Acrobat (any past or current release).
    #2. None.
    #3. A good question. Perhaps a specialty program. Keep in mind, many products provide a promise but those those that actually deliver tend to be expensive.
    Something to play with. Using Acrobat 9 Pro or Pro Extended, try the Preflight Fixup to embed hidden text.
    Then try using the TouchUp Text tool. You may also want to see if you can change the font type of this newly embedded font.
    (use copies of the "real" files - just in case <g>).
    Be well...

  • Activate OCR and Enable Comment in PDF document On an Unix platform

    Hi every Body,
    I have an amount of PDF document stored on an unix server, and i want to anable "Add comment" feature for all of those documents, so that i can
    open every document by Adobe Reader and add comments; sticky, underline....etc. This feature is Avalabe in Acrobat Reader Pro 9 i test it
    it work fine, but i need to do same thing in commande line, i mean install a library or something else and i can do this operation by taping a command on shell terminal.
    The same problème with te OCR feature.
    Thanks for you help

    Hi,
    Unless you use Acrobat 8 Pro's Formatted Text & Graphics" or Acrobat 9 Pro's ClearScan you will find that there is no
    practicable means of editing the OCR "hidden text" in a PDF.
    The TouchUp text tool (Advanced Editing toolbar) is reliant upon the selected text having an available system font to use during touchup. However, both Searchable Image and Searchable Image (Exact)  OCR output is of text rendering mode 3 (invisible text) that is provided from within Acrobat and not any installed system or other application installed font.
    With Searchable Image (Exact) you have the untouched image augmented by the invisible text which is provided as a user aid for search or find with Adobe Reader or Acrobat. The invisible text is not intended to support word processor like editing.
    To your questions:
    #1. There is no practicable way to edit invisible text (text rendering mode 3) with Acrobat (any past or current release).
    #2. None.
    #3. A good question. Perhaps a specialty program. Keep in mind, many products provide a promise but those those that actually deliver tend to be expensive.
    Something to play with. Using Acrobat 9 Pro or Pro Extended, try the Preflight Fixup to embed hidden text.
    Then try using the TouchUp Text tool. You may also want to see if you can change the font type of this newly embedded font.
    (use copies of the "real" files - just in case <g>).
    Be well...

  • How can I sum numbers in a pdf? And, sum a partial list of numbers?

    How can I sum a list (or partial list) of numbers in a pdf?

    Thanks for your answer.  I have Adobe Acrobat 7.0 Professional.  If you
    can help me further by listing the necessary steps that would be great.
    Thanks.
    Barry Baker

  • Converting many numbers files to PDFs?

    Hi everyone!
    I would like to convert many Numbers files to PDF documents. Every Numbers document should be a new PDF with the same name!
    I even tried to write a workflow file with Automator, but I dont get the "create a PDF"-part running.
    Furthermore I googled for 2 hours, but all scripts I found do not work with the latest Numbers version...
    Please help me... Maybe one of you have a Apple Script or a solution using Automator?
    OS X 10.9.3
    Numbers Version 3.2 (1861)

    Dear Lori,
    promptUser =false, see below.
    Does not work.
    All programs that might disturbe PDF output removed from computer.
    Does not work.
    I am now investing some evenings into Word-VBA, making a script loop through the subdirectories over several levels (does Acrobat do that anyway?)...
    Does not work.
    Not yet.
    It will have to...
    best regards,
    Boris
    <?xml version="1.0" encoding="UTF-8"?>
    <Workflow xmlns="http://ns.adobe.com/acrobat/workflow/2012" title="ZOPtest ORT" description="" majorVersion="1" minorVersion="0">
    <Sources defaultCommand="WorkflowPlaybackSelectFolder">
      <Folder path="/nas02/quinsee$/Fachabteilung/ZOP/Standards/Standards NCH"/>
    </Sources>
    <Group label="Unbenannt">
      <Command name="Scan:OPT" pauseBefore="false" promptUser="false">
       <Items>
        <Item name="ApplyMRC" type="boolean" value="false"/>
        <Item name="BkgrRemove" type="integer" value="0"/>
        <Item name="ColorCompression" type="integer" value="4"/>
        <Item name="Descreen" type="boolean" value="false"/>
        <Item name="Deskew" type="boolean" value="false"/>
        <Item name="Format" type="integer" value="1"/>
        <Item name="Language" type="integer" value="-1"/>
        <Item name="MonoCompression" type="integer" value="1"/>
        <Item name="QualityLevel" type="integer" value="1"/>
        <Item name="TextSharpen" type="integer" value="0"/>
        <Item name="doOCR" type="boolean" value="false"/>
       </Items>
      </Command>
    </Group>
    </Workflow>

  • Manual for Numbers 3.1 (1769)

    I upgraded to Mavericks and then upgraded to the new numbers, keynote, etc.
    For Numbers '09, I was able to find a pdf manual to download.
    Does such a document exist for Numbers 3.1 (as well as Pages and Keynote and Apperture?)  It is nice to be able to read how to use new features without being tied to the internet.
    Thank you.
    dcwarner

    Wayne,
    Thank you. This is a disappointment.
    The look of the new Numbers is so different from the old one, that I was having troubles navigating between sheets and tables (though I saw where at the top options to toggle between them existed.)  I was looking for an equivalent of the sidebar in Numbers '09 that shows sheets with their included tables, along with the sum/avg/min/max/count mathematical functions.
    Perhaps it is good that the old program application is still on my Mac, and useable.
    dcwarner
    wilmington, de

  • IPad Numbers guide in PDF

    Hey,
    Can anyone tell me where can I find a PDF guide or manual for Numbers?

    Go to the apple website and select support. Select Manuals then enter "Numbers" in the search box.

  • Exporting Numbers sheet to pdf but keeping formulas?

    Hi!
    I want to know if it's possible to export a Numbers sheet to .pdf in a way that let's me keep the formulas? My goal is to create a pdf which let's the user answer a series of questions where the answers create a result and a couple of diagrams. I have it all drawn out in Numbers but can't figure out how to create a pdf out of this. If I export the sheet to pdf the formulas disappear....
    Please help me!
    Thanks!

    Hi Owen,
    This seems to be a good candidate for the sharing function in Numbers 3 rather than a pdf. You simply send the person the link to the spreadsheet. They enter some data right through their browser and (since, unlike with a pdf, the formulas are there) they will see the results and the charts change on their end. They don't need to have a Mac or Numbers installed; just any computer with a modern browser.
    That works well for one user at a time. If you have different users who will each be inputing different things you can set up multiple copies of your document and share each one to a different person.
    If they have a modern Mac updated to Mavericks you can of course just send them your document and they can open it on their end, since Numbers is now free to all.
    Obviously, none of this is advisable if data integrity is an issue, because once you share a document, both they and you (or anybody else who gets their hands on the link) can go in and change values and formulas until such time as you revoke the sharing.  There's no password protection for shared documents.  Hope that's coming.
    SG
    Edit: Sharing works the same from Numbers for iOS, in case you're running it on an iPad.

  • Automator Actions for Numbers?

    Do they exist?
    If so how can I install them?
    If not are there applescript actions for numbers and where can I learn to use them?
    Any info at all would be greatly appreciated

    Hi Guyzs,
    I'd like to create an action that will enable me to list my file names in Numbers, file type and date of creation and modification. 
    The thing is that I have 900 documents (PDFs, ppt, xls, doc, etc...) contained in folders and sub folders, to list one by one in Numbers, with 4 columns: file name (just the name, not the extension, to be in column A), file extension (with the DOT before the extension name, to be in column B), creation date (to be in column E) and last modification date (to be in column F).
    I'm very new to Automator and Apple Scripts, as I only tried long time ago once or twice and miserably failed, while aided by tutorials.
    Regards,
    Fab'

  • How can I add a new Template to My Templates in Pages? I've read most of the discussions on the subject but it doesn't work for me. By the time I reach the Templates folder, I only see templates for Numbers and not for Pages. Need help, please.  Thanks

    How can I add a new Template to My Templates in Pages? I've read most of the discussions on the subject but it doesn't work for me. By the time I reach the Templates folder, I only see templates for Numbers and not for Pages. Need help, please.  Thanks

    Si vous avez utilisé la commande Save As Template depuis Pages, il y a forcément un dossier
    iWork > Pages
    contenant Templates > My Templates
    comme il y a un dossier
    iWork > Numbers
    contenant Templates > My Templates
    Depuis le Finder, tapez cmd + f
    puis configurez la recherche comme sur cette recopie d'écran.
    puis lancez la recherche.
    Ainsi, vous allez trouver vos modèles personnalisés dans leur dossier.
    Chez moi, il y en a une kyrielle en dehors des dossiers standards parce que je renomme wxcvb.template quasiment tous mes documents Pages et wxcvb.nmbtemplate à peu près tous mes documents Numbers.
    Ainsi, quand je travaille sur un document, je ne suis pas ralenti par Autosave.
    Désolé mais je ne répondrai plus avant demain.
    Pour moi il est temps de dormir.
    Yvan KOENIG (VALLAURIS, France)  mercredi 23 janvier 2011 22:39:28
    iMac 21”5, i7, 2.8 GHz, 4 Gbytes, 1 Tbytes, mac OS X 10.6.8 and 10.7.2
    My iDisk is : <http://public.me.com/koenigyvan>
    Please : Search for questions similar to your own before submitting them to the community

  • How to insert page numbers in a PDF document?

    How to insert page numbers in a PDF document using Adobe Acrobat X Pro 10.1.2?
    Thanks.

    OK, I found it myself:
    1. Tools - Pages - Edit Page Design - Header & Footer - Add Header & Footer.
    2. Select the font and size, etc, place the cursor on the appropriate site to insert the page number, click the "Insert Page Number" button, and click OK.
    That is!

  • How do I find the icon for scanning to a PDF?

    I used to have an icon for scanning to a PDF.  Where should I find it?

    Go to the system preferences. Click on "Network". Then click on "Airport". There should be a checkbox that say "Show Airport status in menu bar". It should be checked in order to show the icon at the top.

  • Withholding tax -number could not be determined for numbering group ID0017

    While positing the FB60 transaction with withholding tax for country Indonesia, the below error is coming:
    A number could not be determined for numbering group ID0017
    Message no. 7Q630
    "The system could not determine a certificate number for the numbering
    group.
    Withholding tax types exist that are relevant to numbering. The system
    cannot determine a certificate number because the Customizing settings
    are incomplete.
    Sysem Response
    Payment cannot be made.
    Procedure
    Check the number ranges in the numbering group."
    Already seen the wothholding tax number range and it is coorec. Please advise how to resolve.

    Dear,
    Please check whether you have assigned the number range to the number group in the below setting
    Spro
    Financial Account Global Setting> Withholding Tax>  Extended Withholding Tax> Posting> India> Remittance Challan>     Assign Number Range to Number Groups

Maybe you are looking for

  • Error message when trying to use "blend options"

    CS6, vista ultimate 64bit,Intel quad2,ATI Radeon HD3600,8gb ram    Working with tiff or psd files, when I try to access layer styles "blend if" I get  error message "can not complete request because of program error". If I convert the file to JPEG an

  • I need to merge my apple accounts to get proper function of iCloud and iWork apps

    I like so many people now have two apple ids.  Becase I was a mobile me subsriber and I had a itunes account.  The migration from me.com to icloud.com created another apple id.  So if I want my apps and music i have purchased over years i have to use

  • Insert and overwrite keyboard shortcuts making unlinked clips.

    I recently switched from Mac to Windows; on the Mac, the insert/overwrite shortcuts would put the clip in the timeline as a linked clip (so the audio and video were considered a single clip). However, after switching to Windows, I've noticed that tho

  • Sensitive space bar

    I get double spaces often and have found that I can cause it to space even when lightly rubbing my finger across the space bar. How might I fix this problem other than have the entire keyboard replaced?

  • Adjusting max import size: total pixel size VS 10000x10000 max (for panoramas)

    1.1 has been an amazing update. However, I found out there is a 10,000 pixel limit width and height. While I understand the reason for this (a 10,000 x 10,000 pixel image is huge) I create a lot of panoramics. Most are imported, but some exceed the 1