How to determine (in bulk) whether a document has OCR text and what reader versions are supported?

Help!
I've inherited a task which has spanned many years and which was not well thought out over the transitions.  At least three project teams have scanned and stored tens of thousands of documents to PDF.  What was discovered subsequently was that the project teams did not apply a uniform standard for which versions of Adobe would be supported in each PDF, and that not all documents appear to have been OCR'ed as part of the scan process.
This has resulted in two major problems.  First, PDFs which support all Reader versions are bloated and consuming significant amounts of storage; second, the automated processing tools which depend upon the OCR text are failing once they pass the front and rear cover sheets (which do contain extractable text).  I need to know if there is a way that PDFs can be bulk scanned to determine which Reader versions are supported (say 8.0 to current), and if the OCR'ed / extractable text is not just limited to the first few and last pages of each PDF.
I have been manually fixing individual files with Adobe Acrobat 9.0.  I can force Adobe to re-OCR and save the files, but I would rather not have to re-process the existing bulk that we have unless absolutely necessary.  If I could determine which ones need fixing and just processing those it will save man years of work.
Thanks, in advance, for any assistance.
Michael

What I meant by supporting a version of Reader is that I don't need the files to be fully backwards compatible all the way back to say the first few versions of Adobe Reader.  (There are likely limitations on that much backwards compatibility, anyway.)  One of the scanners that was used apparently was set for full backwards compatibility, to the extent possible, for every PDF that it generated.  Some of those PDFs are huge, commonly 300-400MBs in size.  If I open them in Acrobat 9.0, limit the backwards compatibility to Adobe Reader 8.0 and forward, then resave the file, the size is often significantly reduced.
As to how it is measured, there is something in the PDF itself that indicates a minimum version for Adobe Reader for compatibility purposes.  When you select for compatibility in Acrobat during the save process I mentioned, you get to pick the version at which you want to stop -- so if you selected 8.0, it would be compatible with 8.0, 9.0, and so on.

Similar Messages

  • How to determine number range for billing document based on company code ..

    Hi Friends!!
    can anybody tell me how to determine number range for billing document based on company code & tax departure country if required??
    Amit...plz help me!!

    Hi Amit,
    1. Define different Billing Document number ranges in  SPRO -> Sales & Dist -> Billing -> Define number ranges for billing docs. (VN01). Make sure that all are internal number ranges.
    e.g.
    NO.  From number To Number    Current number  Ext
    A1   0930000000    0930999999
    A2   0940000000    0940999999
    A3   0950000000    0950999999
    2. Define a Ztable ZNUMB_RANGE as follows
    Comp. Code | Tax departure country | Billing Doc Type | Number Range
    100                IN                               F2                      A1
    200                IN                               F2                      A2
    200                US                              F2                     A3
    3. In user exit RV60AFZZ (USEREXIT_NUMBER_RANGE)
    Read table ZNUMB_RANGE for Number Range with Comp. Code, Tax country and Billing Doc.
    If found pass this number range value to us_range_intern.
    us_range_intern is a standard SAP variable which tells program which number range use to create the current document which is under process.
    Let me know if you are clear.
    Thanks,
    Mandar

  • How can I exchange my pc created documents on Excel, Word, and PowerPoint with my mac and my Pages, Numbers and Keynote created content with my pc?  I need to create and edit between both.

    How can I exchange my pc created documents on Excel, Word, and PowerPoint with my mac and my Pages, Numbers and Keynote created content with my pc?  I need to create and edit and exchange between both types of operating systems.

    Your Windows system will not open any Pages, Numbers or Keynote documents. No applications exist for Windows that can open those formats. You will need to export your documents in Word, Excel and PowerPoint formats, respectively, before you'll be able to view and edit the documents on your Windows system. The iWork documents can natively open Word, Excel and PP documents, though there are limitations on what can be imported (for instance, Numbers does not support all possible Excel functions). Consult the documentation for the relevant iWork application for details on importing and exporting.
    If you are asking how to get the documents from one system to the other, then there are several ways to do that, including file sharing, using an external drive (hard drive, USB flash drive), or emailing the documents to yourself. Which would be best for your situation I can't say without more details about your usage.
    Regarsd.

  • HT204394 How can I expunge my iCloud account which has become contaminated and open a new one?

    How can I expunge my iCloud account which has become contaminated and open a new one?

    If you want to create a new iCloud account, go to Settings>iCloud, tap Delete Account, then sign back in with a different Apple ID to create the new account.  If you have any photo stream photos that you want to keep, save them to your camera roll before deleting the account.

  • HT1212 How do i restore my iphone when it has been disabled and can not be restored through itunes because of the error message that says "The iphone software update server could not be contacted.

    How do i restore my iphone when it has been disabled and cannot be restored through itunes because of the error message that says "The iphone software update server could not be contacted

    Have you jailbroken this device? This will make the iPhone not talk to the Software Update Server (that signs and accepts software updates) please refer to the following link? Error 3194, Error 17, or "This device isn't eligible for the requested build"
    The iPhone will talk to the software update servers, just need to change the host files, it's a simple process, sorry if you were confused about the wording in to top?

  • I came home and i came home and my iphone is now frozzen on the applmy iphone is now frozzen on the apple screen and it hasnt been doing anything i put it in the charger and its not doing anything how do i get it to work so i can  text and call and stuff

    i came home and my iphone is now frozzen on the apple screen and it hasnt been doing anything i put it in the charger and its not doing anything how do i get it to work so i can  text and call and stuff

    I got to that page but clicking on the Subscribe Free had no effect.
    Nothing showed in the Podcasts nor the iTunes U.
    In fact I also subscribed to Stanford's lectures and despite actually managing to DL them there is still no subscription in iTunes U, just the Intro to Computer Science belatedly.
    Peter
    I'd like to attach the screen grab of the out of order and randomly DLed selection of Harvard's course, but the forum won't let me attach it even though it is only 456k. Correction, close post and retry and now it lets me. NB how intermittent the DLs are and in no particular order.

  • My contacts and whats app messages are shown on my sister's iphone! How can I secure my iphone and have a high level of security and privacy! Her Contacts are shown in my iphone as well!!

    I have and iphone 6 with iOS 8.1. My contacts and whats app messages are shown on my sister's iphone! She have iPhone 6 and and iOS as well. How can I secure my iphone and have a high level of security and privacy! Her Contacts are shown in my iphone as well! Setting in mac and iphone are a bit presice and sensitive. Is there any way to solve my issue and increase the safety, security and privacy in my iPhone and its data?

    Your problem is that she used your icloud ID to connect to icloud and thus had all your data synced to her device.  Contacts are not saved in a backup to icloud, since they are stored independently in the Contacts section of icloud.  If someone deletes them, they are gone.  If you had them on the PC would they be available in some backup you frequently make of the PC?

  • How do i fix a portrait photograph that has harsh sunlight and shadows in PSE12? [was: Help please =) ]

    How do i fix a portrait photograph that has harsh sunlight and shadows in PSE12?

    Try this:
    Duplicate the background layer, and convert this layer to black/white (ALT+CTRL+B)
    Select the young lady's face, and place this on a separate layer (CTRL+J)
    Open a Brightness/contrast layer at the top and group the top 2 layers (CTRL+G). Work the brightness and contrast sliders so that the 2 gray scale layers match as much as possible
    Keep the face layer, but delete the duplicated grayscale layer below
    Change the blending mode of the face layer to luminosity
    With the eyedropper tool, sample color from the original skin
    Open a blank layer at the top, and change blending mode to color
    Paint over the face to match colors. Touch up additional spots as needed, e.g.hand
    Add a levels adjustment layer and work the sliders below the histogram to best advantage
    Sharpen slightly

  • How do I reset my security questions if i have no clue what the answers are to my old ones

    How do I reset my security questions if i have no clue what the answers are to my old ones

    The Three Best Alternatives for Security Questions and Rescue Mail
         1.  Send Apple an email request at: Apple - Support - iTunes Store - Contact Us.
         2.  Call Apple Support in your country: Customer Service: Contact Apple support.
         3.  Rescue email address and how to reset Apple ID security questions.
    A substitute for using the security questions is to use 2-step verification:
    Two-step verification FAQ Get answers to frequently asked questions about two-step verification for Apple ID.

  • How to determine the correct moment when Word has finished to write to a docx file

    Hello,
    we are currently passing a document to Microsoft Word for editing and want to take back the changed/saved document when editing is finished. But it seems we have issues to determine the correct moment when Word has for sure saved all changes and
    it is save to take back the file.
    To pass the document to Word we use ShellExecuteEx with the file name. Then we use the Running Objects Table and a File Moniker to wait for the file to be closed.
    Sample code:
    HRESULT hRes = S_OK; 
    CComPtr<IMoniker> spIMoniker; 
    CComPtr<IRunningObjectTable> pRT; 
    hRes = GetRunningObjectTable(0, &pRT); 
    if (FAILED(hRes))
       TWTHROW1(TWERR_ANY_ERROR, hRes, _T("GetRunningObjectTable failed."));
    hRes = ::CreateFileMoniker(m_strTempFilename.AllocSysString(), &spIMoniker);
    if (FAILED(hRes))
       TWTHROW1(TWERR_ANY_ERROR, hRes, _T("CreateFileMoniker failed."));
    while (S_OK == pRT->IsRunning(spIMoniker) && !m_bShutdown)
       Sleep(500);
       continue;
    So we wait for the file to be removed from the Running Objects Table and additionally wait until we can get exclusive access to the file before we take over the changed file. But still it seems to happen that we take the file too early. Hence it
    seems the file is removed from the Running Objects Table by Microsoft Word before saving the file has completed.
    Is this behaviour of Microsoft Word by design? What is the best practice to identify the moment in time when writing the file has completed. Some other applications like Winzip seem to do the same thing.
    Thanks,
    Wolfgang

    Hi WolfGang,
    Thanks for posting in MSDN forum.
    Based on the description, you are developing application with Windows API. I would like to move it to
    General Windows Desktop Development Issues forum.
    The reason why we recommend posting appropriately is you will get the mostqualified pool of respondents, and other partners who read the forums regularly can either share their knowledge
    or learn from your interaction with us.
    Thanks for your understanding.
    Regards & Fei
    We are trying to better understand customer views on social support experience, so your participation in this interview project would be greatly appreciated if you have time. Thanks for helping make community forums a great place.
    Click
    HERE to participate the survey.

  • Looking for a way to detect whether a document has been changed

    We have a document with a "validate" button.
    Pressing this button performs several validations and the result will be : "validated" or "not validated"
    The result is being displayed to the end user as a document status (with it's own color).
    Once the new "status" = "validated", the status should be automatically reset to "not validated" when the user makes 1 of more change to the document.
    Is there an easy way to do this?
    Is there a sort of document status you can fetch?

    Niall's way wil lwork but this one might be easier.
    Acrobat has a "dirty" flag that gets set each time a change is made to the doc. This is what tells Acrobat that you need to do a save when you quit. You can test this flag yourself. To access it use:
    event.target.dirty
    This will return true if a change has been made and false if no changes have been made. Note that if someone enters data into a field and then deletes that data the flag will still be true as a change was deemed to have been made.
    Paul

  • How do  offcard program know whether the applet has been personalizated?

    The offcard program want to know whether the applet has been personalizated. if Y . the offcard program don't send the personalization command to the applet. anyway to inqury the applet state in offcard program. thxs

    I don't know what card framework (OCF, JCOP tools or other) you use.
    For the JCOP tools framework it is very simple. Connect to your applet by creating an instance of the com.ibm.jc.OPApplet. This class has a method called public int getState() wich returns "the applet privileges" in form the flags NOT_AVAILABLE, LOGICALLY_DELETED, INSTALLED, SELECTABLE, PERSONALIZED, BLOCKED and LOCKED.
    Jan

  • How do I transfer a working excel doc to my iPad and what program's do I need? Thanks

    I am trying to transfer an existing excel doc to my iPad so I can use it on sit but I don't know how to. Can anyone let me know how to transfer the doc from my pc to my iPad and then what app is best to buy to use excel on iPad. Thanks so much

    Have a look at the following (in my order of preference)
    http://itunes.apple.com/sg/app/quickoffice-pro-hd-edit-office/id376212724?mt=8&l s=1
    http://itunes.apple.com/sg/app/documents-to-go-premium-office/id317107309?mt=8&l s=1
    http://itunes.apple.com/sg/app/office2-hd/id364361728?mt=8&ls=1

  • Using a PC; converting a Word 2010 document to PDF; upon completion the PDF document has a dark and/or patterned background; I have tried "removing" the background but that has not worked.

    Can someone help me remove what appears to be a 'background' that is created when converting a Word 2010 doc to a PDF format.  I am using the 'save as' method and selecting PDF as the file type.  I have used this method for over a year and have just had this "background" problem today.  It is not happening on all documents; so far it's just happened on 2 resumes but not on the accompanying cover letters which I think is very odd.
    Thanks for any help you can provide!

    Sara –
    I’ve attached the converted documents so you can see what I mean.  The Word document was not shaded in these ways; it was just a white background…or no background at all.
    I don’t know if these were created from scanned documents.
    Thanks!
    Sue Carline
    Operations Assistant
    [email protected]
    Telephone: 217.241.5400
    Fax: 217.241.5401
    3901 Wood Duck Dr., Ste. E / Springfield, IL 62711
    Web<http://www.paulygroup.com/>  / Twitter<https://twitter.com/PaulyGroupInc>  /  LinkedIn<http://www.linkedin.com/pub/pauly-group-inc/66/229/a73>

  • How can I view pdf files that existed before I downloaded a newer reader version?

    I'm unable to view older pdf files after downloading the newest version of reader! How can I fix this?

    They should open the same as they did with the older version.
    How are you trying to open (double clicking, using file>Open from within Reader, clicking on a link on a webpage...etc) them and what happens when you try?

Maybe you are looking for

  • PO Change according to material group and G/L Account no.

    Hi Expert, I have a requirement of "PO Change according to material group and G/L Account no.". I am using BAPI_PO_CHANGE. But it is giving error. "I 06 684 Releases already effected are liable to be reset E BA 003 Instance 4500010532 of object type

  • Linked image border

    This is small but irritating issue. Ever since upgrading to CS3, whenever I add a link to an image Dreamweaver puts the blue border around it. I have to manually set the border to 0 in the properties to get rid of it. Like I say it's no big thing but

  • Compare previous column value in BPS Layout

    How to compare the two column values in BPS layout. My layout format PO NO GL Actual Amount 1001 701 1200 User Entry Actual Value is 1200, User will enter the Amount but it should be equal to 1200 or less then that. If user entered more than 1200 in

  • How could I copy an image under labview and to stick it right A side of old

    copying shapes and stick them x times after knowing that one will differemment modify the color of each shape, I have Imaq Vision Builder

  • How to keep the repeating table and all of its contents on the same page

    Hi All, I use 5.6 build version of xml publisher. My problem is about, repeating table in rtf is divided, so i want to keep the object and its contents on the same logical page. How it can be done? Thanks.