Trying to OCR pdf, pdf says it can't perform bc it already contains renderable text-but does not.

I work for a large agency, and we receive PDF's all the time. 98% of the time I am able to OCR a document with no issues. Just recently I have come across this issue several times, and was wondering if anyone can solve this irritating problem!
*Acrobat 8.1 - When going to OCR the document, I receive the following message " Acrobat could not perform recognition (OCR) on this page because this page already contains renderable text. However, it does not. When you go to select text or search for anything the whole page is selected (like it's still in a "picture" format, not a document format that you can search, ect.)
I am not sure if it is how the document is uploaded originally by the other party that causes this, but the only thing I can do as a work-around - is to print out the entire document, scan and then I can OCR the document just fine! The problem is, if the document is 400 pages or so, this can be a huge waste of time, and money just to be able to search the PDF.
*I have also checked the pdf properties to see if this is some sort of permissions issue, and there are not permissions/security settings in place.*
PLEASE HELP! Any assistance in this matter would save me a lot of time, and of course (my sanity!).
Thank you in advance!

While the alert speaks to "renderable text" that is a simplification. The issue is that you've PDF page content consisting of at least one renderable "character".
Look at font families - you will observe that there are many characters that are not "text" characters (i.e., linguistic characters).
So, there's a "renderable character" present. It may be an alpha numeric that has a font color the same as the page background. It may be under the image and thus not visible to the eye.
You might be able to determine just what is present.
You could export the page of interest to a text file then view that file.
You could deplay the page of interest in Acrobat Pro then select the "Content panel" to view the content tree.
Locate and click on the page number for the page of interest.
From the Content panel's Options menu select "Highlight Content".
Walk down the tree. Select the content containers in turn and observe what is highlighted on the PDF page.
Where might the renderable character come from ? Typically that'd be associated with something in the work flow.
Not always easy to find so don't take anything in the work flow for granted.
Be well...

Similar Messages

Maybe you are looking for

  • How soon are they going to fix the problems with iPod Touch IOS 6?

    I'm using iPod Touch 4G with IOS 6.  I didnt noticed the crashes problems until now. I am having a problem with IOS 6.  I've read about the App store problem with crashes that everybody complaining about somewhere in community forum.  But there are m

  • No keyboard in App Store

    When I go into App Store the keyboard does not appear. I have never turned on Bluetooth so that is not the problem as others have been a advised to do. The window comes up to ask I'd and password but no keyboard appears. Can anyone help me.

  • NLB on 2 different switch running HSRP

    Hi, we have implemented NLB on the SGI Server running IRIX OS, it works fine. the connectivity is 2 NIC connected on 3750-1(ruuning HSRP),now if i connect 1 of NIC in to the other switch 3750-2(running HSRP) will the NLB workz & will i get the 2Gbps

  • Set type COMM_PR_UNIT does not exist

    Hi All How do i make set types available for assignment to categories if they are not available but are there in comm_settype. Getting errors as follows: Set type COMM_PR_UNIT does not exist. Please assist Regards, Patrick Edited by: Patrick Chidarar

  • Import netscape.javascript.JSObject;

    import netscape.javascript.JSObject; This library is deprecated? In this case. Can I replace it with another solution? Thanks.