How to force acrobat to perform ocr?

Hi Folks,
Is there any way I can force acrobat to perform an OCR on pages which it says have "renderable text"?  The problem is that, when I copy and paste from the document, I just get a bunch of gobbledegook out.  You can see the problem document here: http://www.gofc-gold.uni-jena.de/redd/sourcebook/Sourcebook_Version_Nov_2009_cop15-1.pdf
Thanks!
Acrobat Professional 9.2.0
XP SP3

That there is no real solution to this, after many years is an indication that either:
Adobe does not care, or
Adobe does not get it.
I was about to suggest my employer upgrade to a newer version of Acrobat, but see no indication that Adobe has really fixed the problem.
PDFs can contain multiple layers.  When there is bitmapped text, from a scan, and "renderable text" such as a footer containing a document identification number (there are many, many, millions of such pages used in the legal world every day) then for reasons only the programmers at Adobe can explain, Acrobat is incapable of running OCR on the bitmap and leaving the rendered text in its own layer.
This is not a difficult problem. Since Acrobat can deal with layers making up each page, it should be somewhat trivial to have the OCR function find the bitmapped layer and operate on that layer.
Instead Acrobat gives the LAME error message.
The "fix" of saving to TIFF and back is a pain-in-the-arse work-around. It even froze Acrobat when I tried to save to TIFF.
The "fix" of ignoring renderable text in the margin area and running OCR on the interior bitmap is also not acceptable.
Adobe, you once were a reputable company. You gave the world Postscript. You gave the world PDFs. Why can't you get the OCR function to work properly in Acrobat?

Similar Messages

  • Adobe Acrobat 9 Standard is still seeing my old scanner install.  How to force it to see new scanner install.

    Adobe Acrobat 9 Standard is still seeing my old scanner install.  How to force it to see new scanner install?  Scanner is recognized by and works fine in Windows.  Acrobat however still sees Fujitsu fi-6130dj #3.  I need it to see Fujitsu fi-6130dj which is what is listed in Device Manager.  (Windows XP SP3).
    Thank you for any suggestions.

    To provide more detail and list what was done to resolve the issue but there may be a better way.
    Background.  When you install a USB device to a PC it is listed in Windows Device Manager as installed hardware.  If you take the USB plug for any device and plug it into a different USB port on the PC the PC will install drivers again for that device.  In Device Manager, if you set a system environment variable for device manager to show non-present devices you can see all of the times this may have occurred.  In this specific instance there were five installations in Device Manager for one piece of hardware, a Fujitsu fi-6130 scanner.  The first listing was Fujitsu fi-6130dj, then Fujitsu fi-6130dj #2, Fujitsu fi-6130dj #3 and so on.  Adobe Acrobat only saw the third instance of these drivers being installed.  Once I cleaned Device Manager of all Fujitsu scanner installs I was able to have a single set of drivers installed for this Fujitsu but Adobe still recognized the third instance of the driver install and I could not figure out how to force it to see what Device Manager was seeing, a single instance of the scanner being installed (with no #3 behind it).
    Steps performed to try to resolve.
    Repair install of Adobe - no luck - still sees the #3 instance of the scanner
    Deactivation, uninstall, reboot, reinstall, reactivation of Adobe - no luck, same behavior as above
    Complete removal of all scanner related software.  This included the ISIS drivers in Add/Remove Programs and all other related software.  This also included deleting the contents of the TWAIN_32 directory found within the root of the Windows directory and also included deleting the scanner from Device Manager.
    Rebooted.
    Reinstalled complete packages of both TWAIN and ISIS scanners - with the scanner disconnected.  Powered the scanner back on and let Windows install the drivers for the hardware.  Only at this point did Adobe see the new install of the Fujitsu scanner and no longer looked for the #3 instance of that scanner install.

  • Why can't I interact with Acrobat Pro XI while it is performing OCR on a set of large files?

    I am running Acrobat Pro XI on a beefy MacBook Pro with lots of RAM and CPU to spare. I need to perform OCR on hundreds of pages of text spread across dozens of files. That understandably is time consuming and is not perfectly parallelizable.
    However, why am I prevented from doing *anything* with Acrobat while the OCR process runs?
    Two suggestions:
    1. Allow the user to perform OCR on discrete files in parallel. (I realize this will consume additional CPU and memory.)
    2. More urgently, allow the user to access non-OCR features of the application while OCR is in progress.

    Maybe it should be 64-bit and have multi-threading capabilities .  Understand the problems associated with that though and I won't hold my breath.

  • How do I force Acrobat 11 Professional users to use the "save as" verses "save"?

    How do I force Acrobat 11 Professional users to use the "save as" verses "save"?  I want the users to type in a fillable form, but not save their edits on the original form.  I want them to be able to save with a different file name.

    You can't really force it, but you can encourage it... For example, you can set the file as read-only and add an instruction to do so before the file is saved using a script.

  • Acrobat V9 Pro OCR can't produce a file

    I am trying to perform OCR on a credit card statement. The statement has 3 PDF pages and except for the non-regular header info at the top of the page, everything is in nice columns - five of them.   I specify the output file to be an excel spreadsheet. The OCR engine works OK on pages 1,3.  It chokes on page 2 with an error that it cannot recognize any table OR sometimes produces this message: Acrobat could not perform recognition (OCR) on this page because: This page contains renderable text.
    I tried the technote soln to convert to .tiff , but that did not work (actually the instruction are not clear: do you rerun OCR on the .tif file or the newly created .pdf that was made from the .tif file...no matter, I did both, and both failed)http://kb2.adobe.com/cps/333/333110.html
    I have also seperated the .pdf doc into three individual files, and OCR'ed page two with same results.
    I took page2.pdf, scanned it (not with Acrobat), at 600DPI, and tried to OCR it again, same results.
    The page contains a bar code in the margin-could this be killing the OCR process?  I  tried to edit out some of the noise but can't figure out how to delete parts of the .pdf doc.
    Also, I highlight only the colums, select Document-> OCR Text Recognition -> Recognize text using OCR....and it does its thing, says it generates output document, but....WHERE?  It does not ask me where it should be placed, and I have no clue where it sticks it.....
    Any help is appreciated....
    JOhn
    sample is below:

    Really unresolved, but OK.

  • I have paid for Adobe Acrobat XI Pro OCR but it will not recognise a letter in the serial number for me to update my adobe account and any of the programs I have tried using to convert a PDF file on a Mac to a word doc it is converting file with funny sym

    I have been supplied info in an email to copy and paste for a new product purchase "Adobe Acrobat XI Pro OCR" but am unsure how and where to copy and paste to. There is also a link below the serial number. I have tried entering serial number into my Adobe ID but it is not recognising one of the letters in the 24 Digit serial number??? Also I have tried other products previously downloaded to convert a 7 page PDF file on my Mac and convert it to a Word doc but everything I have tried is converting the file to display some text correctly but also displays random symbols and fonts in place of the handwritten info filled in on the form... also getting blank pages included instead of the info??? Would appreciate some help... I am older generation and not always tech savvy, and it is doing my head in haha.

    Hi Jock,
    I've checked your account, and all is well there. Please make sure that you're logging in with the same Adobe ID/password that you used when you signed up.
    Then, clear the browser cache, and try logging in directly to https://cloud.acrobat.com.
    Please let us know how it goes.
    Best,
    Sara

  • Reader XI & IE10 - how to force PDF's to be viewed in READER?

    I have NO Adobe add-on installed in IE10.  Updgraded from Reader 8.1 to Reader XI.  Under Reader 8.1 there was an option in settings to force PDF's to be opened in READER instead of IE.  Cannot find that option.  Went to Settings / Manage-Add-ons to disable Adobe PDF there, but it's not installed, so can't disable it.
    Any ideas on how to force PDF's to be opened in READER instead of IE10?
    Thanks!

    I do not quite understand your initial comment.  You write that you do not have the Adobe Reader add-on installed on IE10, yet PDFs open in IE?
    See this article how to enable viewing PDFs in a browser; you will have to do it the other way around to see the opposite effect: http://helpx.adobe.com/acrobat/using/display-pdf-browser-acrobat-xi.html

  • How to force JEditorPane to be refreshed?

    Dear all,
    I've problem of refreshing JEditorPane using setPage( ) method.
    I've written a simple JAVA browser with an analysing system. When a user clicks a hyperlink, the setPage( ) method will be called. Followed by the setPage( ) method, is another method, called method2, used to analyse the content of the page which will take several seconds to minutes to be finished.
    The problem I have now is that, the new HTML page will be displayed only after method2 is finished. How to force the page to be displayed before running method2?
    Thanks a lot.
    Frances

    Just to add - invokeLater will free up the swing thread so setPage completes in a timely manner but
    StanislavL's example will run method2 on the swing thread also, just as a separate action
    performed at a later time. This is good if method2 makes further calls to swing methods but it
    does mean that the analysis will block the swing thread for however long it takes. If that's OK then fine, otherwise modify as:
    pane.setPage(url);
    Thread t = new Thread() {
      public void run() { method2(); };
    t.start();Check out this [url http://java.sun.com/products/jfc/tsc/articles/threads/threads1.html]article for further info.
    Regards

  • How can I remove asm and ocr installation in AIX?

    Hi,
    I try to install single instance with using ASM in AIX.
    But I did not make successfully.
    Now I want to remove ASM and OCR installation then
    I will plan to make new clear installation.
    How can I remove asm and ocr ??
    Or How can I control my removing is fully correct ?

    1) ASM Instance Clean-Up Procedures
    Stop all of the databases that use the ASM instance that is running from the Oracle home that is on the node that you are deleting.
    On the node that you are deleting, if this is the Oracle home which from which the ASM instance runs, then remove the ASM configuration by completing the following steps. Run the command srvctl stop asm -n node_name for all of the nodes on which this Oracle home exists. Run the command srvctl remove asm -n node for all nodes on which this Oracle home exists. If there are databases on this node that use ASM, then use DBCA Disk Group Management to create an ASM instance on one of the existing Oracle homes on the node, restart the databases if you stopped them.
    If you are using a cluster file system for your ASM Oracle home, then ensure that your local node has the $ORACLE_BASE and $ORACLE_HOME environment variables set correctly. Run the following commands from a node other than the node that you are deleting, where node_number is the node number of the node that you are deleting:
    rm -r $ORACLE_BASE/admin/+ASMnode_number
    rm -f $ORACLE_HOME/dbs/*ASMnode_number
    If you are not using a cluster file system for your ASM Oracle home, then run the rm or delete commands mentioned in the previous step on each node on which the Oracle home exists.
    2) Deleting an Oracle Clusterware Home Using OUI in Silent Mode
    !!! Oracle recommends that you back up your voting disk and OCR files after you complete the node deletion process.
    If you ran the Oracle Interface Configuration Tool (OIFCFG) with the -global flag during the installation, then skip this step. Otherwise, from a node that is going to remain in your cluster, from the CRS_home/bin directory, run the following command where node2 is the name of the node that you are deleting:
    ./oifcfg delif –node node2
    Obtain the remote port number, which you will use in the next step, using the following command from the CRS_home/opmn/conf directory:
    cat ons.config
    From CRS_home/bin on a node that is going to remain in the cluster, run the Oracle Notification Service Utility (RACGONS) as in the following example where remote_port is the ONS remote port number that you obtained in the previous step and node2 is the name of the node that you are deleting:
    ./racgons remove_config node2:remote_port
    On the node to be deleted, run rootdelete.sh as the root user from the CRS_home/install directory. If you are deleting more than one node, then perform this step on all of the other nodes that you are deleting.
    From any node that you are not deleting, run the following command from the CRS_home/install directory as the root user where node2,node2-number represents the node and the node number that you want to delete:
    ./rootdeletenode.sh node2,node2-number
    If necessary, identify the node number using the following command on the node that you are deleting:
    CRS_home/bin/olsnodes -n
    Perform this step only if your are using a non-shared Oracle home. On the node or nodes to be deleted, run the following command from the CRS_home/oui/bin directory where node_to_be_deleted is the name of the node that you are deleting:
    ./runInstaller -updateNodeList ORACLE_HOME=CRS_home
    "CLUSTER_NODES={node_to_be_deleted}"
    CRS=TRUE -local
    Deinstall the Oracle Clusterware home from the node that you are deleting using OUI as follows by running the following command from the Oracle_home/oui/bin directory, where CRS_home is the name defined for the Oracle Clusterware home:
    ./runInstaller -deinstall –silent "REMOVE_HOMES={CRS_home}"
    Perform step 9 from the previous section about using OUI interactively under the heading "Deleting an Oracle Clusterware Home Using OUI in Interactive Mode".

  • Cannot perform OCR (character recognition) on a pdf

    While attempting to perform OCR I get the following message:
    "Unable to process the page because the Paper Capture recognition service unexpectedly terminated".
    I've tried JPEG files on RGB, Grayscale, 600 dpi, 400 dpi, 300 dpi, but it keeps on not performing OCR. How do I make OCR work? Thanks.

    Use TIFF files to get OCR to work.

  • How to force refresh of data through browser or PDF?

    We have the dashboard set to refresh every minute.  We are pulling the data using XML from DB.  When we are in browser and clear the browser cache and then reload the .swf... the data is updated.  We haven't been able to figure out how to force the cache-clear and data refresh with either swf or pdf.
    Your help is greatly appreciated.

    Hi Jeff,
    Is the XML coming from a web page or a web server?
    If yes then you can give this a go. To stop the caching mark your web page with extra tags to say it has expired.
    HTML page example:
    <HEAD>
        < META HTTP-EQUIV="PRAGMA" CONTENT="NO-CACHE" />
        < META HTTP-EQUIV="EXPIRES" CONTENT="0" />
    </HEAD>
    JSP example:
    <%
      // Stop Internet Explorer from caching the results of this page.
      // We do this so that every time Xcelsius calls this page it see the latest results.
      response.setHeader("Cache-Control","no-cache"); //HTTP 1.1
      response.setHeader("Pragma","no-cache"); //HTTP 1.0
      response.setDateHeader ("Expires", 0); //prevents caching at the proxy server 
    %>
    Regards,
    Matt

  • How to improve the query performance in to report level and designer level

    How to improve the query performance in to report level and designer level......?
    Plz let me know the detail view......

    first its all based on the design of the database, universe and the report.
    at the universe Level, you have to check your Contexts very well to get the optimal performance of the universe and also your joins, keep your joins with key fields, will give you the best performance.
    at the report level, try to make the reports dynamic as much as you can, (Parameters) and so on.
    and when you create a paremeter try to get it match with the key fields in the database.
    good luck
    Amr

  • How to Measure Function Module Performance?

    Please can you tell me how I can measure the performance and trace the actions of a Function Module in R/3?
    The function module in R/3 is run when a user calls a WebDynpro action from a WebDynpro screen within the SAP Portal.
    I have tried running a trace on a user (ST05) but that only shows table actions (e.g. reads/fetch etc.). Also it does not appear in ST04 or ST03N. I would like to know how long the program actually takes to run.
    Thanks.
    Paul

    Hi,
    if I want to measure the runtime required to run some Abap, I use SE30. However i used it only for normal Dynpro application, not WebDynpro.
    The detail level of the created trace can be configured. The aggregation level should be set to "Full" or "By call" at the beginning. Disabling aggregation leads to huge trace files.
    You can select which statements should be traced. If you disable an option it's runtime is not lost but add to the traced action in the next level. If for example "Open SQL" is disabled, the time used by it is added into the net time of the method, function module of subroutine. Otherwise if "Open SQL" is enabled the net time of a function module does not include SQL time. SQL time is then listed separately.
    Greetings

  • HOW TO USE A SINGLE PERFORM FOR VARIOUS TABLES ?

    perform test TABLES t_header.
    select
           KONH~KNUMH
           konh~datab
           konh~datbi
           konp~kbetr
           konp~konwa
           konp~kpein
           konp~kmein
           KONP~KRECH
           FROM konh INNER JOIN konp
                  ON konpknumh = konhknumh
           into table iTABXXX
            "ANY TEMPERARY INTERNAL TABLE.
           for all entries in t_header
           where
                 konh~kschl = t_header-kschl
             AND konh~knumh = t_header-knumh.
    endform.
    how can I use above perform for various internal tables of DIFFERENT LINE TYPES but having the fields KSCHL & KNUMH.

    u can use single perform....
    just see this example......hope this is what u r expecting....
    tables : pa0001.
    parameters : p_pernr like pa0001-pernr.
    data : itab1 like pa0001 occurs 0 with header line.
    data : itab2 like pa0002 occurs 0 with header line.
    perform get_data tables itab1 itab2.
    if not itab1[] is initial.
    loop at itab1.
    write :/ itab1-pernr.
    endloop.
    endif.
    if not itab2[] is initial.
    loop at itab2.
    write :/ itab2-pernr.
    endloop.
    endif.
    *&      Form  get_data
          text
         -->P_ITAB1  text
         -->P_ITAB2  text
    form get_data  tables   itab1 structure pa0001
                            itab2 structure pa0002.
    select * from pa0001 into table itab1 where pernr = p_pernr and begda le sy-datum and endda ge sy-datum.
    select * from pa0002 into table itab2 where pernr = p_pernr and begda le sy-datum and endda ge sy-datum.
    endform.                    " get_data
    Regards
    vasu

  • How to force a new password in portal with LDAP user? external users

    With an external portal (used by agents that do not work for you or reside in your office), company policy is for password to be changed every qtr.
    If the users are creating as LDAP users how to force them to change their password when required?
    Is this a custom application that needs to be written so when they log into the portal if the qtr has expired the portal ask them to enter a new password that becomes valid for the next qtr.
    Versus internally deleting and emailing all the users a new password?

    Hi Glenn,
    We are getting one problem when we are creating user in LDAP and login with that user in  Portal that time we are getting Password change screen , but when we create a user in LDAP and change the password of that user in LDAP then when the user tries to  Login to portal that time we are not able to see the password change screen.
    But again if we change the password of that user through Portal we are able to see change password screen.
    can you help on this how we can force the user to change password when we are changing password in LDAP or in SAP System.
    Regards
    Trilochan

Maybe you are looking for

  • How to add Document and Comment in BI Query

    Hi All, I need one help from you, I am trying to upload some douement and comment in my BI query, after executing Query If i right click and Document --> Upload --> Comment/Document, I am getting following msg 1. Cannot get properties for assignment

  • No logical sytem found in the drop down under SAP authentication in CMC

    Hi Swapna, This is quite simple. Click on New and enter the following information: System e.g TS6 Client e.g. 800 Application Server e.g. yourservername System number e.g. 00 Username e.g. Crystal Password e.g Password and Language e.g. EN Press Upda

  • Mismatch values

    hi friends, in production report - the client requirement in two ways 1)Export Value Mismatch, (i think it is currency ) 2)Not Assigned/ Material Code Display in Description column. how to check plz  tell me step wise.give me r/3 tcodes.

  • Sync Manager doesn't actually SYNC

    Am I correct in discovering that the Sync Manager v6.0x does not actually SYNC MP3 audio files to the Zen 32GB correctly? I have spent a lot of hours trying to get my PC-based library sync'ed up with the Zen. Discovered earlier that you can add any f

  • I need to insert some silence in my audio

    I want to give the effect of a natural pause so I want to insert some silence at various locations but can't figure out where because I can't get the audio to preview in the "Edit Audio" section. I see the Play Audio button but it won't play - it als