Even on PDFs scanned at 600dpi, OCR ignores inter-word spaces.

I'm creating PDFs from clear, legible printed documents scanned at 600 dpi, then using OCR on the PDPa. The recognized text, when copied and pasted into fresh documents, appears as a solid block of characters with no spaces between words. What's wrong? I'm using an iMac, OS 10.7.3 and a Samsung SCX-4521F printer/scanner/copier.  

When you recognize the text in the PDF, what setting are you using for the PDF Output Style?

Similar Messages

  • OCR and hidden text in PDF scans of historic documents

    I need to edit the hidden text behind a scanned PDF image of a document.  The image must remain as an “exact” copy of the original scanned document.
    I used Acrobat Pro (versions 7 and 9) to make PDF images of old typed documents from the 1940’s.  When I open those images and run OCR in version 9, then examine the hidden (invisible) text layer behind the image, there are errors.  For example, the word “book” has been picked-up by the OCR as the word “look.”  I need to change the “l” to a “b” in order to make the PDF accurate when it is searched at a later date. 
    I have checked many user forums.  Most people imply that hidden text can be viewed, but NOT edited in Acrobat Pro 7 and 9.  (Hidden text can be viewed in Version 9 by selecting “Document” “Examine Document” and then clicking on the “+” symbol next to “Hidden Text,” then clicking “Show preview.”)  Some say to use Adobe Capture 3.0 to edit hidden text.  Others say to use Photoshop or Illustrator to edit hidden text (I think these folks may have been confused, because Photoshop and Illustrator would be used, logically, to edit the image ON TOP OF the hidden text).  Yet another person seemed to say that a hidden text editor was added to Acrobat 8, but was taken away in Acrobat 9.  (I can’t verify that because I don’t have version 8.)
    The closest answer I was able to find involved using the Text Touch Up Tool on top of the image to edit hidden text behind it, but when you do that you are typing “blind.”  In other words, you highlight a spot on the image (top layer) where you THINK the error MIGHT be, and you type the correction without being able to see what you are typing over.  Then, you go back to the “Examine Document” procedure (described above) to see if you “hit” your mark, and if not, you redo it until you do “hit” your mark.  With the number of documents and corrections that we have, that procedure would be too labor intensive and thus a budget breaker.
    If we have to buy more software, my preference would be to buy a genuine Adobe product because I have experienced problems in the past switching back and forth between Adobe products and other PDF manipulation software.
    Can anyone answer any of these questions: 
    (1) Is there a way in Acrobat versions 7, 8 or 9 to edit hidden text, and if so, how? 
    (2) What Adobe software (other than Acrobat) will edit hidden text behind a PDF image? 
    (3) Assuming no Adobe product will edit hidden text behind a PDF image, is there any non-Adobe products that will do that?
    Thank you!

    Hi,
    Unless you use Acrobat 8 Pro's Formatted Text & Graphics" or Acrobat 9 Pro's ClearScan you will find that there is no
    practicable means of editing the OCR "hidden text" in a PDF.
    The TouchUp text tool (Advanced Editing toolbar) is reliant upon the selected text having an available system font to use during touchup. However, both Searchable Image and Searchable Image (Exact)  OCR output is of text rendering mode 3 (invisible text) that is provided from within Acrobat and not any installed system or other application installed font.
    With Searchable Image (Exact) you have the untouched image augmented by the invisible text which is provided as a user aid for search or find with Adobe Reader or Acrobat. The invisible text is not intended to support word processor like editing.
    To your questions:
    #1. There is no practicable way to edit invisible text (text rendering mode 3) with Acrobat (any past or current release).
    #2. None.
    #3. A good question. Perhaps a specialty program. Keep in mind, many products provide a promise but those those that actually deliver tend to be expensive.
    Something to play with. Using Acrobat 9 Pro or Pro Extended, try the Preflight Fixup to embed hidden text.
    Then try using the TouchUp Text tool. You may also want to see if you can change the font type of this newly embedded font.
    (use copies of the "real" files - just in case <g>).
    Be well...

  • I can no longer attach pdf scans to my emails.   It has all changed.    How do I do it now please.

    I have always attached pdf scans to my emails veyr easily.   Now having been away for five weeks things have changed and I can  no longer do it.   Please help me to find a new way to do this.   I have scanned a document and can open it but cannot attach it what is wrong?.

    You really need to provide some details...
    what is your operating system?
    what is your email client?
    what is your Adobe Reader version?
    what means "can not" (or "no longer");
    how do you try attaching the file?
    what happens when you try?
    also, how large is your scanned document?

  • Forgotten security password for PDF scans

    Is there a way I can unlock my pdf scans? I set security to stop page extraction from a group of pdf scans done years ago and I have now forgotten the password to use to edit them.
    I now maintain a list of passwords but long ago did not. I'm guessing I saved it somewhere but haven't had success locating the password. Thought this might be a shot at being able to edit them.
    I set the files to function at Acrobat v6 and above, if that is any help.
    My thanks,
    MEL

    Not with any Adobe products, and other products are not discussed on these forums...

  • I can't export a PDF scanned file into WORD, help please.

    I can't export a PDF scanned file into Word, please help.

    Hi solarpowerprincess,
    Are you having trouble signing in to your subscription, or having trouble converting files once you're signed in. If you can't sign in, it could be that your subscription just hasn't finished processing yet. Please let us know if you're unable to sign in and we'll go from there.
    Best,
    Sara

  • PDF Scanned in a PDF Text document?

    How can I convert a PDF Scanned file in a PDF text document??
    Regards

    Unfortunately, you cannot change a scan into live text using Adobe Reader. At a minimum you would need Adobe Acrobat to run what's called Optical Character Recognition.
    What I've recommended to people in the past is to use the drawing markup tools to draw boxes around the text they want highlighted then play around with the fill color of the box until they get something close to what they want.

  • Adobe Reader 10.1.3 crashes PDF scan using Presto Scan Buttons

    Following an automatic update from 10.1.2 to 10.1.3 on 12/4/12, Presto scan buttons crashed every time I tried to scan to pdf using the default setting of text under image.  If set to image only It worked.  After much email discussion with NewSoft, we determined that the crashes had only begun after the Adobe update.
    I couldn't establish why there would be a connection, since the scan to pdf should open in Presto Page Manager before being passed manually to AR10.  The scan button applet was crashing before Page Manager opened
    I have now unistalled AR 10.1.3 and installed AR 10.0.1 and Presto now works fine again.  Is there a setting in 10.1.3 that I have overlooked or is there something seriously wrong with the update?
    Pete

    Hi,
    With respect I'm not convinced by your response. Presto Scan Buttons and indeed Page Manager has worked perfectly since it was installed in November 2011 and updated during 2012.  as far as I'm aware it worked fine following the AR 10.1.2 update.
    I run two different PC's, one, Win 7 Pro, the other Win 7 home Premium.  Following the AR 10.1.3 update on 12/4/12, Presto PDF scan button crashes consistently on both. 
    The problem is caused when using the Presto default setting of text under image.  Using image only works fine.
    NewSoft say it's not there fault, their program worked fine prior to your update.  Frankly I have to agree.  I downgraded to Ar 10.00 and I had no problems.  as soon as I installed the AR 10.1.3 update again Presto crashed again.
    The issue is related to the way in which the image is handled.  There are 3 settings; text and image, text under image and image.  Only image works.  AR 10.1.3 must have affected this.
    Please reconsider your response.
    Kind regards
    Pete

  • Problem: Reader XI won't open Lexmark pdf scans

    I recently updated to Adobe Reader XI, and it won't open pdf scans from my Lexmark all-in-one scanner.  (It worked fine with my older version of Reader.)  Any idea how I can fix it?
    (FYI, I use a PC-compatible laptop with Windows 7.)
    Thanks,
    Sholly

    Please mark the question as "answered" so others who have the same trouble can find this solution, thanks.

  • PDF Scans/Mirror Image

    PDF scans view and print as mirror images. How can I correct?

    Okay, this is a Brother setting, because Reader DOESN'T have any settings for scanning PDFs. Acrobat does, but orientation of the scan is set by the scanner software. I looked briefly for troubleshooting for MFC scanner/printers, but was unsuccessful in finding anything on this. Still looking though.

  • All my pdf scans open with  windows photo viewer what can I do to see them in Adobe Reader?

    all my pdf scans open with  windows photo viewer what can I do to see them in Adobe Reader?

    You need to change your file association to open .pdf files with Adobe Reader instead of the photo viewer program; see Change the program that opens a type of file - Microsoft Windows Help

  • I cannot activate copy text tool even though pdf file is not secured.

    I cannot activate copy text tool even though pdf file is not secured.

    What version of Acrobat are you using? Are you sure that the document contains any actual text, as opposed to an image of text?

  • HT1338 Is there a HP Photosmart printer driver update 12.16.1 that supports OS 10.7.5 that will allow me to scan documents with OCR?  If so where/how do I download or get it?  Thanks

    Sorry,  I meant to say Apple printer driver update 2.16.1 for OS 10.7.5.    Is there a HP Photosmart printer driver update 2.16.1 that supports OS 10.7.5 that will allow me to scan documents with OCR?  If so where/how do I download or get it?  Thanks
    <E-mail Edited by Host>

    Hi pmaragoni,
    I understand that you are looking for an updated driver for your HP Photosmart printer. Here is a link to the updated drivers from Apple for HP printers.
    LINK: http://support.apple.com/kb/DL907
    I hope this helps,
    Advance 23
    I work on behalf of HP

  • Picking up PDF URL for insertion as hyperlink in Word

    I just upgraded from Acrobat Pro 8 to 9, and I'm having trouble copying URLs from PDFs. I create a document in Word, then want to insert a hyperlink inside the doc to a PDF at a website.  I used to be able to select text, right-click to bring up the Insert Hyperlink dialog in Word, then open PDF and its URL would automatically insert into the dialog box or I could cut-and-paste the URL from the browser window. Now, all PDFs within a website open not in a browser but in a PDF window with a truncated URL and "[1]" inserted into the filename.  I then have to find the full URL and manually type it into the Word dialog box...and those URLs can get long.  Sorry if this question may be stupid and the answer obvious...I'm happy to wear egg just to get this functionality back.  I'll even downgrade to 8 if I have to...sigh...

    We may not be able to tell you much without seeing the Word document.

  • How can I change the size of a pdf source file, or, convert it to Word?

    How can I change the size of a pdf source file, or, convert it to Word?

    A lot depends on the form of the PDF. Is it graphics, a scan of a text file, pure text, or other? What version of Acrobat are you working with.
    You can do a save as to get a WORD file, but do not expect great results. The ability to get a decent WORD file depends on what the form of PDF you are working from. If it was created from WORD with tags and all, you might get good results. If not, you might get a lot of messed up results.
    Explain what you are starting with and your ultimate goal. Also check the audit of the file (should be under PDF Optimize) to see where the file information is concentrated (text, fonts, graphics, other).

  • How do I convert a pdf in Adobe Acrobat 9 to Microsoft Word document?

    How do I convert a pdf in Acrobat 9 to a Microsoft Word document?

    Hi fireatty,
    In Acrobat 9, you can use the Export command (File > Export) to export your PDF to Word format.
    Please let us know if you need additional assistance.
    Best,
    Sara

Maybe you are looking for

  • Error in Support Package Installation

    Hi , I am trying to install "SAPKIBIIIH" in SAP BI using Tcode :SAINT. OCS Package :SAPKIBIIIH Package Type :Installation Software Comp:BI_CONT Release : 703 After giving the OSSNOTE PASSWORD for SAPKIBIIIH, the installation is starting but after som

  • Lightroom 1.4.1:  Reducing File/Image Size for Web/E-Mail

    This should be a simple thing. I need to reduce my file size to upload it into various things on the web and e-mail and what not.  Sometimes it asks for a specific size. Right now, I'm using the Export function to do this, which means I have to reset

  • After install 2GB memory x2 , Macbook pro start with no screen

    Hello guy, I just bought a pair of Transcend 2GB 200-Pin DDR2 SO-DIMM DDR2 667 memory, and put in my computer. after I installed on my MACBOOK PRO C2D 2.33Ghz, I start comuter with no screen at all, no sound, nothing. Is there any way can fix the pro

  • User Authentication Logical Mode

    Hi Im attaching a logical model. Experts please take a look an guide for changes or to include more details. This is basically a user authentication logical model ERD Eagerly awaiting your reply. I am unable to attach a file if i can share the file i

  • Best way to transfer old files to new computer!!!

    Bought a new macbookpro. I have it setup and loaded my FCP and Logic Studio on it. Now I want to transfer my documents, itunes, address book and things over to the new computer. What's the best way to do this? Thanks RD