A way to undo Formatted Text & Graphics OCR from Acrobat 7?

Over the course of a few months, my company received a large number of PDF files for a project for which the internal policy was that every file should be text searchable.  Unfortunately, we did not save the native files in any sort of convenient way, having at that time not realized that failing to do so was a very bad idea.  We ran OCR on every one of the files that we received, which total approximately 4,000.  At the time that we received the majority of these files, my company was still using Acrobat 7; we've since upgraded to version 8.
Recently we discovered that there were discrepancies between our electronic copies and the hard copy printouts from which our electronic copies had been generated:  in the electronic copies, uppercase F had changed to P, S had changed to 8, etc.  We eventually worked out that it must have been that at some point a computer was mistakenly set to run OCR using the Formatted Text & Graphics setting, as opposed to either Searchable Image or Searchable Image (Exact).  This was absolutely not want we wanted, as for our purposes using a type of OCR that causes the original images to change essentially renders the files useless.  My questions, then, are the following:
1)  As I asked in the title, is there any way of undoing Formatted Text & Graphics OCR that was performed in Acrobat 7?
2)  Is there a way of identifying files that have had Formatted Tex & Graphics OCR performed on them (something stored in the metadata)?
Rebuilding these files from scratch is going to require a gargantuan effort, so any help would be much appreciated.

Hi,
Bernd's been across the mountain and seen the bear; so, you can bank on what he posted.
But, just because, I'll second his "no".
Formatted Text and Graphics (Acrobat 7, 8) and ClearScan (Acrobat 9, X) effectively replace the image of textual characters.
If a character is not recognized as 'something' a bit map is of the thing is left behind.
Now, while Acrobat or other OCR engines (Abbey FineReader, AdLib, Adobe Capture, etc.) are really rather impressive no OCR engine has 100% accuracy 100% of the time. Other variables  come into play (scan lamp age/brightness, platen cleanliness, scanner mechanicals cleanliness, calibration of scanner, hard copy 'quality' (characters' darkness density, contrast between characters and background, presence of lack thereof of boxed in text, text in or adjacent to line arcs/circles, etc.).
All of that is for semantic content that is "textual". Semantic content that is not textual (but, coincidently may contain text) provides little to no useful OCR output (e.g., graphs, drawings, etc.). Validate this by performing OCR on such a PDF then Export to a plain text file. Print this file out and compare that to the source paper or the scanned image.
There is no metadata info that identifies the OCR mode used.
Perhaps something buried in the bowls of PDF page description content; if so, not intrinsically easy to obtain.
My suggestion (fwiw) - move forward with re-scan.
A server product would help to move it along but a high speed scanner hooked to a local machine (with ample resources) and Acrobat Pro 8 or 9 get it done. With Acrobat 8 or 9 use Search Image (Exact).  In Preferences check the category Create PDF or TIFF to assure it is what you desire. Check Acrobat's scan presets to assure you have what you want vis-a-vis Compression and Filtering. Do avoid "Automatic".
Be well...

Similar Messages

  • Is there a way to download a text/imessage conversation from your iphone to your computer to print as a document?

    is there a way to download a text/imessage conversation from your iphone to your computer to print as a document?

    Yes. I do it regularly to archive my messages.
    Try the computer apps PhoneView (Mac) or TouchCopy (Mac & PC):
    http://www.ecamm.com/mac/phoneview/
    http://www.wideanglesoftware.com/touchcopy/index.php
    I use PhoneView and save my texts as a nicely formatted PDF. But, there are other options.

  • Is there a way to print or direct a PDF from Acrobat to another local app?

    Is there a way to print or direct a PDF from Acrobat to another local app? I assume not and there is nothing suitable that I can find in the API.

    Hey there -
    Nope - at least not when printing to PDF. IOS is a tidge, er, limiting.
    I re-posted in the IOS area; we'll see what happens there.
    Thanks again,
    Russ

  • Is there a way to include "Formatted Text" content from a form in the e-mail summary?

    I'd like to be able to include some of the content I've created in a form as "formatted text" in the email summary / notification.  Right now, only the fields that require the end user to input data of some type are included in the summary.  Is there a way to do this?
    Thanks,
    -Jeremy

    Hi,
    You can not include custom text in the notification email.  Only in the summary receipt email.  Can you explain what sort of text you want in the notification email?  Also, you may want to post this to the ideas section.  If others vote for it, there's a higher chance of us adding it in a future release.  The ideas forum can be found here - http://forums.adobe.com/community/formscentral?view=idea
    Thanks,
    Todd

  • Is there any way to paste formatted text into muse, and have it stay formatted?

    Greetings,
    I have a client who sends material to me with intricate formatting (italics and bold only) for posting on the website. Is there any way to avoid losing that formatting when I paste it into Muse? If not, I have several hours of work to format everything all over again, every time he sends an update? Any suggestions?
    Thanks,
    Lori

    Hello Muse Community,
    Ok, so I have figured out how to do this. For those of us who have a large amount of formatted text to paste on a website, but don't want to have to take the time to format every individual word or line, the solution is to embed the following html code:
    <iframe src="filename.html" width="600" height="300" scrolling="yes"></iframe>
    and replace "filename.html" with a the name of a plain html document created from some other software where you can paste formatted text. Muse will pick up the html document as an included file in a window with a scrollbar. It's actually pretty slick.
    Best wishes,
    Lori

  • Recognize Text Using OCR from DLL

    Hi:
    We are a service company,working on a project we need to do OCR on PDF files: convert a PDF to a searchable PDF.
    The customer has licenses for Adobe Acrobat Pro Extended.
    The problem we have to solve it: from a JSP page, run an applet and to have access to Adobe Acrobat Pro Extended for use the funcionality "Recognize Text Using OCR" on a PDF file.
    Ideally, we would be able to access a DLL and invoke this functionality, it is possible?
    If not, what would be the way to access this functionality: IAC? Plug-In?
    Would greatly appreciate any help.
    Thank you very much.
    Raimundo Carlos
    www.base100.com

    [lrosenth:]
    > LiveCycle ES includes lots of PDF functionality that you can use from various APIs.
    I tended to associate the term "LiveCycle" with the newfangled (XML-based) way to handle forms, but it has become clear that LiveCycle is much more than a new Forms paradigm.
    It sounds like the LiveCycle SDK/Library can be used as a (full?) replacement for the original APDFL.
    Is there a table somewhere with the differences between those two SDKs?
    TIA,
    -Ramon

  • Is there a way to undo an iPod touch restore from an iPhone 5?

    I was synching my new iPhone 5 to my Macbook Pro when I accidentally restored it to my old iPod touch instead of starting new at the welcome screen. Is there a way to undo the restore to get it back to the iPhone 5 format instead of the iPod touch one? I've already managed to get my contacts back but can't figure out how to undo back to the iPhone 5 "factory settings" without getting rid of the couple of purchased apps etc. Feeling like SUCH a newbie! Help!

    Yes, restoring deletes everthing on the iPhone. You will then have to resync the apps from iTunes on your computer, or re-download them from the iTunes Store:
    http://support.apple.com/kb/ht2519
    Regards.

  • Is there a way to print all text messages/sms from an iPhone4 3G? I need to go back about 2-3 years back, i also want to see the text messages, not only the tel. number and whether it was sent or received.

    Is there a way to see/print ALL text messages exchanges from my previous phone (Blackberry Storm) to iPhone4 3G?  I want to be able to print ALL text messages from 2 lines on my account. Thank you.

    If they are still stored on your phone, with the BB you may be able to print them; if not natively, you may be able to download an app that will do that.  Offhand, I'm not familiar with one, but google may help.  -The iPhone may have an app that will allow this as well.  Two to three YEARS?  That' s a lot of texting!!

  • Combined text/graphics OCR problem

    Hi.
    I need to convert a dozen of manuals from tiff to (editable) Word files. Each tiff file is one manual consisting about 50 pages.
    There are graphics on some pages with words and/or numbers in it (technical schemes).
    When I open them in Adobe Acrobat Pro and convert/save it to Word using OCR, the schemes are getting messed up because of the OCR.
    How can I let it use OCR on the text pages only and leave the graphics as they are?

    Import the TIFF into a pdf, you can use the Text Recognition tool page by page. Personally, I prefer for a job like your to use a dedicated OCR program that lets me designate which parts of the page should be considered graphics and which are text that needs to be OCRd.

  • Does anyone know of an app or way to recieve a text message notification from an android phone to my iPad - same wireless carrier

    Is there a way for my android obsessed brother to send me a text and I can receive notification on my iPad (which is near me more than my iPhone). It seems like it should be 'easy' because we share the same carrier (and family plan), and my iPhone number is linked to my profile (ie FaceTime, iMessage, etc.). Any ideas? Maybe an app work-around?
    Thanks All!!

    SMS uses the voice channel.
    iPads do not connect to the voice channel and do not do SMS.
    You can download apps like text+ that use their server to receive SMS messages.

  • Is there any way to batch Word 2007 to PDF from Acrobat?

    I am trying to put together my teaching portfolio and I have a lot of papers I had written in Word 2003/2007 and would like to convert them to pdf to include in my ePortfolio. I have been exporting to pdf one at a time but frankly, I have over 200 .DOC files and I am looking for a quicker way. I did not see an option in Batch Processing unless I overlooked it. Can anyone help?

    For the printers, go to START>Printers & ... Then you should see the printers on your computer. Just drag the DOC files to the Adobe PDF printer. That should do it. You might want to check the printer properties in terms of naming files, where they are being saved and such. Try a few first to see what happens. If this works, you are on your way.
    I am not sure why you wanted the OCR. That has no bearing on WORD files. OCR can only be done on pure graphics files with no markup or text.

  • Why so big files in the OCRs from Acrobat X?

    I used Acrobat 8 by many years. Now, I thought maybe was the time to upgrade, and I'm testing Acrobat X. My main use to Acrobat is to scan my own books (photocopy + ADF scan + PDF) and do an OCR scan (usually: exact copy) The reason of this is that I manage maybe 4 or 5.000 books and articles and in my work (history and genealogy) is useful to search the content directly.
    With Acrobat 8, this was done one by one, with certain accuracy.
    With Acrobat X, I try OCR and I noted three big differences:
    1st: The OCR is far better than in the v.8. In the same document where I found after OCR, ie: 50 occurrences, Acrobat X OCR give me now: 150 occurrences to the same word. Nice!!
    2nd: Now, I can do batch OCR. This mean that I can let OCR running a 10 documents list the whole night, or the whole day when I go to office, working one after the other... Nice!!
    I thought I touch the sky... At least I'would OCR all my library...! and all this books downloaded from archive.org and similar ones...
    But all this is nothing compared to:
    3rd: The resultant size of the OCRed files are impossible to manage. A book of 82MB gone to 538MB... As minimum, files doubles the size. With Acrobat 8, maybe gone 10 o 20% up, but not more than double as with the new Acrobat X. I have still 400GB to do OCR... I can't skyrocket this to 1 or 2 TBs...
    I tried: search, exact search and clear scan (no differences) I tried 600, 300 and 75dpi. Even 75dpi was bigger. And illegible, despite at 75dpi at 100%, at least it must appear clear in the screen.
    Im not sure if I must do the upgrade to Acrobat X. Someone can help me? Someone could recommend a specific, commercial software, to do OCR in PDFs, tha don't fat the resultant documents in this way? I can't believe that only text added, can do this.
    Thank you very much,
    Martin

    This is a tough one. One thing to look at is the client side
    buffers and server side buffers. A live stream usually drops data
    to "catch up" to streamtime - buffers size allocated so Im not sure
    how a song can be playing from 15 min ago...
    Are the users that connect up rebuffering alot?
    When troubleshooting try to find the least common
    denominator; see if accessing the stream on the same network is an
    issue. If it's successful, keep moving further away to determine
    what point the bottleneck lies in your network.

  • Washed out text when printing from Acrobat Help?

    When I print PDFs from Adobe Acrobat all the text is washed out. It is supposed to be black but I get gray text.
    I have the print settings at rich text black.
    The print settings on Acrobat are 600dpi.
    It is not a printer issue as I get the same problem with 2 different printer that are different makes and models.
    It is a high resolution PDF with no image compression.
    The file prints fine from Indesign.
    I am running Acrobat Professional 8.2.3
    I am running CS 3 master suite
    any help is appreciated.
    thanks
    Trevor

    IF, (and that is a big if) the pdf is create some some type of text (word open office, pages, etc)  then open PDF  in acrobat and click on Touch up text tool
    (a Samll button with a capital "T" on it -without quote marks).
    Click on select all. then  File menu look for Properties. click on properties
    when window opens click on appearance tab.
    choose color by clicking on colored block. choose black.
    IF your document then your out of luck there are no fonts just a drawing of what appears to be fonts. You might use OCR on Document but then not all Characters might ocr so you would only be partially successful.

  • Is there a way to convert outlined text back into live text?

    I have an Illustrator CS3 doc in which all text has been converted to outlines. (I wish I had an intelligent answer to the question of why I don't have the original doc with live text, but...) I remember seeing a post some time ago that explained how to change the outlines back into live text, which may have involved making a pdf or something, but I can't find that post anywhere in the Adobe forums. Does anyone know how to do this, if it can even be done? Thanks!

    I had to test it. Yes, you can use Acrobat Pro to OCR the text. First you'll need to rasterize the outlined text in AI (at 300dpi bitmap) and save as PDF. Then open in Acrobat and go to "Documents>OCR Text Recognition" to convert the image to text. Click "Edit" in the "Recognize Text dialog window and select "Formatted Text & Graphics" to view only the text in Acrobat. Save and re-open in AI. The text will be editable, but broken into individual words. You can cut multiple words to the clipboard then click with the text tool and paste, to merge the text into a contiguous sentence, but you'll lose the spaces. Exporting as text from Acrobat may be a better route.

  • Formatted text and smartforms/Adobe forms

    Hello all,
    can anybody please tell me if there is a way to display formatted text that was edited with FormattedTextEdit (or is there any other ui element that allows me to format text?) in smartforms/adobe forms with all formats? Also, is it possible to display graphics in the smartforms/adobe forms that was prior uploaded to the application server?
    Thanks and regards, Oliver

    Hi Raja,
    I implenented SAVE_TEXT AND READ_TEXT like you advised but the smartforms still doesn't show the text formats edited via BTF editor.  I even tried to convert it to ITF using "CONVERT_STREAM_TO_ITF_TEXT" but it wouldn't help either. For example the text
    <HTML><HEAD></HEAD> <BODY> <P>This is a test text in Times New Roman</P> <P><FONT face=Arial>This is a test text in Arial</FONT></P> <P><FONT face=Arial><STRONG>This is a test text in Arial and bold</STRONG></FONT></P></BODY></HTML>
    would exactly appear in this manner (using text element in smartforms) and not formatted. Do you any other ideas solving this issue?
    Thanks and regards, Oliver

Maybe you are looking for