Renderable text and indexing

My ultimate goal is to have as complete a “Full Text Index” as possible.  To that end, I have a couple of questions.   I have both scanned pdfs and pdfs that have been created from word or powerpoint and have renderable text.
for those files with renderable text
If a page has renderable text, is all of the text included when the “Full Text Index” is built?
If a page has renderable text and an image, is the text within the image ignored when the “Full Text Index” is built?
If I OCR the document, if a page has renderable text and an image, is the image ignored during the OCR process?
If I OCR the document, if a page has an image and no renderable text, is the text within the image recognized during the OCR process?
Thanks.

1. If a page has renderable text, is all of the text included when the “Full Text Index” is built?
The key question is "Do the renderable texts' fonts map to Unicode?" Fonts that map to Unicode are searchable.
Such will be harvested by the Catalog index.
Using Acrobat Pro you can create a preflight to check for this.
2. If a page has renderable text and an image, is the text within the image ignored when the “Full Text Index” is built?
Ok, a scanned image of text has no "text" just pixels that look like text - it is all just an image.
So, nothing to be harvested by the Catalog index. The page's renderable text will be harvested by the Catalog index.
3. If I OCR the document, if a page has renderable text and an image, is the image ignored during the OCR process?
A page of a PDF that has renderable text cannot be OCR'd.
Provided the page's content is an image containing pixels representing characters OCR will attempt to recognize these and provide an output.
If Searchable Image or Searchable Image (Exact) is used the "recognized" output is a hidden/invisible layer (text rendering mode 3 - no stroke, no fill).
The scanned image remains on the page. Searchable Image 'tweaks' the image. Searchable Image (Exact) does not.
So, for both you have the scanned image and the hidden layer of OCR output.
Alternatively, if you use ClearScan recognized characters (in the image) are replaced with an Acrobat generated font.
Anything not recognized is left as a bitmap of the 'character'.
So, the image is not ignored by OCR as it is what OCR analyses to recognize characters for providing an output.
A Catalog index will harvest the OCR output of any of the three OCR modes.
4. If I OCR the document, if a page has an image and no renderable text, is the text within the image recognized during the OCR process?
OCR will attempt to recognize the pixels that represent characters and when recognized provide an output.
A Catalog index will harvest the OCR output.
Be well...

Similar Messages

  • OCR Renderable Text and Print to PDF Problems

    I have Adobe Acrobat version 8.1 on my PC laptop and have recently had trouble with OCR at work. When I try to run OCR I get a pop-up box telling me Acrobat cannot perform the function because of renderable text, however if I run OCR on the same file using my desktop PC (which has version 10 installed) I have no trouble. Also, I am no longer able to print items (such as a webpage) to PDF, and instead receive a pop-up box telling me something about the AdobePDF.dll file being missing or not functioning, and an error message when I double click the printer icon at the bottom right corner of my screen. I uninstalled Adobe Acrobat and then reinstalled it yesterday, however it did not fix either problem. Does anyone have any suggestions?
    Thanks!

    what windows version you are using? I think your problem might be due to corrupt operating system. Try reinstalling the OS and reinstall Acrobat soon after.
    Good luck!

  • OCR renderable text error

    someone was having the problem below with an older version of acrobat.
    is there now a solution in acrobat mac x?
    i note that exporting to image file loses quality and increases file size
    thanks
    Well, since this is the digital age, it makes sense that I ought to  read the PDFs in digital form (this is a stretch for me, I really like  paper), which is facilitated by a tablet since I can actually see the  page when it’s in the portrait configuration.  It also makes sense that I  ought to mark up the file in Acrobat, using the native highlighting and  searching tools, which is also facilitated by the tablet for obvious  reasons.
    Here’s the problem.  Apparently *every* PDF file, in every digital library, is tagged with headers, or footers, or bates numbers, or some other tag that halts the OCR recognition of the PDF file.   If you google “This page contains renderable text”, you’ll see that  this has been a complaint since Acrobat 6 at least.  So you can’t just  OCR the document and get a nice,  mark-up-able document.
    Now, I know what you’re thinking.  There has to be a workaround,  right?  Of course, there is.  You can manually remove the headers and  try again.  Oh, now there’s a footer; you can take that out too  (manually) and try again.  Oh, now there’s a bates number, okay, take  that out too.  There’s STILL some renderable text in there somewhere,  well, now you can either try and edit out the blocks of renderable text  (again, manually, made more entertaining by the fact that you can’t just  right click on the page and say “remove renderable text”), or you can  export the entire document to a graphics file (say, a TIFF), re-convert  it to a PDF file (which turns the entire document into a rasterized  image), and THEN run the OCR tool to get an actual mark-up-able  document.  This process is made more enjoyable by the fact that Acrobat  will turn that 300 page dissertation you’re reading as part of your  research into 300 distinct TIFF files, which you then need to recombine  into a PDF file.  Multiply this by 100, and you’ll see what sort of a  barrier to productivity this is for me to get started organizing my  existing document collection.
    This is CLOSE TO THE DUMBEST THING I HAVE EVER SEEN.  And I’ve seen a  LOT of bad design.  Rather than prompting me “This document has  renderable text” and giving me “Cancel” as the only option, any  feature-driven developer would say, “Gosh, people get really frustrated  by this.  I know, because I can read the results of a simple google search.    We need to change this right away!  Here, I’ll make it so that you  can just click ‘Treat existing renderable text as white space’ or even  prompt the user to rasterize the renderable text and embed it in the  document, then OCR the resulting file!”
    The only conceivable reason I can imagine that this hasn’t taken  place is because your lovable electronic document vendor wants to make  it a colossally, enormously painful process for someone to actually do anything to the document they’re providing you to use.  Thank you, electronic  document vendor.  You’re going to be wasting about 20% of the time that  you’re saving me by giving me electronic access to this document in the  first place.
    Progress is grand.  Collide it with self-interest, progress seems to lose out more often than not.
    Now, if you’ll pardon me, I’m going to go get some sleep.  Then I’m  going to get up in the morning and go to work.  Then I’m going to come  home, and instead of enjoying some family time with my kids, I’m going  to fart around with manual document conversion.

    Elias,
    I completely agree with your anger. I ran into the same problem and I think I have figured out a workaround. I wrote up a blog post about it.
    http://www.ideationizing.com/2011/03/ocr-acrobat-pdf-with-renderable-text.html
    I hope this works for you.

  • I have a manual that contains headings and index entries that contain less than and greater than characters, and . The Publish to Responsive HTML5 function escapes these correctly in the main body of the text but does not work correctly in either the C

    I have a manual that contains headings and index entries that contain less than and greater than characters, < and >. The Publish to Responsive HTML5 function escapes these correctly in the main body of the text but does not work correctly in either the Contents or the Index of the generated HTML. In the Contents the words are completely missing and in the index entries the '\' characters that are required in the markers remain in the entry but the leading less than symbol and the first character of the word is deleted; hence what should appear as <dataseries> appears as \ataseries\>. I believe this is a FMv12 bug. Has anyone else experienced this? Is the FM team aware and working on a fix. Any suggestions for a workaround?

    The Index issue is more complicated since in order to get the < and > into the index requires the entry itself to be escaped. So, in order to index '<x2settings>' you have to key '\<x2settings\>'. Looking at the generated index entry in the .js file we see '<key name=\"\\2settings\\&gt;\">. This is a bit of a mess and produces an index entry of '\2settings\>'. This ought to be '<key name=\"&amp;lt;x2settings&amp;gt;\" >'. I have tested this fix and it works - but the worst of it is that the first character of the index entry has been stripped out. Consequently I cannot fix this with a few global changes - and I have a lot of index entries of this type. I'm looking forward to a response to this since I cannot publish this document in its current state.  

  • BUG - in flash pro CC, 'bold' and 'italic' properties of TextFormat have no effect on rendered text

    Concise problem statement:
    If you compile with flash pro CC, and use the 'setTextFormat' method of a TextField, the 'bold' and 'italic' properties of the TextFormat argument have no effect on the rendered text. If you compile with flash pro CS6, the 'bold' and 'italic' properties work as expected.
    Apparently, with flash pro CC, the only way to make the text render correctly is to change the font name (add the suffix ' Bold', ' Italic', or ' Bold Italic'.) This means code which dynamically changes font styles only works in CS6 or CC, but not both. For example, if you use the 'bold' property the text renders bold in CS6 and regular in CC, whereas if you change the font name to add the suffix ' Bold', the text renders bold in CC and DOES NOT RENDER at all in CS6. This makes it difficult to transition a team from CS6 to CC.
    Steps to reproduce bug:
    1. Create an xfl with 2 TextFields on the stage, both with font "Trebuchet MS" and style "regular", one named boldTrueText containing the String "bold = true", one named fontNameText containing the String "fontName = Trebuchet MS Bold". Create 2 more TextFields on the stage for visual reference, both with font "Trebuchet MS", one with style "regular", one with style "bold".
    2. Add the following code to the Actions panel on frame 1:
    import flash.text.TextFormat;
    import flash.text.Font;
    var format:TextFormat = boldTrueText.getTextFormat();
    format.bold = true;
    boldTrueText.setTextFormat(format);
    format = fontNameText.getTextFormat();
    format.font = "Trebuchet MS Bold";
    fontNameText.setTextFormat(format);
    var fonts:Array = Font.enumerateFonts(), count:int = fonts.length;
    for (var i:int = 0; i < count; i++) {
        var font:Font = fonts[i];
        trace("fontName: " + font.fontName + ", fontStyle: " + font.fontStyle);
    3. Save, and compile with flash pro CS6 and flash pro CC.
    Results:
    With flash pro CS6, "bold = true" renders bold, and "fontName = Trebuchet MS Bold" DOES NOT RENDER.
    With flash pro CS6, the following is traced:
    fontName: Trebuchet MS, fontStyle: bold
    fontName: Trebuchet MS, fontStyle: regular
    With flash pro CC, "bold = true" renders regular, and "fontName = Trebuchet MS Bold" renders bold.
    With flash pro CC, the following is traced:
    fontName: Trebuchet MS, fontStyle: regular
    fontName: Trebuchet MS Bold, fontStyle: bold
    Expected results:
    The same text is rendered in both flash pro CS6 and CC. I don't know why this behavior was changed in flash pro CC - it causes silent failures in code which dynamically changes font styles. I expected the flash pro CS6 behavior to remain the same in CC, like so:
    With flash pro CC, "bold = true" renders bold, and "fontName = Trebuchet MS Bold" DOES NOT RENDER.
    With flash pro CC, the following is traced:
    fontName: Trebuchet MS, fontStyle: bold
    fontName: Trebuchet MS, fontStyle: regular
    If you don't want to break backward compatibility (any further), you could make both the behaviors work in flash pro CC, like so:
    With flash pro CC, "bold = true" renders bold (font is still "Trebuchet MS"), and "fontName = Trebuchet MS Bold" renders bold also.
    With flash pro CC, the following is traced:
    fontName: Trebuchet MS, fontStyle: bold
    fontName: Trebuchet MS, fontStyle: regular
    fontName: Trebuchet MS Bold, fontStyle: bold
    I submitted this bug with the bug form, and also with adobe bugbase (in case it isn't obsolete) - I'm just trying to maximize my chances of getting a fix.  Has anyone else encountered this bug?

    I just can't believe how there is ZERO documenation for any of this.  Flash's stylesheets have fontStyle and fontWeight properties, but they only recognize regular/italic and regular/bold respectively.
    This change in Flash CC completely breaks systems built in Flash CS6, and the font naming is actually arbitrary and is not a consistant combination of font name and style (e.g. "Eras ITC" family's bold font name is "Eras Bold ITC", but the bold version of Times New Roman is "Times New Roman Bold" (with Bold at the end, rather than the middle), and what's absolutely appaling is that the font name used at runtime is not exposed anywhere in the Flash IDE!!!  In the IDE you select a font family and font style independently, which is absolutely not what's used at runtime, because it actually uses a separate, arbitrarily named field in the font file for the font name. So we can't even know from within Flash what the proper runtime name is, unless we trace it out or open the font properties details tab in Windows explorer.
    It seems that Flash CC is always using the font "Title" that can be found in the properties of the font, NOT the font name displayed in Windows Font Preview or in Flash CC.  For example, the font name for Times New Roman Bold in Windows Font Preview is just "Times New Roman", but the font title in the properties/details tab is "Times New Roman Bold".  If they made the change to allow for specific fonts to be selected, that's fine, but it completely breaks HTML support in TextFields if it's not respecting bold and italic tags.
    This may actually be a trend on the web now, if you read this: http://www.smashingmagazine.com/2013/02/14/setting-weights-and-styles-at-font-face-declara tion/ , it says: "If you’ve used one of FontSquirrel’s amazing @font-face kits, then you’re familiar with this approach to setting weights and styles. The CSS provided in every kit uses a unique font-family name for each weight and style, and sets the weight and style in the @font-face declaration to normal. [...] Notice that the font-family names are unique, with each font-family name accessing the appropriate Web font files."
    But there's just no mention of this in any documentation I can find.  What the hell.
    It's also helpful to realize that font and u tags have been deprecated in HTML5, while b and i tags have been repurposed since they still retain semantic meaning apart from style: https://www.w3.org/International/questions/qa-b-and-i-tags

  • I need in more lines of the Index some words in Bold text and some others in Kursiv text. How can i get it? It seems to me that either i can have all the Style in Bold Text or in Kursiv Text :(

    I need in more lines of the Index some words in Bold text and some others in Kursiv text. How can i get it? It seems to me that either I can edit a Style only in Bold Text or in Kursiv Text
    I make you an example to clear what I really need:
    Index
    Introduction
    I. Leonardo's Monnalisa
    II. Leonardo's Battaglia
    Bibliography
    Please HELP HELP HELP

    What version of Pages are you referring to?
    Basically if you are talking about the Table of Contents in Pages and want to have different character styles within paragraphs in the T.O.C. you will have to export the T.O.C. and bring it back in as text and change that.
    Peter

  • SSRS Export to text and NOTHING BUT TEXT!

    Is it possible to export a report to text only format? When I export my report to csv and then use the old DOS command to "type" the exported csv file there is a horrible little character at the beginning of the file. 
    I need to export just as text and nothing but text to drop the exported report into a 3rd party service which processes it. This squiggly little character is tripping the 3rd party system. I cant believe something so simple is not supported natively by SSRS?
    I'm using SQL Server 2008 R2. 
    Please help!
    LightSwitchShells.com

    Hi Hale,
    Please try to add the following text in the rsreportserver.config to work around the issue.
    <Extension Name="TXT" Type="Microsoft.ReportingServices.Rendering.CsvRenderer.CsvReport,Microsoft.ReportingServices.CsvRendering">
    <OverrideNames>
    <Name Language="en-US">TXT (Pipe Delimited Text File)</Name>
    </OverrideNames>
    <Configuration>
    <DeviceInfo>
    <FieldDelimiter>|</FieldDelimiter>
    <Extension>TXT</Extension>
    <Encoding>ASCII</Encoding>
    <NoHeader>true</NoHeader>
    </DeviceInfo>
    </Configuration>
    </Extension>
    By default, the  rsreportserver.config is locate in:
    C:\Program Files\Microsoft SQL Server\MSRS10_50.MSSQLSERVER\Reporting Services\ReportServer
    There is a document about Export Tab Delimited text file in SSRS 2008 R2, you can refer to it.
    http://revweblog.wordpress.com/2011/09/26/export-tab-delimited-text-file-in-ssrs-2008-r2/
    Hope this helps.
    Regards,
    Alisa Tang
    If you have any feedback on our support, please click
    here.
    Alisa Tang
    TechNet Community Support

  • Rendered text problems in 5.1.2

    I am working on a film project that has a lot of subtitles. (The dalies are digitized at HD DVC-Pro 720p) I was editing on a station running Final Cut Pro 5.1.1 (g5) with no real problems. I recently took the project with me on a trip to work on using my macbook pro (intel) running 5.1.2. I have discovered that all of my rendered text in my timelines is now uneditable. I can't move the text, I can't change the text, I can't even delete the text. I also can't change any of the footage running underneath the rendered text. It has become an immutable part of my film. This is obviously a huge problem. Has anyone else run accross this problem? Is it something that Apple is aware of? Any fixes? If not I may have to move to Avid and start all over again.

    Not that strange, really. Several different NLEs can sometimes lock onto a render file and not update the output until that render file is deleted.
    Rather simple fix, just a little time consuming.
    Next time don't be so quick to threaten to jump ship. First problem and you were ready to move to Avid. I've had more problems working on Avids than I have on FCP. MANY more problems. FCP problems are usually due to user error or third party hardware. Avid problems shut you down for a couple weeks while you ship out your hardware.

  • How to recognize text in XI Pro when files returns "renderable text" error.

    This same file will recognize text & provide searchable document in Adobe 9.
    I saw workaround to convert each page to TIFF, do OCR on each page, convert each back to pdf & combine. That is ridiculous. Will there be a new version of XI Pro that will work correctly for OCR?

    Varinder.Saini wrote:
    David,
    That is how it is. If you run Searchable Image or Searchable Image (Exact), Acrobat will throw error only for pages that contains renderable text. It also gives an option to ignore these error for any further pages containing renderable text.
    If you check this option it will run OCR for rest of the pages and won't show the error again for that PDF.
    This option is not available when OCR'ing using Adobe ClearScan -- ClearScan being the ONLY reason anyway why I own Acrobat Pro AT ALL -- for it's otherwise ridiculous slowness compared to its competitors. Sorry having to tell you that, from the point of view of having worked in a paperless law office for 15 years, therein 4-9 hrs a day with PDF documents usually in the 600 to 1.800 page range.
    CtDave wrote:
    Precluding the entry of anything 'renderable' assures OCR will be accomplished for each page.
    David Peters wrote:
    Just stop bullshitting users about "you better follow well defined protocols" niminy-piminy-finicky.
    There is zillions of cases where one does not have ANY control neither access to the (external) creation of one's PDF files, even if people like you seem to be unable to imagine, and therefore repeat the same pointless sermon over and over that does not add ANYTHING to the case.
    The solution to this problem by the way is not a question of more (lost) decades of rocket science Adobe bloatware engineering, but simply:
    If there is "Renderable Text" somewhere on some fuⅽʞing page:
    then just SKIP IT and continue OCR with the next fuⅽʞing bitmap
    gosh darn it.
    Not that I would hope that any of this would change anything with the course of the megaton tanker Adobe which I usually avoid like the plague -- with the only two exceptions of ClearScan and of course, Acrobat 7 Pro, the last Acrobat version that was not only fairly usable but actually is a pretty amazing piece of software.

  • Conforming and Indexing Errors, Media Pending, Audio won't play in timeline

    I'm working on a desktop PC which is running Windows 7 Professional 64-bit and Adobe Premiere Pro (version CS5.5). It's currently utilizing a second gen. 3.4Ghz i7 2600 processor, 16GB of 1600Mhz RAM, 64GB solid-state drive and a ASUS P8Z68-V Intel Z68 Motherboard with onboard audio (Realtek ALC892 chipset) and onboard video. My problem is this:
    The conforming and indexing of all of my imported media never seems to finish regardless of how many times I reopen the project file and wait for it. On the lower right-hand portion of the screen, next to the conforming/indexing progress bar, is a little red "X". When clicked, it pops up with a list of errors that read: "An unexpected error occurred while performing a conform action on the following file...". As a result, my audio channels have no waveform and during playback there are no audible tones or levels. On some video clips there's just text that reads "Media Pending". This only appears to happen with project files that I saved on external hard drives, and I suspect it has something to do with the Media Cache Files folder and how Premiere Pro locates these conform/index files. I've also encountered this problem in CS3 and CS4.
    I have a few questions:
    1) How do I avoid error messages in regards to indexing and conforming
    2) How do you know when indexing/conforming has completed itself? (there doesn't seem to be a progress log or a list of commands/executions)
    3) Indexing and conforming appears to be an automatic process, but is there a way to do it manually?
    4) What's the best way to setup your media cache files when you click EDIT > PREFERENCES > MEDIA?
    5) If I have approximately 1 hour of footage, what's an average wait time for conforming/indexing? What about 5 hours of footage? 10?
    6) Adobe recommends not editing until the conforming and indexing has completed itself-- how important is this?
    7) Sometimes it appears as though the conforming and indexing has finished, but then I still have problems with playback. Do I have to reopen the project for it to continue with the conforming/indexing progress? I've already determined that the video file I'm working with is intact and free of any corruption.
    I'm fine with having to wait for a project to conform and index, but it never seems to complete itself! Any help regarding this matter would be greatly appreciated.

    Harm filled in pretty much all the salient details, but I'll do another pass here.
    1) How do I avoid error messages in regards to indexing and conforming
    Two parts here.  One, conforming only happens for certain media files, ie the ones where performance is critical and we can't depend on extracting the audio fast enough for realtime playback.  That's basically anything in an .mpeg wrapper, or AVCHD material.  So if you edit XDCAM HD/EX or P2, or RED, or even AVIs or QT, those formats don't require audio conforming.
    If you're stuck editing AVCHD or MPEG2, then it needs to conform.  But, that being said, you shouldn't be getting errors in the first place. I think it's related to your external drives.  More below...
    2) How do you know when indexing/conforming has completed itself? (there doesn't seem to be a progress log or a list of commands/executions)
    Nope, you have a progress status bar indicating which file it's working on.  If there's an error, it shows up in the events panel.
    3) Indexing and conforming appears to be an automatic process, but is there a way to do it manually?
    No.
    4) What's the best way to setup your media cache files when you click EDIT > PREFERENCES > MEDIA?
    While some people like having the check box for having the conform files beside the media, I hate it.  Yes, it means that if you move the project to a different system & reopen, it means that you potentially can avoid recreating CFA files, but I find the drive littering not worth it.  I much prefer having setting the Media prefs to point to a specific media drive.  Usually a raid, if available.  Definitely not an external drive that you disconnect & walk away with.  If you don't have a permanent raid on your system, then preferably a dedicated internal drive for media (think along the lines as your Photoshop 'scratch disk').  Failing that, leave it on your C: drive, although with a 64 Gig SSD, you probably don't have much room for transient temporaries.
    5) If I have approximately 1 hour of footage, what's an average wait time for conforming/indexing? What about 5 hours of footage? 10?
    Like Harm said.  Totally dependant on the media container & the speed of your drive i/o.  The conforming is iterating through the entire file & pulling audio data, so it's not CPU intensive, it's all i/o.
    6) Adobe recommends not editing until the conforming and indexing has completed itself-- how important is this?
    If you're trying to play/scrub while conforming, it's going to be pokey.  Esp. if you're trying to access the file that's actively being conformd.  As I just said, we're hitting the files for all the audio.  The i/o is being saturated already, so unless you have a stellar raid, you don't have much headroom.
    7) Sometimes it appears as though the conforming and indexing has finished, but then I still have problems with playback. Do I have to reopen the project for it to continue with the conforming/indexing progress? I've already determined that the video file I'm working with is intact and free of any corruption.
    You should be good to go.  Sounds like there's something else at play here.
    Okay, back to what I think is wrong:  you don't mention what kind of external drives you're using.  You're making a bad assumption that blowing away conformed files & doing a reconform is buggy - I doubt it, as that's the same process that happened when you initially brought in the files.  I've blown away my media cache folder multiple times and have never seen failures on reconform.  So it's got to be one of two things:  either a read error from the source when attempting to pull the audio, or a write error to the destination.  Now I don't know where you currently are pointing the media cache directory, or what your source drive is, so I can only speculate.
    My suggestion is to do some elimination.   Copy one of the files that failed on you to your C drive, & target your media cache directory also to C:.  Pick a new project, import your copied file, confirm that it conforms correctly & behaves.   Then, try to use the same clip from your external drive, keeping the media cache to C:.  If that's still good, then try targeting another (local/internal) drive as your media cache target; close/restart, then import the clip from C:, and then import the clip from your external drive.  This troubleshooting should give us something.
    PS, if you're trying to edit from external USB drives, good luck.  I find it a major PITA that I avoid as much as possible.  Firewire isn't much better.  I know some people do it successfully, but I think it's a road fraught with peril.  These devices are generally not designed for heavy duty I/O and a flaky connection or drive is nothing but pain.
    Cheers

  • Trying to OCR pdf, pdf says it can't perform bc it already contains renderable text-but does not.

    I work for a large agency, and we receive PDF's all the time. 98% of the time I am able to OCR a document with no issues. Just recently I have come across this issue several times, and was wondering if anyone can solve this irritating problem!
    *Acrobat 8.1 - When going to OCR the document, I receive the following message " Acrobat could not perform recognition (OCR) on this page because this page already contains renderable text. However, it does not. When you go to select text or search for anything the whole page is selected (like it's still in a "picture" format, not a document format that you can search, ect.)
    I am not sure if it is how the document is uploaded originally by the other party that causes this, but the only thing I can do as a work-around - is to print out the entire document, scan and then I can OCR the document just fine! The problem is, if the document is 400 pages or so, this can be a huge waste of time, and money just to be able to search the PDF.
    *I have also checked the pdf properties to see if this is some sort of permissions issue, and there are not permissions/security settings in place.*
    PLEASE HELP! Any assistance in this matter would save me a lot of time, and of course (my sanity!).
    Thank you in advance!

    While the alert speaks to "renderable text" that is a simplification. The issue is that you've PDF page content consisting of at least one renderable "character".
    Look at font families - you will observe that there are many characters that are not "text" characters (i.e., linguistic characters).
    So, there's a "renderable character" present. It may be an alpha numeric that has a font color the same as the page background. It may be under the image and thus not visible to the eye.
    You might be able to determine just what is present.
    You could export the page of interest to a text file then view that file.
    You could deplay the page of interest in Acrobat Pro then select the "Content panel" to view the content tree.
    Locate and click on the page number for the page of interest.
    From the Content panel's Options menu select "Highlight Content".
    Walk down the tree. Select the content containers in turn and observe what is highlighted on the PDF page.
    Where might the renderable character come from ? Typically that'd be associated with something in the work flow.
    Not always easy to find so don't take anything in the work flow for granted.
    Be well...

  • How to send a plain text and HTML email at once?

    Hi all,
    I'm attempting to use cfmailpart to send a HTML and plain text email all in the same cfmail script.  I'm using Outlook and Gmail to test.  I temporarily changed my Outlook settings to "read all standard mail in plain text," but it does not read the plain text cfmailpart of the email, it just attempts to format the text from the HTML email and display the links.  If I remove the HTML cfmailpart from my cfmail script, the plain text version is delivered, but Outlook removes extra line breaks that I actually want to keep intact, and some of the other formatting is improperly rendered.  Is there a better way to make sure email clients hold the plain text formatting (even though there really isn't any formatting with plain text) and a better way to test?  The HTML version looks great in both Gmail and Outlook.
    Thanks!

    Use the wraptext attribute of the cfmailpart tag to add line breaks.

  • Keep getting "renderable text" error when I need to OCR PDF's from FrameMaker.

    My solution has been to individually extract all those pages, then open them up in Photoshop, flatten them and
    widen the canvas size to standard 8.5 x 11.
    But that's a little tedious and time-consuming and you have to delete the original page from your document, after
    importing the OCR-friendly page.
    Is there a printer definition, or something you can set up when you're generating your PDF's in the first place,
    that will get rid of that annoying "renderable text" error?

    Ok...
    I don’t know how it happens but after I save my work in FrameMaker or MS Word, and print
    to PDF for the final output, there are often pages with text in them that isn’t recognizable,
    or that can’t be found with a CTRL+F search.
    That is a serious issue, and one we might be able to help you with, but really, quite separate from the issue here. It's too late to try and fix this once it is a PDF.
    What is it that’s lost when OCR is run?
    Quality. Small file size. Tags (which might be required legally). Almost everything except the basic text, and that might also be lost given that OCR is not guaranteed to work. This is NOT the right way to solve your problem.
    The translator doesn’t have any Adobe products except Reader, so I’m limited to Acrobat
    to show her how the words and pictures are laid out on a page.
    In order for her to copy and paste that text – or search it, to find all of the places where the
    same word might be used – I need to make sure every word is there for her to grab.
    I have heard of translators trying to work with PDFs, and few that succeed. You can reasonably expect a transation service to support FrameMaker. But if they don't I recommend you extract the text from FrameMaker to a simple Word or text file. They should be fine using the PDF as a visual reference, and having the text to translate, and for you to flow back into the original layout. (Again, something I'd expect a full service translation to do themselves, but there are advantages to keeping control too).
    Those were the 2 pages that gave the ‘renderable text’ error. Don’t ask me why or how, they
    look like all the other pages in that document. Except Acrobat thinks they’re scanned graphics,
    that’s how they present when you wave the cursor around in them, hunting for text.
    Renderable text is just text. It means that somewhere on that page there is text. Surely there is layout, page numbers, whatever from FrameMaker on the pages. If not, we really need to look at your production methods - back to the first point.

  • Why won't text and graphics line up in adobe premiere?

    Hey, I have been out of the editing thing for about 8 years now. I recently needed to make a video for another business I run.
    Anway, I am using animated / still graphics and the basic text titles that come with premiere. I see that the graphics line up correctly behind the text in the Previews Folder. When I render it, the text has shifted down completely. Also, the text and graphics look horrible after it's rendered, very pixelated. I used to be a pro at this stuff and in fact, I would use flash to import my animated graphics. But it seems that doesn't work anymore either. (That would probably solve this problem too). Anyone know why this is taking place in the final render??
    I would much rather you ask the questions before i post all sorts of unnecessary info.
    Premiere Pro 1.5.1
    Thanks!

    1. not sure what you mean by "preset". The project settings are:
    General -
    Capture - DV 1394 -
    Video Sequence -
    Default Sequence -
    2. The dimensions of the stills are 2000x2000, *.png
    3.Titles - I do not align them here, I align them in the effects controls
    4. Screen Cap
    5. Export settings
    6. That article is nice, but not what I'm needing. remember, the preview clips shows everything perfectly lined up. but the final export shows all the text being shifted down.
    Thanks for the reply, I look forward to a response and I hope these answer your questions.

  • Space between text and images?

    I have a gallery of images with headings above them and am trying to change the distance between the text and the images. I have tried floats and padding but cant get it to work. My headings are in a table and my images are not so it seems that there is some sort of default padding between the images and the text tables.
    Here is a link..
    http://www.rsicorp.org/rsi/silvagilroy.html

    Copy & paste this into a new, blank html page.  It's a real quick and dirty layout but hopefully it will show you what you need to know.
    FYI Code errors do matter. 90% of browser rendering issues are directly related to code.  Trouble shooting is made that much easier when you work with clean code.
    <!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Transitional//EN" "http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd">
    <html xmlns="http://www.w3.org/1999/xhtml">
    <head>
    <meta http-equiv="Content-Type" content="text/html; charset=utf-8" />
    <title>Untitled</title>
    <style type="text/css">
    CSS layout modified by Alt-Web Design and Publishing www.alt-web.com 8/17/2010
    body {
    margin:0;
    padding:0;
    font-family: Verdana, Arial, Helvetica,sans-serif;
    background: #FFF;
    text-align:center;
    #container {
    width : 1000px; /**adjust width in px, ems or % **/
    margin : 0 auto; /**centered on page**/
    border : 1px solid #020873;
    text-align : left;
    background: navy;
    #header {
    width: 100%;
    min-height: 100px; /**same height as background image; if none; remove height**/
    _height: 100px;
    padding:0;
    color: #FFF;
    #header h1 {
    margin: 0;
    padding: 10px 20px;
    font-size: 14px;
    text-transform:uppercase;
    #header address {
    padding: 0 20px;
    font-size: 14px;
    font-weight:bold;
    #sidebar1 {
    width: 175px;
    height: 1%;
    padding: 15px 5px;
    /**to reduce float drop issues in IE6**/
    overflow: hidden;
    float: left;     
    color: #FFF;
    #mainContent {
    margin: 0 0 0 190px;
    background: #FFF;
    padding: 10px;
    #footer {
    padding: 0 10px;
    background:#DDD;
    clear: both;
    #footer p {
    margin: 0;
    padding: 10px 0;
    font-size: 12px;
    color: #CC3300;
    text-align: center;
    /**BEGIN PHOTOS**/
    /**this styles the image container**/
    #photos p {
    float:left;
    padding: 10px;
    border: 1px solid silver;
    /**this styles caption text**/
    font: italic 12px/1 Geneva, Arial, Helvetica, sans-serif;
    color: #666;
    text-align:center;
    line-height: 2em;
    margin: 22px;
    /**same size images recommended, adjust as req'rd**/
    #photos img {
    width:121px;
    height:90px
    /**photos float clearing**/
    .clearing {
    clear:left;
    height:1px;
    width: 100%;
    /**re-usable CLASSES**/
    .left {text-align: left}
    .center {text-align: center}
    .right {text-align:right}
    </style>
    <!--[if IE]>
    <style type="text/css">
    /* place css fixes for all versions of
    IE in this conditional comment */
    body {font-size: 76%}
    #sidebar1 {
    padding-top: 30px; word-wrap: break-word; }
    #mainContent { zoom: 100%; padding-top: 15px; }
    /* the above proprietary zoom property
    gives IE the hasLayout it needs to avoid several bugs */
    </style>
    <![endif]-->
    </head>
    <body>
    <div id="container">
    <div id="header">
    <h1 class="right">Company Name </h1>
    <address class="right">
    address, city, state,
    telephone
    </address>
    <!-- end #header --> </div>
    <div id="sidebar1">
    <p>#sidebar1 = 175px wide + 10px padding </p>
    <p>insert your list menus here</p>
    <p>insert your ads here</p>
    <p> </p>
    <p>&lt;end of sidebar1&gt; </p>
    <!--end sidebar1 --> </div>
    <!--begin mainContent -->
    <div id="mainContent">
    <h1>#container = 1000px  </h1>
    <h1>#mainContent = left-margin 190px </h1>
    <h3>  </h3>
    <h3>Begin #Photos </h3>
    <!--begin photos -->
    <div id="photos">
    <p>optional caption<br />
    <img src="http://www.rsicorp.org/rsi/images/Thumbs Pre/1001thumb.jpg" alt="some description" /></p>
    <p>optional caption<br />
    <img src="http://www.rsicorp.org/rsi/images/Thumbs Pre/1001thumb.jpg" alt="some description" /></p>
    <p>optional caption<br />
    <img src="http://www.rsicorp.org/rsi/images/Thumbs Pre/1001thumb.jpg" alt="some description" /></p>
    <p>optional caption<br />
    <img src="http://www.rsicorp.org/rsi/images/Thumbs Pre/1001thumb.jpg" alt="some description" /></p>
    <p>optional caption<br />
    <img src="http://www.rsicorp.org/rsi/images/Thumbs Pre/1001thumb.jpg" alt="some description" /></p>
    <p>optional caption<br />
    <img src="http://www.rsicorp.org/rsi/images/Thumbs Pre/1001thumb.jpg" alt="some description" /></p>
    <p>optional caption<br />
    <img src="http://www.rsicorp.org/rsi/images/Thumbs Pre/1001thumb.jpg" alt="some description" /></p>
    <p>optional caption<br />
    <img src="http://www.rsicorp.org/rsi/images/Thumbs Pre/1001thumb.jpg" alt="some description" /></p>
    <!--end photos --></div>
    <!--IMPORTANT! clear floats after photos div-->
    <hr class="clearing" />
    <!-- end #mainContent --></div>
    <div id="footer">
    <p>Your footer text goes here</p>
    <!-- end #footer --> </div>
    <!-- end #container --> </div>
    </body><!--end body -->
    </html> <!--end document -->
    Good luck with your project,
    Nancy O.
    Alt-Web Design & Publishing
    Web | Graphics | Print | Media  Specialists
    http://alt-web.com/
    http://twitter.com/altweb

Maybe you are looking for