Document Capture OCR missed interperted Western Latin characters

Hi,
I'm using the OCR function to read words with Latin characters, but the reading is not being done in the correct way this is the character "à" appears as "6" and the same is happening for other characters.
Is any way to define the character set?
Edited by: carlos.pereira on Nov 12, 2008 7:01 PM

A new feature was added in the 10.1.3.4.0 release of Oralce Document Caputre to allow for selection of your OCR settings. These settings include the Language to be used for the OCR engine to interpret.
You can get to these settings by going to the System menu and selecting "Settings". If running anything older that 10.1.3.4.0, these settings will not be available.
There is a chance that when you make this setting change that it will still misinterpret the characters in your data. In that case, you may be able to improve your results by adding creating a custom Index macro.

Similar Messages

  • Oracle distributed document capture OCR

    there is no OCR in Oracle distributed document  capture there is any solution fro this issue ??

    Yes, the 11.1.1.8 version has Capture as a managed server and not as a windows based desktop app. So no seperate installer for the same.
    Also, Looks like they'ave covered the ODDC use cases in the Capture itself, that being a web app now.
    regards,
    Pradeep Mahajan.
    http://ucmchronicles.wordpress.com
    http://ucmchronicles.wordpress.com/2013/08/20/whats-new-with-capture-in-the-11-1-1-8-version

  • Webcenter Capture OCR Engine and Languages

    Hello, does anyone know which OCR engines are used in full document OCR recognition? Is there support for recognition of Greek text?
    Thanks in advance!

    WebCenter Capture is new name for the same "old" Oracle Document Capture. Even the official WebCenter links are pointing to the 10gR3 documentation. Latest version is 10.1.3.5.1 and I gave information related to this one. Once, I stumbled upon the list of supported OCR languages but I can't find it any more. Anyway it was useless as there was over 100 languages with only handful marked as "tested", meaning that Oracle doesn't guarantee they all are actually working as expected.
    For the greek language I have looked inside my Document Capture, so support is there for sure.
    Regards,
    Boris

  • Issue with Oracle Distributed Document Capture in table update

    Hi All,
    I installed ODDC and configured with Oracle 11gr2 for document commit.
    I have table with 4 fields i.e id,c_number,content,mime_type. I am storing the image in content which blob datatype.
    when i import and send document from WebCapture screen i am getting document send successfully. But the data is no commiting in database table.
    The pak file are genereated in "/Document Capture/Webpages/ClientAcces".
    how i have to commit this files to database?.
    am I missing something in configuration?. Please suggest.
    thanks
    nr
    Edited by: pnr on Jul 30, 2012 5:50 AM
    Edited by: pnr on 30 Jul, 2012 7:54 AM

    It's likely that Oracle Distributed Document Capture Service (ecNetService) haven't been started as it is responsible for the pak file processing and it doesn't start automatically after install.
    Regards,
    Boris
    Edited by: tombo on 2012.08.01 04:49

  • When I share a pages document as a PDF (via e-mail) all the pictures change from the original document, most are missing with just a few of them repeated in the spots where others had been.  How do I do this without the document changing?

    When I share a pages document as a PDF (via e-mail) all the pictures change from the original document, most are missing with just a few of them repeated in the spots where others had been.  How do I do this without the document changing?
    I need to be able to send it to a PC in order to print it.

    Hard to say what is happening without examining the file.
    If you like click on my blue name and email me both the .pages file and the the .pdf you have made from it.
    Peter
    ps It would help to say what version of Pages you are using and on what you are running it. iOS or Mac and what version.

  • HP AiO on iOS7 document capture not working with iPad2?

    Has anyone else found that the Document Capture does not work with the latest version of HP AiO Remote on iOS7? The app grumbles about "The document capture feature is only available if the camera on your iPad is over 5 megapixels" - any solutions to this problem as we have about 95 iPads out in the business that need to use this function regularly

    Hi VijayJois,
    I understand that you are getting an error message when trying to use the capture feature in HP's AiO Remote app.
    The capture feature in the most recent version of the app will give the error message that you are seeing.  A work around to this is to take a photo instead of using capture.  The photo is now available in the AiO app in your camera roll to send to the printer, or send via email. 
    Regards,
    Happytohelp01
    Please click on the Thumbs Up on the right to say “Thanks” for helping!
    Please click “Accept as Solution ” on the post that solves your issue to help others find the solution.
    I work on behalf of HP

  • Error while using Oracle Document Capture

    Whenever I scan pictures from the local disk, an error -
    Error 4515, The file is not a valid image comes
    If anybody knows the solution to remove this error, please let me know
    thanks
    Sumit

    Hi Sumit,
    Try reinstalling the documents capturing module with no spaces for the installation path. This should solve the problem

  • I am using photoshop cc 2014.2 And when I use save for Web it gives me a dialog box that says it is using Latin characters I've never seen this before. Does anyone have an answer for this problem or do I need to go back to the older version of Photoshop.

    I'm using Photoshop cc 2014.21 I you save for Web and dialog box comes up that says that Photoshop is using Latin characters and that servers cannot recognize Latin characters in webpages and you may have trouble viewing these pages does anyone have an answer for this problem?

    What Firefox version is currently installed on the computer?
    You can find the full version of the current current Firefox release (37.0.2) in all languages and all operating systems here:
    *https://www.mozilla.org/en-US/firefox/all/
    If you have a very old Firefox version then you could consider to uninstall that version to clean up existing registry keys (especial the uninstall key).
    Make sure NOT to remove "personal data" when you uninstall Firefox, because that will remove all profile folders and you lose personal data like bookmarks and passwords including data in profiles created by other Firefox versions.
    Check the Firefox program folder and remove the Firefox program folder if there are still files left in it.
    *(32 bit Windows) "C:\Program Files\Mozilla Firefox\"
    *(64 bit Windows) "C:\Program Files (x86)\Mozilla Firefox\"
    It is important to delete the Firefox program folder to remove all the files and make sure that there are no problems with files that were leftover after uninstalling.
    *http://kb.mozillazine.org/Uninstalling_Firefox
    Your bookmarks and other personal data are stored in the Firefox profile folder and won't be affected by an uninstall and (re)install, but make sure NOT to remove personal data when you uninstall Firefox as that will remove all Firefox profile folders and you lose your personal data.
    *http://kb.mozillazine.org/Profile_folder_-_Firefox
    *http://kb.mozillazine.org/Profile_backup
    *http://kb.mozillazine.org/Standard_diagnostic_-_Firefox#Clean_reinstall

  • Non latin characters in Safari search

    In Safari 6 the search and URL fields are combined. That's fine, except...
    We can no longer search using non-Latin characters, because the field accepts only Latin characters. I was trying to search for a Japanese term, and when I switch to Hiragana input and move to the search field, the input switches back to English.
    What's the workaround??

    The problem has gone away. I suspect it was a problem with corrupted prefs. I trashed the Safari prefs and rebooted to clear another problem and no longer have the problem with search using Japanese characters.
    FWIW, the problem I had when I trashed the prefs was with trying to mail a Safari page. I ran in to this originally a week or two ago and called Apple who told me to delete the Safari prefs and reboot. (Actually they gave an alternate procedure to try first, but I didn't bother.) That appears to be a recurrent problem and since there was no hesitation on the solution when I called I would guess that it will be fixed in an early patch. I had already tried trashing the mail prefs since that's where the problem actually appeared (an extra copy of Mail would open, and then hang) but it was in fact the Safari prefs that was causing the problem. I've had to do the delete-and-reboot routine every few days. Not sure why the reboot is required, but it obviously is since just quitting Safari or even logging out doesn't fix it.

  • Non latin characters in .cfm filename

    Hi - I have users who want to name files with non latin characters.  i.e.
    Логотип_БелРусь_2500x1.cfm
    We get a file not found error, it is not an IIS issue and we have UTF-8 encoding and are running CF8.
    Yes we can rename the files but for now would like to know if non latin characters are allowed in .cfm file names.
    Thank you!
    Sapna

    PaulH wrote:
    en_US is the JRE locale. is that the same as the OS? and what file encoding?
    (check via cfadmin).
    i ask, because pretty sure you can't use non-ascii file names w/cf. there's an
    open bug on that:
    http://cfbugs.adobe.com/cfbugreport/flexbugui/cfbugtracker/main.html#bugId=77177
    only can guess that file encoding isn't latin-1, etc. and/or OS locale equals
    the same language as the file name.
    cfadmin gives pretty much the same information. Here's a direct copy
    Server Product
    ColdFusion
    Version
    9,0,0,241018  
    Edition
    Developer  
    Serial Number
    Operating System
    Windows 2000  
    OS Version
    5.0  
    Update Level
    /C:/ColdFusion9/lib/updates/hf900-78588.jar  
    Adobe Driver Version
    4.0 (Build 0005)  
    JVM Details
    Java Version
    1.6.0_12  
    Java Vendor
    Sun Microsystems Inc.  
    Java Vendor URL
    http://java.sun.com/
    Java Home
    C:\ColdFusion9\runtime\jre  
    Java File Encoding
    Cp1252  
    Java Default Locale
    en_US  
    File Separator
    Path Separator
    Line Separator
    Chr(13)

  • Cannot create file with Non-latin characters- I/O

    I'm trying to create a file w/ Greek (or any other non-latin) characters ... for use in a RegEx demo.
    I can't seem to create the characters. I'm thinking I'm doing something wrong w/ IO.
    The code follows. Any insight would be appreciated. - Thanks
    import java.util.regex.*;
    import java.io.*;
    public class GreekChars{
         public static void main(String [ ] args ) throws Exception{
              int c;
              createInputFile();
    //          String input = new BufferedReader(new FileReader("GreekChars.txt")).readLine();
    //          System.out.println(input);
              FileReader fr = new FileReader("GreekChars.txt");
              while( (c = fr.read()) != -1)
                   System.out.println( (char)c  );
         public static void createInputFile() throws Exception {
              PrintStream ps = new PrintStream(new FileOutputStream("GreekChars.txt"));
              ps.println("\u03A9\u0398\u03A0\u03A3"); // omega,theta,pi,sigma
              System.out.println("\u03A9\u0398\u03A0\u03A3"); // omega,theta,pi,sigma
              ps.flush();
              ps.close();
              FileWriter fw = new FileWriter("GreekChars.txt");
              fw.write("\u03A9\u0398\u03A0\u03A3",0,4);
              fw.flush();
              fw.close();
    // using a printstream to create file ... and BufferedReader to read
    C:> java GreekChars
    // using a Filewriter to create files  .. and FileReader to read
    C:> java GreekChars
    */

    Construct your file writer using a unicode format. If
    you don't then the file is written using the platform
    "default" format -probably ascii.
    example:
    FileWriter fw = new FileWriter("GreekChars.txt",
    "UTF-8");I don't know what version of FileWriter you are using, but not that I know of take two string parameters. You should try checking the API before trying to help someone, instead of just making things up.
    To the OP:
    The proper way to produce a file in UTF-8 format would be this:
    OutputStreamWriter writer = new OutputStreamWriter(new FileOutputStream("filename"), "UTF-8");Then to read the file, you would use:
    InputStreamReader reader = new InputStreamReader(new FileInputStream("filename"), "UTF-8");

  • We cannot type Polish (non-latin) characters in WebDynpro applications

    We cannot type Polish (non-latin) characters in WebDynpro application (in runtime) because 'Browser Help Shortcuts' are fired.
    To type a polish character in polish keyboard you need to press AltGr + letter (ie. AltGr + a/c/e/s/o/l/z/x/n). To type an uppercase polish character you need to press AltGr + Shift + letter. This comination is in fact the same as pressing Alt + Ctrl + Shift + letter (because AltGr produces Alt + Ctrl) and it fires some of 'Browser Help Shortcuts'. For example AltGr + Shift + O should produce a letter O with a dash on it's top but instead it fires 'Show nesting of HTML containers'.
    We tried to turn off sap-wd-lightspeed, but then other key combinations are reserved for u2018Browser Help Shortcutsu2019.
    We need to be able to use AltGr + Shift + a/c/e/s/o/l/z/x/n in runtime.
    Product: SAP NW 7.11 SP04
    WebDynpro for Java
    I hope there is a somewhere a hidden parameter that solves our problem Maybe we're in some kind of debug mode?
    Thanks for your help!!

    The funny thing is that bold font [when message unread in message list] shows OK, ie in greek, but when i click on unread message, it is assumed to have been read, so it changes over to medium [non bold] and the encoding changes as well into the one that is not greek and thus unreadable.  In ~/.sylpheed/sylpheedrc the fonts are:
    widget_font=
    message_font=-microsoft-sylfaenarm-medium-r-normal-*-*-160-*-*-p-*-iso8859-7
    normal_font=-monotype-arial-medium-r-normal-*-12-*-*-*-*-*-iso8859-7
    bold_font=-monotype-arial-bold-r-normal-*-12-*-*-*-*-*-iso8859-7
    small_font=-monotype-arial-medium-r-normal-*-12-*-*-*-*-*-iso8859-7
    In /etc/gtk, for gtk1.2 apps the file refering to greek encoding [el] seems to be fine [exactly the same as in slackware 9.1].

  • Distributed Document Capture and date fields

    Greetings!
    We are using Oracle Distributed Document Capture and Universal Content Management as repository, it works fine. However we defined a custom metadata field with data type "date" in UCM and we defined a scan profile with its asssociated date field and the problem that we have is when we are using the distributed document capture client and the user selects a date from the calendar and send the document, the commit process fails with the following error message :
    The field 'xxx' does not contain a valid date.
    However If the user types the date field as "29/01/2009 00:00" the commit process works fine and the document is stored on UCM.
    My question is how to enable the calendar on the Distributed Document Capture client in order to enter a valid date with format "dd/m/yyyy hh:mm"
    Thanks!

    Thanks for your answer I set the property "Automatically assign a Content ID on check in" Should I specify the minor revision label and the major revision label?.
    Because I set just the "Automatically assign a Content ID on check in" and now I got the following when I am scanning documents:
    ***Error*** Failed to process batch 01354425_01354446. Error Number: 40011, Error: Failed to commit batch (with delete). Commit Profile - ucm: Document file already exists. Commit is configured to abort. Document File: \\svrdocumentos_ejc_ejercito_mil_co\batch\00J\0000000D\0000000D_1.TIF
    The good news is the previous error "Error Number: 40008" is solved
    I really appreciate yor help.
    Thanks.

  • Auto Commit in Oracle Document Capture

    Hi,
            I have recently installed and configured Oracle Document Capture 10g as a part of IPM installation. I have configured ODC to Scan, Index and commit to UCM 11g system. Scanning and indexing are going fine without and problem. However, I could not make the commit automatic. Please help me configure an auto-commit, so that I don't have to go to ODC every time and do a manual commit. Or is it possible to auto-commit at all ?
    Thank you
    Rama

    In this case I think the problem is somewhere else. Normally, an indexed batch is committed immediately when you you confirm the message at the end of the last page. For that you don't even need the Commit server configured. Commit server with Indexing is used mainly in use cases, when you want to schedule committing (e.g. to a certain time). I guess the Commit Server is not activated, because the batch is not fully processed. AFAIK, recently released 11g is much better option for small batches.
    I'd recommend you to try the Commit Server together with the Import/Recognition Server, which can be fully automated. Maybe this will enable you to understand what the problem with Indexing could be.

  • HT2506 hi i can't seem to copy and paste pdf files from preview to a microsoft word document. it always comes up with nonsense characters in the word document. how do i fix this? thanks in advance

    hi i can't seem to copy and paste pdf files from preview to a microsoft word document. it always comes up with nonsense characters in the word document. how do i fix this? thanks in advance

    Hi,
    The quickest way to do this (tested with OS 10.8.2, MS Word 2011) is in one line:
    tell application "Microsoft Word" to close every document whose name begins with "book"
    The reason why your script doesn't work is that you aren't specifying to Word which document you want it to close.
    So, changing one line ought to do it:
    close document incrementvalue -- instead of "close document"
    But... when you close a document, you change the document count. Document 2 becomes document 1, document 3 becomes document 2 etc. So eventually you'd get a message saying "The object you are trying to access doesn't exist".
    Because of this, if you want to do it in a repeat loop, you have to work backwards:
    tell application "Microsoft Word"
      set doc_count to count documents
      repeat with x from doc_count to 1 by -1
      if name of document x begins with "book" then close document x
      end repeat
    end tell
    Hope this helps.

Maybe you are looking for