Unicode Issues

Hi All,
I have a requirement where i need to get content from a file and then loading it into a table.
The file is containing Some Unicode Characters.
The content is inserted into a CLOB Column in a table.
I placed the file in a directory located in DB Server and loaded into table using below proc.
-- (1) Insert a new row into html_documents with an empty CLOB, and
-- (2) Retrieve the empty CLOB into a variable with RETURNING..INTO
INSERT INTO clob_docs(id,code,
title,
html_document,
category)
VALUES( id, Code,
Title,
empty_clob(),
Category)
RETURNING html_document INTO theCLob;
-- (3) Get a BFile handle to the external file
theBFile := BFileName(dir,file);
-- (4) Open the file
dbms_lob.fileOpen(theBFile);
-- (5) Copy the contents of the BFile into the empty CLOB
dbms_lob.loadCLOBFromFile(dest_lob => theCLob,
src_bfile => theBFile,
amount => dbms_lob.getLength(theBFile),
DEST_OFFSET => dst_offset,
SRC_OFFSET => src_offset,
BFILE_CSID => DBMS_LOB.DEFAULT_CSID,
LANG_CONTEXT => lang_ctx,
WARNING => warning);
-- (6) Close the file and commit
dbms_lob.fileClose(theBFile);
COMMIT;
The file is loaded fine, but inverted commas are loaded in the place of unicode characters.
Example: *¿acc¿¿¿*
Characterset in database is NLS_CHARACTERSET -->AL32UTF8
NLS_NCHAR_CHARACTERSET --> AL16UTF16
Please help me out to avoid this situation and all the characters to be properly loaded.
Thanks.

All text files are, by definition, encoded using some character set. You have to know what character set you are using to know how to translate the series of 1's and 0's in the file into a particular character. The same character will have very different binary representations in different character sets.
We need to know what character set your file is encoded using to know how to load it into the database properly. If you don't specify, Oracle assumes that the file is encoded using the database character set (AL32UTF8). If that is not the actual character set used to encode the file, you'll get character set translation errors which would result in replacement characters getting stored (those are the question marks that you're seeing).
Justin

Similar Messages

  • Unicode issue jdbc receiver adapter

    Hi guys,
    I'm facing an unicode issue while trying to put some data into a MS database.
    The sender system is a SAP ERP 6.0 system.
    When we are trying to transfer some text in polish or korean or something else, this textes are not transferred correctly.
    I already checked the SAP Note 831162 FAQ: XI 3.0 / PI 7.0 / PI 7.1 JDBC Adapter
    and used the url parameter sendStringParametersAsUnicode=true
    Furthermore I changed my mapping to get the sql statements, so that I add the attribute hasQuot="no"
    and a leading N in front of the data string as described in the Note and in some threads in SDN
    Unicode problem in JDBC receiver adapter
    JDBC adapter , problem in inserting Korean Charaters in MS SQL
    but nothing solved my problem.
    any ideas?
    Do all the languages have to be installed on the XI system?
    Kind regards
    Jochen

    Hi Sarvesh,
    thanks for helping.
    Unfortunately I´m not sure what I should exactly do.
    If I get you wright I have to test the mapping using the test functionality in message mapping (test tab).
    I did this already
    The result message looks like this
    <?xml version="1.0" encoding="UTF-8"?>
    <ns0:IDOC_CDB_IMPORT_insert xmlns:ns0="http://endress.com/xi/sbo/cdb">
       <ns0:StatementName1>
          <ns0:dbTableName action="insert">
             <ns0:table>CFG.DBO.IDOC_CDB_IMPORT</ns0:table>
             <ns0:access>
                <ns0:IDOC>CHRMAS</ns0:IDOC>
                <ns0:SEGMENT>E1CABTM</ns0:SEGMENT>
                <ns0:ATNAM></ns0:ATNAM>
                <ns0:SPRAS>PL</ns0:SPRAS>
                <ns0:ATBEZ hasQuot="no">N&apos;D?ugo?? ca?kowita Tl; L:&apos;</ns0:ATBEZ>
             </ns0:access>
          </ns0:dbTableName>
       </ns0:StatementName1>
    </ns0:IDOC_CDB_IMPORT_insert>
    How to proceed?
    copy this message to notepad or ultra edit and then doing what?
    please specify clearly.
    this message will be send to the jbdc adapter and this will insert it to database.
    <ns0:ATBEZ hasQuot="no">N&apos;D?ugo?? ca?kowita Tl; L:&apos;</ns0:ATBEZ>
    will end up in a table entry like D?ugo?? ca?kowita Tl; L:
    has anybody an idea how I will get the wright characters in database, when I test my UDF in eclipse, the variable during debugging has the correct text.
    When output this variable via system.ou.print(text); the same issue occurs, the special characters are shown as ?.
    I appreciate for your help, unfortunately my polish colleagues hustle a little bit.
    kind regards
    Jochen

  • Windows 8.1 Pro Non-Unicode Issue: CS6 Standard

    I use WIndows 8.1 Pro English, I installed the CS6 Standard in Spanish, I changed the language for the non-unicode programs to spanish but still cannot use the Illustrator, I am having issue only with the Illustrator, noting that in the installation I was prompt for Spanish(Latin America) or Just Spanish, I clicked on Spanish(Latin America), but in the non-Unicode languages there is no Spanish(non-Unicode) they have Spanish per countries.

    I have the same problem did you manage to find the solution? is impossible to contact adobe helpers, they just hide all contact

  • Java JCo SAP XI RFC SAP R/3 6.C ... Unicode issue

    Hello,
    My scenario is very simple. I'd like to connect a Java program to a SAP R/3 (MDMP) for RFC calls (in both direction), using SAP XI (some of the RFC calls will redirected to some other systems). I use SAP JCo v2.1.8 and tried with Java 1.4.2_09 and Java 6.0.
    It works well when the Java program is calling a function module, but when SAP R/3 is doing it, the returned texts looks like: S#A#P# #J#C#o# #v#2#.#1#.#8# #(#2#0#0#6#-#1#2#-#1#1#)# (the text should be SAP JCo v2.1.8).
    It looks like an issue with Unicode. Although I didn't check the Unicode checkbox in the RFC sender adapter (the RFC receiver one doesn't offer the possibility to specify unicode or not).
    Do anyone have a solution ? Is it a code in the Java program or something to set in SAP XI / SAP R/3 ?
    Thanks in advance for your help.

    This is a workaround solution. In the case of this prototype it is ok, but the final plan is to migrate a web based application. It was directly connected to one SAP R/3, but now we would like to connect it to SAP XI because a few RFC calls will be re-directed to some other systems, but the majority of them will we'll still be sent to the SAP R/3. We don't want to make development on this part for cost reason (and time). Plus on some text there are the "#" character that is legitimate.

  • JCo - Unicode issue

    Hi,
      Recently, we migrated SAP to unicode version. Due to this, we are getting the below error when we try to connect from SapJCo to SAP. The version of JCo used is 2.1.6. SAP is hosted on Unix and Java applications are hosted on Windows. We tried with jco.client.codepage= 4103; we are getting different error(RFC_ERROR_LOGON_FAILURE: >> unknown RFC error, no error log found <<) when we give the codepage property.
    Please let us know how to resolve the below issue:
    Exception in thread "main" com.sap.aii.proxy.framework.core.BaseProxyException: Conversion error between two character sets., error key: RFC_ERROR_SYSTEM_FAILURE
          at com.sap.aii.proxy.framework.core.AbstractProxy.send$(AbstractProxy.java:150)
    At the SAP side, we are getting the below errors
    Error analysis
    The error probably occurred when installing the
    R/3 system.
    The termination was triggered in function "RfcExtendedReceive"
    of the SAP kernel, in line 301 of the module
    "//bas/700_REL/src/krn/rfc/abrfcrcv.c#5".
    The internal operation just processed is "FUNC".
    Internal mode was started at 20101124144800.
      CLUDE INCL_INSTALLATION_ERROR
    Edited by: manohar mirle on Nov 26, 2010 2:03 PM

    This got resolved.
    We had to make changes to the XML which is embedded in the java code of R3Connector classes. Changed the encoding type and data types in the XML. It went through.

  • Help: Unicode issue in Snow Leopard not in Leopard?

    Hello All,
    I've finally switched from Leopard to Snow Leopard.
    I'm loving the improvements.
    But, I do have one issue.
    I like to keep things tidy and orderly on my computer.
    In Leopard I had been using a number of characters from the Unicode Miscellaneous Symbols and Pictorgraphs block (U+1F300-1F3FF) in the names of my folders and files, to make them easy to organise and quickly identify (a book for books, a family for family, a plane for travel, etc).
    Under Leopard, this system worked very well.
    Under Snow Leopard, it's not working at all. Even though I've placed the correct font in the System folder (Symbola).
    Is there something I can do to fix the problem?
    Why did it work under Leopard and not work under Snow Leopard?
    I'm including a screenshot of the problem in just one folder on my computer.
    But the problem is everywhere.
    Incidentally, the airplane is actually from a different Unicode block (well within the BMP).

    Looks like a bug.  You can report here:
    http://www.apple.com/feedback/macosx.html

  • Unicode Issue - Corrupted long texts in Plant Maintenance Work Orders

    we're having an issue regarding corrupted texts in a Unicode system. here's the scenario:
    - long text came from 4.5b wherein, of course, the japanese characters are viewable either in EN or JA logon
    - now, this long text when viewed in our acceptance/testing system which is already, take note, already in unicode long text is garbled. both in EN and JA logon long texts are garbled. we are expecting that since the system is already in unicode it can now support code page for japanese.
    (here's the tricky part -- the one seen in the field is properly displayed but when trying to view the complete long text, those that are not in the field are garbled.)
    - we also have some scenarios that came from 4.5b, when viewed in the testing system while not unicoded - it's garbled in EN. this is normal since it's still non-unicoded. and of course when viewed using JA logon it's not corrupted anymore. we're expecting that this will also be the same with the issue above. but it's not.
    please see link for screenshots. [http://img16.imageshack.us/img16/9760/unicodeissuee.jpg|http://img16.imageshack.us/img16/9760/unicodeissuee.jpg]

    i know it's weird.
    but then i talked to a friend, she's an abaper. she said that those that can be converted or properly viewed are texts elements in database tables and those that are like free texts or comments can not. and there's no possible way for this but to manually retype everything.
    makes sense right.

  • Flex Mobile Unicode issue

    I am building an application for android using flash builder 4.6. I am trying to display unicode character as in this xml "http://www.bbc.co.uk/hindi/index.xml"  Characters are displayed correctly when run on simulator in flash builder but when installed on android, it displays square boxes [][][][][][][][][]. How can i go about resolving this issue so that font embeds correctly *

    Sounds like you are missing mobilecomponents.swc

  • How to Convert a HEX value to CHAR value...Unicode Issue...

    Hi,
    How can I convert HEX value to CHAR value.
    The Code in <b>non Unicode</b> system is:
    DATA: t_text LIKE tline OCCURS 0 WITH HEADER LINE.
    constants:   c_hex_20a5(2) TYPE x            VALUE '20A5'.
    t_text-tdline = 'seller of the item so listed.  A legend of the Seller Code(s) is as'.
          TRANSLATE t_text-tdline USING c_hex_20a5.
    The same code give error in <b>Uni-Code</b> system:
    <b>error is "c_hex_20a5 must be an character type object (C, N, D, T or String type)."</b>
    If anyone know, what is the solution, please let me know.
    Thanks!
    Puneet.

    Hi,
    Try declaring the Hex chars using the ABAP char utilities. This is just a sample piece of code on how to declare and use:
    CLASS cl_abap_char_utilities DEFINITION LOAD.
    DATA:  ws_lf TYPE c VALUE cl_abap_char_utilities=>cr_lf.
    data:    c_newline           TYPE x VALUE '0D'.  [ it is zero D, for carriage return ]
    ws_lf = c_newline.
        CONCATENATE it_tab-maktx
                    ws_lf
                    ws_template
                    INTO it_notificatn-template.
    Hope this helps...
    Regards
    Subramanian

  • Unicode issue (# in FILE) URGENT

    Hi,
    I am uploading a file with GUI_UPLOAD and downloading this file again on the SAP Server with OPEN DATASET IN BINARY MODE / TRANSFER / CLOSEDATA SET on a UNICODE system.
    When I look in AL11 to the downloaded file I see a lot of # (#<#?#x#m#l# #v#e#r#s#i#o#n#=#"#1#.#0#"#?#>#<#L#I#S#T# ...) (In HEX it shows 00)
    I want to use this file for test purposes in my ABAP-mapping. When I read this file into this mapping, the file is stored in a xstring variable. When I convert this to a string the # still occur.
    How can I get rid of these signs??
    Ron

    Hi Ron,
    take a look at this response and MODE ENCODING:
    Re: File Transfer non-unicode -> unicode via client
    also you may take a look at this:
    http://help.sap.com/saphelp_nw04/helpdata/en/79/c554dcb3dc11d5993800508b6b8b11/content.htm
    Regards,
    michal

  • Inbound IDOC with UNICODE issues

    We are trying to send in an IDOC with Romanian characters that are outside of latin-1.  We have upgraded our 4.7 SAP system to be UNICODE, but when loading these characters via IDOC they are not being represented correctly in R/3.  We have tried setting our login language to RO when doing the input and it doesn't seem to matter.  Is there a setting somewhere that we are missing to designate an inbound IDOC as UTF-8?
    TIA...
    Darrin

    Hi Darrin,
    if the sender of the IDOC is an external RFC program, you should use the "Codepage" parameter in the logon string, to declare your character set. The SAP RFC lib will then translate between external and R/3 internal codepage.
    If you find my answer useful, please don't forget the reward.
    Regards,
    Juergen

  • Unicode issues connecting Crystal Report to MySQL 5.0

    Post Author: Charles Wong
    CA Forum: Data Connectivity and SQL
    Hi,
    We are developing a web application by PHP 4.4.5, Crystal Report XI developer edition (English version), MySQL 5.0 and English Windows 2003 server.
    MySQL has been configured to store unicode values in all levels. In other words, connectivity, server itself, tables, etc are set to unicode.
    In the web browser, PHP can store and show special Portugues characters (e.g. çã), Chinese (both traditional and simplified) normally. In MySQL browser, all these values can be shown successfully.
    In Crystal Report, both Traditional and simplifed Chinese characters show as ???, however. The connection between Crystal Report and MySQL is throught MySQL ODBC 3.5.12 version.
    Does anybody know why? Any special setting needed in Crystal Report or in MySQL or in ODBC???

    Post Author: Charles Wong
    CA Forum: Data Connectivity and SQL
    Hi All,
    I'm replying myself.
    I read something in the Internet. Somebody said Windows and Crystal report use UCS-2 (or UTF-16?) as the unicode encoding scheme. In our the system, we set to UTF-8. Can this be a problem?

  • Unicode issue with Boot Camp XP

    My Boot Camp XP is running perfectly fine. However, I have a piece of software that has a process which involves renaming files, and this renaming now results in some weird file names (the software does not do this when running a "regular" Win XP). The support for the software told me to try changing the unicode setting (to UTF-8) - how do I go about this? Thanks!

    Actually, you must have Windows finish its installation before installing the Bootcamp Drivers. When you get in your desktop for the first time, go to: Start> My Computer> Select your Mac's DVD Drive. Look for an "eject" button in the Windows Explorer window.
    You may also want to check the Bootcamp Installation guide for Mac OS X Snow Leopard (10.6), it comes handy in these situations:
    http://manuals.info.apple.com/en_US/Boot_Camp_Install-Setup_10.6.pdf
    This is the one for Mac OS X Lion (10.7):
    http://manuals.info.apple.com/en_US/boot_camp_install-setup_10.7.pdf
    EDIT:
    Added a video tutorial from CNET, but follow the Apple Provided installation guide:
    http://www.youtube.com/watch?v=9dLcJmDIccc
    This video is more up to date:
    http://www.youtube.com/watch?v=X-CXTovYk3A&feature=related
    Note that the tutorial says that Windows 7 is not compatible, this not longer the case after the release of Bootcamp 3.1.
    Message was edited by: vea1083

  • UNICODE Issue With Chinese Characters

    Dear Experts,
    I have chinese characters in my data. I want to downlaod this data on front end as ANSI file type and the chinese data should be convertable.
    Am able to download using 4120 unicode in gui-downalod. but am not getting fiel format of ANSI if I have chinese characters in my file.
    Please hep me.
    Thanks in Advance.
    Kiran.

    Hi
    You can get the code page of a language using function module NLS_GET_FRONTEND_CP.
    Import parameters               Value
    LANGU                          CH
    FETYPE                          MS
    Export parameters               Value
    FRONTEND_CODEPAGE               1160
    Then pass the code to you program and try.
    Shiva

  • ABAP Unicode issue...

    ABAP gurus,
    We are in the process of upgrading our SAP from 4.6C to ECC 6.0. I am trying to pass some of the programs from UCCHECK. I got following exceptions:
    "XK_HEX00" must be a character-type data object (data type C, N, D, T or STRING)
    "XK_HEX09" must be a character-type data object (data type C, N, D, T or STRING)
    "XK_HEXLF" must be a character-type data object (data type C, N, D, T or STRING)
    We have following statements in the program:
          xk_hex00(1)   type x value '00',      "binaer null
          xk_hex09(1)   type x value '09',      "Steuerzeichen Tabulator
          xk_hexcr(1)   type x value '0D',      "Steuerzeichen CR
          xk_hexlf(1)   type x value '0A',      "Steuerzeichen LF
    Please help.
    Thanks.
    Rajesh.

    Please help how to convert hexadecimal values '00', '09', '0D' and '0A' in type C for unicode validation. I know there is a class called CL_ABAP_CHAR_UTILITIES. But I don't know what attribute to use for what ?
    Thanks.
    Regards,
    Rajesh.

Maybe you are looking for