ISO vs UTF-8

La migración de la BBDD de una aplicación desde un encoding ISO-8859-1 a otra con enconding UTF-8 puede presentar problemas durante el proceso de import debido al tratamiento que el encoding UTF-8 realiza de los campos CHAR, VARCHAR y VARCHAR2.
El origen de los problemas viene provocado por que en ISO-8859-1 cualquier carácter (normales, acentuados, ñ, etc…) ocupa 1 byte mientras que en UTF-8, los caracteres “especiales” ocupan más de 1 byte.
Para evitar este problma es necesario especificar los campos CHAR, VARCHAR y VARCHAR2 con el atributo CHAR ya que de esta manera se consigue desligar a los datos del encoding de la BBDD en la que resida. En caso contrario ORACLE asumirá por defecto que el tamaño implica “bytes” y como consecuencia un encoding UTF-8 producirá problemas al manejar caracteres especiales.
Por otro lado, la realización de este cambio deja en manos de ORACLE la decisión sobre el tamaño físico de datos a emplear de forma que, en el caso que los VARCHAR que no contengan caracteres extraños su tamaño coincidirá exactamente al tamaño en bytes en TS pero, un campo que contenga uno (o más) caracteres especial, verá incrementado el espacio empleado en disco de forma proporcional. En este caso, ORACLE recomienda considerar un incremento de un 15% sobre el espacio de disco ocupado en la versión ISO.
http://www.fancypeople.org

Google Translate does it thusly
>
The migration of a DB application from an encoding ISO-8859-1 to another with enconding UTF-8 can present problems during the import process due to treatment than UTF-8 encoding performed in the fields CHAR, VARCHAR and VARCHAR2.
The origin of the problem that is caused by ISO-8859-1 any character (normal, accentuated, ñ, etc ...) while occupies 1 byte in UTF-8 characters "special" occupy more than 1 byte.
To work around this problme is necessary to specify the fields CHAR, VARCHAR and VARCHAR2 to CHAR attribute as is done this way to separate the data encoding of the DB where you live. Otherwise default ORACLE assume that the mean size "bytes" and result in UTF-8 encoding will cause problems when handling special characters.
Furthermore, the implementation of this change ORACLE leaves the decision on the physical size of data to be used so that, for the VARCHAR containing no strange characters will match exactly the size in bytes in size but TS A field containing one (or more) special characters, will increase the disk space used proportionately. In this case, Oracle recommends considering an increase of 15% of the disk space occupied in the ISO version.
>
I do not see a question or issue in there :-)
Srini

Similar Messages

  • Invalid Characters shown in UTF-8 character set

    There is an XMLP report whose template output character set is ISO-8859-1. The character set ISO-8859-1 is required for this report as per Spanish Authorities. When the report is run, output gets generated in the output directory file of application server. This output file doesn't contain any invalid characters.
    But when the output is opened from SRS window, which opens it in a browser, the invalid characters are shown for characters like Ñ , É etc.
    Investigation done:
    Found that the output generated on the server is having ISO encoding and hence doesn't contain any invalid characters. Whereas the output generated from SRS window, it is in UTF encoding, so it seems the invalid characters are displayed when conversion takes place from ISO to UTF-8 format.
    Created the eText output using the data xml and template using BI publisher tool, the output is in ISO encoding. So if i go and change the encoding to UTF-8 by opening it in explorer or Notepad++, invalid charcters are shown for Ñ, É etc.
    Is there any limitation, that output from SRS window will show only in UTF-8 encoding? If not then please suggest.
    Thanks,
    Saket
    Edited by: 868054 on Aug 2, 2012 3:05 AM
    Edited by: 868054 on Aug 2, 2012 3:05 AM

    Hi Srini,
    When customer is viewing output from the SRS window, then it contains invalid characters because it is in UTF-8 character set. Customer is on Oracle OnDemand so they cannot take the output generated on the server.Every time they have to raise a request to Oracle for the output file. So the concern here is, why don't the output from SRS window show output with valid characters ?
    The reason could be conversion of ISO format to UTF-8. How could this be resolved ? Does SRS window output cannot generate in ISO format ?
    A quick reply will be appreciated as customer is chasing for an update.
    Thanks,
    Saket
    Edited by: 868054 on Aug 7, 2012 11:08 PM

  • File character encoding format conversion

    Hi People,
    uname -a
    Linux abcd.us.com 2.6.32-400.21.1.el5uek #1 SMP Wed Feb 20 01:35:01 PST 2013 x86_64 x86_64 x86_64 GNU/Linux
    I am trying to convert a file abcd.dat
    # file abcd.dat
    abcd.dat: binary Computer Graphics Metafile
    #file -i abcd.dat
    abcd.dat: application/octet-stream
    I've tried dos2unix in vain.
    dos2unix: converting file abcd.dat to UNIX format ...
    dos2unix: converting file abc.dat to UNIX format ...
    dos2unix: problems converting file abc.dat
    I've used iconv successfully earlier with this command
    iconv -f UTF-16 -t UTF-8  abcd.dat > abcd.abc
    only, this time I do not know the "from" format of the file,
    #iconv -l
    The following list contain all the coded character sets known.  This does
    not necessarily mean that all combinations of these names can be used for
    the FROM and TO command line parameters.  One coded character set can be
    listed with several different names (aliases).
      437, 500, 500V1, 850, 851, 852, 855, 856, 857, 860, 861, 862, 863, 864, 865,
      866, 866NAV, 869, 874, 904, 1026, 1046, 1047, 8859_1, 8859_2, 8859_3, 8859_4,
      8859_5, 8859_6, 8859_7, 8859_8, 8859_9, 10646-1:1993, 10646-1:1993/UCS4,
      ANSI_X3.4-1968, ANSI_X3.4-1986, ANSI_X3.4, ANSI_X3.110-1983, ANSI_X3.110,
      ARABIC, ARABIC7, ARMSCII-8, ASCII, ASMO-708, ASMO_449, BALTIC, BIG-5,
      BIG-FIVE, BIG5-HKSCS, BIG5, BIG5HKSCS, BIGFIVE, BS_4730, CA, CN-BIG5, CN-GB,
      CN, CP-AR, CP-GR, CP-HU, CP037, CP038, CP273, CP274, CP275, CP278, CP280,
      CP281, CP282, CP284, CP285, CP290, CP297, CP367, CP420, CP423, CP424, CP437,
      CP500, CP737, CP775, CP803, CP813, CP819, CP850, CP851, CP852, CP855, CP856,
      CP857, CP860, CP861, CP862, CP863, CP864, CP865, CP866, CP866NAV, CP868,
      CP869, CP870, CP871, CP874, CP875, CP880, CP891, CP901, CP902, CP903, CP904,
      CP905, CP912, CP915, CP916, CP918, CP920, CP921, CP922, CP930, CP932, CP933,
      CP935, CP936, CP937, CP939, CP949, CP950, CP1004, CP1008, CP1025, CP1026,
      CP1046, CP1047, CP1070, CP1079, CP1081, CP1084, CP1089, CP1097, CP1112,
      CP1122, CP1123, CP1124, CP1125, CP1129, CP1130, CP1132, CP1133, CP1137,
      CP1140, CP1141, CP1142, CP1143, CP1144, CP1145, CP1146, CP1147, CP1148,
      CP1149, CP1153, CP1154, CP1155, CP1156, CP1157, CP1158, CP1160, CP1161,
      CP1162, CP1163, CP1164, CP1166, CP1167, CP1250, CP1251, CP1252, CP1253,
      CP1254, CP1255, CP1256, CP1257, CP1258, CP1361, CP1364, CP1371, CP1388,
      CP1390, CP1399, CP4517, CP4899, CP4909, CP4971, CP5347, CP9030, CP9066,
      CP9448, CP10007, CP12712, CP16804, CPIBM861, CSA7-1, CSA7-2, CSASCII,
      CSA_T500-1983, CSA_T500, CSA_Z243.4-1985-1, CSA_Z243.4-1985-2,
      CSA_Z243.419851, CSA_Z243.419852, CSDECMCS, CSEBCDICATDE, CSEBCDICATDEA,
      CSEBCDICCAFR, CSEBCDICDKNO, CSEBCDICDKNOA, CSEBCDICES, CSEBCDICESA,
      CSEBCDICESS, CSEBCDICFISE, CSEBCDICFISEA, CSEBCDICFR, CSEBCDICIT, CSEBCDICPT,
      CSEBCDICUK, CSEBCDICUS, CSEUCKR, CSEUCPKDFMTJAPANESE, CSGB2312, CSHPROMAN8,
      CSIBM037, CSIBM038, CSIBM273, CSIBM274, CSIBM275, CSIBM277, CSIBM278,
      CSIBM280, CSIBM281, CSIBM284, CSIBM285, CSIBM290, CSIBM297, CSIBM420,
      CSIBM423, CSIBM424, CSIBM500, CSIBM803, CSIBM851, CSIBM855, CSIBM856,
      CSIBM857, CSIBM860, CSIBM863, CSIBM864, CSIBM865, CSIBM866, CSIBM868,
      CSIBM869, CSIBM870, CSIBM871, CSIBM880, CSIBM891, CSIBM901, CSIBM902,
      CSIBM903, CSIBM904, CSIBM905, CSIBM918, CSIBM921, CSIBM922, CSIBM930,
      CSIBM932, CSIBM933, CSIBM935, CSIBM937, CSIBM939, CSIBM943, CSIBM1008,
      CSIBM1025, CSIBM1026, CSIBM1097, CSIBM1112, CSIBM1122, CSIBM1123, CSIBM1124,
      CSIBM1129, CSIBM1130, CSIBM1132, CSIBM1133, CSIBM1137, CSIBM1140, CSIBM1141,
      CSIBM1142, CSIBM1143, CSIBM1144, CSIBM1145, CSIBM1146, CSIBM1147, CSIBM1148,
      CSIBM1149, CSIBM1153, CSIBM1154, CSIBM1155, CSIBM1156, CSIBM1157, CSIBM1158,
      CSIBM1160, CSIBM1161, CSIBM1163, CSIBM1164, CSIBM1166, CSIBM1167, CSIBM1364,
      CSIBM1371, CSIBM1388, CSIBM1390, CSIBM1399, CSIBM4517, CSIBM4899, CSIBM4909,
      CSIBM4971, CSIBM5347, CSIBM9030, CSIBM9066, CSIBM9448, CSIBM12712,
      CSIBM16804, CSIBM11621162, CSISO4UNITEDKINGDOM, CSISO10SWEDISH,
      CSISO11SWEDISHFORNAMES, CSISO14JISC6220RO, CSISO15ITALIAN, CSISO16PORTUGESE,
      CSISO17SPANISH, CSISO18GREEK7OLD, CSISO19LATINGREEK, CSISO21GERMAN,
      CSISO25FRENCH, CSISO27LATINGREEK1, CSISO49INIS, CSISO50INIS8,
      CSISO51INISCYRILLIC, CSISO58GB1988, CSISO60DANISHNORWEGIAN,
      CSISO60NORWEGIAN1, CSISO61NORWEGIAN2, CSISO69FRENCH, CSISO84PORTUGUESE2,
      CSISO85SPANISH2, CSISO86HUNGARIAN, CSISO88GREEK7, CSISO89ASMO449, CSISO90,
      CSISO92JISC62991984B, CSISO99NAPLPS, CSISO103T618BIT, CSISO111ECMACYRILLIC,
      CSISO121CANADIAN1, CSISO122CANADIAN2, CSISO139CSN369103, CSISO141JUSIB1002,
      CSISO143IECP271, CSISO150, CSISO150GREEKCCITT, CSISO151CUBA,
      CSISO153GOST1976874, CSISO646DANISH, CSISO2022CN, CSISO2022JP, CSISO2022JP2,
      CSISO2022KR, CSISO2033, CSISO5427CYRILLIC, CSISO5427CYRILLIC1981,
      CSISO5428GREEK, CSISO10367BOX, CSISOLATIN1, CSISOLATIN2, CSISOLATIN3,
      CSISOLATIN4, CSISOLATIN5, CSISOLATIN6, CSISOLATINARABIC, CSISOLATINCYRILLIC,
      CSISOLATINGREEK, CSISOLATINHEBREW, CSKOI8R, CSKSC5636, CSMACINTOSH,
      CSNATSDANO, CSNATSSEFI, CSN_369103, CSPC8CODEPAGE437, CSPC775BALTIC,
      CSPC850MULTILINGUAL, CSPC862LATINHEBREW, CSPCP852, CSSHIFTJIS, CSUCS4,
      CSUNICODE, CSWINDOWS31J, CUBA, CWI-2, CWI, CYRILLIC, DE, DEC-MCS, DEC,
      DECMCS, DIN_66003, DK, DS2089, DS_2089, E13B, EBCDIC-AT-DE-A, EBCDIC-AT-DE,
      EBCDIC-BE, EBCDIC-BR, EBCDIC-CA-FR, EBCDIC-CP-AR1, EBCDIC-CP-AR2,
      EBCDIC-CP-BE, EBCDIC-CP-CA, EBCDIC-CP-CH, EBCDIC-CP-DK, EBCDIC-CP-ES,
      EBCDIC-CP-FI, EBCDIC-CP-FR, EBCDIC-CP-GB, EBCDIC-CP-GR, EBCDIC-CP-HE,
      EBCDIC-CP-IS, EBCDIC-CP-IT, EBCDIC-CP-NL, EBCDIC-CP-NO, EBCDIC-CP-ROECE,
      EBCDIC-CP-SE, EBCDIC-CP-TR, EBCDIC-CP-US, EBCDIC-CP-WT, EBCDIC-CP-YU,
      EBCDIC-CYRILLIC, EBCDIC-DK-NO-A, EBCDIC-DK-NO, EBCDIC-ES-A, EBCDIC-ES-S,
      EBCDIC-ES, EBCDIC-FI-SE-A, EBCDIC-FI-SE, EBCDIC-FR, EBCDIC-GREEK, EBCDIC-INT,
      EBCDIC-INT1, EBCDIC-IS-FRISS, EBCDIC-IT, EBCDIC-JP-E, EBCDIC-JP-KANA,
      EBCDIC-PT, EBCDIC-UK, EBCDIC-US, EBCDICATDE, EBCDICATDEA, EBCDICCAFR,
      EBCDICDKNO, EBCDICDKNOA, EBCDICES, EBCDICESA, EBCDICESS, EBCDICFISE,
      EBCDICFISEA, EBCDICFR, EBCDICISFRISS, EBCDICIT, EBCDICPT, EBCDICUK, EBCDICUS,
      ECMA-114, ECMA-118, ECMA-128, ECMA-CYRILLIC, ECMACYRILLIC, ELOT_928, ES, ES2,
      EUC-CN, EUC-JISX0213, EUC-JP-MS, EUC-JP, EUC-KR, EUC-TW, EUCCN, EUCJP-MS,
      EUCJP-OPEN, EUCJP-WIN, EUCJP, EUCKR, EUCTW, FI, FR, GB, GB2312, GB13000,
      GB18030, GBK, GB_1988-80, GB_198880, GEORGIAN-ACADEMY, GEORGIAN-PS,
      GOST_19768-74, GOST_19768, GOST_1976874, GREEK-CCITT, GREEK, GREEK7-OLD,
      GREEK7, GREEK7OLD, GREEK8, GREEKCCITT, HEBREW, HP-ROMAN8, HPROMAN8, HU,
      IBM-803, IBM-856, IBM-901, IBM-902, IBM-921, IBM-922, IBM-930, IBM-932,
      IBM-933, IBM-935, IBM-937, IBM-939, IBM-943, IBM-1008, IBM-1025, IBM-1046,
      IBM-1047, IBM-1097, IBM-1112, IBM-1122, IBM-1123, IBM-1124, IBM-1129,
      IBM-1130, IBM-1132, IBM-1133, IBM-1137, IBM-1140, IBM-1141, IBM-1142,
      IBM-1143, IBM-1144, IBM-1145, IBM-1146, IBM-1147, IBM-1148, IBM-1149,
      IBM-1153, IBM-1154, IBM-1155, IBM-1156, IBM-1157, IBM-1158, IBM-1160,
      IBM-1161, IBM-1162, IBM-1163, IBM-1164, IBM-1166, IBM-1167, IBM-1364,
      IBM-1371, IBM-1388, IBM-1390, IBM-1399, IBM-4517, IBM-4899, IBM-4909,
      IBM-4971, IBM-5347, IBM-9030, IBM-9066, IBM-9448, IBM-12712, IBM-16804,
      IBM037, IBM038, IBM256, IBM273, IBM274, IBM275, IBM277, IBM278, IBM280,
      IBM281, IBM284, IBM285, IBM290, IBM297, IBM367, IBM420, IBM423, IBM424,
      IBM437, IBM500, IBM775, IBM803, IBM813, IBM819, IBM848, IBM850, IBM851,
      IBM852, IBM855, IBM856, IBM857, IBM860, IBM861, IBM862, IBM863, IBM864,
      IBM865, IBM866, IBM866NAV, IBM868, IBM869, IBM870, IBM871, IBM874, IBM875,
      IBM880, IBM891, IBM901, IBM902, IBM903, IBM904, IBM905, IBM912, IBM915,
      IBM916, IBM918, IBM920, IBM921, IBM922, IBM930, IBM932, IBM933, IBM935,
      IBM937, IBM939, IBM943, IBM1004, IBM1008, IBM1025, IBM1026, IBM1046, IBM1047,
      IBM1089, IBM1097, IBM1112, IBM1122, IBM1123, IBM1124, IBM1129, IBM1130,
      IBM1132, IBM1133, IBM1137, IBM1140, IBM1141, IBM1142, IBM1143, IBM1144,
      IBM1145, IBM1146, IBM1147, IBM1148, IBM1149, IBM1153, IBM1154, IBM1155,
      IBM1156, IBM1157, IBM1158, IBM1160, IBM1161, IBM1162, IBM1163, IBM1164,
      IBM1166, IBM1167, IBM1364, IBM1371, IBM1388, IBM1390, IBM1399, IBM4517,
      IBM4899, IBM4909, IBM4971, IBM5347, IBM9030, IBM9066, IBM9448, IBM12712,
      IBM16804, IEC_P27-1, IEC_P271, INIS-8, INIS-CYRILLIC, INIS, INIS8,
      INISCYRILLIC, ISIRI-3342, ISIRI3342, ISO-2022-CN-EXT, ISO-2022-CN,
      ISO-2022-JP-2, ISO-2022-JP-3, ISO-2022-JP, ISO-2022-KR, ISO-8859-1,
      ISO-8859-2, ISO-8859-3, ISO-8859-4, ISO-8859-5, ISO-8859-6, ISO-8859-7,
      ISO-8859-8, ISO-8859-9, ISO-8859-10, ISO-8859-11, ISO-8859-13, ISO-8859-14,
      ISO-8859-15, ISO-8859-16, ISO-10646, ISO-10646/UCS2, ISO-10646/UCS4,
      ISO-10646/UTF-8, ISO-10646/UTF8, ISO-CELTIC, ISO-IR-4, ISO-IR-6, ISO-IR-8-1,
      ISO-IR-9-1, ISO-IR-10, ISO-IR-11, ISO-IR-14, ISO-IR-15, ISO-IR-16, ISO-IR-17,
      ISO-IR-18, ISO-IR-19, ISO-IR-21, ISO-IR-25, ISO-IR-27, ISO-IR-37, ISO-IR-49,
      ISO-IR-50, ISO-IR-51, ISO-IR-54, ISO-IR-55, ISO-IR-57, ISO-IR-60, ISO-IR-61,
      ISO-IR-69, ISO-IR-84, ISO-IR-85, ISO-IR-86, ISO-IR-88, ISO-IR-89, ISO-IR-90,
      ISO-IR-92, ISO-IR-98, ISO-IR-99, ISO-IR-100, ISO-IR-101, ISO-IR-103,
      ISO-IR-109, ISO-IR-110, ISO-IR-111, ISO-IR-121, ISO-IR-122, ISO-IR-126,
      ISO-IR-127, ISO-IR-138, ISO-IR-139, ISO-IR-141, ISO-IR-143, ISO-IR-144,
      ISO-IR-148, ISO-IR-150, ISO-IR-151, ISO-IR-153, ISO-IR-155, ISO-IR-156,
      ISO-IR-157, ISO-IR-166, ISO-IR-179, ISO-IR-193, ISO-IR-197, ISO-IR-199,
      ISO-IR-203, ISO-IR-209, ISO-IR-226, ISO/TR_11548-1, ISO646-CA, ISO646-CA2,
      ISO646-CN, ISO646-CU, ISO646-DE, ISO646-DK, ISO646-ES, ISO646-ES2, ISO646-FI,
      ISO646-FR, ISO646-FR1, ISO646-GB, ISO646-HU, ISO646-IT, ISO646-JP-OCR-B,
      ISO646-JP, ISO646-KR, ISO646-NO, ISO646-NO2, ISO646-PT, ISO646-PT2,
      ISO646-SE, ISO646-SE2, ISO646-US, ISO646-YU, ISO2022CN, ISO2022CNEXT,
      ISO2022JP, ISO2022JP2, ISO2022KR, ISO6937, ISO8859-1, ISO8859-2, ISO8859-3,
      ISO8859-4, ISO8859-5, ISO8859-6, ISO8859-7, ISO8859-8, ISO8859-9, ISO8859-10,
      ISO8859-11, ISO8859-13, ISO8859-14, ISO8859-15, ISO8859-16, ISO11548-1,
      ISO88591, ISO88592, ISO88593, ISO88594, ISO88595, ISO88596, ISO88597,
      ISO88598, ISO88599, ISO885910, ISO885911, ISO885913, ISO885914, ISO885915,
      ISO885916, ISO_646.IRV:1991, ISO_2033-1983, ISO_2033, ISO_5427-EXT, ISO_5427,
      ISO_5427:1981, ISO_5427EXT, ISO_5428, ISO_5428:1980, ISO_6937-2,
      ISO_6937-2:1983, ISO_6937, ISO_6937:1992, ISO_8859-1, ISO_8859-1:1987,
      ISO_8859-2, ISO_8859-2:1987, ISO_8859-3, ISO_8859-3:1988, ISO_8859-4,
      ISO_8859-4:1988, ISO_8859-5, ISO_8859-5:1988, ISO_8859-6, ISO_8859-6:1987,
      ISO_8859-7, ISO_8859-7:1987, ISO_8859-7:2003, ISO_8859-8, ISO_8859-8:1988,
      ISO_8859-9, ISO_8859-9:1989, ISO_8859-10, ISO_8859-10:1992, ISO_8859-14,
      ISO_8859-14:1998, ISO_8859-15, ISO_8859-15:1998, ISO_8859-16,
      ISO_8859-16:2001, ISO_9036, ISO_10367-BOX, ISO_10367BOX, ISO_11548-1,
      ISO_69372, IT, JIS_C6220-1969-RO, JIS_C6229-1984-B, JIS_C62201969RO,
      JIS_C62291984B, JOHAB, JP-OCR-B, JP, JS, JUS_I.B1.002, KOI-7, KOI-8, KOI8-R,
      KOI8-T, KOI8-U, KOI8, KOI8R, KOI8U, KSC5636, L1, L2, L3, L4, L5, L6, L7, L8,
      L10, LATIN-9, LATIN-GREEK-1, LATIN-GREEK, LATIN1, LATIN2, LATIN3, LATIN4,
      LATIN5, LATIN6, LATIN7, LATIN8, LATIN10, LATINGREEK, LATINGREEK1,
      MAC-CYRILLIC, MAC-IS, MAC-SAMI, MAC-UK, MAC, MACCYRILLIC, MACINTOSH, MACIS,
      MACUK, MACUKRAINIAN, MIK, MS-ANSI, MS-ARAB, MS-CYRL, MS-EE, MS-GREEK,
      MS-HEBR, MS-MAC-CYRILLIC, MS-TURK, MS932, MS936, MSCP949, MSCP1361,
      MSMACCYRILLIC, MSZ_7795.3, MS_KANJI, NAPLPS, NATS-DANO, NATS-SEFI, NATSDANO,
      NATSSEFI, NC_NC0010, NC_NC00-10, NC_NC00-10:81, NF_Z_62-010,
      NF_Z_62-010_(1973), NF_Z_62-010_1973, NF_Z_62010, NF_Z_62010_1973, NO, NO2,
      NS_4551-1, NS_4551-2, NS_45511, NS_45512, OS2LATIN1, OSF00010001,
      OSF00010002, OSF00010003, OSF00010004, OSF00010005, OSF00010006, OSF00010007,
      OSF00010008, OSF00010009, OSF0001000A, OSF00010020, OSF00010100, OSF00010101,
      OSF00010102, OSF00010104, OSF00010105, OSF00010106, OSF00030010, OSF0004000A,
      OSF0005000A, OSF05010001, OSF100201A4, OSF100201A8, OSF100201B5, OSF100201F4,
      OSF100203B5, OSF1002011C, OSF1002011D, OSF1002035D, OSF1002035E, OSF1002035F,
      OSF1002036B, OSF1002037B, OSF10010001, OSF10020025, OSF10020111, OSF10020115,
      OSF10020116, OSF10020118, OSF10020122, OSF10020129, OSF10020352, OSF10020354,
      OSF10020357, OSF10020359, OSF10020360, OSF10020364, OSF10020365, OSF10020366,
      OSF10020367, OSF10020370, OSF10020387, OSF10020388, OSF10020396, OSF10020402,
      OSF10020417, PT, PT2, PT154, R8, RK1048, ROMAN8, RUSCII, SE, SE2,
      SEN_850200_B, SEN_850200_C, SHIFT-JIS, SHIFT_JIS, SHIFT_JISX0213, SJIS-OPEN,
      SJIS-WIN, SJIS, SS636127, STRK1048-2002, ST_SEV_358-88, T.61-8BIT, T.61,
      T.618BIT, TCVN-5712, TCVN, TCVN5712-1, TCVN5712-1:1993, TIS-620, TIS620-0,
      TIS620.2529-1, TIS620.2533-0, TIS620, TS-5881, TSCII, UCS-2, UCS-2BE,
      UCS-2LE, UCS-4, UCS-4BE, UCS-4LE, UCS2, UCS4, UHC, UJIS, UK, UNICODE,
      UNICODEBIG, UNICODELITTLE, US-ASCII, US, UTF-7, UTF-8, UTF-16, UTF-16BE,
      UTF-16LE, UTF-32, UTF-32BE, UTF-32LE, UTF7, UTF8, UTF16, UTF16BE, UTF16LE,
      UTF32, UTF32BE, UTF32LE, VISCII, WCHAR_T, WIN-SAMI-2, WINBALTRIM,
      WINDOWS-31J, WINDOWS-874, WINDOWS-936, WINDOWS-1250, WINDOWS-1251,
      WINDOWS-1252, WINDOWS-1253, WINDOWS-1254, WINDOWS-1255, WINDOWS-1256,
      WINDOWS-1257, WINDOWS-1258, WINSAMI2, WS2, YU
    ==================================================
    also,
    #which od
    /usr/bin/od
    but I don't know how to use it.
    ==================================
    #cat -v abcd.dat
    has a lot of ^@
    ===================================
    #echo $LANG
    en_US.UTF-8
    ======================================================================================
    #hexdump -C abcd.dat|head -5
    00000000  00 22 00 34 00 36 00 32  00 39 00 33 00 22 00 7c  |.".4.6.2.9.3.".||
    00000010  00 22 00 32 00 30 00 31  00 33 00 2d 00 31 00 31  |.".2.0.1.3.-.1.1|
    00000020  00 2d 00 31 00 38 00 20  00 30 00 38 00 3a 00 30  |.-.1.8. .0.8.:.0|
    00000030  00 39 00 3a 00 34 00 38  00 22 00 7c 00 22 00 33  |.9.:.4.8.".|.".3|
    00000040  00 36 00 37 00 22 00 7c  00 22 00 53 00 75 00 73  |.6.7.".|.".S.u.s|
    =======================================================================================
    #vi abcd.tst
    testing
    esc:wq
    #file abcd.tst
    abcd.tst: ASCII text
    Let me know the complete iconv command with from-and-to encoding.
    Appreciate any help.

    Hi BalusC,
    as we write in jsp page as <meta http-equiv="Content-Type" content="text/html; charset=utf-8" />
    is their something we can write in .properties file
    russian words are not correctly displayed in browser......how we can dispaly it in correct format...??
    i have all russian words in my .properties file
    Thanks a lot

  • PD4ML generating a PDF file with weird characters

    I am running SSM SP07 NW 7.1 with Oracle in HP-UX. If we create a PDF via "Print" or "Mail" the created PDF generates a weird character  where ever a blanck space should be.
    I read that it could be a problem of the string "&nbsp" being read from ISO to UTF-8 encoding. I have tried everything with no result.
    If I transport my SSM configuration to an SQL windows environment it works fine. The problem is I need it in Oracle and HP-UX.
    Has anyone an idea?
    Thanks in advance
    Roberto

    Hello Roberto,
    Do you see the problem when using Operational Reviews as well as when exporting to a PDF ( print or mail)?
    The script looks like it is part of the upgrade, there is no separate script that you can download.
    I checked the note and there are no special instructions to run any script so I assume this runs automatically.
    This note is for SSM 7.5 installs and you can upgrade your system to 7.5 SP07 by following the upgrade guide:
    "Strategy Management Upgrade Guide 7.0 Series to 7.5 Series "
    You can get this at: https://websmp209.sap-ag.de/instguidesCPM-STM
    Select the 7.5 link on the left side to get the list of documents for 7.5.
    Thanks,
    Bill

  • Charset issues

    I have a JSP that pulls an RSS news feed from another site, via javascript. The news is in french and the french characters are not displaying properly on some browsers. I tried setting the charset and encoding as followings...
    <%@ page pageEncoding="UTF-8" contentType="text/html; charset=UTF-8" %>
    <html>
    <head>
    <meta http-equiv="Content-Type" content="text/html;charset=UTF-8" />I also tried ISO-8859-1 and neither seem to do the trick.
    In debugging I noticed the following: If I turn off auto-detect on my computer, it works fine in ISO or UTF. On another computer it only works in UTF and not ISO. Even with auto-detect it doesn't work.
    Anything else I can try?
    thanks
    Edited by: black_lotus on 2-Mar-2010 4:46 PM

    Because the page encoding is set via java. Are you implying that the html retrieved from the feed with javascript can have another encoding?Well if it's a completely different request/response then yeah, it will have its own character encoding associated with that exchange.
    The pageEncoding specifies the character set that your Jsp server (Tomcat or whatever) will send the html to the browser in. It doesn't affect the running of javascript at all as I understand.

  • Is it possible to read/write Chinese character into Oracle 8i(WE8ISO8859P1) in Applet

    I can display Chinese character in swing label or text field, however, when I read
    the Chinese character data from (or write to) Oracle DB, it shows ??? instead. I can't
    change the code page of my DB, (if use ZHT16Big5 instead of WE8Iso8859P1 as the DB page
    code, than everything is OK. :< ) is there any solution for this?
    Server:
    DB:Oracle8i(8.1.6)with Character set: WE8ISO8859P1
    System Environment:NLS_LANG=American_America.WE9ISO8859P1
    OS: Win2000 Advance Server(NLS:950 Traditional Chinese)
    client:
    Applet (written by JDeveloper 3.2) compiled with code: ISO8859_1
    JDBC Driver: Oracle JDBC Thin Driver (ver 8.1.7 or 8.1.6)
    Browser: IE5.5 (with code Western Europe(ISO) or UTF-8 or Traditional Chinese...)
    null

    Eric,
    There is no good way to do this. In your ASP/ADO application I think you are creating a garbage in garbage out situation. Because the client NLS_LANG matches the DB characterset no conversion takes place. Therefore your chinese characters get inserted in the DB. But the data is really garbage.
    When you use Java thin driver a conversion automatically takes place from UCS-2 -> WE8ISO8859P1. The Chinese characters in UCS-2 have no equivalent code point in WE8ISO8859P1, so you end up with replacement characters ??

  • Apostrophe etc converting to question mark in black diamond

    I have read the related threads and do not see an actual answer to the problem other than the suggestion that I turn off Straight Quotes with Smart Quotes which does not make sense as everyone else here has it turned on.  I am one user of many on
    our system and only a few have this problem.  When I send an email it looks fine.  When I receive the reply back my apostrophes and 3 dots etc are replaced with a question mark inside a black diamond.  

    It's related to the character set and character encoding you are using and the easiest, sure-fire fix is to disable smart quotes. Using ISO-8859-1 encoding and a standard font should fix it too, but it may still be a problem for a few recipients. Using the
    basic ACSII characters will guarantee it doesn't happen. 
    What encoding are you using? Look in File, Options, Advanced - near the bottom.  Try Western European (ISO), not UTF-8.
    What font do you use? Stick with one of the standard fonts. 
    Diane Poremsky [MVP - Outlook]
    Outlook & Exchange Solutions Center
    Outlook Tips
    Subscribe to Exchange Messaging Outlook weekly newsletter

  • Using include-xml and different character encodings

    I have static XML documents which contain different character encodings in the xml prolog....
    ISO-8859, UTF-8, and SHIFT_JIS.
    Each of these documents has entities declared in an internal doctype declaration.
    I have an XSQL page with several <xsql:include-xml href="???.xml"/> statements.
    I want to make either --
    1) UTF-8 as the ultimate encoding of the resulting XML data
    2) Force the encodings on the individual documents to be changed to UTF-8
    I want to get a final XML data steam which can be successfully parsed and transformed. The xsql page has a stylesheet reference.
    Any assistance would be greatly appreciated.
    null

    karol wrote:XML and XSLT are separate, but they're not suitable for printing / pdf generation - LaTeX is. XML is really fluid and has no notions of typesetting built in. It's perfect for on-line reports you view with a web browser.
    Can I ask why XML and XSLT aren't suitable for printing / pdf generation?  Using XSL-FO I can define a page size, margin widths, etc., right?  Just because it isn't normally done doesn't immediately make it a bad idea.  I was under the impression that XML was intended to be abstract enough that it could be used for more than just web pages or data transfer.

  • [Muse] Problème : les accents ne s'exporte pas en HTML

    Bonjour,
    Après avoir réalisé mon site avec Muse, lorsque je l'exporte en HTML ou que je le télécharge directement vers hôte FTP, les accents, sont directement converti et impossible de les affichés sur mon site.
    Exemple: le "é" devient sur la page "é"
    Est-ce que quelqu'un aurait une solution SVP?
    Merci d'avance pour votre aide.
    Max

    Please contact your hosting provider. It seems they are specifying a character encoding of iso-8859-utf-8 while the content generated by Muse uses utf-8. You can pass on this information from the W3C validator to the provider:
    [Invalid] Markup Validation of http://www.artisanslabastille.fr/ - W3C Markup Validator
    Abhishek

  • Charset problem on E75

    Hello,
    I'm using an exchange mail account on my phone. It seems like the default charset for the phone is ISO-8859-1 but I need it to be UTF-8. How do I change this on a E75?
    /Zarre

    Hi
    I have the same problem with E75 and embeded mail client for exchange, update to latest sw doesn`t solve issue. Using E66 or E71 I have an option when creating a new e-mail message to change charset from ISO to UTF-8. Does anyone know how to resolve this issue?
    Thanks!

  • Como deixar a codificação padrão "iso-8859-1" ao invés de "utf-8" no navegador?

    Quando faço o acesso ao banco do meu cartão de crédito, a primeira pagina deles é codificada com "UTF-8" e também existe o comando "charset=utf-8" no código HTML, após digitar o número do cartão, ele direciona para outra página, porém codificada com "iso-8859-1", mas sem nenhuma codificação no código HTML.
    Identifiquei que o navegador usa por padrão o "utf-8" quando não existe codificação no código HTML, então gostaria de modificar esse padrão para "iso-8859-1".
    O único lugar que encontrei para essa codificação foi ao pressionar o botão de avançado em "Fontes e cores", porém não funcionou.
    A versão do meu navegado é 23.0.1, entretanto desde a versão 18 que encontro esse problema.

    It is possible that the server sends the file(s) by default with UTF-8 and in such a case this encoding send by the server prevails over all other settings.
    You can see the encoding here: Tools > Page Info > General
    *Press the F10 key or tap the Alt key to bring up the hidden "Menu Bar" temporarily.

  • UTF-8 encoding vs ISO 8859-1 encoding

    The iTunes tech specs call for UTF-8 encoding of the XML feed file; a friend of mine uses feed generator software through his blog that uses ISO 8859 encoding. Is there a way to convert the latter to UTF-8 so that iTunes tags may be successfully added?
    When I tried editing his XML file, I got error messages when I submitted the file to RSS feed validator sites (such as http://feedvalidator.org/. Any help or knowledge is appreciated because I am not the least bit expert in this coding arena.

    You don't need to convert iso 8859-1 (us-ascii) to utf-8 unless you have nonstandard characters. Basically, ascii is a subset of utf-8 and for English it will serve you just fine. You can have iTunes tags in the xml file even if the file itself is encoded in iso 8859-1.
    The error you see at feedvalidator.org is most likely a warning.
    Hope this helps!
    - Andy Kim
    Potion Factory
    http://www.potionfactory.com

  • Xml payload encoding from utf to iso

    Hi Experts,
    Could you please let me know how can I encode he xml payload from utf-8 to ISO-8859-1.
    its bit urgent any help is appreciated.
    Thanks & Regards,
    Ranganath.

    Hi Ranganath,
    Here is the java mapping for PI 7.1 and above which will transform encoding type from utf-8 to ISO-8859-1.
    import java.io.FileInputStream;
    import java.io.FileOutputStream;
    import java.io.InputStream;
    import java.io.OutputStream;
    import java.util.Map;
    import com.sap.aii.mapping.api.AbstractTransformation;
    import com.sap.aii.mapping.api.StreamTransformationException;
    import com.sap.aii.mapping.api.TransformationInput;
    import com.sap.aii.mapping.api.TransformationOutput;
    public class addAttributeToTag2 extends AbstractTransformation {
          * @param args
         public void execute(InputStream in, OutputStream out)
                   throws StreamTransformationException {
              // TODO Auto-generated method stub
              try
                   int c;
                   int count=0;
                   String s="";
                   while(1>0)
                        c=in.read();
                        if(c<0)
                             break;
                        if(count<=2 && (char)c=='?')
                             count++;
                        if(count<=2)
                             s=s+(char)c;
                             if(count==2)
                                  s=s.replaceAll("utf-8","ISO-8859-1");
                                  s=s.replaceAll("UTF-8","ISO-8859-1");
                                  count=3;
                                  out.write(s.getBytes());
                             continue;
                        out.write(c);
                        //System.out.print((char)c);
                   in.close();
                   out.close();
              catch(Exception e)
         public void setParameter(Map arg0) {
              // TODO Auto-generated method stub
         public static void main(String[] args) {
              // TODO Auto-generated method stub
              try{
                   addAttributeToTag2 genFormat=new addAttributeToTag2();
                   FileInputStream in=new FileInputStream("C:\\Apps\\my folder\\sdn\\copy.xml");
                   FileOutputStream out=new FileOutputStream("C:\\Apps\\my folder\\sdn\\copy1.xml");
                   genFormat.execute(in,out);
                   catch(Exception e)
                   e.printStackTrace();
         public void transform(TransformationInput arg0, TransformationOutput arg1)
                   throws StreamTransformationException {
              this.execute(arg0.getInputPayload().getInputStream(), arg1.getOutputPayload().getOutputStream());
    if you are working in PI 7.0 the you need following code
    import java.io.FileInputStream;
    import java.io.FileOutputStream;
    import java.io.InputStream;
    import java.io.OutputStream;
    import java.util.Map;
    import com.sap.aii.mapping.api.StreamTransformation;
    import com.sap.aii.mapping.api.StreamTransformationException;
    public class addAttributeToTag2 implements StreamTransformation {
          * @param args
         public void execute(InputStream in, OutputStream out)
                   throws StreamTransformationException {
              // TODO Auto-generated method stub
              try
                   int c;
                   int count=0;
                   String s="";
                   while(1>0)
                        c=in.read();
                        if(c<0)
                             break;
                        if(count<=2 && (char)c=='?')
                             count++;
                        if(count<=2)
                             s=s+(char)c;
                             if(count==2)
                                  s=s.replaceAll("utf-8","ISO-8859-1");
                                  s=s.replaceAll("UTF-8","ISO-8859-1");
                                  count=3;
                                  out.write(s.getBytes());
                             continue;
                        out.write(c);
                        //System.out.print((char)c);
                   in.close();
                   out.close();
              catch(Exception e)
         public void setParameter(Map arg0) {
              // TODO Auto-generated method stub
         public static void main(String[] args) {
              // TODO Auto-generated method stub
              try{
                   addAttributeToTag2 genFormat=new addAttributeToTag2();
                   FileInputStream in=new FileInputStream("C:\\Apps\\my folder\\sdn\\copy.xml");
                   FileOutputStream out=new FileOutputStream("C:\\Apps\\my folder\\sdn\\copy1.xml");
                   genFormat.execute(in,out);
                   catch(Exception e)
                   e.printStackTrace();
    However as Krish has pointedf out file adapter has option to set encoding type, you can try that option first.
    regards
    Anupam

  • Codepage coverting error utf-8 from System codepage to iso-8859-1 (PI 7.1)

    Hello Experts,
    In our Prcess, we receive an Idoc from an IS-U system and then we send this Idoc with some Header-Information via http-Adapter to a Seeburger System.
    In the outbound communication Channel we have a XI Payload manipulation with xml-Code iso-8859-1.
    We get the Error: Codepage coverting error utf-8 from System codepage to iso-8859-1, and only for this Idoc, where othe similar Idocs runs correctly.
    Is it possible, that the Idoc contains non-utf-8 chars so the error occurs?
    PS: another XI in our landscape uses a http-Channel with the same configuration in a similar process, an it work, so guess the Problem is not in the communication channel.
    thanks,
    best regards

    > Is it possible, that the Idoc contains non-utf-8 chars so the error occurs?
    A would rather think, that there could be any non-iso-8859-1 character be in the IDoc. For example an czech or polish character.

  • HTTP adapter - change encoding from UTF-8 to ISO-8859-1

    Hi,
    I am trying to change the encoding used by the HTTP sender adapter in a scenario.
    However, when I enter ISO-8859-1 in the XML Code under XI Payload Manipulation on the comms channel it has no effect - the paylad still shows as UTF-8 in SXI_MONITOR.
    Am I missing a step or entering the field incorrectly ??
    Thanks
    Colin.

    Hi,
    From help
    Enhancing the Payload
    Some external systems, for example, Web servers in marketplaces, can only process data if it is sent as an HTML form using HTTP.
    A typical HTML form comprises named fields. When transferring a completed form to the server or a CGI program, the data must be transferred in such a way that the CGI script can recognize the fields that make up the form, and which data was entered in which field.
    The plain HTTP adapter constructs this format using a prolog and an epilog. Therefore, there is a particular code method that separates form fields and their data from each other. This code method uses the following rules:
         Individual form elements, including their data, are separated from each other by the character &.
         The name and data of a form element are separated from each other by an equals sign (=).
         Blanks in the entered data (for example, in multiple words) are replaced by a plus sign (+).
        All characters with the (enhanced) ASCII values 128 to 255 (hexadecimal 80 to FF) are transcribed using a hexadecimal sequence, beginning with a percentage sign (%) followed by the hexadecimal value of the character (for example, the German umlaut ö in the character set ISO-8859-1 is transcribed as %F6).
       All characters that occur in these rules as control characters (&, +, =, and %) are also transcribed hexadecimally in the same way as high value ASCII characters
    http://help.sap.com/saphelp_nw2004s/helpdata/en/44/79973cc73af456e10000000a114084/content.htm
    Regards
    Chilla

Maybe you are looking for

  • How do I cancel that association of my iPhone 4S with an Apple ID

    my child took my iPhone without my permission and he started downloading song with his Apple ID, somehow he associated his ID with my iPhone, now I want to download old purchases and when I tap on "download" it appears a message that tells me "an iPh

  • How much hard drive space left

    How do I find out how much hard drive space I'm using? My system profiler doesn't tell me didley.

  • 2 x Cinema displays the same colours?

    I have just bought a 2nd 20" Cinma display, for my Mac G5 dual Core. I already had one which was fine, but now have a second one to run for increased space, but I cant get them the same colors? I open the "about this mac" page and place it in the mid

  • ABAP HR- calculation of workind days (excluding weekend) for an employee

    Hi, I need to get number of working days between two dates excluding weekends for an employee. Please let me know below questions in this regard: 1. I have to use Factory calender or holiday calender for this purpose? 2. Will FM RKE_SELECT_FACTDAYS_F

  • Fly in when scrolling down

    I would like to make an object fly over when you're scrolling down on the page. (And when you scroll up, the object goes back) Is it possible to do this with the 'Animation' panel? I can make an object fly by when the page loads, but not when scrolli