Trouble Using Character Sets - Chinese GB2312

Hi,
I am trying to display my site in Simplified Chinese
(GB2312). I have verified that all of the files are encoded with
the GB2312 character set, when I open them in a language capable
editor I can see chinese characters.
I have also used <cfheader name="Content-Type"
value="text/html; charset=gb2312" /> at the top of my
Application.cfm file.
Yet, when I view the file in any browser, it shows only
question marks or odd characters in place of the chinese
characters.
When I look at the browser character encoding settings, they
are correct (a check mark next to Simplified Chinese).
Any thoughts, am I missing a step.
Thanks in advance for any help given.

lan99 wrote:
> I have also used <cfheader name="Content-Type"
value="text/html;
> charset=gb2312" /> at the top of my Application.cfm
file.
what ver of cf?
if cf6 or better, have you used cfprocessingdirective at the
top of each file?

Similar Messages

  • Using Character Set recognized by ProTime

    Is it possible to have the SAP program generating the file for ProTime using the character set WEISO8859PI (currently used by ProTime) ?

    I mean source files.
    hmmm... I am not sure that that link is what I need, but I find it a bit hard to tell exactly what it is all about.
    I am new to programming, so most of the preferences in Eclipse doesn't make very much sense to me, although I've been using computers extensively all of my life...
    I tried going to the file > properties > info as suggested to set the encoding, but it seems to do the same thing as if I ctrl-click (right click) on a class and choose properties.
    The issue is that files coming from a PC shows up with wrong characters in place of localized characters like , which I use when I comment the code in Danish (I newer use localized characters for the code itself). When I set it to UTF-8 the "wrong" characters are replaced by little questionmarks. Is the right formatting lost or is there some way to get it back on track?

  • Conversions between character sets when using 'exp' and 'expdp' utilities

    When I make export with exp utility with NLS_LANG environment
    variable not set, export is done with US7ASCII character set,
    but when server uses some other character set e.g. EE8ISO8859P2,
    there can be some national specific characters lost with such a conversion.
    But when I use expdp utility and have not NLS_LANG set,
    in log file there is not mentioned that export is done in other character set
    than that on the server, does it mean that it uses character set on server?

    When I make export with exp utility with NLS_LANG environment
    variable not set, export is done with US7ASCII character set,
    but when server uses some other character set e.g. EE8ISO8859P2,
    there can be some national specific characters lost with such a conversion.
    But when I use expdp utility and have not NLS_LANG set,
    in log file there is not mentioned that export is done in other character set
    than that on the server, does it mean that it uses character set on server?

  • Importing from a different character set

    Oracle 8.1.7 / Windows NT
    I'm trying to import a dump file which was created with character set WE8ISO8859P9. My database uses character set UTF8. Some of the records can't be inserted because of error "ORA-1401: Value too large for column". Is this because of the different character sets? If I switch my session to WE8ISO8859P9, imp says "character set conversion from x to y not supported."
    How can I get these last records inserted? Here's an excerpt from the log:
    Verbunden mit: Oracle8i Enterprise Edition Release 8.1.7.0.0 - Production
    With the Partitioning option
    JServer Release 8.1.7.0.0 - Production
    <
    Export-Datei wurde von EXPORT:V08.00.05 |ber konventionellen Pfad erstellt
    Warnung: Die Objekte wurden von NOC_ADMIN exportiert, nicht von Ihnen.
    Importvorgang mit Zeichensatz WE8ISO8859P9 und Zeichensatz UTF8 NCHAR durchgef|hrt
    Import-Server verwendet Zeichensatz UTF8 (mvgliche Zeichensatzkonvertierung)
    Export-Server verwendet Zeichensatz WE8ISO8859P9 NCHAR (mvgliche Zeichensatzkonvertierung)
    . Import NOC_ADMIN's Objekte in NOC_ADMIN
    . . Import der Tabelle "ACCESSROUTERIFS_" 782 Zeilen importiert
    . . Import der Tabelle "ITEM_"
    IMP-00019: Zeile zur|ckgewiesen aufgrund von Oracle-Fehler 1401
    IMP-00003: Oracle-Fehler 1401 gefunden
    ORA-01401: Eingef|gter Wert zu gro_ f|r Spalte
    Spalte 1 33886
    Spalte 2
    Spalte 3
    Spalte 4 1323
    Spalte 5
    Spalte 6 11
    Spalte 7 18600
    Spalte 8 18600
    Spalte 9 20-NOV-2000:00:00:00
    Spalte 10 processing
    Spalte 11 inactive
    Spalte 12
    Spalte 13
    Spalte 14 35682.0
    Spalte 15
    Spalte 16
    Spalte 17
    Spalte 18 05.12.00: KD weiss noch nix neues, er wird uns inf...
    Spalte 19
    Spalte 20 kschmid
    Spalte 21 09-FEB-2001:15:50:21
    Spalte 22
    Spalte 23 12
    Spalte 24
    Spalte 25 06-NOV-2000:00:00:00
    null

    Please try ORacle RDBMS support. this issues is to do with Oracle Import.

  • DB Creation(character set Problem)

    HI,
    I am in the process of creating new database from the full export dump file which i got from client. In the export logfile i can see,
    Export done in WE8MSWIN1252 character set and AL16UTF16 NCHAR character set
    I am creating the database manually at my site and planning to import that full export dumpfile from the client.
    Do i need to create the DB with national character set WE8MSWIN1252 or can i also use character set as AL32UTF8
    I want to se the correct character set so that my import doesn't fails because of charactor set problem.
    Please help me.
    Oracle Version At client site:9.2.0.1
    OS :Windows
    Oracle Version at my site:9.2.0.1
    OS:LINUX
    Thank You..

    I can't answer your question because of several things.
    The first is that all versions of 9i are currently in desupport and it has been years since I've seen one.
    But if I was forced to work with 9i I certainly wouldn't do anything at all until I had patched to 9.2.0.8.
    My recommendation would be, unless someone else can help you, to try it and see what happens.
    Nothing ventured nothing gained.

  • Importing converts character set

    Hello, everyone. I'm exporting from Windows 8.1.5 and importing into Linux 8.1.6. Both servers use character set WE8ISO8859P1. But when I import, I get this:
    Export file created by EXPORT:V08.01.05 via direct path
    import done in US7ASCII character set and US7ASCII NCHAR character set
    import server uses WE8ISO8859P1 character set (possible charset conversion)
    export client uses WE8ISO8859P1 character set (possible charset conversion)
    export server uses WE8ISO8859P1 NCHAR character set (possible ncharset conversio
    n)
    The import server originally was set to US7ASCII but I changed it using ALTER DATABASE CHARACTER SET and I've set NLS_LANG=.WE8ISO8859P1 on the import server.
    Suggestions? Your help is much appreciated!
    John

    Nothing is jumping out at me in terms of incorrect settings. Where did you specifically run the import from, Windows or Linux and from what tool Net8 etc? Are you sure NLS_LANG on Linux is now WE8ISO8859P1?

  • Russian Character Set Help

    Hello
    I have a web application developed using JSP/Oracle.
    The site content is Russian , thats a problem .
    Oracle uses character set CL8MSWIN1251.
    If i fill russian character set into database directly
    all works fine.
    But if i try to use the administration part of the web
    application
    I mean enter contents in Russian and submit the form
    from request i dont get the thing i posted.Its completly
    different.
    I tried all options like request.setCharacterEncoding to many
    charactersets that i found in different forums , but still in vein.
    I will greatly appreciate your help.Its really urgent.
    Thanks in advance

    Thank you for your reply.
    I run the query:
    select * from nls_database_parameters
    where parameter in('NLS_LANGUAGE','NLS_TERRITORY','NLS_CHARACTERSET');
    And he returned:
    PARAMETER     VALUE
    NLS_LANGUAGE     AMERICAN
    NLS_TERRITORY     AMERICA
    NLS_CHARACTERSET     CL8MSWIN1251
    Then I set the variable NLS_LANG=AMERICAN_AMERICA.WE8MSWIN1252; export NLS_LANG
    and tried to run ./frmcmp /oracle/formsp/CORE/FORM_NAME.FMB userid=XXX/XXX@XXX compile_all=yes batch=yes window_state=minimize
    but received FRM-91500: Unable to start/complete the build.
    Strange, but when you try to change the environment variable NLS_LANG, frmcmp can not run. If I do unset NLS_LANG frmcmp working properly.
    Maybe it's something to do with the configuration of locale and fonts in Solaris?
    Because I can not enter Russian or in graphical mode (when working through vnc) or in konsole mode (when connecting via putty) ...
    It is worth mentioning that when I run the form, then all the data from the database appear in the fields correctly, in Russian. Incorrectly displays only those characters that belong to the form (such as titles of forms, labels on buttons, etc.).
    Please help me find a solution.
    Edited by: Tarasov_ES on 23.12.2009 19:45
    Edited by: Tarasov_ES on 23.12.2009 19:50

  • Unrecognised Char in GB2312 character set using java InputStreamReader??

    Reading the following file chinese GB2312 html file from
    http://news.xinhuanet.com/local/2007-02/13/content_5732705.htm
    using the InputStreamReader with GB2312 encoding as shown below
    public class readGB2312html file
    //........TmpText declarations.....
    public static void main( String[] args )
    try
    FileInputStream is = new FileInputStream( args[0] );
    BufferedReader br = new BufferedReader(
    new InputStreamReader( is, "GB2312" ) );
    String strLine;
    while ( (strLine = br.readLine()) != null )
    TmpText.append(strLine);
    TmpText.append("\r\n");
    br.close();
    bw.close();
    catch ( Exception e )
    e.printStackTrace();
    The TmpText variable does not display the last character in the article properly &#65288;&#35760;&#32773;&#22799;&#29690;&#65289; it gives instead &#65288;&#35760;&#32773;&#22799;?B&#65289;
    Inside the html file the unrecognised charcter is represented by �B in the html file Why is this so
    ���������B��
    In the internet browser it is displayed and recognised as a chinese GB2312 character why not recognised by Java InputStreamReader???
    Any help or explanation would be much appreciated

    Yes, it is not a GB2312 character
    The �B character is AC40 in hex format which is outside of the GB2312 character range, it is in GBK
    Copied from wikipedia,
    GBK is an extension of the GB2312 character set for simplified Chinese characters, used in the People's Republic of China.
    GB stands for National Standard, while K stands for Extension. GBK not only extended the old standard GB2312 with Traditional Chinese characters, but also with Chinese characters that were simplified after the establishment of GB2312 in 1981. With the arrival of GBK, certain names with characters formerly unrepresentable, like the "rong" (�g) character in former Chinese Premier Zhu Rongji's name, are now representable.
    Thanks a lot will use the GBK charset to read the file for all GB2312 file since it is a subset of it.

  • Chinese Character Sets

    I'm setting up a web site, as a favor for a friend, that will
    be in Chinese and French. I'm not sure what the best page encoding
    would be. My friend sent me some sample Chinese, and she said, "The
    font I use is 'Simsun'." Otherwise, she is not very technical.
    There are apparently 4 character sets available in
    Dreamweaver that will display Chinese (after a fashion):
    1. charset=big5: displays in both code and design view, but
    Dreamweaver complains that not all characters can be displayed when
    I save the file, and some characters appear as "?" in both IE7 and
    Firefox. My friend says the characters don't look good.
    2. charset=gb2312: displays correctly in Dreamweaver and both
    of my browsers, and my friend says it looks OK.
    3. charset=hz-gb-2312: displays correctly in Dreamweaver code
    view, but not design view. Displays in both browsers I use, and my
    friend says it looks OK. This is apparently a 7 bit code that will
    encode both Chinese (as two byte characters) and ASCII. There is an
    escape character, "~" for toggling between the two sets. The HTML
    Dreamweaver creates seems to deal with this transparently.
    4. charset=utf-8: doesn't display in Dreamweaver (code or
    design view). Does seem to display correctly in my browsers, but my
    friend doesn't like it.
    I also noticed that although the characters displayed on my
    desktop system, I got all "?" on my laptop. Then it occurred to me
    that my friend had used my desktop to check her e-mail, so she had
    apparently installed the correct font(s). I displayed the pages in
    IE on my laptop, was prompted to install the fonts (Firefox didn't
    mention a problem), and then the text also displayed on my laptop.
    I intend to deal with this problem by creating any home page text
    (there won't be much) as Fireworks images. Anyone looking at the
    other Chinese text will undoubtedly have the correct fonts already.
    If any of you have done anything like this before, I would be
    grateful for any suggestions.

    Clean & Sober wrote:
    > 2. charset=gb2312: displays correctly in Dreamweaver and
    both of my browsers,
    > and my friend says it looks OK.
    GB2312 is the correct character set for Chinese as used in
    the People's
    Republic. It's sometimes known as "simplified Chinese". That
    doesn't
    mean it's simple, but that the characters have been
    modernized by
    removing some of the more complex strokes.
    To view pages written in Chinese or any other non-alphabetic
    script, you
    need to have the correct fonts installed. That shouldn't be a
    problem
    for the target audience.
    David Powers
    Adobe Community Expert
    Author, "Foundation PHP for Dreamweaver 8" (friends of ED)
    http://foundationphp.com/

  • HOW can I enter text using Japanese character sets?

    The "Text, Plates, Insets" section of the LOOKOUT(6.01) Help files states:
    "Click the » button to the right of the Text field to expand the field for multiple line entries. You can enter text using international character sets such as Chinese, Korean, and Japanese."
    Can someone please explain HOW to do this? Note, I have NO problem inputting Hirigana, Katakana, and Kanji into MS WORD; the keyboard emulates the Japanese layout and characters (Romaji is default) and the IME works fine converting Romaji, and I can also select charcters directly from the IME Pad. I have tried several different fonts with success and am currently using MS UI Gothic.ttf as default. Again, everything is normal and working in a predictable manner within Word.
    I cannot get these texts into Lookout. I can't cut/paste from HTML pages or from text editors, even though both display properly. Within Lookout with JP selected as language/keyboard, when trying to type directly into the text field, the IME CORRECTLY displays Hirigana until <enter> is pressed, at which point all text reverts to question marks (?? ???? ? ?????). If I use the IME Pad, it does pretty much the same. I managed to get the "Yen" symbol to display, though, if that's relevant. As I said, font selected (in text/plate font options) is MS UI Gothic with Japanese as the selected script. Oddly enough, at this point the "sample" window is showing me the exact Hirigana character I want displayed in Lookout, but it won't. I've also tried staying in English and copying unicode characters from the Windows Character Map. Same results (Yen sign works, Hirigana WON'T).
    Help me!
    JW_Tech

    JW_Tech,
    Have you changed the regional setting to Japanese?
    Doug M
    Applications Engineer
    National Instruments
    For those unfamiliar with NBC's The Office, my icon is NOT a picture of me
    Attachments:
    language.JPG ‏50 KB

  • Character sets - UTF8 or Chinese

    Hi,
    I am looking into enhancing the application I have built in Oracle to save/display data in Chinese & English. I have looking into how to change the character set of a database to accept different languages i.e. different characters.
    From what I understand I can create a database to use a Chinese character set (apparently English ascii characters are also a part of any Chinese character set) or I can set the database to use a unicode multi-byte character set (UTF8) - which seems to be okay for all languages.
    Has anyone had any experience of a) changing an existing standard 7 byte ascii database into database which can handle Chinese and/or b) the difference/ implications between using a Chinese and unicode character sets.
    I am using Oracle RDBMS 8.1.7 on SuSE Linux 7.2
    Thanks in advance.
    Dan

    If the data is segmented so that character set 1 data is in a table and character set 2 data is in another table then you may have a chance to salvage the data with help from support. The idea would be to first export and import only your CL8MSWIN1251 data to UTF8. Be careful that your NLS_LANG is set to CL8MSWIN1251 for export so that no conversion takes place. Confirm the import is successful and remove CL8MSWIN1251 data from database. Oracle support can now help you override the character set via ALTER database to say MSWIN1252. Now selectively export/import this data, again make sure NLS_LANG is set to MSWIN1252 for export so that no conversion takes place. Confirm the import is successful and remove MSWIN1252 data from database. And then do the same steps for 1250 data.

  • UTF8 character set conversion for chinese Language

    Hi friends,
    Would like to some basic explanation on UTF8 feature,what does it help while converting the data from chinese language.
    Would like to know what all characters this UTF8 will not support while converting from chinese language.
    Thanks & Regards
    Ramya Nomula

    Not exactly sure what you are looking for, but on MetaLink, there are numerous detailed papers on NLS character sets, conversions, etc.
    Bottom line is that for traditional Chinese characters (since they are more complicated), they require 4 bytes to store the characters (such as UTF-8, and AL32UTF8). Some mid-eastern characters sets also fall in this category.
    Do a google search on "utf8 al32utf8 difference", and you will get some good explanations.
    e.g., http://decipherinfosys.wordpress.com/2007/01/28/difference-between-utf8-and-al32utf8-character-sets-in-oracle/
    Recently, one of our clients had a question on the differences between these two character sets since they were in the process of making their application global. In an upcoming whitepaper, we will discuss in detail what it takes (from a RDBMS perspective) to address localization and globalization issues. As far as these two character sets go in Oracle, the only difference between AL32UTF8 and UTF8 character sets is that AL32UTF8 stores characters beyond U+FFFF as four bytes (exactly as Unicode defines UTF-8). Oracle’s “UTF8” stores these characters as a sequence of two UTF-16 surrogate characters encoded using UTF-8 (or six bytes per character). Besides this storage difference, another difference is better support for supplementary characters in AL32UTF8 character set.
    You may also consider posting your question on the Globalization Suport forum which pertains more to these types of questions.
    Globalization Support

  • It's possible to convert 'WE8MSWIN1252' character to chinese character set?

    Hi All,
    Is anyone know how to convert "WE8MSWIN1252" character to chinese character set in order to display chinese word in oracle apex?
    My problem is i can't display chinese character set in oracle apex. The chinese field is showed like °×ѪÇò¼ÆÊý. I'm using WE8MSWIN1252 database character set.
    I'm wondering it's possible to show character word?
    I'm appreciating if anyone have a good solution to share with me.
    Thanks a lot in advance!
    Edited by: Apex Junior on Jul 16, 2010 2:18 PM

    WE8 is a Western European character set. If you wish to store and access a globalized multibyte character set you must have a database that supports it: You don't have one at the moment.
    Given this is Apex I'd suggest you read the docs and reinstall.
    Alternatively you could try CSSCAN and CSALTER and perhaps you can make the change but be very careful and have a good backup before you try.
    http://www.morganslibrary.org/reference/character_sets.html

  • Insert Chinese Data(Character Set

    Can any Chinese DBA tell me how to store chinese data in Oracle 8.1.5.0.0 Database.
    I tried the combination of UTF8 database character set with UTF8 national character set and UTF8 database character set with ZHS16MACCGB231280 national character set. A column in a table was defined as nchar/nvarchar2. But when I tried to insert chinese data into the column using insert into... command, I always got the error message 'Character Set Mismatch'.

    Originally NCHAR was defined for a few selective fixed with Aisan character sets. In 9i it was redefined to support unicode. For 8.1.5 you will need to store your Chinese data in char, varchar, and CLOB data types.

  • Message uses a character set that is not supported by the internet service

    Does any one have any advice on how to fix this problem?
    E-mails sent from my iphone 3G periodically arrive in an unreadable form at the recipient. The body of the e-mail has been replaced with the message "This message uses a character set that is not supported by the internet service...." The problem e-mails also include an attachment that contains an unformatted text file containing the original message surrounded by what appears to be lots of formatting data that is displayed as gibberish.
    This occurs sometimes, but not always, even with the same recipients. I am sending e-mail through a G-mail account that is configured on the iphone using IMAP. I have tried the gmail account to use the two available formatting options for mail, but neither fixes the problem.
    I have also upgraded to 2.01 and restored a few times without impact.

    Hi,
    I got somewhat similar problem with special charecters(German umlaud �,�,�..).
    I create a file with java having special charecters in it. Now if I open this file I am able to view the special charecters in it.But If I attach this file send it using following code then receiver can not see the umlaud charecters in it.They get replaced by _ or ?
    MimeBodyPart mbp2 = new MimeBodyPart();
    FileDataSource fds = new FileDataSource(fileName);
    mbp2.setDataHandler(new DataHandler(fds));
    mbp2.setFileName(output.getName());
    Multipart mp = new MimeMultipart();
    mp.addBodyPart(mbp2);
    msg.setContent(mp);
    Transport.send(msg);
    From you message it looks like you are able to send the mail attachment correctly(by preserving special charecters).
    Can you tell me what might be wrong in my code.
    I appriciate your efforts in advance.
    Prasad

Maybe you are looking for

  • Contribute CS4 hides repeat question buttons

    Hi all, when upgrading to Contribute CS4, our internal client cannot see the buttons ,it is there on screen, but somehow hidden underneath the another div, the above image was taken using CS3, in CS4 its completely hidden behind the div. does anyone

  • Retaining Custom Made Folders

    Hello, I have some absolutely fantastic apps for making customized folders. In fact, I recommend checking out "Viou" if you have a chance. I especially like the suitcase folders! Anyway, I notice that when I make a customized folder and send it to so

  • I get constant buffering very slow downloads and the pinwheel.  is there a keystroke remedy for this??

    i get constant buffering very slow downloads and the pinwheel.  is there a keystroke remedy for this??

  • GAU2 Condition caluculation

    Hi Experts In Purchasing Pricing procedure, I am using the condition type GAU2 (Actual price) for procurement of price sensitive precious metals. Could any one advise how this price is being calculated after the goods receipt is being posted. I am us

  • Lost content, please help!

    Well, my harddrive needed replacing and so I had that done. iTunes was saved and all is well with that. The problem is that all of the content I purchased on my AppleTV while my computer was being repaired dissappeared when it synched the first time