Problem in reading special characters � Microsoft symbols.

Hi All,
I have a text field where user can enter some string and search. Unusually/unfortunately users can copy/paste text from ms-word file and search for that entry in the database. Here they can even copy ms-word special symbols such as ellipsis (�), em dash(�), en dash(�). Since database has such (special character) entries we cannot restrict them from doing this (after all that is the requirement).
Now the problem is when I read the text value in servlet/jsp I find that those special characters are replaced by question marks(?). Because of this database return �no rows found� though there are entries.
What I have observed is when I read the ASCII value � (int)char I am able to get proper value. Using that I am having if else block to replace the question marks with proper symbols. But is there a direct way to do this? I even tried converting the string to another character set type string. But no luck.
(Also when I print the text value to a JSP, I can see proper vales through Internet Browser.)
Below is the sample program(read_n_print.jsp), sysout prints question marks in console, but JSP/HTML shows proper value in browser.
<html>
<%
     String value = request.getParameter("special_text");
     System.out.println("value " + value);
     if(value != null){
          byte[] bytes= value.getBytes();
          try {
               String output = new String(bytes,"ISO-8859-1");     
               System.out.println("output " + output);     
          } catch (Exception e) {
               // TODO Auto-generated catch block
               e.printStackTrace();
%>
<head>
<title>Insert title here</title>
</head>
<body>
<form method="post" action="./read_n_print.jsp">
<input type="text" name="special_text" value=""> <input type="submit" value="Go" > <br>
<font size="8">Value entered is <%= value %></font>
</form>
</body>
</html>

Hi DrClap,
Thanks for your reply.
That article was helpful in understanding character conversions. And it works fine for JSPs.
But when I tried to apply the same in JSF it does not work. May be this is not the right forum to dicuss JSF related things. But if you know how pageEncoding and contentType can be mentioned in JSF. I am using myfaces and I have tried <h:form id="SearchForm" acceptCharset="UTF-8" accept="text/html;charset=UTF-8">. Didnt work....

Similar Messages

  • Problem in reading special characters

    Hi friends
    I am doing one program which read serial port data (for bar code reader) using java application.
    My program is working well and reading serial data but it is not displaying special characters.When i read through hyperterminal i can see all special characters but when see data throught the output of java program these special characters are missing...
    I don't know i am doing something wrong in stream or ???
    My code is
    [CODE]public void serialEvent(SerialPortEvent event) {
              System.out.println("before serialEvent");
              StringBuffer readBuffer1 = new StringBuffer();
              switch (event.getEventType()) {
              case SerialPortEvent.OUTPUT_BUFFER_EMPTY:
                   System.out.println("Data empty:");
                   break;
              case SerialPortEvent.DATA_AVAILABLE:
                      byte[] readBuffer = new byte[100];
                      int r = 0;
                      try {
                           System.out.print("test"); 
                          //t inputStream = serialPort.getInputStream();
                         while((r = inputStream.read(readBuffer)) != -1) {
                            System.out.print(new String(readBuffer, 0, r));
                      } catch (Exception e) {
                         e.printStackTrace(System.out);
    }[/CODE]Please help

    Hi DrClap,
    Thanks for your reply.
    That article was helpful in understanding character conversions. And it works fine for JSPs.
    But when I tried to apply the same in JSF it does not work. May be this is not the right forum to dicuss JSF related things. But if you know how pageEncoding and contentType can be mentioned in JSF. I am using myfaces and I have tried <h:form id="SearchForm" acceptCharset="UTF-8" accept="text/html;charset=UTF-8">. Didnt work....

  • Problem with reading special characters in unix

    Hi,
    Iam trying to read the data from a file by the following code
    FileInputStream inputFile = new FileInputStream(xx);
    InputStreamReader reader = new InputStreamReader(inputFile);
    BufferedReader bufferedReader = new BufferedReader(reader);
    String s= bufferedReader.readLine();
    one of the line in file has the character �
    It working fine on windows, but on unix Iam getting it as ��.
    I tried with InputStreamReader contructor which accepts the charset name also, like by giving Cp1252, and latin1 etc, but iam not able to get around this problem.
    Any Ideas Please?

    ��This suggests to me that your input file is encoded in UTF-8. But that's only a guess, you need to find out for sure. Asking the person who produced the file would be the most reliable way. When you do find out, specify that encoding as the second parameter of the InputStreamReader constructor.

  • Problem while reading special characters in java

    Hi i am faving the follwing xml.
    <ns1:dc_application_bullets>
    <w:p wsp:rsidR="004E4084" wsp:rsidRPr="007024E6" wsp:rsidRDefault="000D5B97" wsp:rsidP="007024E6">
    <w:r>
    <w:rPr>
    <w:rFonts w:ascii="Arial Unicode MS" w:fareast="Arial Unicode MS" w:h-ansi="Arial Unicode MS" w:cs="Arial Unicode MS" w:hint="fareast" />
    <wx:font wx:val="Arial Unicode MS" />
    <w:lang />
    </w:rPr>
    <w:t>������</w:t>
    </w:r>
    </w:p>
    </ns1:dc_application_bullets>
    no i ma trying to parse this XML using java program and jdom parser. when i try to read the special chacters and print then i ma getting ?? on the console.
    how can i print on to the console as the one what they look.
    any help is appreciated.

    please any bosy throw some light on this.You first. Do you think we're mindreaders? Reading and printing code please.

  • ### Problem in retrieving special characters with Oracle 9i JDBC drivers

    hi,
    We are having some problem with retrieving special characters like '�' from the database.
    Our application is using JDK1.3.1 with Oracle 9i at the back end(Version: 9.0.1.0.0). We are using oracle 9i thin drivers (classes12.zip) for database interaction.
    To relieve the data from database we are using PreparedStatement in two ways
    1. Creating a preparedstatement from connection object without any parameters and then retrieve the
    data using it. This gives the results in correct format i.e. special characters like '�'
    2. Create the preparedstatement by passing the following parameters.
    i) ResultSet.TYPE_SCROLL_INSENSITIVE
    ii) ResultSet.CONCUR_READ_ONLY
    In this case we are not able to retrieve the special character like '�' correctly. Instead the ResultSet
    returns 'h'
    I think this is the problem with Oracle drivers. Does anyone have any information about the mentioned problem.
    rgds

    I don't know exactly (because I am using JDK 1.4 with ojdbc14.jar where these problems seem to be rare...) but you may consider this:
    1. Add nls_charset12.zip to your classpath to ensure that the encoders are present (may or may not help)
    2. Swith to JDK 1.4, and do this:
    Instead of String s = getString(column)
    use
    byte[] bytes = getBytes(column);
    ByteBuffer bb = ByteBuffer.wrap(bytes); // in package java.nio
    CharBuffer cb = Charset.forname("ISO-8859-x").decode(bb);
    String s = cb.toString();
    The latter method allows you to perform the encoding/decoding manually.
    3. Change the character encoding in the database to unicode upon database setup.
    4. Try playing with NLS parameters (alter session ...)

  • Problem with Icelandic special characters on Mac

    Hello
    I am working on a Flash publication for students, and I want it to run on Mac as well as PC. Everything goes fine, except a problem with three special characters in my language, Icelandic. I am working on a registration and login page where I am using text boxes and text input boxes. Everything looks correct on PC, but on Mac the characters Þ Ð Ý are lost.
    I have tried different fonts etc.
    Any idea what is wrong?
    Jónas Helgason

    Hello Jónas,
    Did you ever figure this out ?
    I have a similar problem except only with two letters (both upper and lower case).  These two Icelandic letters can't be entered into a Flex TextInput box in the Flex apps I am creating when they are loaded on a Mac.  The letters are known as &Eth, &eth, &Thorn and &thorn in HTML terminology.  Typing these characters on the keyboard results in the following:  { [ ? /
    However I can copy the characters in question from some other app like TextEdit and paste them into a TextInput box in my Flex app and all is well, they show up correctly.
    This happens regardless of the Mac browser used and the Flash plugin version used (have tried both 9 and 10) and also happens in the standalone Flash Player application.
    Does anyone have any idea how to fix this or is this a bug in Flash Player ?  This is really annoying as it makes text input into Flex apps on Icelandic Macs very difficult.
    There must be something wrong with the mapping of keyboard key codes into character codes on the Mac that is causing this.
    Btw, I just heard from a friend that this problem does not exist in MacOS 10.6.  I am running 10.4 and have tested this on 10.5 and it exists on both of those OS versions.
    Rgds,
    Hordur Thordarson
    Lausn hugbunadur
    http://lausn.is

  • Can't read special characters in an excel file using JDBC

    Hi! I 've a code to read an excel file using JDBC-ODBC bridge. I can read the values, but any special characters is readed wrong, just symbols. The special characters are of spanish language. This is my code:
                    Locale currentLocale;
              currentLocale = new Locale("es", "MX");
              Locale.setDefault(currentLocale);
                   Class.forName( "sun.jdbc.odbc.JdbcOdbcDriver" );
                   c = DriverManager.getConnection("jdbc:odbc:Driver={Microsoft Excel Driver (*.xls)};DBQ=comisionesperfiles.xls");
                   stmnt = c.createStatement();
                   String query = "Select * from [Hoja1$]" ;
                   ResultSet rs = stmnt.executeQuery( query );
                   while( rs.next() ){
                        String valor = rs.getString(2) ;
                        if(valor != null && !"null".equalsIgnoreCase(valor)){
                             if(!comisiones.contains(valor)){
                                  System.out.println(valor);
                                  comisiones.add( valor );
                   rs.close();
                   stmnt.close();As you can see, I've tried to set the locale, but it didn't work.
    I'm using Excel 2003, Java Version 1.4.2_07 and Windows XP Professional (in latin american spanish).
    Hope someone can help me!

    FYI: Apache's POI can read/write Excel files in Java:
    http://jakarta.apache.org/poi/index.html

  • Reading special characters from a flat file and inserting into DB

    I'm reading data with special characters like . etc from a flat file , assigning the data to variable in my anonymous block and inserting into my DB. But the show up as inverted ? s. Any clues about how to do this?
    If i try to do the insert directly it works. It seems like the error occurs when reading this data into a variable
    thanks for the help
    Lalit Bhatia

    lalit, this is probably an character set problem, the default on Database creation tends to be 7bit Ascii which does not support special characters, it's been a while since I set up a db in this way, but you need to change settings in oracle.ini. The db will need to be restarted for this. Also, to check current settings try:
    select * from NLS_DATABASE_PARAMETERS
    You want an 8bit, unicode or multibyte character set. Sorry I cannot remember moer off the top of my head, try searching on NLS or character set

  • PDS - XML format, unable to read special characters in a string.

    Hi All,
    I've been using XML files to create my report with Oracle9i's new featue of PDS.
    Now, I'm unable to read a file if it contains '&' or '<' in the string. In fact, the xml file itself not able to crate a xml page in the browser, if these special characters are present.
    Is there any way to solve this problem ?
    I can't tell the users to remove &, < from the description data. Is there any way to workaround for this ?
    Thanks in advance,
    Satya.

    Hi Rohit and Rajesh
    Thanks for your reply.
    In fact, my problem is like this:
    I'll get data in .txt file and save that as a .xml file in the network before calling Oracle report ,which gets data from the .xml.
    The contents are like this:
    <Comapy name> Ford Motors UK & Europe</Company name>
    <Department> HR & Pensions</Depatrment>
    <Salary>Grad4 < 25000 </Salary>
    As per XML entity character representation, I need to replace '&' with '&amp;' and '<' with '<lt;'
    I'm wondering is there any other way to solve this problem as it is difficult to replace '<' and '>' generically.
    Honestly, I couldn't understand what Mr. Rajesh was trying to tell.
    'for & use & ( with semicolon at the end)'
    'for < use <'
    'for > use >'
    Can you shed some light on this ?
    Many thanks
    Satya.

  • Mysql problem with german special characters

    hi,
    I wrote a software and it worked quite good, but after I installed it on a new machine with j2se 1.4 I've problems with the german special characters.
    this code works good on the old machine (jdk 1.3.1) and prints the wanted characters like �,�,�.
    Class.forName("org.gjt.mm.mysql.Driver").newInstance();
    java.sql.Connection conn;
    conn = DriverManager.getConnection("jdbc:mysql://localhost/testdb?user=testuser&password=xxxx");
    Statement s = conn.createStatement();
    ResultSet r = s.executeQuery("select something from testtb where id='1'");
    r.first();
    System.out.println( r.getString(1) );
    but on the new machine (j2se 1.4) I only receive the character ?.
    I updated my org.gjt.mm.mysql to the current MySQL Connector/J 3.0.9 and added
    conn = DriverManager.getConnection("jdbc:mysql://localhost/testdb?user=testuser&password=xxxx&useUnicode=true&characterEncoding=ISO-8859-1");
    but I've got still the same problem.
    Thanks in advance
    Markus

    with "wanted characters like �,�,�"
    I meant: like &#x00E4;,&#x00FC;,&#x00F6;

  • Problem with norwegian 'special characters' (æøå) in LaTeX

    Having installed just about every latex-package in the repos I still cant get this to work. This is what I have in my .tex file:
    \documentclass[12pt,norsk,a4paper]{article}
    \usepackage[norsk]{babel}
    \usepackage[latin1]{inputenc}
    \usepackage[T1]{fontenc}
    \begin{document}
    Testing æøå - ÆØÅ
    \end{document}
    Any suggestions?

    What characters are you trying to put into math mode?
    Generally, with regular LaTeX, you can't put special characters in math mode. You need to use the LaTeX symbol commands instead. For example, use \rightarrow and not →. The only exception would be if you were using XeLaTeX rather than regular (pdf)LaTeX, which generally has better Unicode support, especially if you had loaded the unicode-math packge: but that's still considered in beta status, I believe.
    You can also escape to text mode inside math with the \text command from the amsmath package, e.g., $\text{ÆØÅ}$ should work.
    Last edited by frabjous (2010-09-17 04:12:07)

  • Jar (or Zip) file with problems in accents, special characters

    Hi! I've a servlet that creates a jar (or zip) file and then send it. My problem is that when I create an entry in which the filename has special characters such as accents, when I unjar or unzip de file, it brings a lot of garbage characters. For example, the nex entry:
    Ex�menes.doc
    when unzipped is:
    Ex?�menes.doc
    I've tried a lot of things:
    -Setting the locale to ES, MX
    -Replacing all the letters with special characters with its unicode (like s.replace('�','\u00E1')
    -Trying to convert it to UTF8 (new String(path.getBytes(),"UTF8") )
    -Replacing the file separator char (according to a workaround that I found in the bug database)
    But nothing of this worked, alone or together. I've read that this is (or was) a bug in the API, but don't know if a solution has been found.
    Any help will be greatly appreciated!

    It's not clear what you are asking. Maybe this will help
    http://www.cfdev.com/code_samples/code.cfm/CodeID/83/Java/Simple_Ant_build_xml_Build_Task

  • Problem with reading special char from file

    Hello Oracle community,
    Got a problem when reading from a file. I am using a croatian keyboard and trying to read special charachters (ČĆŽŠĐ) from a file.
    declare
      l_file utl_file.file_type;
      s varchar2(200);
    begin
      l_file := utl_file.fopen('test_dir', 'test.txt', 'R');
      loop
        utl_file.get_line(l_file, s);
        dbms_output.put_line(s);
      end loop;
    exception
      when no_data_found then
        utl_file.fclose(l_file);
    end;But I keep getting this in dbms_output: ČƎŠĐ. For some reason it keeps skipping 2 chars Š and Ž. If I try to insert or update data the values are show correctly. What could be the cause of such a problem?
    Best regards,
    Igor

    Hi Igor,
    Looks like a NLS_LANGUAGE issue. Check the following threads:
    UTL_FILE and NLS_LANG setting
    Re: Arabic characters not displaying in Forms
    Regards,
    Sujoy

  • Problems with using special characters in Interactive Report Search

    Hi!
    I am currently developing an Application on Application Express 3.1.2.00.02 including a page with an Interactive Report, facing the problem that I cannot use special german characters in the Searchbar.
    So if i try to find a name like 'Schröder' the created Filter looks like this 'Schröder' and i won't get any valid search results. By the way the rest of the application supports these special characters like using them in Buttons or any other Page elements.
    Does anyone have a clue how to fix this problem, because it's driving me nuts ;)
    Thanks in advance
    Philipp
    Edited by: philipp_m on 10.06.2009 11:15

    Does noybody have a clue how to solve this problem. I tried to find out where the Problem occures. The Ajax Request looks like this
    f01     contains
    f01     Schröder
    f01     15
    p_flow_id     100
    p_flow_step_id     50
    p_instance     3176950818119673
    p_request     APXWGT
    p_widget_action     QUICK_FILTER
    p_widget_action_mod     ADD
    p_widget_mod     ACTION
    p_widget_name     worksheet
    p_widget_num_return     15
    x01     14175446766823030
    x02     14176526259823035
    So I guess it has to be inside the Javascript file (apex_ns_3_1.js). I hope someone can help me.
    Bye
    Philipp

  • Problems with Turkish special characters

    Hello!
    We are producing Oracle Help with WebWorks Publisher 2003 Professional for FrameMaker. There are some problems with the Turkish version of our online help: The Turkish special characters (for example the dotless i) aren´t displayed correctly in the navigation panes TOC and index. They are replaced by other characters (for example by a "y")
    Has anyone an idea how to solve this problem?
    Thank you very much.
    Kind regards,
    Miriam Rassenhofer

    Miriam,
    What encoding are the TOC and index XML files being generated in? You should use UTF-8 for the minimum of problems. I presume WebWorks has an option for this. Other than that, make sure the top of those XML files has the proper XML declaration with the encoding:
    <?xml version="1.0" encoding="UTF-8"?>You may want to try opening the XML files in an XML-aware text editor to ensure they look right there (JDeveloper is one such editor).
    If all of that is working, post back and we can talk offline about getting a snippet of one of those XML files for us to look at.
    -brian

Maybe you are looking for