Verity & non-English categories

Hi there
I've attached some test code, which demonstrates what I'm
doing. It looks a bit long-winded - it's 60-odd lines - and I'd
usually not post that much, but I kind of have to here. It's pretty
simple, so I hope you don't get frightened away by the apparent
length of my post.
Note: to run the code, you'll need to D/L and install the
Eastern European lanugage pack from here, first:
http://www.macromedia.com/go/verity.
Note the install instructions are wrong, and you should put the
files in {verity}\k2\common, not in the {cf} install dir. For me
the correct dir was C:\ColdFusionSearchService\k2\common.
I am trying to index some Russian-language data, which also
has Russian-language categorisation. The basic indexing works fine:
I can index and search for Russian text. However if I add
Russian-language categories to my indexed records, I cannot search
on those categories: I always get no results.
Oddly, if I change the language of the collection to English
and search on the Russian-language categories, it "works", but all
the Cyrillic data is munged. So "close but no banana" on that one.
I have to concede that today is the first time I've tried to
use Verity in any language other than English, so I figure (hope!)
I am just doing something daft.
Any thoughts?
Adam

Ballocks.
For those using the newsfeed, this line:
<cfset sCategory =
"ÀÞááØï"><!--- change this from
"ÀÞááØï" to be something
in English, and it works fine --->
Might come out with question marks instead of the word
"Russia" in Russian.
It reads OK on the web-UI.
Adam

Similar Messages

  • Forum-board for non-english speaking users?

    Hello!
    As my English isn't very good, and sometimes i don't know how to discribe my problem in English i'd like to ask if it's pssible to open a new Board for non-English speaking ArchLinux Users?
    Perhaps one in German, French etc.
    There non-english speaking  Users could ask for help in their language and other German/French/Spanish-speaking users could help them.
    CU ActionNews

    sarah31 wrote:well i have notice that there are some more common non-english language users on this forum. there are quite a few germans, various latin languages (spanish, portugese, etc), and so forth. i wonder if it would not be possible to have catagories on this forum for our foreign language user so that if they feel more comfortable working in their own language they can post there. i mean really it is not that much of a drain of resources. if these cata gories became very active more organization or splitting could then be done.
    The problem with separate categories for different languages is, as somebody already pointed out, that potentially useful information is then also separated. As a non-native English speaker myself, I still feel much more comfortable trying to understand bad English than perfect German of French.
    I also reckon that here are not that many Finnish archers that we could have forum of our own.. and even if there was Finnish category I would propably still post to the English forums.
    just my 0.02€
    .murkus

  • Non-English characters

    Hello, I have read several times that since Java uses Unicode, it solves the problems of non-English characters automatically or something like that.
    But my app is not working as expected. Would someone help please?
    I have a client/server combo written in Java. The server can send messages in English or Japanese. The Japanese messages are hard-coded as String literals in the server source code. On the client side, they are displayed on a JEditorPane. But the Japanese characters are all garbled. The OS on the server side and client side are, of course, different.
    My supposition, which is obviously wrong as it is not working, is that since both ends of communication are Java app, I need not worry about any encoding conversions for String literals.
    Suggest me what is wrong here?

    How is the required encoding/decoding supposed to be done?
    When I didn't worry about non-English characters, I did the following, which WORKED.
    // SENDER side
    Socket socket ;
    PrintWriter     out = new PrintWriter(socket.getOutputStream(),true);
    String outMessage = "my message";
    out.println(outMessage);//RECEIVER
    Socket socket ;
    BufferedReader in = new BufferedReader(new InputStreamReader(socket.getInputStream()));
    String inMessage =  in.readLine();When non-English characters are involved, I did the following, which DID NOT WORK. Please someone correct me.
    // SENDER side
    Socket socket ;
    PrintWriter     out = new PrintWriter(socket.getOutputStream(),true);
    String outMessage = "my message";
    String utfString = new String(outMessage.getBytes(),"UTF-8");
    out.println(utfString);//RECEIVER
    Socket socket ;
    InputStreamReader ins = new InputStreamReader(clientSocket.getInputStream(),"UTF-8");
    BufferedReader in = new BufferedReader(ins);
    String inMessage =  in.readLine();The received message is still garbled.

  • Non-english character display as square box

    Hi all,
    I'm not very sure if this question should be asked here or in the JRE board, thus I'm trying here also
    I have been trying an opensourced application called Alliancep2p (could be obtained from www.alliancep2p.com) using JRE 1.6 on an English Windows XP Pro machine.
    The problem:
    all chinese input are displayed as "square box". It looks like the programme "gets" the correct character, only that everything is displayed as "square box".
    It looks like a font issue, though I'm not that sure. Is there anyway the default fonts could be changed, or to get the characters correctly displayed?
    Note: I have east asian fonts installed, and the Java config panel can display chinese or other non-english characters correctly.
    I tried the same application under GNU/Linux (locale is UTF-8) and chinese input/display correctly without any problem at all. Does it mean that it is not the problem of the application, or?
    The original question in the JRE board:
    http://forum.java.sun.com/thread.jspa?threadID=5265369&tstart=0
    Thanks for all the input.

    I'm not really sure if it's a problem of the application or not. But the fact that it works perfectly under Linux makes me think maybe it's not the problem of the program, and actually their developers said that unicode is being used all over the program and seems like they're not CJK users also.
    I'm not a java guru so I can't really tell from the source if there's anything wrong.

  • Non English characters in FTP transport

    Hi gurus,
    I have this kind of problem: I need to create text file from internal table (table of 10000 charactes lines) , the file should be created in given ftp server (currently using FM FTP_R3_TO_SERVER).
    But, and here is the problem, the text file contains some non English character (Czech to be specific) and after the file is created in ftp server the non English character are replaced by #, characters are fine in the table, even after creating the file on local system everything is fine.
    I checked rfc connections, their are Unicode, file is sending in bin mode of ftp (ascii mode had no change, only acsii mode cuts the line to 256 character which is not enough) and parameter of fm FTP_R3_TO_SERVER character_mode is checked.
    I will appreciate any help or "kick" in right direction.
    Thanks
    Martin

    Hi Salehashaikh,
    I'm not using the FM 'FTP_R3_TO_SERVER' any more to transfer the internal table to ftp, so I do not have the code.  Sorry,
    but i will try to answer the question.
    when you are using function module  'FTP_R3_TO_SERVER' to transfer internal data to ftp the parameter fname should be the name or path of the file you are creating. If you only put file name there for example:
         CALL FUNCTION 'FTP_R3_TO_SERVER'
           EXPORTING
           handle = w_hdl
           fname = 'test_file.txt'
         character_mode = 'X'
    this will create file test_file.txt in root directory on ftp.
    You could also put there whole path on the ftp for example
          fname = '/directory/test_file.txt'
    Other approach will be to change to the directory with function module 'FTP_COMMAND' with parameter command set to cd /directory/ code will look like this:
    data: lv_command type string,
             lv_path type string.
    constants: lc_change_dir type string value 'cd'.
    *body of the program including calls of FM to connect to ftp
    concatenate lc_change_dir lv_path into lv_command sepparated by space.
    CALL FUNCTION 'FTP_COMMAND'               "this FM will execute any ftp command on ftp server
      EXPORTING
       HANDLE                = lv_handle
        COMMAND               = lv_command
      TABLES
        DATA                  = lt_result
    CALL FUNCTION 'FTP_R3_TO_SERVER'
           EXPORTING
           handle = w_hdl
           fname = 'test_file.txt'
          character_mode = 'X'.
    this "code" will create file test_file.txt in the ftp server on given directory from internal table passed to fm ftp_r3_to_server.
    list of ftp commands could be found http://www.cs.colostate.edu/helpdocs/ftp.html
    good explanation of ftp is here http://wiki.sdn.sap.com/wiki/display/Snippets/ABAPsolutiontoimplementFTP+transactions
    I hope this is understandable and helps to solve your problem
    Martin
    Edited by: Martin Gabris on Feb 3, 2011 1:38 PM

  • Non english characters in DN cannot be retrieved

    We are using Netscape directory server 4, protocal V3. We have a problem related to non-english characters appearing in RDN.
    We publish to Ldap entries using the values from database. For example, we have pubulished an entry to Ldap, based on DB values, the entry should have a DN like: ou=Liege BELGIUM ... LGG1a, <other components of DN>. However, when we call netscape search API (search against uid attribute which does not have non-english characters), the search return the entry, but when further call getDN() method on the returned Ldap Entry, it only returns Li, instead of the complete DN value.
    It seems the entry is corrupted in Ldap. I wanted to delete the corrupted entry and re create new one to test. I tried many ways, but none of them worked, I think it is because DN is corrupted, there is no key value to identify the Ldap entry for any operation(modify, delete).
    You help and insights are much appreciated.
    Thanks.
    Han Shen

    LDAP uses the UTF8 encoding. You must store data in the directory using the UTF8 encoding. This includes DN values. This also means that if you want to be able to view the values in your native character set and font, you must use an application that can convert the UTF8 LDAP data back to the native character encoding. The directory console by default should work for LATIN-1 (ISO 8859) languages if the LOCALE is set correctly.

  • Non English characters in BIP email

    Hi, my report contains Japanese characters, when I view the output in HTML format. It is displayed properly. But when I click on send button , enter email parameters like to, cc, bcc, subject , etc and send it, in the mail I receive, the japanese characters are not getting displayed properly. The same problem occurs for spanish and portugese texts-in general to all non english characters. I am using Oracle Business Intelligence Publisher Release 10.1.3.4. If someone has faced a similar issue, kindly help. Thanks in advance

    Suggestions
    1) Try with NLS_LANG as
    SWEDISH_SWEDEN.WE8DEC
    2) Make a paramform and enter via paramform (unencoded)
    (This is just for testing purpose)
    3) Change machine locale to swedish and try
    4) Which reports version is this ?
    Please see
    BUG 2713695 - NLS CHARACTERS FOR PARAMETERS CHANGE TO QUESTION MARKS WHEN PASSED ON URL BAR
    Get in touch with Support to see if this is the issue and if "yes" get a one-off patch.
    [    All Docs for all versions    ]
    http://otn.oracle.com/documentation/reports.html
    [     Publishing reports to web  - 10G  ]
    http://download.oracle.com/docs/html/B10314_01/toc.htm (html)
    http://download.oracle.com/docs/pdf/B10314_01.pdf (pdf)
    [   Building reports  - 10G ]
    http://download.oracle.com/docs/pdf/B10602_01.pdf (pdf)
    http://download.oracle.com/docs/html/B10602_01/toc.htm (html)
    [   Forms Reports Integration whitepaper  9i ]
    http://otn.oracle.com/products/forms/pdf/frm9isrw9i.pdf
    ---------------------------------------------------------------------------------

  • "Program files" directory problem during Microsoft Office Customization Installer in non-English versions of Windows

    We have a document-level customization solution for Word and are experiencing problems during deployment in an environment running on terminal services. The OS (Windows 2012) is English and Word (2013) is non-English (German). 
    Installation is done into the "Program Files" folder correctly. But when trying to start a word document linked to the specific template. The "Microsoft Office Customization Installer" pops up with the error.
    "There was an error during installation"
    From: file:///C:/Programme/[CompanyName]/[Productname]/[Productname].vsto
    Downloading file:///c:/Programme/[CompanyName]/[Productname]/[Productname].vsto did not succeed.
    Exception: ....
    System.Deployment.Application.DeploymentDonwloadException: Download file:///C:/Programme/[Companyname]/Productname]/[Productname].vsto did not suceed. ---> System.Net.WebException: Could not find a part of the path 'C:\Programme\[Companyname]\[Productname]\[Productname].vsto'.
    ---> System.Net.WebException: ...... ---> System.IO.DirectoyNotFoundException......
    The problem seems to be that the installer is looking for C:\PROGRAMME instead of C:\PROGRAM FILES. C:\PROGAMME is the German localized name of PROGRAM FILES (http://en.wikipedia.org/wiki/Program_Files).
    The installer installs the solution correctly deployed into c:\program files, but when the later a user tries to start it and the Microsoft Office Customization Installer is called, it tries to access the non-existing "c:\programme" folder. This
    doesn't exist, because Windows is English.
    Is there any thing related to deploying solutions on a platform which has different languages (mixing/matching of OS language and Office language?)
    Thank you for your help

    Hello,
    1. First, I would confirm with you whether you dealt with the localization for your document-level add-in?
    2. Did you use this way to define the Create a class that defines the post-deployment action part of Put the document of a solution
    onto the end user's computer (document-level customizations only) and did you get the path with Environment.SpecialFolder enum?
    To handle this, I would recommend you consider using Environment.SpecialFolder to set that property.
    Regards,
    Carl
    We are trying to better understand customer views on social support experience, so your participation in this interview project would be greatly appreciated if you have time. Thanks for helping make community forums a great place.
    Click
    HERE to participate the survey.

  • Oracle 10g - Defining the column name in Non English

    Hi Experts,
    I have an exisitng application which is developed on Windows using ASP Technology and uses Oracle 10g 10.1.0.2.0.
    The application is supported with an instance of Data Base within which multiple tablespaces are created for different clients. The application is developed in such a way that some of the tables arecreated dynamically and the columns are named using the data entered through the UI.
    This application needs to be globalized now. The problem is, the column name entered through the UI can be in any language based on the client's settings and those values in turn will be used for naming the columns in the tables.
    1) Can I have the column names to be named using non english characters in Oracle 10g DB? If so,
    1.1) what should I do to configure the exisiting Oracle instance to support it?
    1.2) To what level is that configuration possible, is it per DB instance level (or) can it be done at Tablespace level. I would like to configure each tablespace to host tables with columns defined with different languages, say for example, tablespace 1 will have tables with Japaenese column names and tablespace 2 will have tables with German column names?
    2) What should I do to make my entire DB to support unicode data i.e., to accept any language strings. Currently all strings are declared as VarChar2, should I change all VarChar2 to NVarChar2 (or) is there a way to retain the VarChar2 as is and make some database wide setting?
    Please note that I do not have an option of retaining the column in English as per the Business Requirement.
    Envionment:
    OS - Windows 2003 32 bit
    Oracle 10g 10.1.0.2.0
    UI forms in ASP
    TIA,
    Prem

    1. Yes, you can.
    SQL> create table ÜÝÞ( ßàá number(10));
    Table created.
    SQL> insert into ÜÝÞ values (10);
    1 row created.1.1 and 1.2 and 2. You can choose UTF as your default character set. It allows the user of non-English characters in VARCHAR columns in your whole database. It is not per tablespace.
    SQL> create table ÜÝÞ( ßàá varchar2(100));
    Table created.
    SQL> insert into ÜÝÞ values ('âãäçìé');
    1 row created.

  • English reviews in non English app stores

    In the past it was possible to see the English / American reviews in non-English app stores. It would be more than nice to get that ability back. In small countries there are often not that many reviews and also English speaking people in non-English countries should be able to read reviews.

    Are you referring to the Customer Reviews? iTunes doesn't distinguish which language is used by the customer in writing the review so it can't filter out those you don't want to see. In fact someone could write one review and use many languages in it, but it seems those that speak German are the most vocal in the Swiss store.
    This is also a user forum. Responses are from other users of Apple products. It is mainly for other users helping you with a problem that they can fix. If you want to give feedback to Apple select Provide iTunes Feedback from the iTunes menu from within the iTunes application.

  • Non-English characters not displaying correctly - Serious Issue

    My corporate email is on a Lotus Domino server with Lotus Traveler installed.
    I have set my PlayBook (with OS 2) up to syncronize with the corporate email trough Active Sync (see http://alturl.com/qh3nn), which works perfectly.
    I have however noticed that in some emails special non-english characters are displayed correctly but in some emails special non-english characters are displayed as a black diamond with a question mark inside.
    This is of course a serious issue as most non English speaking countries use some special characters.
    When trying to understand this problem how can I analyse the emails and see what character set is being used?
    And of course better; has someone solved this?

    I am having the same problem. Is there any update available?

  • Problem with Non-English Fields Output to PDF by JASPER in JDev10.1.3

    I am using jsprx files(designed in i-report) to generate pdf reports out of an oracle database.
    The non-English fields are shown correctly when I output the report into an HTML or when I view it with JasperView.
    If I try making PDF files (JasperExportManager.exportReportToPdfFile) the static fields containing e.g.Arabic/Chineese characters won't be displayed and dynamic fields from the database with non-English contents will be shown as ??? or null.
    I received some suggestions about using PARAMETERS to feed the report instead of FIELDS, which I think can not be helpful in this case and in general.
    I think this should be a common problem. These are the components I am using:
    itext-1.4.7. jar
    commons-digester- 1.7.zip
    jasperreports- 1.2.8.jar
    Any comment or help is appreciated.
    Thanks
    Farbod

    I am using jsprx files(designed in i-report) to generate pdf reports out of an oracle database.
    The non-English fields are shown correctly when I output the report into an HTML or when I view it with JasperView.
    If I try making PDF files (JasperExportManager.exportReportToPdfFile) the static fields containing e.g.Arabic/Chineese characters won't be displayed and dynamic fields from the database with non-English contents will be shown as ??? or null.
    I received some suggestions about using PARAMETERS to feed the report instead of FIELDS, which I think can not be helpful in this case and in general.
    I think this should be a common problem. These are the components I am using:
    itext-1.4.7. jar
    commons-digester- 1.7.zip
    jasperreports- 1.2.8.jar
    Any comment or help is appreciated.
    Thanks
    Farbod

  • Reading a non-english character

    Hi, I have a trouble with reading a non-english character from a html page.
    I'm taking the word from the html page, and compare it with itself,
    like this
    string.equals("B&#304;TT&#304;")
    but it returns false.
    is it possible to correct this?

    specify an encoding for your inputstream reader:
    BufferedReader in = new BufferedReader(
                new InputStreamReader(new FileInputStream("infilename"), "8859_1")); for example

  • How to retrieve non-english characters from a query

    Hello,
    My apologies if this post is not in its proper place, but I was a bit confused where to add it.
    I'm running a query using SQL Developer on a table which contains several companies names from many different countries, and one of the checks I need to make to ensure data consistency is to search for all rows which the name of company contains special or non-english characters (like ç, ã, ä as example).
    I don't know what can I use to do this. I tried to collate using NLS_SORT but it didn't work.
    Is there someway to select only the rows that contain these special or non-english characters, excluding from the results the rows that only have english characters? Please have in mind that we have many languages in this table.
    The field I would like to make the conditions on is VARCHAR2.
    Please let me know if there is any extra information I should provide you so that you can help me.
    Thank you in advance for the help.
    Regards,
    Luís

    Hi Luis,
    My apologies if this post is not in its proper place, but I was a bit confused where to add it.This is the Forum for the SQL Developer Data Modeler product.
    I suggest you try using the SQL and PL/SQL Forum: PL/SQL
    David

  • Lack of World or Non-English language films and search filter in Australia

    iTunes Australia doesn't have a search set up for World or Non-English language films in Australia whereas it does in US. Please fix and please also include more titles - very limited to the point of embarrassing Apple...

    We are fellow users here on these forums, if you want to leave feedback for Apple then you can do so via this page : http://www.apple.com/feedback/itunesapp.html
    In terms of the films that are available there, Apple can only sell the films that the studios/rights-holders have granted them licenses to sell there. You can try requesting that films be added, but unless the studios agree to it then Apple won't be able to add them : http://www.apple.com/feedback/itunes.html

Maybe you are looking for

  • Driver issue with Cisco HD Precision / JFV 4.4

    Hi, Just to installed Jabber for Video4.4 on my laptop ( win7 64bits) all works fine except the video!! i use a Cisco HD precision wih Jabber, may be a driver issue, the camera is not recognized. When i selected my integreted camera from laptop, the

  • Syncing iPad notes to be seperate from iPhone notes

    Hello. I sync my iPhone notes with iTunes. I have separate notes on my iPad that I would like to back up and sync as well. How can this be done so that I don't have my already synced iPhone notes show up on my iPad? Every time I have my iPad plugged

  • ASA 5510 8.2.5 & asdm 6.4.5 protocol statistics

    Hi, Running Asa5510, 8.2.5, with asdm 6.4.5 and I am looking for a graph in asdm that will show me what protocols and internal ip adresses uses the most traffic. Maybe a bit like "Top 10 protected servers under SYN attach". My reason for this is of c

  • Uploading local package problems in 11g

    I've tried to add a package to the "Local PKGs" section of the local content. I can install the package manually without issue but when I upload it into "Local Content" it ends up with the "Broken Rules (Missing Dependencies)" state. When the package

  • Grey Screen, so, bought a new HD for MBP and still grey screen! Please help

    The first thing I did was I used CCC to copy the image to the new HD and still got the grey screen, I then tried SuperDuper, same result, grey screen, the old image is still attached wonder if that makes a difference but I selected STARTUP DRIVE as t