Oracle Text for Traditional Chinese

I would like to ask how can i specific keywords in Chinese and whenever i type sometimes in Chinese, the Oracle Text's token is indexed very interested and the stop word is definite not sometimes rational in Traditional Chinese. Anyone can help? (DR$xxx$I)

You can specify a stoplist in Chinese. Please refer to the documentation for more information about the CTX_DDL.CREATE_STOPLIST procedure and for the Chinese lexer.

Similar Messages

  • Configuring Oracle Text for 10g

    Hello,
    Sorry, it this sounds like such a newbie question, but is there is a simple guide to the basic configuration and setup of Oracle Text for 10g?
    Thank you for your help.

    Hi,
    Check out note 280713.1. You can verify your install and install manually if necessary.
    If you haven't done anything with the db (created schemas/objects) it might actually be easiest to simply delete the db and create a new one. Just make sure when you do that you select 'custom' from the list of templates and then choose Oracle Text as one of the options.
    Thanks,
    Ron

  • Oracle text for varchar2

    Hi ...
    can i use oracle Text for searching in varchar2 field ....
    IF yes , plz give me the details ....
    Thanks ....

    SELECT OD OID, TAB Layer, COLUM Field, TEX Result,
    score(22) Score FROM VIEW_MASTER
    WHERE CONTAINS ( TEXT_VALUE, SEARCH_TERMS, 22 ) > 0
    ORDER BY Score;
    The search_terms are an inbound parameter. Not sure
    what the 22 does, i think its just an alias name. I
    don't know what the score coming back means.
    Sometimes I get 16, sometimes 12, sometimes 7.
    I could use some help on this myself.Yes, 22 is just an alias. You can use any number here since it is just a label which is used to correlate the CONTAINS function with its corresponding SCORE function.
    The details of how the score is computed are available in the Oracle Text Reference book, Appendix "F The Oracle Text Scoring Algorithm".
    Faisal

  • Oracle text for italian language document

    How i can set Oracle Text index to index an italian text field.
    How can i set the right stop_list, lexer, .....
    Thanks

    I believe if your NLS_LANG settings are set appropriately for Italian, it should automatically use the proper defaults for Italian in a text index.

  • Substring formula problem for Traditional Chinese

    Dear all,
    I created two InfoObjects (type: character, length: 60)in order to load one field with 100 characters from R/3. I created formula in transfer rule using "Substring" function for each of the two InfoObjects (e.g. Substring(field_name, 0, 60) and Substring(field_name, 60, 40). The data is in Traditional Chinese and both R/3 and BW are Unicode verison. The first InfoObject seems always get less than 60 characters, but the second InfoObject get 40 characters correctly. When I use English data to test, both InfoObjects get correct legnth of data. Does anyone come across this problem before? Thanks a lot.

    Download CS6 products
    Mylenium

  • Using Oracle Text for searching with UCM 10g

    I am using Oracle text with UCM 10gR3 and Site Studio 10gR4 and I am trying to sort the search results by relevancy and to also include a snippet of the retrieved document. I have the fields that the SS_GET_SEARCH_RESULTS service returns but the relevancy score is always equals 5 and the snippet contains characters such as < idcnull, /p, etc., which you can see are XML/HTML/UCM tags but which result sin even more strangeness in the snippet if I try to remove them programmatically.
    I have read the Oracle Text documentation and there appear to be ways you can configure Oracle Text but I am not clear at all on what I can do from UCM. It looks like the configuration is either done in database tables or in the query itself, neither of which are readily configurable to me.
    Is anyone experienced in this or know of any documentation this might help?
    Bill

    Hi
    If I remember correctly then this issue was seen with an older version of OTS component and Core Update patch / bundle . Upgrade the UCM instance with the latest CS10gr35 update bundle patchset 6907073 and also upgrade OTS component from the same patchset .
    Let me know how it goes after this .
    Thanks
    Srinath

  • About: Language Pack for Traditional Chinese

    Dear all:
    When support traditional chinese language pack?
    Best regards
    Sam Wu

    No one here knows.
    Apple may release it next week or next year or never. If you want to push them to speed up, you can leave feedback at http://www.apple.com/feedback/pages.html .

  • No keyboard for traditional Chinese?

    How do I type traditional Chinese in iPad? I would hate to see that there's no traditional Chinese input method in iPad.

    You can handwrite the Chinese characters directly into the keypad. You can choose traditional Chinese or simplified Chinese too. Click the "globe" icon in the keyboard to switch to different character sets.
    You have to set up the international character set first in the "Settings" app. Then click "General" > "International" > "keyboards" >"Add new keyboards" > "Traditional Chinese" or "Simplified Chinese"
    Once you set this up, you can use the Chinese keyboard by clicking the "globe" icon in the input keyboard. You can cycle through the keyboard character set back in to English by clicking the "globe" again.
    The handwriting input method is cool. You don't have to use other translation methods. It does a fairly good job to do the character recognition, but if you use traditional Chinese, it is not that smart to figure out the complex characters.

  • Using Oracle Text for Text Mining

    Hi all !
    I have 30000 text documents stored in a table and I want to construct a "dictionnary" that contains all the terms include in those documents. I will use this dictionnary to classify my documents with the text mining solutions of Oracle.
    I created an index with a multi datastore, specific lexer, stoplist and wordlist and it creates well my dictionnary. The problem is that I would like to add some other tokens containing more than a word. I wonder how to use the index created : I tried to work on the table $I created for the index but it's not really successful. So I thought about 2 solutions :
    - work on the index with insert, delete and synchronise operations
    - find a way to insert the additionnal tokens (which corresponds to pattern) during the indexing operation
    Can you help me ?
    Laetitia

    Hi,
    Check out a thesaurus (enter your phrases here) and see if it will do what you need:
    http://download-west.oracle.com/docs/cd/B19306_01/text.102/b14217/cthes.htm#i1006324
    Don't modify the $I table as a general rule (or any system-generated table).
    Thanks,
    Ron

  • Download problem to CS6 (license) for traditional chinese

    Have bought the license of CS6 on Mac and PC, however after receiving the license & serial #, after login to the license page, the download link of CS6 disappeared (only a CS5 download link appeared!).
    Please help!

    Download CS6 products
    Mylenium

  • Localization problem: Traditional Chinese is not working for Taiwan App Store

    I have published an app that has three languages:
    (1) English (as Primary)
    (2) Simplified Chinese
    (3) Traditional Chinese
    I expect I can see Simplified Chinese in China App Store and Traditional Chinese in Taiwan App Store.
    I found out it works well for Simplified Chinese case.
    However, for Traditional Chinese, it's always English in Taiwan App Store. I have tried so many devices (with language setting to Traditional Chinese). I even went to a local Internet Cafe (so all computers are with Traditional Chinese setting), it's still English for Taiwan App Store.
    From iTunes Connect, when I go to "Version Summary" where I can change the "Territory" there. I always see English for Taiwan Territory while it's always Simplified Chinese when China is selected.
    From all the experiments I've done, I really think there is something wrong in App Store or iTunes to link Traditional Chinese with Taiwan App Store.
    How to report this kind of bug to Apple & have them examine this issue?

    Unfortunately you have already done everything we can possibly instruct you to do.. at this point the only option is getting them to exchange it its under warranty its not working you have to get a exchange

  • Is Oracle Text the right solution for this need of a specific search!

    Hi ,
    We are on Oracle 11.2.0.2 on Solaris 10. We have the need to be able to do search on data that are having diacritical marks and we should be able to do the serach ignoring this diacritical marks. That is the requirement. Now I got to hear that Oracle Text has a preference called BASIC_LEXER which can bypass the diacritical marks and so solely due to this feature I implemented Oracle Text and just for this diacritical search and no other need.
    I mean I set up preference like this:
      ctxsys.ctx_ddl.create_preference ('cust_lexer', 'BASIC_LEXER');
      ctxsys.ctx_ddl.set_attribute ('cust_lexer', 'base_letter', 'YES'); -- removes diacritics
    With this I set up like this:
    CREATE TABLE TEXT_TEST
      NAME  VARCHAR2(255 BYTE)
    --created Oracle Text index
    CREATE INDEX TEXT_TEST_IDX1 ON TEXT_TEST
    (NAME)
    INDEXTYPE IS CTXSYS.CONTEXT
    PARAMETERS('LEXER cust_lexer WORDLIST cust_wl SYNC (ON COMMIT)');
    --sample data to illustrate the problem
    Insert into TEXT_TEST
       (NAME)
    Values
       ('muller');
    Insert into TEXT_TEST
       (NAME)
    Values
       ('müller');
    Insert into TEXT_TEST
       (NAME)
    Values
       ('MULLER');
    Insert into TEXT_TEST
       (NAME)
    Values
       ('MÜLLER');
    Insert into TEXT_TEST
       (NAME)
    Values
       ('PAUL HERNANDEZ');
    Insert into TEXT_TEST
       (NAME)
    Values
       ('CHRISTOPHER Phil');
    COMMIT;
    --Now there is an alternative solution that is there,  instead of thee Oracle Text which is just a plain function given below (and it seems to work neat for my simple need of removing diacritical characters effect in search)
    --I need to evaluate which is better given my specific needs -the function below or Oracle Text.
    CREATE OR REPLACE FUNCTION remove_dia(p_value IN VARCHAR2, p_doUpper IN VARCHAR2 := 'Y')
    RETURN VARCHAR2 DETERMINISTIC
    IS
    OUTPUT_STR VARCHAR2(4000);
    begin
    IF (p_doUpper = 'Y') THEN
       OUTPUT_STR := UPPER(p_value);
    ELSE
       OUTPUT_STR := p_value;
    END IF;
    OUTPUT_STR := TRANSLATE(OUTPUT_STR,'ÀÁÂÃÄÅÇÈÉÊËÌÍÎÏÑÒÓÔÕÖØÙÚÛÜÝàáâãäåçèéêëìíîïñòóôõöøùúûüýÿ', 'AAAAAACEEEEIIIINOOOOOOUUUUYaaaaaaceeeeiiiinoooooouuuuyy');
    RETURN (OUTPUT_STR);
    end;
    --now I query for which name stats with  a P%:
    --Below query gets me unexpected result of one row as I am using Oracle Text where each word is parsed for search using CONTAINS...
    SQL> select * from text_test where contains(name,'P%')>0;
    NAME
    PAUL HERNANDEZ
    CHRISTOPHER Phil
    --Below query gets me the right and expected result of one row...
    SQL> select * from text_test where name like 'P%';
    NAME
    PAUL HERNANDEZ
    --Below query gets me the right and expected result of one row...
    SQL>  select * from text_test where remove_dia(name) like remove_dia('P%');
    NAME
    PAUL HERNANDEZMy entire need was only to be able to do a search that bypasses diacritical characters. To implement Oracle Text for that reason, I am wondering if that was the right choice! More so when I am now finding that the functionality of LIKE is not available in Oracle Text - the Oracle text search are based on tokens or words and they are different from output of the LIKE operator. So may be should I have just used a simple function like below and used that for my purpose instead of using Oracle Text:
    This function (remove_dia) just removes the diacritical characters and may be for my need this is all that is needed. Can someone help to review that given my need I am better of not using Oracle Text? I need to continue using the functionality of Like operator and also need to bypass diacritical characters so the simple function that I have meets my need whereas Oracle Text causes a change in behaviour of search queries.
    Thanks,
    OrauserN

    If all you need is LIKE functionality and you do not need any of the complex search capabilities of Oracle Text, then I would not use Oracle Text. I would create a function-based index on your name column that uses your function that removes the diacritical marks, so that your searches will be faster. Please see the demonstration below.
    SCOTT@orcl_11gR2> CREATE TABLE TEXT_TEST
      2    (NAME  VARCHAR2(255 BYTE))
      3  /
    Table created.
    SCOTT@orcl_11gR2> Insert all
      2  into TEXT_TEST (NAME) Values ('muller')
      3  into TEXT_TEST (NAME) Values ('müller')
      4  into TEXT_TEST (NAME) Values ('MULLER')
      5  into TEXT_TEST (NAME) Values ('MÜLLER')
      6  into TEXT_TEST (NAME) Values ('PAUL HERNANDEZ')
      7  into TEXT_TEST (NAME) Values ('CHRISTOPHER Phil')
      8  select * from dual
      9  /
    6 rows created.
    SCOTT@orcl_11gR2> CREATE OR REPLACE FUNCTION remove_dia
      2    (p_value   IN VARCHAR2,
      3       p_doUpper IN VARCHAR2 := 'Y')
      4    RETURN VARCHAR2 DETERMINISTIC
      5  IS
      6    OUTPUT_STR VARCHAR2(4000);
      7  begin
      8    IF (p_doUpper = 'Y') THEN
      9        OUTPUT_STR := UPPER(p_value);
    10    ELSE
    11        OUTPUT_STR := p_value;
    12    END IF;
    13    RETURN
    14        TRANSLATE
    15          (OUTPUT_STR,
    16           'ÀÁÂÃÄÅÇÈÉÊËÌÍÎÏÑÒÓÔÕÖØÙÚÛÜÝàáâãäåçèéêëìíîïñòóôõöøùúûüýÿ',
    17           'AAAAAACEEEEIIIINOOOOOOUUUUYaaaaaaceeeeiiiinoooooouuuuyy');
    18  end;
    19  /
    Function created.
    SCOTT@orcl_11gR2> show errors
    No errors.
    SCOTT@orcl_11gR2> CREATE INDEX text_test_remove_dia_name
      2  ON text_test (remove_dia (name))
      3  /
    Index created.
    SCOTT@orcl_11gR2> set autotrace on explain
    SCOTT@orcl_11gR2> select * from text_test
      2  where  remove_dia (name) like remove_dia ('mü%')
      3  /
    NAME
    muller
    müller
    MULLER
    MÜLLER
    4 rows selected.
    Execution Plan
    Plan hash value: 3139591283
    | Id  | Operation                   | Name                      | Rows  | Bytes | Cost (%CPU)| Time     |
    |   0 | SELECT STATEMENT            |                           |     1 |  2131 |     2   (0)| 00:00:01 |
    |   1 |  TABLE ACCESS BY INDEX ROWID| TEXT_TEST                 |     1 |  2131 |     2   (0)| 00:00:01 |
    |*  2 |   INDEX RANGE SCAN          | TEXT_TEST_REMOVE_DIA_NAME |     1 |       |     1   (0)| 00:00:01 |
    Predicate Information (identified by operation id):
       2 - access("SCOTT"."REMOVE_DIA"("NAME") LIKE "REMOVE_DIA"('mü%'))
           filter("SCOTT"."REMOVE_DIA"("NAME") LIKE "REMOVE_DIA"('mü%'))
    Note
       - dynamic sampling used for this statement (level=2)
    SCOTT@orcl_11gR2> select * from text_test
      2  where  remove_dia (name) like remove_dia ('P%')
      3  /
    NAME
    PAUL HERNANDEZ
    1 row selected.
    Execution Plan
    Plan hash value: 3139591283
    | Id  | Operation                   | Name                      | Rows  | Bytes | Cost (%CPU)| Time     |
    |   0 | SELECT STATEMENT            |                           |     1 |  2131 |     2   (0)| 00:00:01 |
    |   1 |  TABLE ACCESS BY INDEX ROWID| TEXT_TEST                 |     1 |  2131 |     2   (0)| 00:00:01 |
    |*  2 |   INDEX RANGE SCAN          | TEXT_TEST_REMOVE_DIA_NAME |     1 |       |     1   (0)| 00:00:01 |
    Predicate Information (identified by operation id):
       2 - access("SCOTT"."REMOVE_DIA"("NAME") LIKE "REMOVE_DIA"('P%'))
           filter("SCOTT"."REMOVE_DIA"("NAME") LIKE "REMOVE_DIA"('P%'))
    Note
       - dynamic sampling used for this statement (level=2)
    SCOTT@orcl_11gR2>

  • Process for Oracle Text

    am working as a dba and we plan to introduce oracle text for text search. since I am new to this concept, I would like to know step by step implementation of Oracle text. I've searched some web sites but still not clear on the implementation part.
    Please help me out

    Hi,
    Oracle Text is included in both standard and enterprise editions of the data server. When you are creating your database, select Oracle Text during configuration (one of the options). You will then have Oracle Text available on your database. The schema name is CTXSYS. You need to unlock this account just like any other.
    To use Text, either grant permissions on the specific objects you need for the user, or use the CTXAPP role. It is up to you to know the permissions required for the objects (in other words, I can't tell you your requirements), so research this in the reference manual.
    At this point, it is ready to use. Just create your indexes according to the Oracle Text Developer's Guide (you can find this with all of the documentation - look at the Application Developer's tab in Doc Library). Your search syntax depends totally on your requirements, and the type of index you choose to create. For example, the CONTEXT index uses the CONTAINS operator, and the CTXCAT index uses CATSEARCH (unless of course you want to use templates, but let's not go there just yet...).
    There are two references you will want to review: The Oracle Text Developer's Guide and the Oracle Text Reference.
    Hope this helps,
    Ron

  • Display problem when reading Traditional Chinese in Crystal Report 2008

    Hi All,
    I have tried to connect MS SQL 6.5 to build up a report. However, I encounter a problem when read Traditional Chinese. all chinese characters turn into adnormal characters. I wonder it is problem of SQL Server or some setting that I may not know. In addition, I can able to read chinese in the application program , ISQL / Query Analyzer. Please help to give idea and suggestion to fix this. Thanks
    SQL Server: MS SQL Server  6.5
    Client OS: Window XP ( Traditional Chinese Version)
    Server OS: Window NT and Window Server 2000
    Stsyou

    Hi
    If you are using a Chinese language build database, Crystal Reports English build might not display the characters properly as non english databases include one byte characters as ASCII code character and double-byte characters as their own language code character.
    Traditional Chinese, Simplified Chinese, Japanese (Kanji, Hiranga, and Katakana), Korean and Vietnamese use double-byte characters.
    However, a possible workaround to this issue is to verify that the encoding on the database client is configured according to the examples below. Also, you will need to install the language pack in the English environment.
    CONFIGURATION EXAMPLES:
    Shift-JIS for a Japanese database
    Big5 for a Traditional Chinese database
    GB2312 for a Simplified Chinese database
    iso-2022-kr for a Korean database
    Windows 1258 for a Vietnamese database
    EXAMPLE SCENARIO:
    An Oracle database saves non-English language data such as Japanese.
    The Windows operating system is English.
    STEP TO CONFIGURE THE CHARACTER CODE IN THE ENVIRONMENT SETTING
    (This step is based on the details of the EXAMPLE SCENARIO.)
    1. Configure the system environment on the database client side to 'Shift-JIS' as the character code set in the Environment Setting.
    This results in the NLS_LANG variable set to Japanese_Japan.JA16SJIS under the registry HKEY_Local_Machine\Software\Oracle\Homeo.
    STEPS TO INSTALL LANGUAGE PACK IN AN ENGLISH ENVIRONMENT
    (These steps are based on the details of the EXAMPLE SCENARIO.)
    1. Add languages to the computer system by clicking:
    Control Panel > Regional Options > General
    2. Select the check box for 'Traditional Chinese', 'Simplified Chinese', 'Japanese', 'Korean' or 'Vietnamese'.
    3. Click 'Apply'.
    4. When you are prompted, insert the Windows CD-ROM to install the language pack.
    5. Restart the computer.
    Upon completing these steps, you are able to display the languages characters in the Crystal Reports Designer in a Windows English environment.
    ==========
    NOTE:
    If the database table and field names use non-English language characters, Data Explorer in CR will not correctly display these names. However, when you preview the report, the non-English data displays correctly.
    ==========
    Configuring the database client according to the examples and installing the language pack will display the characters successfully. However, there are cases when this workaround does not resolve the issue.
    For further information about CR and double-byte languages, refer to knowledge base articles, c2008083 and c2008349.
    Hope this helps!!!
    Regards
    Sourashree

  • Oracle Text in TimesTen?

    If i have a ,lot of documents I am using Oracle Text for Indexing, are there any options to using TimesTen?

    TimesTen doesn't support Oracle Text directly.
    If caching is used then Text Searches are possible via a timesten client and executed on the Oracle  RDBMS database.

Maybe you are looking for