Oracle Text for Traditional Chinese
I would like to ask how can i specific keywords in Chinese and whenever i type sometimes in Chinese, the Oracle Text's token is indexed very interested and the stop word is definite not sometimes rational in Traditional Chinese. Anyone can help? (DR$xxx$I)
You can specify a stoplist in Chinese. Please refer to the documentation for more information about the CTX_DDL.CREATE_STOPLIST procedure and for the Chinese lexer.
Similar Messages
-
Configuring Oracle Text for 10g
Hello,
Sorry, it this sounds like such a newbie question, but is there is a simple guide to the basic configuration and setup of Oracle Text for 10g?
Thank you for your help.Hi,
Check out note 280713.1. You can verify your install and install manually if necessary.
If you haven't done anything with the db (created schemas/objects) it might actually be easiest to simply delete the db and create a new one. Just make sure when you do that you select 'custom' from the list of templates and then choose Oracle Text as one of the options.
Thanks,
Ron -
Hi ...
can i use oracle Text for searching in varchar2 field ....
IF yes , plz give me the details ....
Thanks ....SELECT OD OID, TAB Layer, COLUM Field, TEX Result,
score(22) Score FROM VIEW_MASTER
WHERE CONTAINS ( TEXT_VALUE, SEARCH_TERMS, 22 ) > 0
ORDER BY Score;
The search_terms are an inbound parameter. Not sure
what the 22 does, i think its just an alias name. I
don't know what the score coming back means.
Sometimes I get 16, sometimes 12, sometimes 7.
I could use some help on this myself.Yes, 22 is just an alias. You can use any number here since it is just a label which is used to correlate the CONTAINS function with its corresponding SCORE function.
The details of how the score is computed are available in the Oracle Text Reference book, Appendix "F The Oracle Text Scoring Algorithm".
Faisal -
Oracle text for italian language document
How i can set Oracle Text index to index an italian text field.
How can i set the right stop_list, lexer, .....
ThanksI believe if your NLS_LANG settings are set appropriately for Italian, it should automatically use the proper defaults for Italian in a text index.
-
Substring formula problem for Traditional Chinese
Dear all,
I created two InfoObjects (type: character, length: 60)in order to load one field with 100 characters from R/3. I created formula in transfer rule using "Substring" function for each of the two InfoObjects (e.g. Substring(field_name, 0, 60) and Substring(field_name, 60, 40). The data is in Traditional Chinese and both R/3 and BW are Unicode verison. The first InfoObject seems always get less than 60 characters, but the second InfoObject get 40 characters correctly. When I use English data to test, both InfoObjects get correct legnth of data. Does anyone come across this problem before? Thanks a lot.Download CS6 products
Mylenium -
Using Oracle Text for searching with UCM 10g
I am using Oracle text with UCM 10gR3 and Site Studio 10gR4 and I am trying to sort the search results by relevancy and to also include a snippet of the retrieved document. I have the fields that the SS_GET_SEARCH_RESULTS service returns but the relevancy score is always equals 5 and the snippet contains characters such as < idcnull, /p, etc., which you can see are XML/HTML/UCM tags but which result sin even more strangeness in the snippet if I try to remove them programmatically.
I have read the Oracle Text documentation and there appear to be ways you can configure Oracle Text but I am not clear at all on what I can do from UCM. It looks like the configuration is either done in database tables or in the query itself, neither of which are readily configurable to me.
Is anyone experienced in this or know of any documentation this might help?
BillHi
If I remember correctly then this issue was seen with an older version of OTS component and Core Update patch / bundle . Upgrade the UCM instance with the latest CS10gr35 update bundle patchset 6907073 and also upgrade OTS component from the same patchset .
Let me know how it goes after this .
Thanks
Srinath -
About: Language Pack for Traditional Chinese
Dear all:
When support traditional chinese language pack?
Best regards
Sam WuNo one here knows.
Apple may release it next week or next year or never. If you want to push them to speed up, you can leave feedback at http://www.apple.com/feedback/pages.html . -
No keyboard for traditional Chinese?
How do I type traditional Chinese in iPad? I would hate to see that there's no traditional Chinese input method in iPad.
You can handwrite the Chinese characters directly into the keypad. You can choose traditional Chinese or simplified Chinese too. Click the "globe" icon in the keyboard to switch to different character sets.
You have to set up the international character set first in the "Settings" app. Then click "General" > "International" > "keyboards" >"Add new keyboards" > "Traditional Chinese" or "Simplified Chinese"
Once you set this up, you can use the Chinese keyboard by clicking the "globe" icon in the input keyboard. You can cycle through the keyboard character set back in to English by clicking the "globe" again.
The handwriting input method is cool. You don't have to use other translation methods. It does a fairly good job to do the character recognition, but if you use traditional Chinese, it is not that smart to figure out the complex characters. -
Using Oracle Text for Text Mining
Hi all !
I have 30000 text documents stored in a table and I want to construct a "dictionnary" that contains all the terms include in those documents. I will use this dictionnary to classify my documents with the text mining solutions of Oracle.
I created an index with a multi datastore, specific lexer, stoplist and wordlist and it creates well my dictionnary. The problem is that I would like to add some other tokens containing more than a word. I wonder how to use the index created : I tried to work on the table $I created for the index but it's not really successful. So I thought about 2 solutions :
- work on the index with insert, delete and synchronise operations
- find a way to insert the additionnal tokens (which corresponds to pattern) during the indexing operation
Can you help me ?
LaetitiaHi,
Check out a thesaurus (enter your phrases here) and see if it will do what you need:
http://download-west.oracle.com/docs/cd/B19306_01/text.102/b14217/cthes.htm#i1006324
Don't modify the $I table as a general rule (or any system-generated table).
Thanks,
Ron -
Download problem to CS6 (license) for traditional chinese
Have bought the license of CS6 on Mac and PC, however after receiving the license & serial #, after login to the license page, the download link of CS6 disappeared (only a CS5 download link appeared!).
Please help!Download CS6 products
Mylenium -
Localization problem: Traditional Chinese is not working for Taiwan App Store
I have published an app that has three languages:
(1) English (as Primary)
(2) Simplified Chinese
(3) Traditional Chinese
I expect I can see Simplified Chinese in China App Store and Traditional Chinese in Taiwan App Store.
I found out it works well for Simplified Chinese case.
However, for Traditional Chinese, it's always English in Taiwan App Store. I have tried so many devices (with language setting to Traditional Chinese). I even went to a local Internet Cafe (so all computers are with Traditional Chinese setting), it's still English for Taiwan App Store.
From iTunes Connect, when I go to "Version Summary" where I can change the "Territory" there. I always see English for Taiwan Territory while it's always Simplified Chinese when China is selected.
From all the experiments I've done, I really think there is something wrong in App Store or iTunes to link Traditional Chinese with Taiwan App Store.
How to report this kind of bug to Apple & have them examine this issue?Unfortunately you have already done everything we can possibly instruct you to do.. at this point the only option is getting them to exchange it its under warranty its not working you have to get a exchange
-
Is Oracle Text the right solution for this need of a specific search!
Hi ,
We are on Oracle 11.2.0.2 on Solaris 10. We have the need to be able to do search on data that are having diacritical marks and we should be able to do the serach ignoring this diacritical marks. That is the requirement. Now I got to hear that Oracle Text has a preference called BASIC_LEXER which can bypass the diacritical marks and so solely due to this feature I implemented Oracle Text and just for this diacritical search and no other need.
I mean I set up preference like this:
ctxsys.ctx_ddl.create_preference ('cust_lexer', 'BASIC_LEXER');
ctxsys.ctx_ddl.set_attribute ('cust_lexer', 'base_letter', 'YES'); -- removes diacritics
With this I set up like this:
CREATE TABLE TEXT_TEST
NAME VARCHAR2(255 BYTE)
--created Oracle Text index
CREATE INDEX TEXT_TEST_IDX1 ON TEXT_TEST
(NAME)
INDEXTYPE IS CTXSYS.CONTEXT
PARAMETERS('LEXER cust_lexer WORDLIST cust_wl SYNC (ON COMMIT)');
--sample data to illustrate the problem
Insert into TEXT_TEST
(NAME)
Values
('muller');
Insert into TEXT_TEST
(NAME)
Values
('müller');
Insert into TEXT_TEST
(NAME)
Values
('MULLER');
Insert into TEXT_TEST
(NAME)
Values
('MÜLLER');
Insert into TEXT_TEST
(NAME)
Values
('PAUL HERNANDEZ');
Insert into TEXT_TEST
(NAME)
Values
('CHRISTOPHER Phil');
COMMIT;
--Now there is an alternative solution that is there, instead of thee Oracle Text which is just a plain function given below (and it seems to work neat for my simple need of removing diacritical characters effect in search)
--I need to evaluate which is better given my specific needs -the function below or Oracle Text.
CREATE OR REPLACE FUNCTION remove_dia(p_value IN VARCHAR2, p_doUpper IN VARCHAR2 := 'Y')
RETURN VARCHAR2 DETERMINISTIC
IS
OUTPUT_STR VARCHAR2(4000);
begin
IF (p_doUpper = 'Y') THEN
OUTPUT_STR := UPPER(p_value);
ELSE
OUTPUT_STR := p_value;
END IF;
OUTPUT_STR := TRANSLATE(OUTPUT_STR,'ÀÁÂÃÄÅÇÈÉÊËÌÍÎÏÑÒÓÔÕÖØÙÚÛÜÝàáâãäåçèéêëìíîïñòóôõöøùúûüýÿ', 'AAAAAACEEEEIIIINOOOOOOUUUUYaaaaaaceeeeiiiinoooooouuuuyy');
RETURN (OUTPUT_STR);
end;
--now I query for which name stats with a P%:
--Below query gets me unexpected result of one row as I am using Oracle Text where each word is parsed for search using CONTAINS...
SQL> select * from text_test where contains(name,'P%')>0;
NAME
PAUL HERNANDEZ
CHRISTOPHER Phil
--Below query gets me the right and expected result of one row...
SQL> select * from text_test where name like 'P%';
NAME
PAUL HERNANDEZ
--Below query gets me the right and expected result of one row...
SQL> select * from text_test where remove_dia(name) like remove_dia('P%');
NAME
PAUL HERNANDEZMy entire need was only to be able to do a search that bypasses diacritical characters. To implement Oracle Text for that reason, I am wondering if that was the right choice! More so when I am now finding that the functionality of LIKE is not available in Oracle Text - the Oracle text search are based on tokens or words and they are different from output of the LIKE operator. So may be should I have just used a simple function like below and used that for my purpose instead of using Oracle Text:
This function (remove_dia) just removes the diacritical characters and may be for my need this is all that is needed. Can someone help to review that given my need I am better of not using Oracle Text? I need to continue using the functionality of Like operator and also need to bypass diacritical characters so the simple function that I have meets my need whereas Oracle Text causes a change in behaviour of search queries.
Thanks,
OrauserNIf all you need is LIKE functionality and you do not need any of the complex search capabilities of Oracle Text, then I would not use Oracle Text. I would create a function-based index on your name column that uses your function that removes the diacritical marks, so that your searches will be faster. Please see the demonstration below.
SCOTT@orcl_11gR2> CREATE TABLE TEXT_TEST
2 (NAME VARCHAR2(255 BYTE))
3 /
Table created.
SCOTT@orcl_11gR2> Insert all
2 into TEXT_TEST (NAME) Values ('muller')
3 into TEXT_TEST (NAME) Values ('müller')
4 into TEXT_TEST (NAME) Values ('MULLER')
5 into TEXT_TEST (NAME) Values ('MÜLLER')
6 into TEXT_TEST (NAME) Values ('PAUL HERNANDEZ')
7 into TEXT_TEST (NAME) Values ('CHRISTOPHER Phil')
8 select * from dual
9 /
6 rows created.
SCOTT@orcl_11gR2> CREATE OR REPLACE FUNCTION remove_dia
2 (p_value IN VARCHAR2,
3 p_doUpper IN VARCHAR2 := 'Y')
4 RETURN VARCHAR2 DETERMINISTIC
5 IS
6 OUTPUT_STR VARCHAR2(4000);
7 begin
8 IF (p_doUpper = 'Y') THEN
9 OUTPUT_STR := UPPER(p_value);
10 ELSE
11 OUTPUT_STR := p_value;
12 END IF;
13 RETURN
14 TRANSLATE
15 (OUTPUT_STR,
16 'ÀÁÂÃÄÅÇÈÉÊËÌÍÎÏÑÒÓÔÕÖØÙÚÛÜÝàáâãäåçèéêëìíîïñòóôõöøùúûüýÿ',
17 'AAAAAACEEEEIIIINOOOOOOUUUUYaaaaaaceeeeiiiinoooooouuuuyy');
18 end;
19 /
Function created.
SCOTT@orcl_11gR2> show errors
No errors.
SCOTT@orcl_11gR2> CREATE INDEX text_test_remove_dia_name
2 ON text_test (remove_dia (name))
3 /
Index created.
SCOTT@orcl_11gR2> set autotrace on explain
SCOTT@orcl_11gR2> select * from text_test
2 where remove_dia (name) like remove_dia ('mü%')
3 /
NAME
muller
müller
MULLER
MÜLLER
4 rows selected.
Execution Plan
Plan hash value: 3139591283
| Id | Operation | Name | Rows | Bytes | Cost (%CPU)| Time |
| 0 | SELECT STATEMENT | | 1 | 2131 | 2 (0)| 00:00:01 |
| 1 | TABLE ACCESS BY INDEX ROWID| TEXT_TEST | 1 | 2131 | 2 (0)| 00:00:01 |
|* 2 | INDEX RANGE SCAN | TEXT_TEST_REMOVE_DIA_NAME | 1 | | 1 (0)| 00:00:01 |
Predicate Information (identified by operation id):
2 - access("SCOTT"."REMOVE_DIA"("NAME") LIKE "REMOVE_DIA"('mü%'))
filter("SCOTT"."REMOVE_DIA"("NAME") LIKE "REMOVE_DIA"('mü%'))
Note
- dynamic sampling used for this statement (level=2)
SCOTT@orcl_11gR2> select * from text_test
2 where remove_dia (name) like remove_dia ('P%')
3 /
NAME
PAUL HERNANDEZ
1 row selected.
Execution Plan
Plan hash value: 3139591283
| Id | Operation | Name | Rows | Bytes | Cost (%CPU)| Time |
| 0 | SELECT STATEMENT | | 1 | 2131 | 2 (0)| 00:00:01 |
| 1 | TABLE ACCESS BY INDEX ROWID| TEXT_TEST | 1 | 2131 | 2 (0)| 00:00:01 |
|* 2 | INDEX RANGE SCAN | TEXT_TEST_REMOVE_DIA_NAME | 1 | | 1 (0)| 00:00:01 |
Predicate Information (identified by operation id):
2 - access("SCOTT"."REMOVE_DIA"("NAME") LIKE "REMOVE_DIA"('P%'))
filter("SCOTT"."REMOVE_DIA"("NAME") LIKE "REMOVE_DIA"('P%'))
Note
- dynamic sampling used for this statement (level=2)
SCOTT@orcl_11gR2> -
am working as a dba and we plan to introduce oracle text for text search. since I am new to this concept, I would like to know step by step implementation of Oracle text. I've searched some web sites but still not clear on the implementation part.
Please help me outHi,
Oracle Text is included in both standard and enterprise editions of the data server. When you are creating your database, select Oracle Text during configuration (one of the options). You will then have Oracle Text available on your database. The schema name is CTXSYS. You need to unlock this account just like any other.
To use Text, either grant permissions on the specific objects you need for the user, or use the CTXAPP role. It is up to you to know the permissions required for the objects (in other words, I can't tell you your requirements), so research this in the reference manual.
At this point, it is ready to use. Just create your indexes according to the Oracle Text Developer's Guide (you can find this with all of the documentation - look at the Application Developer's tab in Doc Library). Your search syntax depends totally on your requirements, and the type of index you choose to create. For example, the CONTEXT index uses the CONTAINS operator, and the CTXCAT index uses CATSEARCH (unless of course you want to use templates, but let's not go there just yet...).
There are two references you will want to review: The Oracle Text Developer's Guide and the Oracle Text Reference.
Hope this helps,
Ron -
Display problem when reading Traditional Chinese in Crystal Report 2008
Hi All,
I have tried to connect MS SQL 6.5 to build up a report. However, I encounter a problem when read Traditional Chinese. all chinese characters turn into adnormal characters. I wonder it is problem of SQL Server or some setting that I may not know. In addition, I can able to read chinese in the application program , ISQL / Query Analyzer. Please help to give idea and suggestion to fix this. Thanks
SQL Server: MS SQL Server 6.5
Client OS: Window XP ( Traditional Chinese Version)
Server OS: Window NT and Window Server 2000
StsyouHi
If you are using a Chinese language build database, Crystal Reports English build might not display the characters properly as non english databases include one byte characters as ASCII code character and double-byte characters as their own language code character.
Traditional Chinese, Simplified Chinese, Japanese (Kanji, Hiranga, and Katakana), Korean and Vietnamese use double-byte characters.
However, a possible workaround to this issue is to verify that the encoding on the database client is configured according to the examples below. Also, you will need to install the language pack in the English environment.
CONFIGURATION EXAMPLES:
Shift-JIS for a Japanese database
Big5 for a Traditional Chinese database
GB2312 for a Simplified Chinese database
iso-2022-kr for a Korean database
Windows 1258 for a Vietnamese database
EXAMPLE SCENARIO:
An Oracle database saves non-English language data such as Japanese.
The Windows operating system is English.
STEP TO CONFIGURE THE CHARACTER CODE IN THE ENVIRONMENT SETTING
(This step is based on the details of the EXAMPLE SCENARIO.)
1. Configure the system environment on the database client side to 'Shift-JIS' as the character code set in the Environment Setting.
This results in the NLS_LANG variable set to Japanese_Japan.JA16SJIS under the registry HKEY_Local_Machine\Software\Oracle\Homeo.
STEPS TO INSTALL LANGUAGE PACK IN AN ENGLISH ENVIRONMENT
(These steps are based on the details of the EXAMPLE SCENARIO.)
1. Add languages to the computer system by clicking:
Control Panel > Regional Options > General
2. Select the check box for 'Traditional Chinese', 'Simplified Chinese', 'Japanese', 'Korean' or 'Vietnamese'.
3. Click 'Apply'.
4. When you are prompted, insert the Windows CD-ROM to install the language pack.
5. Restart the computer.
Upon completing these steps, you are able to display the languages characters in the Crystal Reports Designer in a Windows English environment.
==========
NOTE:
If the database table and field names use non-English language characters, Data Explorer in CR will not correctly display these names. However, when you preview the report, the non-English data displays correctly.
==========
Configuring the database client according to the examples and installing the language pack will display the characters successfully. However, there are cases when this workaround does not resolve the issue.
For further information about CR and double-byte languages, refer to knowledge base articles, c2008083 and c2008349.
Hope this helps!!!
Regards
Sourashree -
Oracle Text in TimesTen?
If i have a ,lot of documents I am using Oracle Text for Indexing, are there any options to using TimesTen?
TimesTen doesn't support Oracle Text directly.
If caching is used then Text Searches are possible via a timesten client and executed on the Oracle RDBMS database.
Maybe you are looking for
-
Why can't I sign in on my G4?
I'm sure that I have done something terribly stupid, so here goes: I decided to decrypt my vault file (hoping to save memory), which took about an hour. The computer asked for my Mac OSX sign in. I inserted the short form of my administrator's name a
-
I've been reading about all the problems new Mac Mini users are having with trying to expand their desktop using the HDMI port. ie: snowy screens, fuzzy text, blackouts etc etc. My Mini is due to arrive in a couple days and I have two Dell UltraSha
-
Live streaming of a non-Flex View video from PC to TV via any FiOS capabilities
Hey all - Is it possible to, using any capability of FiOS/Flex View, stream a live video from my laptop to my TV? There is an annual competition from Europe I watch every year (only available on the web to US viewers) and it requires a special
-
I have been trying to import my config file from MAX 4.6 into a new system running MAX 5.5 and daq 9.8. The data neighborhood does not appear, however my application sees the information. How can this be remedied? Start over? I have a lot of inputs
-
MD datasource for Purchase orders in Procurement(MM)
Hi Friends, Do I need to activate all the MD datasources under MM or are there any particular MD datasource for Purchase Orders .... Points will be awarded