Full text search for web ? Yes or no ?

Hi,
I have a DB that has more then 1.8mil records in a single table .... and would like to implement full text search or some sort of caching for quicker Web search ....
Let me describe you what I have .... The table that holds 1.8mil records is made out of 30 clob columns ... each holding text .... actually these are alphabetic columns ... so words that start with char 'A' are in the first clob ... 'B' in second 'C' in third and so forth ....
Searching is always done first by customerID and CreateDate which are both indexed columns , and then clobs are searched using instr ...
Execution plan was good ... but searching times started to increase ....
So therefor I would like to improve the search ... by implementing some sort of caching mechanism ....
I read a lot about this and found an example where I would create a table containing unique words and table for occurrences of the words ... but this would then
make like 1.8mil articles containing approc 500 words , which would then repeat through articles ... so ok there would be less then 50.000 unique words (in our language ) , but the occurrences would dramatically increase cause every word inside article has to have a link in occurrences table ... so this would then be like 900mil records inside table ..
Is this at all possible to have so many records inside single table ? And still make it quick ?
Is the Oracle Full text search the only right way in this situation ?
Any suggestions ? Did anyone implement anything like this ?
Thanks,
Kris

Let's start with your Oracle version. Please specify which version you run because Text capabilities vary dramatically between releases.
>
I tried using Oracle Text as suggested ... now if I understand correctly ....
CTXCAT - would be great because when new records are added, index is updated automatically .... but doesn't support CLOBs ... so no go
>
CTXCAT is a concatenated transactional index that is supposed to optimize combined searches on text and other columns. No go for you as it indeed does not support CLOB columns.
>
CONTEXT - supports CLObs , but I need to explicitly synchronize index ....
There are like 4000 inserts per day ..... and they all need to be indexed in a real-time ...
>
Not true, at least since 10g: SYNC(ON COMMIT) parameter makes this index type transactional (it's synchronized automatically on commit with this parameter set.)
>
If CTX_DLL.SYNC_INDEX procedure synchronize the whole table which is now 1.8mil records, this can take a while ... so it can't be run after inserts ....
>
It does not, it only synchronizes changed data since last sync operation.
So CONTEXT is actually perfectly suited for your needs (just redesign those 30 columns into one document column and index it.) Note that you need to regularly maintain CONTEXT indexes by scheduling CTX_DDL.OPTIMIZE_INDEX to run at off-hours and purge stale/removed data and rebuild its own internal index bitmaps for better performance. Otherwise you will see performance degrade as changes to the indexed data accumulate. You might also want to tweak initial indexing parameters, especially MEMORY parameter, as it greatly affects resulting index fragmentation - the more memory you give for initial indexing or optimization, the less fragmented and the more performant the index will be all other things equal.

Similar Messages

  • When can oracle support full text search for Simplified Chinese?

    When I create index using the create index clause,the following errors appears
    "ORA-29855: error occurred in the execution of ODCIINDEXCREATE routine
    ORA-20000: interMedia Text error:
    DRG-11440: operation not supported for the SIMPLIFIED CHINESE language"
    Maybe I have to use like to query words...
    Did somebody encounter the same problem and have a good solutions?I am expecting your help!
    null

    Hi,
    Full-text search capabilities is not currently not supported and Unfortunately, we do not have a timeline as to when it will be available.
    You can post a feedback on the below link.
    http://feedback.azure.com/forums/217321-sql-database
    Regards,
    Mekh.

  • How does full-text search for pdf files work?

    Hi there,
    Basically I can see my pdf file in the content server.. inside the pdf there's a piece of test that says: "Test's Sample" but when I do a search with that string the file gets filtered from the results.
    I think it has to do with the ' (single quote) being there because other text in the pdf works fine.. so I was wondering how does VDK store this full text? where? I'd like to see how it gets translated IF that's how it works with pdf files....
    Following advice from Re: Parse error with search query I tried doing the search by:
    Test\'s Sample
    Test`s Sample
    "Test's Sample"
    The database is db2 if that helps.. how can I fix this problem?

    Nevermind, I fixed it by changing the VDK filters (in case someone is looking for a solution too).
    Cheers,

  • Full-Text Search has not worked since we upgraded to 2012

    I have a filestream database and table. Our full-text searches have always worked until we upgraded to SQL 2012 in December.  Now, no file that has been uploaded since December is searchable.  What has gone wrong here.  It should have been
    a clean upgrade.  We are not getting any error messages.  We are just not getting any records returned when we search on a word that we know are in the documents we've uploaded since December (for instance, the word 'aluminum'). 
    Filestream is enabled for the instance.
    A full-text catalog exists and contains a full-text index (the same one we've always had).  Full-text indexing is ENABLED.
    I've tried rebuilding the catalog and the index.  I've tried to do a FULL POPULATION on the table.
    We haven't changed our queries nor the way the files are uploaded.
    Nothing works.  I have been a database administrator since the SQL 2005 days and I have never seen anything like this.
    Please help.

    Hi GINGER PIERCE,
    Since the issue regards SQL Server Search. I will help you post the question in the related forums. It is appropriate and more experts will assist you.
    According to your description, in theory , if you can do a full-text search for SQL Server 2008, when upgrading the SQL Server version from 2008 to 2012, the Full text indexing feature should be run well in SQL Server 2012 databases. If not , you can try
    to restore your database from SQL Server 2008 to SQL Server 2012, create an new Full-Text Catalog and Index on the table or view in the database, and then use Full Text Index to search words, phrases and multiple forms of word or phrase via FREETEXT() and
    CANTAINS() with “and” or “or” operators.  check if it is normal that the full-text search feature is enabled in the SQL Server 2012 instance. For more information , see:
    Full Text Search step by step in SQL Server 2012.
    Note:  In SQL Server 2012 SP1 , the server will report that Full Text Search is not supported in this edition of SQL Server when it clearly is. The workaround is to create the initial catalog by using a T-SQL query:
    CREATE FULLTEXT CATALOG  
    In addition, since it is a fileStream database, we need to verify if you do Full Text Searches on documents in FileTables, if yes, you should enable FileStream database for your SQL Server, and enable FileTable options for the database. For more information,
    see:
    Full Text Searches on Documents in FileTables.
    Regards,
    Sofiya Li
    If you have any feedback on our support, please click here.
    Sofiya Li
    TechNet Community Support

  • Full text search with XML input in Oracle

    Hi,
    I need to develope a web application which can do full text search for a library. We will use XML as the input data format and stored XML file in Oracle. How can I do full text search to the XML according todifferent tags or elements. e.g search from title or from article.
    Do not only convert XML to Ascii and do the full text search. It should utilize the XML tag and structure.
    InterMedia text in 8i or Oracle text in 9i? Or any other toolkit?
    Thanks a lot
    Jack

    Please refer to Oracle Text documentation

  • Disable Full Text Search in Records Management

    Hi Everyone,
    This is Shiva, we implemented SAP Records management & i am came in middle of this project. Right now client want to do some customizations in Records.
    Actually i need to disable Full Text Search under Records. Under Cross Query i disabled the Full text by the following process
    (Two sample element types are delivered with the service provider
    SRM_SP_QUERY:
    SRM_SPS_QUERY
    Search for records, documents, cases, record models and file plans
    SRM_SPS_QUERY_REC_DOC)
    Under(SPRO) Cross SP search based on Property Unification; you can able to disable the FullText search field(this will be effects only for Cross Query), but under Records, when you right click there will be one option called FIND in which i am unable to disable the FULL TEXT search.
    I used DMWB T-code under LOIO, i  search for Standard/free/instance attributes, but i didn't find any Full Text Search Field in that one.
    Can anyone please help me in this. I really appreciate your inputs & help.
    Thanks,
    Shiva.
    Edited by: Shivasap on Aug 2, 2010 6:01 PM
    Edited by: Shivasap on Aug 3, 2010 4:01 PM

    Dear Shiva,
    Full text search needs to be deactivated at content model level for attributes. You will find a document class in the element type of the record. In transaction DMWB, you will find LOIO & PHIO classes for the document class. For each attribute, you will find a checkbox "index-relevant" which needs to be unticked if you dont want to use TREX. For deactivating full-text search for which a field is given in the FIND popup, TREX needs to be deactivated. There is no option to deactivate just for a record element type in RM. Unfortunately, the field itself cannot be hidden. This is programmed in the logic.
    Best Regards,
    Pragya

  • Full text search on PLM Web UI

    Hello,
      Full text search functionality in DMS with TREX solution , is working with SAP GUI. ( Original is checked in CV01N or CV02N )
      Same functionality is not working with PLM Web UI .
    Is there any setting has to be activated for this.
    Thanking you,
    Regards,
      Satya

    Hi ,
    Yes.  When I checkin Document in CV01N , then I can find in CV04N.
      But if I check original in PLM Web UI , then I can not find in CV04n.
      I assume that indexing will not problem as it is working in SAP GUI case.
       Indexing functionalities ( TREX ) are not called when Document is checked in through PLM Web UI.
      Where we can check these settings ?
    Thanking you,
    Regards,
      Satya

  • How to enable Full text search feature in Azure SQL DB - Web Edition

    I have created an instance of Azure SQL DB web edition (basic), using the BizSpark a/c, for one of my global search feature, I need to enable/install full-text search , I don't find this option in Azure SQL DB web edition that I installed , would
    appreciate any help on this.

    Unfortunately full text search is not supported in Azure SQL DB.  
    Cotega - Azure SQL DB Monitoring, Notification and Scheduling service

  • "Filter/partition key" for full-text searching

    Hi there,
    We have a challenge whereby we have a table of products by store, each store having say 200,000 products.  Basically, for each store, we want to allow searching by product name.  The best solution for this is to have full-text searching, but there
    is no way to have a "filter" or "partition" key on the store ID.
    So in essence what happens, the full-text search scans the entire full-text catalog for the products, then it uses the primary key to match to the table and then filters out the other stores.  Considering we have hundreds of stores in the table, this
    is not a good solution.
    We contemplated adding separate indexed views and full-text catalogs for every store, but this would be a nightmare to manage.
    I was expecting to see some sort of a "partition by Column" in the full-text indexes, but it doesn't exist.  This basically means we have to scrap full-text and look for a third party solution.
    Does anyone have any idea how we could achieve this with just standard SQL full-text searching?

    Hi Adam,
    Thank you for your question.  I am trying to involve someone more familiar with this topic for a further look at this issue. Sometime delay might be expected from the job transferring. Your patience is greatly appreciated. 
    Thank you for your understanding and support.
    If you have any feedback on our support, please click
    here.
    Elvis Long
    TechNet Community Support

  • Full text search across books in iBooks for Mavericks

    Is there a way to do full text search of all documents under iBooks for Mavericks?
    I have quite a few books in categories and collections that are not mutually exclusive. It is becoming quite tedious to find the book I need when I need it. I have tried search in iBooks but it only seems to hit titles, maybe some keywords, and perhaps other fields, but not the full text of the documents.
    I cannot find any way in Spotlight preferences or in iBook preferences to specify that the body of documents under iBooks should be indexed/searched.
    Best regards,
    Tom

    Just to register that I have fixed the problem.
    I tried reinstalling RH, but that didn't work. I was about to
    give up when I noticed that an error was coming up when I compiled
    the project: the infamous "HHC6003: The file itircl.dll has not
    been registered correctly". I found out that you only fix that
    problem by registering another dll: itcc.dll
    Once you fix this, the compilation is perfect and the
    full-text Search works properly again.
    Thanks anyway for the interest.
    Paulo.

  • Clarifications needed for full text search

    Hi,
    I need some clarification regarding full text search.
    1) Is japanese part of the standard Oracle full text search?
    2) if it is not, how to install the japanese lexer?
    3) how oracle is sorting international characters. If a column contains both english, japanese and french, how will be the output?
    Thanks
    Muneer

    Follwoing is the sql statement and the result i got
    select language, description,lengthb(description) bytes, length(description) length, vsize(description) vsize from t2;
    LANGUAGE      DESCRIPTION      BYTES      LENGTH      VSIZE
    English      abcdefghij      10      10      10
    English      zyxwvutsrq      10      10      10
    French      désignéess      16      12      16
    French      réconcilia      13      11      13
    German      Einfuhrzöl      13      11      13
    German      müßtämpfer      19      13      19
    Greek      Î´Î·Î¼Î¿ÏƒÎ¹Î¿Î³ÏÎ±      40      20      40
    Greek      Î±Ï€Î¿ÎºÎ»ÎµÎ¹ÏƒÏ„ι      42      20      42
    Russian      Ð¿Ñ€ÐµÑÑ‚упник      42      20      42
    Russian      Ð¿Ñ€Ð¸Ð²ÐµÐ´ÐµÐ½Ð¸Ñ      41      20      41
    Japanese      å…¥é–“市の会社の銀行口      65      30      65
    Japanese      ç¥žæˆ¸å¸‚ä¸å¤®åŒºã®æ±éŠåœ’      62      30      62
    Korean      ì¶œë°œì ë¶€í„°ì¶œë°œì ë¶€í„°      64      30      64
    Korean      ë³´ì¢Œê´€ì„출발점부터경      64      30      64
    Hindi      à¤•à¤¤à¤œà¤¨à¤¤à¤®à¤¨à¤œà¤¤à¤¶à¥à¤°      73      36      73
    Hindi      à¤¨à¥€à¤šà¥‡à¤¨à¥€à¤šà¥‡à¤¨à¥€à¤šà¥‡à¤¨à¥€à¤šà¥‡à¤¨à¥€à¤šà¥‡      130      60      130 I think it explains a lot. I am facing another problem in searching blob columns when it contains japanese or korean characters. I tried with multi lexer (adding japanese as sub lexer and making english as default lexer). But it is not searching the column. Do i have to set any other parameters (editing registry, changing enviornment setting etc). I used the following script to set the lexer.
    begin
         ctx_ddl.create_preference('english_lexer','basic_lexer');
         ctx_ddl.set_attribute('english_lexer','index_themes','yes');
         ctx_ddl.set_attribute('english_lexer','theme_language','english');
         ctx_ddl.create_preference('german_lexer','basic_lexer');
         ctx_ddl.set_attribute('german_lexer','composite','german');
         ctx_ddl.set_attribute('german_lexer','mixed_case','yes');
         ctx_ddl.set_attribute('german_lexer','alternate_spelling','german');
         ctx_ddl.create_preference('japanese_lexer','japanese_vgram_lexer');
         ctx_ddl.create_preference('korean_lexer','KOREAN_MORPH_LEXER');
         ctx_ddl.set_attribute('korean_lexer','COMPOSITE','NGRAM');
         ctx_ddl.create_preference('global_lexer', 'multi_lexer');
         ctx_ddl.add_sub_lexer('global_lexer','default','english_lexer');
         ctx_ddl.add_sub_lexer('global_lexer','german','german_lexer','ger');
         ctx_ddl.add_sub_lexer('global_lexer','japanese','japanese_lexer','jpn');
         ctx_ddl.add_sub_lexer('global_lexer','korean','Korean_lexer');
    end;Hope i presented enough details.

  • How to Enable Full text search in UCM with Oracle 10g

    Hi,
    Can you please guide me on how to enable full text search in UCM after I have installed it. My database is Oracle 10g. It would be great if you could guide me through the steps since I am very new to this product.
    Regards
    Ashish

    Hi
    Are you sure that your indexer is finishing without any errors? Check activeindex.hda under \search directory to see what is the variable set for activeindex variable? If Oracle Text is the indexer search enabled then the variable activeindex should either OTS1 or OTS2. If anything else then you have indexer problems that is not indexing documents.
    Check the CS log file to see if indexer errors are shown there. If yes, then put it over here so that we can analyse it.
    Hope it helps
    Thanks
    Srinath

  • Kazehakase with full-text search in history using Hyper Estraier

    A guide for Kazehakase with full-text history search using Hyper Estraier
    I adopted qdbm and submitted hyperestraier in AUR, so you can enable full-text search frature by installing Hyper Estraier from AUR and rebuilding Kazehakase using srcpac or yaourt.
    1. Install QDBM and Hyper Estraier from AUR. The easiest way is using yaourt. (If you prefer not to use yaourt, download tarball and do makepkg && pacman -U manually.)
    yaourt -S qdbm hyperestraier
    2. Rebuild Kazehakase using srcpac.
    srcpac -Sb kazehakase
    Of cource you can rebuild Kazehakase using yaourt.
    yaourt -Sb kazehakase
    You don't have to modify configure option in PKGBUILD of Kazehakase, because "--enable-hyper-estraier" is implied by default. If Hyper Estraier is installed successfully, you'll get "Hyper Estraier: yes" in configure messages.
    3. Configure Kazehakase. To enable full-text search in history, run Kazehakase and go Edit>Preference>General and change UI Level to "Expert" and apply settings. Next, go Edit>Preference>History and set Search engine name to "hyper-estraier" and restart Kazehakase. Then you'll see "History Search" box next to "Internet Search" box.
    Sorry for my poor English.

    "ctxsrv" is no longer supported at version 10.1.
    Instead PARAMETERS clause has SYNC option, and you can specify ON COMMIT for this.
    If you created the database with DBCA and chose Oracle Text option, then you have no need to perform any further operations to configure Oracle Text.

  • Problems using and configuring Oracle 10gR2 database full-text search

    I am having problems trying to set up full-text indexing and search with Universal Content Management (UCM). I followed the Oracle Content Server Installation Guide for windows at [http://download-west.oracle.com/docs/cd/E10316_01/cs/cs_doc_10/documentation/integrator/install_cserver_win_10en.pdf].
    What I did was:
    1. Modify E:\oracle\ucm\server\config\config.cfg by adding SearchIndexerEngineName=DATABASE.FULLTEXT to the end of the file.
    2. Restart the content server.
    3. Rebuild the search indexing using Repository Manager.
    However, I keep seeing the following error when I query by entering words in the "Full-Text Search" box.
    Unable to retrieve search results. Unable to retrieve search results. Unable to create result set for query 'SELECT IdcColl1.dID, dDocName, dDocTitle, dDocType, dRevisionID, dSecurityGroup, dDocAuthor, dDocAccount, dRevLabel, dFormat, dOriginalName, dExtension, dWebExtension, dInDate, dOutDate, dCreateDate, dPublishType, dRendition1, dRendition2, VaultFileSize, WebFileSize, URL, dFullTextFormat, dFullTextCharset, DocMeta.*
    FROM IdcColl1, DocMeta
    WHERE IdcColl1.dID=DocMeta.dID AND (((CONTAINS(dDocFullText,'test') > 0 ))) ORDER BY dInDate Desc'. ORA-20000: Oracle Text error:
    DRG-10599: column is not indexed
    Some web searches suggested the following (all of which I have tried but not resolved this problem).
    1. Publish the schema using Configuration Manager (applet) and then rebuild index
    2. Set the dDocFullText as a "zone field". This is not possible, because dDocFullText does not show up under the list of fields under "Database" or "DatabaseFullText" for the Search Engine drop down (when using Zone Fields Configuration).
    3. Reboot the server (did not work either).
    I logged onto the Oracle database and checked the IdcColl1 table. There is indeed, no index for the field, dDocFullText. There is only 1 index for the field, did. The field, dDocFullText, is a BLOB. The question is, if I am supposed to create an index manually for this field, how would I do it? A web search has not been fruitful in answering this question.
    Here are my server settings.
    For UCM:
    Operating System: Windows 2003 Enterprise
    UCM : 10gR3
    Memory: 1 GB
    Web Server: Apache 2.2.11
    For Oracle:
    Operating System: Windows 2003 Enterprise
    Oracle: 10gR2
    Memory: 1 GB
    Thanks.

    I found out what the problem was. The problem was that I had to create the role, stellent_role, as described in the installation manual. After I created this role and assigned the database user to this role, a restart of the Content Server services and collection rebuild of the index fixed the problem.
    However, I did notice one thing. I checked in 3 PDF files, and when I used Repository Manager to do a collection rebuild, I noticed that for Indexer Counters, the count for Full Text was 0 and the count for Meta Only was 3.
    Anyone have any ideas? Is there something else that I missed? From reading the installation manual, it was not clear how database full-text indexing/searching would handle PDF files.

  • E-Recruiment: Full Text Search Not Working

    Hi Guys,
    We have a problem with the integration of SAP E Recruitment and TREX Search Engine. Maybe someone can share their idees if same problems occured.
    We have of course done investigations on SAP marketplace and the sdn community. In Fact the TREX is configured and works fine for searches with drop down selections, etc> When using full text search however no responses are shown. When performing searches on the indexes in TREX itself we have results on free texts.(as example Web Dynpro application Full Text Search )
    Does anyone has a clue where to look? Of course customizing have been reviewed several times. Thanks in advance for any feedback.
    Regards
    Reginald

    I am having the same problem but now in my case the RFC_PERIODICAL_SERVICES job keep running eventually we have to kill it..
    It usually runs for less than a min but now it is running for hours..
    I have reindex a few time and turn that job off when i was reindexing but still no resolution.  Search works in trex admin tool but still not able to search threw website.
    I openned up a high priority ticket with sap.  Any suggestions would be great..

Maybe you are looking for