TREX not indexing document 10 KB

hi...
  TREX search is working fine for documents < 10 KB but if i try indexing documents > 10 KB it gives prperation failed error.Please let me know what as to be done so that it indexes document > 10 KB
Please help me on this as it is very urgent
Thanks in Advance,
Shanshank

I installed an EP7/TREX SP10 two years ago. All worked fine until I decided to apply windows SP2 and some security patch. My machine is a win2003 x64.
Now EP7 works fine, but TREX in not able to reindex documents > 10kb.
Trex preprocessor trace says:
Preprocessor.cpp(03550) : HANDLE: DISPATCH - Processing Document with key '/documents/Segreterie/Documenti PCTP/Doc. in Arrivo/Anno 2009/2009_03123.pdf' failed, returning PREPROCESSOR_ACTIVITY_ERROR (Code 6401)
In the portal security log file I found the corresponding error:
System/Security/Authentication#sap.com/irj#com.sap.engine.services.security.authentication.logincontext#Guest#2####764602d0407311dea83600188b77747b#SAPEngine_Application_Threadimpl:3_17##0#0#Info#1#com.sap.engine.services.security.authentication.logincontext#Plain###LOGIN.FAILED
User: N/A
Authentication Stack: ticket
It seems to me that the portal is not able to pass user name to Trex (User: N/A) and then It is not authorized to retrive the documents.
I tried to change the user of indexmanager service "index_service"; to set the alternative host in url generator service. Nothing changed.
Any suggestion?
Giorgio Peressin

Similar Messages

  • TREX does not index documents

    Now this one is a bit strange. We have created an index on a file system repository that contains a number of different types of files including plain text, MS-Word, Excel and Acrobat PDFs. The queue now has a red status showing a sizeable number of documents failed the preparation, but a few are ready to be transmitted.
    We found that none of the MS-Word, Excel and Acrobat documents passed the preparation stage. Anything that passes preparation is either a directory or a plain text file, though this is not consistent either since a number of files that did not pass preparation where als plain text files.
    What gives? Am I missing something here? Any hints would be very much appreciated.

    Hello Wolfgang,
    we faced exactly the same problem. To be more precisely, documents with more than 10kb were not indexed. The reason for this is, that doc´s smaller than 10kb are automatically transmitted to TREX by the crawler. For documents bigger than 10kb the crawlers only transmit the URI and then the TREX fetches them by itself.
    The URL´s that are sent to TREX are normally built by the URL generator using the host entry. If you have a clustered landscape this might not work, because the Firewall, Load-Balancers, WebServers, or external authentication Systems might cause problems when they are directly called by TREX via the normal portal URL.
    In this case you must specify the name of a specific server (e.g. http://server.com:50000) as Alternative Host in the URL Generator Service. But be careful with that setting. A wrong setting might crash your system (happened on our productive layer unfortunately).
    If you use SSL for the communication between TREX and portal it even gets more complicated. Fortunately I did not have to deal with that.
    Thus, if you have clustered system, be sure that the communication TREX to Portal can take place without problems. Take a look at the trace files of the TREX preprocessor, this should give you a hint on this.
    Hope this helps.
    Kind regards,
    Christopher Höfele

  • TREX not indexing Excel files

    Hi all,
    We are getting the following error when trying to index some MS Excel files:
    14011 - Filter: The file is password protected or encrypted
    We are using TREX 7.10 rev 34.
    We have found that the cause of this is when the Excel file has been set to protect the worksheet with a password (in Excel, go to Tools -> Protection -> Protect Workbook -> set password). It seems odd to me that TREX cannot index a file like this because it should be able to open it. I would think that it only would have problems if it were trying to write to the file (which it is not trying to do).
    Does anyone have any ideas as to why it's like this or what can we do about this other than remove the workbook password? We have dozens of files like this, and I need to find a solution other than removing the password.
    Thanks!
    -StephenS

    Hi Stephen,
    I couldn't find any release note specific to this issue however I do know that there are newer filter versions available in the most recent TREX revisions so the upgrade is usually a good option to resolve such issues. Of course I cannot say for certain if Revision 39 will resolve it without trying to index your specific document on that version myself.
    Anyway I know there are some Oracle filter issues ongoing with TREX 7.1 so I think an upgrade to a newer revision may be inevitable at some stage, but  perhaps in this scenario it would be best to wait for confirmation of the exact revision that contains the fix from the TREX experts in your message.
    Kind regards,
    Lorcan.

  • TREX not indexing

    hi,
    I have installed TREX 6.1 server which is working fine. But when i configure it to a EP 7 for indexing its repositories it is not responding. Can i use a TREX6.1 server for pointing to a EP7 system in the first hand? If yes, why is it not indexing the documents in it? please clarify.
    Regards,
    Ganesh N

    Hi Akhilesh,
    I am also having some problem in TREX search ,documents are giving lots of error:
    <b>8005,time out reached
    preparation failed.</b>
    When i go to KM-> component monitor->repository managers two of my repository are having below errors:
    <b>Name :Company_SAN
    Type: Repository manager
    status: Start up failed
    Des:  Startup Error:  getting mapped math - Invalid argume
    Name :Web repository
    Type: Repository manager
    status: Start up failed
    Des:  Startup Error:  Exception during start up of sub-manager: W2kSecurityManager: problems getting config parameters from Repository Manager</b>
    Please come up with some solution to it.
    Thanks in Advance
    Amit kumar koyal

  • KM TREX not indexing

    Hi
    I have a problem regarding the TREX.
    I have this folder indexed.
    I create resources in this folder by a custom WDP application.
    Basically i will upload a .pdf or .doc file from desktop to this folder.
    Now the problem is some of the documents are not getting indexed.
    There are very few documents though, but i cant find a reason why it is not getting indexed.
    Now, when i check the properties of the documents which gets failed, i dont see any problem.
    If i simple press "Save" button, the document starts gettign indexed and thus it gets indexed.
    Here, i am not changing any property, but simply pressing the save button.
    Till date, i couldnt reproduce the error.
    But since this error was reported from Prod, we want to know why this error comes and how it can be avoided.
    Please give ur thoughts...
    Regards
    BP

    Hi Piyush,
    My case is like some docuemtns didnt get indexed.
    Well some others working perfectly fine.
    Even now if i upload some docuemtns, it is getting indexed.
    And as i stated earlier, i goto details view and press "Save", it starts indexing the doc.
    So communication to the TREX server is working fine.
    The main problem is i cannot replicate the issue.
    Here is the code sample i use to upload the document to KM
    public boolean uploadDocument(
              final String documentName,
              final byte[] documentContent,
              final String location,
              final String fileExtension,
              final String company,
              final String brokerInstitution,
              final Date publishedDate,
              final String title)
              throws UserManagementException, ParseException, WcmException {
              final IContent fileContent;
              final RID rid;
              final ICollection collection;
              final IResource createdRes;
              fileContent = this.getResContent(documentContent, fileExtension);
              rid = RID.getRID(location);
              collection =
                   (ICollection) this.resourceFactory.getResource(
                        rid,
                        this.resourceContext);
              //Check for duplicate
              String tempStr =
                   (new StringBuffer()
                        .append(location)
                        .append('/')
                        .append(documentName))
                        .toString();
              final IResource reportResource =
                   this.resourceFactory.getResource(
                        RID.getRID(tempStr),
                        this.resourceContext);
              if (reportResource == null && collection != null) {
                   final IPropertyMap propertyMap =
                        this.getMetadatapropertyMap(
                             company,
                             brokerInstitution,
                             publishedDate,
                             title);
                   createdRes =
                        collection.createResource(
                             documentName,
                             propertyMap,
                             fileContent);
                   return true;
              } else {
                   return false;
         public IPropertyMap getMetadatapropertyMap(
              final String company,
              final String brokerInstitution,
              final Date publishedDate,
              final String title)
              throws ResourceException, ParseException {
              final IMutablePropertyMap propertyMap = new MutablePropertyMap();
              final String propertyNamespace = getproperty("PROPERTIES_NAMESPACE");
              final String companyNameID = getproperty("COMPANY_NAME");
              final String brokerInstitutionID = getproperty("BROKER_INSTITUTION");
              final String publishDateID = getproperty("PUBLISHED_DATE");
              final String createdDateID = getproperty("CREATED_DATE");
              final String titleID = getproperty("TITLE");
              IPropertyName propertyName =
                   new PropertyName(propertyNamespace, companyNameID);
              IProperty property = new MutableProperty(propertyName, company);
              propertyMap.put(property);
              propertyName = new PropertyName(propertyNamespace, brokerInstitutionID);
              property = new MutableProperty(propertyName, brokerInstitution);
              propertyMap.put(property);
              propertyName = new PropertyName(propertyNamespace, publishDateID);
              property = new MutableProperty(propertyName, publishedDate);
              propertyMap.put(property);
              //Calendar cal = Calendar.getInstance();
              //Date today = new Date(cal.getTime().getTime());
              java.util.Date date = new java.util.Date();
              Date today = new Date(date.getTime());
              propertyName = new PropertyName(propertyNamespace, createdDateID);
              property = new MutableProperty(propertyName, today);
              propertyMap.put(property);
              propertyName = new PropertyName(propertyNamespace, titleID);
              property = new MutableProperty(propertyName, title);
              propertyMap.put(property);
              return propertyMap;
    Regards
    BP

  • TREX not indexing some files

    Hi,
    When I create an index, this gets to index some documents, but others no.   
    It gets to index files txt and html, but it doesn't get to index files pdf, doc,xls, etc. It gives the following error: 14021 message not found / Preparation failed. 
    I already verified the file TREXValidMimeTypes.ini and it seems everything well. 
    I need help......
    Trex 7.0.41   
    Portal 7.0 SP11
    Regards,
    Nelson Costa

    Hi,
    I have also faced same issue and got resolved after setting host name in the URL Generator Service
    Logon to portal as administrator
    Navigate to System Administration -> System Configuration -> Knowledge management -> Content Management -> Global Service -> Show Advanced option -> URL Generator Service
    Then check the Host name.
    It should be according to your portal
    For example if your portal URL is http://GHIKD.sde.rfd:53200/irj/portal then you need to give host name like http://GHIKD:53200.
    https://www.sdn.sap.com/irj/sdn/thread?threadID=471175&tstart=0
    Hope this helps u.
    Regards
    Krishna.

  • TREX does not index all documents fro document class SOLARGNSRC

    Hi all,
    I've setup a connection between a TREX server (which is also used by a portal system) and Solution Manager. I've gone through the settings in SAP Note 750623 and I was able to create a index, the queue and even bypass the basic authentication in the preprocessing for document class SOLARGNSRC
    According to everything I've read, this should be enough to be able to index all documents in Solution Manager and find them with Full Tex Search. But this doesn't work. From the 50.000 documents in de class (35.000 English and 15.000 German) only 6300 documents are passed to TREX. Most of them are German HTML links to help.sap.com. I can see in the trace files that the URL's to some documents in the content server are passed to TREX. I can open dthe document using the links but I cannot find the documents using the full text search which I think means that nothing was really indexed.
    Questions:
    1. Has anyone been able to succesfully index Solution Manager documents for full text search purposes?
    2. Why are only 10% of the documents passed to TREX? Is there a specific setting for this?
    3. Why does TREX use the Content Server HTTP links to index the documents and not the RFC connection?
    Cheers
    Marcel Rabe

    Yep, all lights green
    SSR maintained. DRFUZZY as Search Engine (I have not tried Verity)
    Return Code? Interesting, I don't get an return code when I trigger the Index/Deindex. I just see the hourglass for about 5 minutes (when I run it in the foreground) and after that its back to the way it was. No messages. Nothing appears in the application logs as far as I can see.
    The program RSTIRIDX  is scheduled in the background and runs every hour minutes for about 10 seconds without an error.
    TREX Version 7.00.39.00
    Five languages actived, including German and English. In SKPR07 under Indexed documents I can see that 300 German and 6000 English documents have status indexed.
    No proxy server. Systems sit within the same network segment
    Thnx for your help. I posted a message with SAP as well as this seems strange to me.
    Marcel

  • Website on Mac Mini Server not accessible. Error 403 index document not available.

    Dear all,
    I'm trying to host a website on my Mac Mini Server. As far as I can see, my DNS is working correctly. Also my firewall is setup correctly. (I'm hosting another website which works OK). But I setup a new site. Basically a copy of the working site. But this site doesn't reflect an index.htm page. I receive an Error 403 with further information that there is no index document or that the site is secured. There is a contact link on this page directing to my webmaster. So I conclude that something is working, but no index.htm page. Has anyone similar experience? And offcourse a solution?
    Many thanks.

    Launch Terminal.app and issue the commands:
    cd /Library/Webserver/Documents/
    ls -al
    And see if the files are readable by the web server.   Here's an example index file listing that's owned by root and with group access for admin folks, and this file is marked as readable and writable for the owner and the group, and is readable for everybody else:
    -rw-rw-r--    1 root  admin   5094 Jul 25  2009 index.html
    In this configuration, the web server would be using the other access; the web server can read this file, but not write to it.  Here is how to duplicate these ownership and access settings on your own index.html file, assuming that you have that file present in the ls listing above, and assuming that you have issued the cd (change default directory) command shown above:
    sudo chown root:admin index.html
    sudo chmod ug=rw,o=r index.html
    You will need to enter your administrative password for the sudo command.
    I would very strongly suggest that you start learning some basics of bash and of Apache, because you're going to be needing those skills very soon.

  • Index.log - could not open document

    I get error when indexing a folder.
    The document with the problem is named: document.pdf
    Other documents in this folder works fine.
    The  index.log tells:
    Could not open document: W:\Doc\document.pdf-Press Quality.joboptions
    This file is skipped in the created index.
    What is the problem?
    I tried to regenerate the pdf - still the same problem.
    Is the index process looking for the joboption file when doing the indexing job?
    It's a kind of weared as I use this Joboption in other documents without problem.
    I use software:
    Frame Maker 10.
    Adobe Acrobat X Pro.
    Windows 7 Enterprise 64 bit.

    Hi Bill!
    Thanks.
    My version was 10.0.x
    I now updated to 10.1.13. This did not help.
    How I get there in the first place?
    I work at a company that develops and manufactures machines and systems.
    My job is to produce instruction books, manage and distribute all documentation (manuals, diagrams etc.) to our customers. 
    This is done by gathering all pdf-documents (wiring diagrams manuals etc.) in a doc-folder (documents are contained in sub folders), linking these documents in a master-page (main menu). (Documents in the folders includes pdf documents from other suppliers.)
    When all this is done, I create an index for all documents in the  doc-folder. Then connecting this index to the master page.
    This makes the entire documentation set searchable from the master page.
    Normally, this works just fine.
    But not for this this document.
    I just do not understand what the error actually means.  
    Could not open document: W:\Doc\document.pdf-Press Quality.joboptions
    This "document.pdf" is an instruction manual that can be opened as any other pdf file.
    More ideas are very much appreciated.

  • Trex search not working for not english documents

    Hello, I've got a problem enabling full text search for russian documents
    For enabling full text search for documents stored in kPro theese steps
    were performed:
    - creating rfc connection with TREX instance
    - enabling Index Release and Classify in transaction skpr06 for document
    area SRM
    - marking LANGUAGE attribute in table SDOKCLPROP for custom PHIO class
    Z780003P01 as obligatory
    - creating index category and indexing stored documents in transaction
    skpr07 for class Z780003P01
    The indexing was performed successfully. Under the created index category
    for class Z780003P01 it can be seen two indexed document classes
    (transaction spkr07, button indexed documents) one with language marked E
    (english) and one marked R (russian).
    When trying to perform test search (transaction skpr07) and indicating
    language as RU only documents in english can be searched. When trying
    a search request in russian search engine allways shows 0 results (for
    every request more than 2 characters). Small requests in russian (1-2
    characters) show some results but they are hardly to be right.
    Have you got some experience enabling full-text search for documents in language other than english?

    sorry for long delay, we tried to work with sap support on this problem but still no profit,
    >>First go to the document class for which you need to set Russian language , change the language if you have already created a document class for the perticular document type
    We tried to add a LANGUAGE attribute as a free  attribute in transaction DMWB for our LOIO and PHIO document  classes with attribute value 'RU', but still the same issue.
    >> Indexing as well as document class also need to change the laguage.
    The problem is that the indexing report says that most of documents were marked as indexed in russian successfully, so the problem is somwhere else I think.

  • TREX is not indexing certain excel files

    Hi Experts,
    TREX is not indexing certain excel-files correctly. I get a "no message found error" (14047) Trex: Preparation failed: index operation by certain excel files - but not by all. I found out what they have in common: some have filters activated and some have a lot of columns or rows.
    Is there any way to avoid these errors and index them correctly?
    TREX: 7.0.47
    Portal: 7.01 SP7
    Thank you in advance
    Best Regards
    Norman

    Hi Norman,
    Did you review note 1528705?  It has a fix for this issue.
    Best regards,
    Duncan

  • CM repository FSDB type - documents are not indexed (authentication?)

    Hi guys!
    I have problem with indexing documents on file system. I suppose, there can be problem, that indexing service can not acces file system, because of authentication. Which user accesses files to index? What access permissions must be set for this user?
    Thanx!
    Peter

    Hi Peter,
    in the share security of the fsdb folder, you need :
    Group Administrators : Full Control
    This group include at least administrator, sidadm, sapservicesid
    Everyon : Read
    In the security of the folder itself :
    Administrators : Full control
    Please verify that your host url in the url generator is valid (System administration\System Config\KM\CM\Global Services\Url Generator (Show advanced Options).
    Brad

  • Fast ESP matching engine "Not indexed / Invalid documents"

    I have done full indexing ,and could see content is feed but not indexed and  "Not indexed / Invalid documents" is not zero in Matching engine.Please let me know cause and resolution for the same.

    Could you run resetindex command i.e %FASTSEARCH%\bin\indexeradmin -a resetindex.
    Thank should trigger rebuild of indices.

  • Trex is not indexing Outlook messages .msg

    Hi Everybody,
    Does anyone know why TREX is not indexing .msg (Outlook mail) files?
    In [Help.sap.com|http://help.sap.com/saphelp_nw04s/helpdata/en/55/cb634114b78047e10000000a1550b0/frameset.htm] it says it should be supported...
    Any ideas?
    Thanks in advance,
    Fede

    Thanks Raghu!
    You solved my problem!!
    The definitive solution was:
    In portal (SysAdm > SysConf > KM > CM > Utilities > MIME Types) I've added this settings:
    Name "msg"
    Default Extension ""
    Extension "msg"
    Locale ""
    MIME Type "application/vnd.ms-outlook"
    Resource Bundle File ""
    Resource Key ""
    In <disk_drive>:\usr\sap\trex_<instance_number>\TREXValidMimeType.ini, I've added the following rows:
    application/vnd.ms-outlook
    message/rfc822
    Now TREX is indexing .msg outlook files!
    Thanks Again,
    Fede

  • Problem in indexing documents

    hi all,
    I've created an index and checked The status in TREX Monitor it is showing that 69 documents got satatus ok. but my repository contains some thousands of files. how can i make all file to be search.but one thing wht i obzerved is day by day the number of files got status is increasing from 69 to 80 like that and i am unable to find how the no is increasing.so, how can we  make all the  documents got status ok.
    Regards
    gnana

    Hi,
    Documents with more than 10kb were not indexed. The reason for this is, that doc´s smaller than 10kb are automatically transmitted to TREX by the crawler. For documents bigger than 10kb the crawlers only transmit the URI and then the TREX fetches them by itself.
    The URL´s that are sent to TREX are normally built by the URL generator using the host entry. If you have a clustered landscape this might not work, because the Firewall, Load-Balancers, WebServers, or external authentication Systems might cause problems when they are directly called by TREX via the normal portal URL.
    In this case you must specify the name of a specific server (e.g. http://server.com:50000) as Alternative Host in the URL Generator Service. But be careful with that setting. A wrong setting might crash your system.
    If you have clustered system, be sure that the communication TREX to Portal can take place without problems. Take a look at the trace files of the TREX preprocessor, this should give you a hint on this.

Maybe you are looking for

  • Need to pass the PROFILE options in SQL query

    SELECT ffvs.FLEX_VALUE_SET_NAME , ffvs.FLEX_VALUE_SET_ID , ffv.FLEX_VALUE_ID , ffv.FLEX_VALUE FROM fnd_flex_value_sets ffvs , fnd_flex_values ffv WHERE 1=1 AND ffvs.FLEX_VALUE_SET_ID=ffv.FLEX_VALUE_SET_ID AND ffvs.FLEX_VALUE_SET_ID=1012493  -- SCHEME

  • Error when ruuning Initial services in Endeca Application

    Hi , I am getting this error when i am trying to run initial_servies in my endeca application. [oracle@atgtelco bin]$ cd /u01/app/oracle/endeca/sampleApps/Testen/control [oracle@atgtelco control]$ ./initialize_services.sh \[01.08.13 21:21:32] SEVERE:

  • Colour problems when burning  dvd

    Having created 4 slideshows in one project in idvd, I burnt the project to dvd, when viewing the dvd on tv through dvd player, the colours are all wrong, red is blue, blue is brown, yellow is blue. Can anyone help me solve the problem please

  • How to make iframe work in Jdeveloper ADF

    We have a ADF JSF page made by Jdeveloper. The page has an input field and a go button, we need an iframe embedded in the same page and the iframe loads URL which is typed in the input field. For example, user typed in http:www.google.com, click go,

  • How to do Complete factory reset without Yosemite Disk on Macbook Air

    how to do Complete factory reset without Yosemite Disk on Macbook Air