TREX does not index documents

Now this one is a bit strange. We have created an index on a file system repository that contains a number of different types of files including plain text, MS-Word, Excel and Acrobat PDFs. The queue now has a red status showing a sizeable number of documents failed the preparation, but a few are ready to be transmitted.
We found that none of the MS-Word, Excel and Acrobat documents passed the preparation stage. Anything that passes preparation is either a directory or a plain text file, though this is not consistent either since a number of files that did not pass preparation where als plain text files.
What gives? Am I missing something here? Any hints would be very much appreciated.

Hello Wolfgang,
we faced exactly the same problem. To be more precisely, documents with more than 10kb were not indexed. The reason for this is, that doc´s smaller than 10kb are automatically transmitted to TREX by the crawler. For documents bigger than 10kb the crawlers only transmit the URI and then the TREX fetches them by itself.
The URL´s that are sent to TREX are normally built by the URL generator using the host entry. If you have a clustered landscape this might not work, because the Firewall, Load-Balancers, WebServers, or external authentication Systems might cause problems when they are directly called by TREX via the normal portal URL.
In this case you must specify the name of a specific server (e.g. http://server.com:50000) as Alternative Host in the URL Generator Service. But be careful with that setting. A wrong setting might crash your system (happened on our productive layer unfortunately).
If you use SSL for the communication between TREX and portal it even gets more complicated. Fortunately I did not have to deal with that.
Thus, if you have clustered system, be sure that the communication TREX to Portal can take place without problems. Take a look at the trace files of the TREX preprocessor, this should give you a hint on this.
Hope this helps.
Kind regards,
Christopher Höfele

Similar Messages

  • TREX does not index all documents fro document class SOLARGNSRC

    Hi all,
    I've setup a connection between a TREX server (which is also used by a portal system) and Solution Manager. I've gone through the settings in SAP Note 750623 and I was able to create a index, the queue and even bypass the basic authentication in the preprocessing for document class SOLARGNSRC
    According to everything I've read, this should be enough to be able to index all documents in Solution Manager and find them with Full Tex Search. But this doesn't work. From the 50.000 documents in de class (35.000 English and 15.000 German) only 6300 documents are passed to TREX. Most of them are German HTML links to help.sap.com. I can see in the trace files that the URL's to some documents in the content server are passed to TREX. I can open dthe document using the links but I cannot find the documents using the full text search which I think means that nothing was really indexed.
    Questions:
    1. Has anyone been able to succesfully index Solution Manager documents for full text search purposes?
    2. Why are only 10% of the documents passed to TREX? Is there a specific setting for this?
    3. Why does TREX use the Content Server HTTP links to index the documents and not the RFC connection?
    Cheers
    Marcel Rabe

    Yep, all lights green
    SSR maintained. DRFUZZY as Search Engine (I have not tried Verity)
    Return Code? Interesting, I don't get an return code when I trigger the Index/Deindex. I just see the hourglass for about 5 minutes (when I run it in the foreground) and after that its back to the way it was. No messages. Nothing appears in the application logs as far as I can see.
    The program RSTIRIDX  is scheduled in the background and runs every hour minutes for about 10 seconds without an error.
    TREX Version 7.00.39.00
    Five languages actived, including German and English. In SKPR07 under Indexed documents I can see that 300 German and 6000 English documents have status indexed.
    No proxy server. Systems sit within the same network segment
    Thnx for your help. I posted a message with SAP as well as this seems strange to me.
    Marcel

  • Adobe PDF iFilter 9 for 64-bit platforms does not index my PDF files with Digital Sign

    Adobe PDF iFilter 9 for 64-bit platforms does not index my PDF files with Digital Sign, why?

    hi  Phillip
    i am not sure what you mean
    I downloaded the ifilter and installed it
    then configured everything as shown in the pdf file
    I tried indexing from scratch exactly as i did successfully in the other computer
    and got some errors in the log file
    i checked the sql server log and the event viewer logs and got :
    Error '0x80004005' occurred during full-text index population for table or indexed view '[Pirsumim_ext_ck].[dbo].[T_PUBLICATIONS]' (table or indexed view ID '2073058421', database ID '14'), full-text key value 0x0000027A. Attempt will be made to reindex it.    
    The component 'PDFFilter.dll' reported error while indexing. Component path 'C:\Program Files\Adobe\Adobe PDF iFilter 9 for 64-bit platforms\bin\PDFFilter.dll'.   
    Informational: Full-text retry pass of Full population completed for table or indexed view '[Pirsumim_ext_ck].[dbo].[T_PUBLICATIONS]' (table or indexed view ID '2073058421', database ID '14'). Number of retry documents processed: 1. Number of documents failed: 1.
    Changing the status to MERGE for full-text catalog "Pirsumim_ext_catalog_ck" (5) in database "Pirsumim_ext_ck" (14). This is an informational message only. No user action is required.
    Informational: Full-text Auto population initialized for table or indexed view '[Pirsumim_ext_ck].[dbo].[T_PUBLICATIONS]' (table or indexed view ID '2073058421', database ID '14'). Population sub-tasks: 1
    the same dll worked fine in another computer...
    how can i see more details what is wrong with this dll  ?
    meidad

  • Trex is not indexing Outlook messages .msg

    Hi Everybody,
    Does anyone know why TREX is not indexing .msg (Outlook mail) files?
    In [Help.sap.com|http://help.sap.com/saphelp_nw04s/helpdata/en/55/cb634114b78047e10000000a1550b0/frameset.htm] it says it should be supported...
    Any ideas?
    Thanks in advance,
    Fede

    Thanks Raghu!
    You solved my problem!!
    The definitive solution was:
    In portal (SysAdm > SysConf > KM > CM > Utilities > MIME Types) I've added this settings:
    Name "msg"
    Default Extension ""
    Extension "msg"
    Locale ""
    MIME Type "application/vnd.ms-outlook"
    Resource Bundle File ""
    Resource Key ""
    In <disk_drive>:\usr\sap\trex_<instance_number>\TREXValidMimeType.ini, I've added the following rows:
    application/vnd.ms-outlook
    message/rfc822
    Now TREX is indexing .msg outlook files!
    Thanks Again,
    Fede

  • Adobe PDF iFilter 9 for 64-bit platforms does not index my PDF files in SQL server database

    hi all
    I need your help
    i have this asp.net site which works with sql server
    it searches a specific word in the database with full text search
    in my database i have a column of type image that holds a PDF file
    i want my SP to search for given word in my file
    so i installed the Adobe Ifilter
    configured it and every thing worked fine
    but after i moved to production Server and installed every thing the same way ...it did not create the index for the pdf files
    i can search for doc and docx and even for xml but not pdf
    i know i installed it correctly since it is working great in the testing environment
    i used this link for the configuration
    http://www.adobe.com/special/acrobat/configuring_pdf_ifilter_for_ms_sql_server_2005.pdf
    the only difference i see between my Testing environment and my Production environment  is that
    my testing has
    windows sever 2003 sp2 with Sql server 2005 64 bit std sp2   -- works fine
    my production  has
    windows sever 2003 R2sp2 with Sql server 2005 64 bit ent sp3  --does not work, does not index my pdf files
    can you tell me if there are problems with some environments ?
    are there any known problems with this Ifilter ?
    what should i do ..

    Thank you
    I opened it in the Acrobat Windows forum
    http://forums.adobe.com/message/2557155#2557155
    meidad Evyoni

  • Account currency200114 does not match document currency(1)

    Hi
    While adding A/P invoice i receive error:
    Account currency200114 does not match document currency(1)
    my bp currency is in USD and local is INR
    Plz solve it.

    Found.Thanks.
    Edited by: Malhotra Saurabh on Jul 30, 2009 11:28 AM

  • TREX is not indexing certain excel files

    Hi Experts,
    TREX is not indexing certain excel-files correctly. I get a "no message found error" (14047) Trex: Preparation failed: index operation by certain excel files - but not by all. I found out what they have in common: some have filters activated and some have a lot of columns or rows.
    Is there any way to avoid these errors and index them correctly?
    TREX: 7.0.47
    Portal: 7.01 SP7
    Thank you in advance
    Best Regards
    Norman

    Hi Norman,
    Did you review note 1528705?  It has a fix for this issue.
    Best regards,
    Duncan

  • ICloud does not accept documents (Pages)

    iCloud does not accept documents (Pages) "There was a problem in the upoading" it says (in Swedish)..."Try later". I have tried 2 days now. What shall I do?

    Hi,
    I´m having the same issue - Have you been able to solve it yet? (Your Post is from January).
    iCloud has accepted some of my docs created in Pages on my Mac (Mini 2010),
    but says "Format not supported" when I try draging them from `Documents´ on my Mac into the iCloud.
    The docs I was able to upload to iCloud (From Pages on my Mac), were created on my Mac in Pages
    (But also using photos and elements from elsewhere on my Mac, ie. `Documents´ folder.
    Please let me know if you have meanwhile found a solution, but not posted it.
    Thanks a lot!

  • Keynote 4.0.2 does not recognize documents from 4.0.1

    Since I upgraded to version 4.0.2, Keynote does not open documents created under version 4.0.1. The error message reads that I need a newer version of keynote to open the document.

    Welcome to the discussions, Cristiane.
    Do you also have iWork '06 still installed. If so, then rename Keynote in the iWork '06 folder so there's not two copies of a "Keynoe" app on your system.
    http://www.keynoteuser.com/news/?p=259

  • TREX not indexing document 10 KB

    hi...
      TREX search is working fine for documents < 10 KB but if i try indexing documents > 10 KB it gives prperation failed error.Please let me know what as to be done so that it indexes document > 10 KB
    Please help me on this as it is very urgent
    Thanks in Advance,
    Shanshank

    I installed an EP7/TREX SP10 two years ago. All worked fine until I decided to apply windows SP2 and some security patch. My machine is a win2003 x64.
    Now EP7 works fine, but TREX in not able to reindex documents > 10kb.
    Trex preprocessor trace says:
    Preprocessor.cpp(03550) : HANDLE: DISPATCH - Processing Document with key '/documents/Segreterie/Documenti PCTP/Doc. in Arrivo/Anno 2009/2009_03123.pdf' failed, returning PREPROCESSOR_ACTIVITY_ERROR (Code 6401)
    In the portal security log file I found the corresponding error:
    System/Security/Authentication#sap.com/irj#com.sap.engine.services.security.authentication.logincontext#Guest#2####764602d0407311dea83600188b77747b#SAPEngine_Application_Threadimpl:3_17##0#0#Info#1#com.sap.engine.services.security.authentication.logincontext#Plain###LOGIN.FAILED
    User: N/A
    Authentication Stack: ticket
    It seems to me that the portal is not able to pass user name to Trex (User: N/A) and then It is not authorized to retrive the documents.
    I tried to change the user of indexmanager service "index_service"; to set the alternative host in url generator service. Nothing changed.
    Any suggestion?
    Giorgio Peressin

  • Folder redirection configured in GPO does not create Documents folder and does not redirect

    Hi
    Another Folder Redirect-post - sorry for that, but I could not find an answer for my problem so far: even with consulting many threads here...
    We have an existing environment under Windows XP and want to move away from that. Now I ran into troubles with folder redirection...
    The following folder- and permission structure exists so far:
    \\<server>\<Users$-share>: This is the base folder for all users-directories
    -> Permissions: SYSTEM: Full / Administrators: Full / Users: Read&Execute, only this folder
    -> Share-permissions: Authenticated users: Full control
    \\<server>\<Users$-share>\<username>: base folder for the specific user
    -> Permissions: SYSTEM: Full / Administrators: Full / User: Change, all permissions inherited onwards
    -> Giving only change permission prevent further problems with self-called "advanced users"... ;-)
    \\<server>\<Users$-share>\<username>\profil.V2: Profile directory of the user
    -> Of course here the permissions are set by the system: override the predefined permission
    \\<server>\<Users$-share>\<username>\daten: Atcual Home directory of the user
    \\<server>\<Users$-share>\<username>\daten\Documents: Suposed Documents directory of the user
    Now I am going to Server 2012 and Windows 8.1, configured the GPO to redirect Documents folder into the above mentioned:
    GPO - User configuration - Policies - Windows settings - Folder Redirection - Documents:
    Setting: Standart - redirects all folders to the same path
    Destination folder: Copy to base directory of the user
    I apply policy to the user, log out and in - it doesn't work, no folder Documents created in my home-folder, Folder Documents still configured at C:\Users\<user>\Documents
    A very special point:
    I also do Redirection of the My Pictures-folder: Define it to follow the Documents folder. Funnily that one works and creates and configures \\<server>\<Users$-share>\<username>\daten\Pictures
    -> So in my eyes, it should work!
    Then: I want to do the folder redirection without Offline Files, due to the fact, that our users work with dynamically assigned virtual desktops, which are been cleaned everytime a user logs off a machine. Therefore synchronizing doesn't make sense...
    I just cannot see, why this redirection does not work :-(
    Thank you very much for any help!
    Kind regards
    David

    Hi David,
    Before going further, would you please let me confirm the OS version of the Windows Server which you used to
    configure folder redirection? Based on your description, did you mean that those users (who will be applied folder redirection settings) logged on Windows XP client computer?
    When you configure the folder redirection setting in Document Properties (path:
    User Configuration-> Policies-> Windows Settings-> Folder Redirection-> Documents), please check if you checked “Also apply redirection policy to Windows 2000, Windows 2000 Server, Windows XP, and Windows Server 2003 operating system” in Settings
    tab. As below picture shows.
    à
    GPO - User configuration - Policies - Windows settings - Folder Redirection - Documents:
    à•Setting: Standart
    - redirects all folders to the same path
    à•Destination
    folder: Copy to base directory of the user
    Would you please provide a screenshot of those settings you describe? Meanwhile, please summarily describe
    that how you configure. For example, where this GPO link to? Or any other. It will help me to understand clearly. Thanks for your understanding.
    In addition, please use
    gpresult command to check if the folder redirection group policy was really applied.
    If any update, please feel free to let me know.
    Hope this helps.
    Best regards,
    Justin Gu

  • NIDAQ 6.9.3 does not install documents

    Using NIDAQ 6.9.3 downloaded from NI site. Installs the drivers but does not install the documents. If I run the docsetup.msi it gives the option to select documents and tries to install them, but cannot find the PDF's. I am left with links to non-existent documents from the Start menu. Any suggestions (other than downloading them individually from the NI site)?
    Thanks

    Hello,
    Other than downloading a previous or later version of NI DAQ from the NI website, then next suggestion would be to download the documents individually from the ni website.
    Also how are you installing NI DAQ (If from cd have you tried copying the contents from the cd on to the hard disk and then trying to install it.
    Kind Regards
    LouisP
    Applications Engineer
    NI-UK

  • Account currency does not match document currency

    Hi,
    We are using SAP Business ONE 2005 A PL 50.  After having created an invoice in USD for an export customer, the payment is made in the local currency amount. When trying to do the incoming payment in the local currency we get Sap error message 173-66 " ACCOUNT CURRENCY DOES NOT MATCH THE DOCUMENT CURRENCY". Is there a way we can do the incoming payment in local currency?
    Thanks,
    Monil

    Hiii Monilshah
    Please check this link for your settings
    https://websmp101.sap-ag.de/~form/sapnet?_FRAME=CONTAINER&_OBJECT=011000358700000610242007E
    thanks
    Rizz

  • Spotlight Does Not Index My Time Capsule

    Hi all.
    I've been having this problem for about 6 months now. I've never been able to get Spotlight to pull up anything on my TC when I do a search.
    I've checked every forum on the web that has anything even close to this topic and I've tried all the suggestions (using Terminal, placing TC into the Spotlight "Privacy" tab and then removing it, etc..) and nothing works.
    My TC works fine, even though it is very slow on the wireless network, and I have nothing corrupted on it as far as I can see.
    So my question is this: Is it that TC simply DOES NOT work with Spotlight?
    Or is there something wrong with my Leopard? Everything else on my computer works perfectly fine.
    Thanks, any help would be appreciated.

    I have the same problem and posted the very same question last week, but no one replied.
    Anyone from Apple?

  • TREX does not search PDF files

    Hi,
    we have another problem with TREX 6.0.
    Our file repository is working fine, search also works for .txt files, but doesn't work for pdf files. Out pdf files are indexed correctly, but there are no result for this kind of files if we do a search.
    What can we do?
    Kind regards
    Thomas

    Your situation may already be solved.  However, one thing I did not hear in the details was: 1) how many PDF's were being indexed.  What was the size of the files?  Did you check the TREX Monitor to ensure all the PDF's had been sent through the entire system.  In the crawler monitor, did it state it found the correct number of files you believe to be in the index?  By default, TREX holds documents in a que for 30 minutes between processes unless you either reset this property or flush the que.
    There is a document TREXRecomenations which give some very good tips with regards to file size and other common settings.  For PDF it states:
    You want to index very large documents in PDF format from Adobe. These documents are not being indexed because they fail to pass the preprocessing stage.
    Limitation PDF is a complicated file format to preprocess. Typically PDF files larger than 15 MB cause problems. The time taken for preprocessing and filtering rises to over an hour and the process delivers bad results. Recommendation You should avoid the indexing and processing of PDF files that are larger than 15 MB.
    If you cannot find this document, let me know and I can forward it to you

Maybe you are looking for