Trouble with indexing web repository

Hi All,
We've recently upgraded to TREX version 7.10.34.00, and I'm trying to get one of our web repositories to index.
I can get the web repository to index if I do not include any 'include' result resource filters in the crawler parameters, but it does not index if I do include one. I have had some success using an 'exclude' result resource filter, just not the 'include' one.
The name of the web site that I'm indexing is http://site1.domain.com. When I do not include any result filters, a sampling of the crawler log file looks like this:
INFO Jan 27, 2010 10:10:10 AM /mywebrepository/site1.domain.com http://site1.domain.com/  provided  text/html
INFO Jan 27, 2010 10:10:10 AM /mywebrepository/site1.domain.com/files/index.htm http://site1.domain.com/files/index.htm  provided  text/html
INFO Jan 27, 2010 10:10:10 AM /mywebrepository/site1.domain.com/files/folder1/tableofcontents.htm     
http://site1.domain.com/files/folder1/tableofcontents.htm  provided  text/html
When I go into TREX monitor, the queue has lots of documents that it indexes.
This is what my result filter settings look like:
Include Documents/Web-Pages: <checked>
Include Folders: <checked>
Include Links (Not Applicable For Web-Sites): <unchecked>
Case Sensitive (Folders And Documents/Web-Pages): <unchecked>
Item ID Mode (Documents/Web-Pages Only): include
Item ID Patterns (csv): *.html, *.htm
Mime Type Mode (Documents/Web-Pages Only): include
Mime Type Patterns (csv):
Minimum Content Size (Documents/Web-Pages Only): <blank>
Maximum Content Size (Documents/Web-Pages Only): <blank>
Maximum Age of Last Modification (Documents/Web-Pages Only): <blank>
With the result filter in place in the crawler parameters, I click the button to index. The crawler log files are generated, but nothing shows up in the TREX monitor queue for the index. The Time Stamp doesn't change either. I have tried changing the parameters in the 'Item ID Patterns' field, but it still doesn't work.
Is this a bug with this new version of TREX or am I not using this filter properly? This seemed to work when I was using TREX version 6.
Thanks!
-StephenS

I was never able to resolve this problem but I have now retired the computer

Similar Messages

  • Viewing Troubles with CSS web

    Hello,
    I am still having trouble with a web page that I was asked to
    redesign. I decided to implement CSS into the page to change the
    appearance and navigation, however, the javascript (inherited) that
    was used on the page is having problems. I don't not know
    javascript, therefore I really can't address that issure right now.
    The page is centered in all of the browsers, along with the
    header and the container for the tab menu. The issue is the other
    containers that have side navigation or the content within the tab
    menu container.
    My concern is that the page is viewing correctly in IE6 and
    Opera but not in Netscape and Firefox. When I preview the page in
    the browsers, I get an error message in the status bar "done but
    errors on page" (still views correctly in IE & Opera).
    Hopefully, this all makes sense and everyone is not off
    enjoying pre-4th July celebrations to respond. -Thanks!
    css style

    Did you post a link to your page?
    Murray --- ICQ 71997575
    Adobe Community Expert
    (If you *MUST* email me, don't LAUGH when you do so!)
    ==================
    http://www.dreamweavermx-templates.com
    - Template Triage!
    http://www.projectseven.com/go
    - DW FAQs, Tutorials & Resources
    http://www.dwfaq.com - DW FAQs,
    Tutorials & Resources
    http://www.macromedia.com/support/search/
    - Macromedia (MM) Technotes
    ==================
    "Iwannaknow2" <[email protected]> wrote in
    message
    news:e8b5du$9q2$[email protected]..
    > Hello,
    >
    > I am still having trouble with a web page that I was
    asked to redesign. I
    > decided to implement CSS into the page to change the
    appearance and
    > navigation,
    > however, the javascript (inherited) that was used on the
    page is having
    > problems. I don't not know javascript, therefore I
    really can't address
    > that
    > issure right now.
    >
    > The page is centered in all of the browsers, along with
    the header and the
    > container for the tab menu. The issue is the other
    containers that have
    > side
    > navigation or the content within the tab menu container.
    >
    > My concern is that the page is viewing correctly in IE6
    and Opera but not
    > in
    > Netscape and Firefox. When I preview the page in the
    browsers, I get an
    > error
    > message in the status bar "done but errors on page"
    (still views correctly
    > in
    > IE & Opera).
    >
    > Hopefully, this all makes sense and everyone is not off
    enjoying pre-4th
    > July
    > celebrations to respond. -Thanks!
    > [email protected]
    >

  • Error when indexing web repository

    I'm working on a problem that I'm having with indexing a web repository. For the sake of this post, we will call the web site that I'm indexing for the repository http://mysite1.com. For the most part, things are working just fine. The problem is that there's a couple of links in one of the pages in http://mysite1.com that aren't getting crawled.
    The first link is http://mysite2.com. This link is to a web site that is on our network, but you are normally required to provide a username and password to access it. The message in the crawler error file that's being generated is:
    ERROR     Mar 27, 2009 8:04:02 AM     /webdynamic/mysite2.com     http://mysite2.com/     processing failed     com.sapportals.wcm.repository.AuthorizationRequiredException     
    I created an HTTP System in the System Landscape Definitions for http://mysite2.com, and here's what it looks like:
    Description:         mysite
    Same User Domain:    <unchecked>
    Max Connections:     0
    Password:            <set to the password for the user>
    Server Aliases:      <blank>
    Server URL:          http://mysite2.com
    User:                myuser
    I have verified that the username and password that I have configured here are valid. I have also set up a web site definition for this, and here's what it
    looks like:
    Login Timeout:       <blank>
    System ID:           mysite.com
    All the rest of the options for the web site are blank.
    What else do I need to do to get the crawler to access the content of http://mysite2.com?
    The other link that I'm getting errors on is http://mysite3.com. The error in the crawler error file is:
    ERROR     Mar 27, 2009 8:04:01 AM     /webdynamic/mysite3.com     http://mysite3.com/     processing failed     com.sapportals.wcm.repository.TimeExceededException:
    request to /: Read timed out     
    This site is accessible both internally and externally to our network. I'm not sure what I need to do for this. Can anyone help me out with this?
    Thanks!
    -Stephen Spalding

    Hi Esther Schmitz,
    Thanks for quick reply. As you said, i have changed website url to http://www.cnn.com.
    but still it shows below error messages.
    The target of the link TECH you tried to navigate to is not available. Its repository might be disconnected or the target may have been renamed, moved, or deleted. Contact your system administrator if you think the target /CNN/TECH/TECH should be available.
    Thanks,
    Satya

  • Having trouble with custom web auth page on 4404

    Hi all
    I am having trouble with a custom web auth page on my controller, we have edited the original file, but when we click login it goes to page cannot be displayed and it doesnt redirect to the page I want, however when I close the window and reopen it has already authenticated me.
    Has anyone got a copy of some working html code I can use ?
    cheers

    There is sample Web Authentication bundle avaiable for download from cisco.com. if you go to the software download page and go to Wireless->Standalone Controllers->4404 you should see a link for Wireless Lan Web Authentication Bundle.
    Its the same bundle whether you have a WiSM, 4404 or 2100

  • Trouble with sucure web sites

    I am having trouble with sucure websites. When I log into a sucure site I can navigate to no more than 3or 4 pages (sometimes as little as 1) and the the web brouser will stop loading. I have tried several brouser and it happens with all of them. I am using a rev B iMac running 10.3.9 hook to the internet via a airport extreem. Help?
    rev B iMac   Mac OS X (10.3.9)  

    I was never able to resolve this problem but I have now retired the computer

  • Trouble with Photoshop Web Galleries

    Hello,
    I am trying to create a web gallery. I have created several in the past, but I have a problem with the way the captions display.
    Here is a gallery from an event at my campus last week:
    http://www.alasu.edu/video/forensics/gallery.swf
    In order to get the captions to display, you have to click the "?" icon on the bottom right. It's the same way with all the galleries I have created, but I would like for the captions to show by default without having to click something.
    PLEASE HELP!

    BOILERPLATE TEXT:
    Note that this is boilerplate text.
    If you give complete and detailed information about your setup and the issue at hand,
    such as your platform (Mac or Win),
    exact versions of your OS, of Photoshop (not just "CS6", but something like CS6v.13.0.6) and of Bridge,
    your settings in Photoshop > Preference > Performance
    the type of file you were working on,
    machine specs, such as total installed RAM, scratch file HDs, total available HD space, video card specs, including total VRAM installed,
    what troubleshooting steps you have taken so far,
    what error message(s) you receive,
    if having issues opening raw files also the exact camera make and model that generated them,
    if you're having printing issues, indicate the exact make and model of your printer, paper size, image dimensions in pixels (so many pixels wide by so many pixels high). if going through a RIP, specify that too.
    etc.,
    someone may be able to help you (not necessarily this poster, who is not a Windows user).
    a screen shot of your settings or of the image could be very helpful too.
    Please read this FAQ for advice on how to ask your questions correctly for quicker and better answers:
    http://forums.adobe.com/thread/419981?tstart=0
    Thanks!

  • Trouble with Index.html

    The site is designed with Adobe Muse. When I upload it to a hosting site the index.html page appears different and the images do not link. When I open the index.html on my computer it looks fine, but the minute it's uploaded it changes. I thought it was Yahoo - www.amydealdesign.com and so I made a change to Go Daddy www.amydeal.com - same thing. Why aren't the images linking?

    It appears that your images haven't been uploaded to your webhost.
    How are you uploading your site?
    If you publish to a temporary Business Catalyst site, Muse will upload all the required files.
    If you use the 'upload to FTP host' command from Muse, it should upload everything you need.
    If you export to a local folder and upload using a 3rd party FTP client, be sure to upload everything in the export folder, include the 'images', 'assets' and 'scripts' subfolders.

  • Trouble with tablet web access

    I have a Acer Iconia B1 tablet, I use Comcast Xfinity and have WiFi but can not get the tablet on line ??
     it shows the WiFi signal is strong but can not connect ??
    any ideas ?
    Thanks
    Beachben

    I have a Acer Iconia B1 tablet, I use Comcast Xfinity and have WiFi but can not get the tablet on line ??
     it shows the WiFi signal is strong but can not connect ??
    any ideas ?
    Thanks
    Beachben

  • Web repository manager

    Hi,
    I am working with NW04S.
    I am facing 2 issues which are related with the web repository manager.
    1. When we create a web repository manager, we must be able to see it under content management->KM content. When we choose the web repository, we should be able to see the link of the website that we configured.
    The issue that I am facing is that I am unable to see this link although my web repository manager is seen in the KM content.
    I am able to the see the links for the web sites in the web repository managers that I had created previously.
    I have done all configurations according to the config guide. I have created html system, website, html property extractor, cache and then web repository manager.
    2. When I went back to check how I had configured the older web repository managers, I found that only the ones that I created recently were present. Very old ones were missing. But these are visible under KM content.
    Is there some place where these are archived?
    Could you please help me with this?
    Best Regards,
    Vidhya

    Hi,
    I checked some other posts on the forum and found that i had to check the component monitor. i did so.
    it gives me an error saying that
    2007-04-30T03:55:33Z: GET /: com.sapportals.wcm.WcmException: sending request to: http://www.yahoo.com/ request uri: / unable to connect to www.yahoo.com: unknown host: www.yahoo.com (java.net.UnknownHostException: www.yahoo.com)
    i have tried the same with cnn.com also.
    could someone tell me what i should do?
    regards,
    Vidhya

  • Index and crawler not working on Web Repository

    Hi Team,
    I'm trying to setup a Web Repository and crawling it for indexing. I've followed the steps from a SAP "how-To" document, but I guess the problem might be the way I'm confuring the web site in EP. I've created a Virtual Directory on my laptop's IIS 5.0 web server and the URL of the web site has been set as http://laptop-ashishk/myWebSite.
    Do I need to set the START PAGE as /index.html (as per the spec it says it's not mandatory)...
    Let me know whether you need any information with regards to this problem.
    Ashish

    They've set:
    meta name="viewport" content="initial-scale=2.3, user-scalable=no"
    It's the user-scalable that's the problem. Apple considers the default (per their web coding rules at http://developer.apple.com/iphone/designingcontent.html to be yes.
    I've noticed the same thing.
    Aym

  • Indexing a Web Repository (CNN Website)

    Hello All,
    Am working to configure a Web Repository for Indexing with CNN site.
    Following SAP document -> https://www.sdn.sap.com/irj/scn/go/portal/prtroot/docs/library/uuid/77f6aa90-0201-0010-b681-e013540efb3b
    The configuiration is all done as per the above mentioned documetn but its not working.
    Getting exception as:
    Link target is not available
    The target of the link CNN-TECH you tried to navigate to is not available. Its repository might be disconnected or the target may have been renamed, moved, or deleted. Contact your system administrator if you think the target /CNN_WebRepository/CNN-TECH/TECH/ should be available.
    Also cross checked possibilities as mentioned in these posts but no solution:
    https://www.sdn.sap.com/irj/scn/thread?messageID=2135428
    https://www.sdn.sap.com/irj/scn/thread?messageID=1293140
    Investigations done:
    1. Suspecting this to be a proxy issue, have configured proxy in System Admin-> Service Config -> httpservce
    2. Following this link on help.sap site (Case B), also created HTTP System and user mapping with index_service user - http://help.sap.com/saphelp_nw04s/helpdata/en/ae/46833ceb3da02ce10000000a114027/frameset.htm
    None of these changes helped me.
    In addition, I am getting similar exception when I configured Web Repository with a Intranet page thinking in this case proxt will not be required.
    Link target is not available
    The target of the link Intranet_Home you tried to navigate to is not available. Its repository might be disconnected or the target may have been renamed, moved, or deleted. Contact your system administrator if you think the target /CWIntranet/Intranet_Home/index.html should be available
    Any ideas?
    Awaiting Reply.
    Thanks,
    Ritu

    Hi Ritu,
    I am currently facing a similar issue to the original problem you had.
    I am trying to create a link to some iViews in my PCD but it displays the following message:
    Link target is not available
    The target of the link S&OP Calendar you tried to navigate to is not available. Its repository might be disconnected or the target may have been renamed, moved, or deleted.
    Could you explain how you resolved this issue?  It was working before!
    Thanks,
    Oloy.

  • Indexing and searching on a web repository -- No document excerpt available

    Hi everybody,
    I created a web repository with content of our intranet. I also created a index for this web repository. Everything seems to work fine. But when I search on the index no document excerpt is shown. It says: "No document excerpt available" under each search result.
    I think no full text index is performed, where and how can I tell that I want a full text index?
    Thank you in advance, Christoph

    Hi,
    Have you solved this problem ?
    Best Regards,
    Fabien

  • I am really in trouble with AP Div-How do I fix it on the web?

    Hi,
    I am really in trouble with my website. I have added some pictures and text on top of Fireworks Image and have published it on the website.
    But the concern is, when I zoom in & zoom out, I can see the previous text on the screen and also the picture and texts I have added using Ap Div  tag are scattered moving all to the left when I zoom out. Can someone help me how to fix this in one particular place so that it doesn't move when I zoom in or zoom out!!! I am using Adobe Dreamweaver CS3, if this will help.
    Appreciate your sincere help on this.
    Thanks in advance.

    Frankly, there's a lot that is wrong with that page:
    1. Most of your content is in the images - this means that you will get very poor search engine ranking
    2. Your extensive use of absolute positioning for layout - this means that when you enlarge the text size in the browser, you will have overflow problems on the page (for example, the terrible problems at the bottom of the page)
    3. You have used tables for layout - this is because of your use of Fireworks to create the HTML
    Each of these problems is solvable but none of them are solvable easily without a redesign of the page. A web page should be built from the top down, stacking content containers (i.e., <div>, <section>, <article>, <aside>, etc.) vertically or floating them horizontally or both. These containers would be loaded with the text content of the page, and images would be used only for cosmetic appearance. Using CSS to style/locate the content will allow you to completely move away from tables for layout. Most typical pages can be created without the use of absolute positioning which should be used only for special purposes, not for layout of the page elements.

  • Having trouble with web authentication in 5504

    Hi everybody,
    We´re experiencing a trouble with our Wireles LAN solution. We have a WLC 5504, a ACS 4.2 and APs 1131AG.
    After deploying the solution and doing some tests we noticed when a user attempted to connect by wireless network there was too much delay since they clicked ie (internet explorer) until web authentication into WLC was shown. the delay was around 3 minutes. This issue also ocurrs despite of doing a test from my laptop that was next to one access point, then, I moved to another access point and the result was the same, a laptop problem is ruled out.
    Has anybody ever had this kind of trouble? , How could I reduce this time?, is it possible?, Which part of configuration shoud I check?
    Regards,
    Manuel

    Friends,
    I´ve made a mistake. Our WLC is a 4404.  
    Regards,
    Manuel

  • TROUBLE WITH WEB ADDRESS IN SAFARI

    I have previously posted this topic under Safari directly but have received no response. I am having some difficulty with my desktop G5 Quad in respect to logging on to a particular web address that I have been using for years now. When I try to log on, I am receiving a message that says, "Safari cannot locate the server".
    But if I go downstairs in our kitchen on our laptop Macbook, I have no trouble whatsoever in logging on to this address.
    Can someone please tell me why I would be having this problem on my main desktop computer? It seems I do not have any trouble with any other web address on either computer except for this one.
    Thanks

    Most likely caused by a corrupt cookie.
    Delete all cookies for that website, and also delete your bookmark for it. Close Safari.
    Re-open Safari and type in the URL for that site. This will create a fresh cookie.
    Any better?

Maybe you are looking for

  • New Macbook spontaneous restart

    Hi all, I have a month old Macbook pro that has been spontaneously restarting with an error. It doesn't matter what I am doing on it; it will restart at any point in time randomly. I took it to the genius bar and they said the only restart they found

  • Out of control highlighting

    has anyone had a problem where you end up with an out of control highlighting situation where you have to force quit to make it stop? if so...have you figured out why it might be doing that? It's happened to me several times recently.

  • Droid 3 Backup Assistant "Sync Now" Failure

    I've been spending nights and hours on the phone and chat boxes with technical support.  Some folks suggest deleting the Backup Assistant icon (there isn't one).  One said it had to do with security blocks I've put on my account (there don't seem to

  • Automatic Goods Issue after Outbound delivery and shipment.

    Hi, I am working on LES part. Could you please explain me how to do Automatic post goods issue for shipments. (in our scenario where after outbound delivery, picking, packing shipment takes place.) Can we use VL23 for automatic goods issue for shipme

  • Apple TV and PDF presentations

    Is there any way to be able to show PDF presentations on a TV using Apple TV? Powerbook G4 17"   Mac OS X (10.4.9)