Indexing a Web Repository (CNN Website)

Hello All,
Am working to configure a Web Repository for Indexing with CNN site.
Following SAP document -> https://www.sdn.sap.com/irj/scn/go/portal/prtroot/docs/library/uuid/77f6aa90-0201-0010-b681-e013540efb3b
The configuiration is all done as per the above mentioned documetn but its not working.
Getting exception as:
Link target is not available
The target of the link CNN-TECH you tried to navigate to is not available. Its repository might be disconnected or the target may have been renamed, moved, or deleted. Contact your system administrator if you think the target /CNN_WebRepository/CNN-TECH/TECH/ should be available.
Also cross checked possibilities as mentioned in these posts but no solution:
https://www.sdn.sap.com/irj/scn/thread?messageID=2135428
https://www.sdn.sap.com/irj/scn/thread?messageID=1293140
Investigations done:
1. Suspecting this to be a proxy issue, have configured proxy in System Admin-> Service Config -> httpservce
2. Following this link on help.sap site (Case B), also created HTTP System and user mapping with index_service user - http://help.sap.com/saphelp_nw04s/helpdata/en/ae/46833ceb3da02ce10000000a114027/frameset.htm
None of these changes helped me.
In addition, I am getting similar exception when I configured Web Repository with a Intranet page thinking in this case proxt will not be required.
Link target is not available
The target of the link Intranet_Home you tried to navigate to is not available. Its repository might be disconnected or the target may have been renamed, moved, or deleted. Contact your system administrator if you think the target /CWIntranet/Intranet_Home/index.html should be available
Any ideas?
Awaiting Reply.
Thanks,
Ritu

Hi Ritu,
I am currently facing a similar issue to the original problem you had.
I am trying to create a link to some iViews in my PCD but it displays the following message:
Link target is not available
The target of the link S&OP Calendar you tried to navigate to is not available. Its repository might be disconnected or the target may have been renamed, moved, or deleted.
Could you explain how you resolved this issue?  It was working before!
Thanks,
Oloy.

Similar Messages

  • Error when indexing web repository

    I'm working on a problem that I'm having with indexing a web repository. For the sake of this post, we will call the web site that I'm indexing for the repository http://mysite1.com. For the most part, things are working just fine. The problem is that there's a couple of links in one of the pages in http://mysite1.com that aren't getting crawled.
    The first link is http://mysite2.com. This link is to a web site that is on our network, but you are normally required to provide a username and password to access it. The message in the crawler error file that's being generated is:
    ERROR     Mar 27, 2009 8:04:02 AM     /webdynamic/mysite2.com     http://mysite2.com/     processing failed     com.sapportals.wcm.repository.AuthorizationRequiredException     
    I created an HTTP System in the System Landscape Definitions for http://mysite2.com, and here's what it looks like:
    Description:         mysite
    Same User Domain:    <unchecked>
    Max Connections:     0
    Password:            <set to the password for the user>
    Server Aliases:      <blank>
    Server URL:          http://mysite2.com
    User:                myuser
    I have verified that the username and password that I have configured here are valid. I have also set up a web site definition for this, and here's what it
    looks like:
    Login Timeout:       <blank>
    System ID:           mysite.com
    All the rest of the options for the web site are blank.
    What else do I need to do to get the crawler to access the content of http://mysite2.com?
    The other link that I'm getting errors on is http://mysite3.com. The error in the crawler error file is:
    ERROR     Mar 27, 2009 8:04:01 AM     /webdynamic/mysite3.com     http://mysite3.com/     processing failed     com.sapportals.wcm.repository.TimeExceededException:
    request to /: Read timed out     
    This site is accessible both internally and externally to our network. I'm not sure what I need to do for this. Can anyone help me out with this?
    Thanks!
    -Stephen Spalding

    Hi Esther Schmitz,
    Thanks for quick reply. As you said, i have changed website url to http://www.cnn.com.
    but still it shows below error messages.
    The target of the link TECH you tried to navigate to is not available. Its repository might be disconnected or the target may have been renamed, moved, or deleted. Contact your system administrator if you think the target /CNN/TECH/TECH should be available.
    Thanks,
    Satya

  • Index and crawler not working on Web Repository

    Hi Team,
    I'm trying to setup a Web Repository and crawling it for indexing. I've followed the steps from a SAP "how-To" document, but I guess the problem might be the way I'm confuring the web site in EP. I've created a Virtual Directory on my laptop's IIS 5.0 web server and the URL of the web site has been set as http://laptop-ashishk/myWebSite.
    Do I need to set the START PAGE as /index.html (as per the spec it says it's not mandatory)...
    Let me know whether you need any information with regards to this problem.
    Ashish

    They've set:
    meta name="viewport" content="initial-scale=2.3, user-scalable=no"
    It's the user-scalable that's the problem. Apple considers the default (per their web coding rules at http://developer.apple.com/iphone/designingcontent.html to be yes.
    I've noticed the same thing.
    Aym

  • Trouble with indexing web repository

    Hi All,
    We've recently upgraded to TREX version 7.10.34.00, and I'm trying to get one of our web repositories to index.
    I can get the web repository to index if I do not include any 'include' result resource filters in the crawler parameters, but it does not index if I do include one. I have had some success using an 'exclude' result resource filter, just not the 'include' one.
    The name of the web site that I'm indexing is http://site1.domain.com. When I do not include any result filters, a sampling of the crawler log file looks like this:
    INFO Jan 27, 2010 10:10:10 AM /mywebrepository/site1.domain.com http://site1.domain.com/  provided  text/html
    INFO Jan 27, 2010 10:10:10 AM /mywebrepository/site1.domain.com/files/index.htm http://site1.domain.com/files/index.htm  provided  text/html
    INFO Jan 27, 2010 10:10:10 AM /mywebrepository/site1.domain.com/files/folder1/tableofcontents.htm     
    http://site1.domain.com/files/folder1/tableofcontents.htm  provided  text/html
    When I go into TREX monitor, the queue has lots of documents that it indexes.
    This is what my result filter settings look like:
    Include Documents/Web-Pages: <checked>
    Include Folders: <checked>
    Include Links (Not Applicable For Web-Sites): <unchecked>
    Case Sensitive (Folders And Documents/Web-Pages): <unchecked>
    Item ID Mode (Documents/Web-Pages Only): include
    Item ID Patterns (csv): *.html, *.htm
    Mime Type Mode (Documents/Web-Pages Only): include
    Mime Type Patterns (csv):
    Minimum Content Size (Documents/Web-Pages Only): <blank>
    Maximum Content Size (Documents/Web-Pages Only): <blank>
    Maximum Age of Last Modification (Documents/Web-Pages Only): <blank>
    With the result filter in place in the crawler parameters, I click the button to index. The crawler log files are generated, but nothing shows up in the TREX monitor queue for the index. The Time Stamp doesn't change either. I have tried changing the parameters in the 'Item ID Patterns' field, but it still doesn't work.
    Is this a bug with this new version of TREX or am I not using this filter properly? This seemed to work when I was using TREX version 6.
    Thanks!
    -StephenS

    I was never able to resolve this problem but I have now retired the computer

  • Indexing and searching on a web repository -- No document excerpt available

    Hi everybody,
    I created a web repository with content of our intranet. I also created a index for this web repository. Everything seems to work fine. But when I search on the index no document excerpt is shown. It says: "No document excerpt available" under each search result.
    I think no full text index is performed, where and how can I tell that I want a full text index?
    Thank you in advance, Christoph

    Hi,
    Have you solved this problem ?
    Best Regards,
    Fabien

  • Newly created Web repository not showing up in explorer

    I'm trying to create an index which would finally enable me to search the 'CNN' website (following the 'how to' document : 'HOW TO SET UP A WEB REPOSITORY AND CRAWLING IT FOR INDEXING'). I'm unable to assign the web repository which I  created in an earlier step to the index because it doesn't show up in the repository/folder listing. I can't even see it under 'Content Administration -> KM Content -> Repositories'.
    What exactly am I doing wrong?!
    thanks,
    Biju.

    Hi Karsten,
    - We're on EP6 SP2.
    - Yes, I'm referring to the document you mentioned.
    - Yes, the repository does show up with a green light under KM -> Component Monitor.
    BTW, I was able to bring up the web repository now that I've specified under 'System Admin ® System Config ® Service Config ® Applications ® com.sap.portal.ivs.httpservice ® Services ® proxy ® HTTP-Bypass Proxy Servers' that everything under 'mycompany.com' must be bypassed. I'd originally specified the proxy set up in the TREX configuration, which runs on another (physical) server than the portal.
    I don't really get the connection, but essentially took the cue from the earlier reply I got.
    So, in short, I think the problem is solved at least for the moment.
    Thanks again for your help.

  • Web Repository clarifications

    Hi,
    I am an sap newbie
    I am creating a web repository in KM.  This is the scenario I want:
    I want all html pages contain within a website (ie. http://www.xyz.com) to be stored in the repository.
    So I created an HTTP system to http://www.xyz.com and then I configurate a Website with the HTTP system, with a start page of /main/index.html and a system path of /main
    I then created an index and so forth.  However, when I try to access the repository, I could only access the start page.
    Any pointers?  Or was my concept about the repository incorrect?
    Points will be generously awarded.
    Charles

    Thanks for your reply.
    yes I have followed all the steps in the link.
    I have created an HTTP system with server URL http://www.xyz.com, a Web Site to the system (with the same system ID) with /main/index.html as the start page and /main as the system path.  I then configurate a cache, and created a Web repository manager with the Web Site and cache I just created.
    I gather that all documents of the website, with URL beginning with http://www.xyz.com/main will be stored in the wb repository once I have accomplished the steps above?
    However, when I try to access the web repoistory via KM Content, I can only see the Web Site, and when I clicked on it, I arrived at the start page http://www.xyz.com/main/index.html.  How to make it such that all the resources beginning with http://www.xyz.com/main will appear in the web repository?  I have created an index and configurate the crawler, but still the same. 
    And what's the difference between a Simple and a Standard Web repoistory?  I have read the sap documentation and still don't get it
    Thanks, points will be generously awarded
    Charles

  • Web repository manager

    Hi,
    I am working with NW04S.
    I am facing 2 issues which are related with the web repository manager.
    1. When we create a web repository manager, we must be able to see it under content management->KM content. When we choose the web repository, we should be able to see the link of the website that we configured.
    The issue that I am facing is that I am unable to see this link although my web repository manager is seen in the KM content.
    I am able to the see the links for the web sites in the web repository managers that I had created previously.
    I have done all configurations according to the config guide. I have created html system, website, html property extractor, cache and then web repository manager.
    2. When I went back to check how I had configured the older web repository managers, I found that only the ones that I created recently were present. Very old ones were missing. But these are visible under KM content.
    Is there some place where these are archived?
    Could you please help me with this?
    Best Regards,
    Vidhya

    Hi,
    I checked some other posts on the forum and found that i had to check the component monitor. i did so.
    it gives me an error saying that
    2007-04-30T03:55:33Z: GET /: com.sapportals.wcm.WcmException: sending request to: http://www.yahoo.com/ request uri: / unable to connect to www.yahoo.com: unknown host: www.yahoo.com (java.net.UnknownHostException: www.yahoo.com)
    i have tried the same with cnn.com also.
    could someone tell me what i should do?
    regards,
    Vidhya

  • Web Repository searching - Confusion

    Hi Experts,
    I have a query regarding Web Repositories . I am using EP 2004s.
    I have done something like this :
    1. Created a web repository as given in "How to Create a Web Repository ...." guide.
    2. I have created it for the portal index page and given the start page as the page for webservices.
    3. Web repository is working fine .
    4. When I search this repository it displayes some indexed html pages , which are nothing but some part of the start page itself.
    Now , my understanding of web repository is it indexes the links on a website and when you search for it , it will display the documents which are accessable using those links .
    I know it sounds so confising , I am also confused .
    Can anyone tell me how exactly a web repository should work ?
    Note : Helpful answers will be rewarded with points.
    Thanks & Regards,
    Amit Kade

    Hi Amit,
    Web Repository is working as two ways.
    The one way is you just configure one website to index as data source. Then you are able to find the documents/links from that website. You can configure websites as many as you want.
    The second way accessing the documents from remote server. So in this case you have some folders/documents on remote server. You just create one index and add those folders as data sources (You should have configured web repository in a portal). Now you are able to find the docuents from remote server also.
    If you want more information or still have aproblem, please feel free to ask.
    Regards,
    Chamkaur

  • Crawling Web Repository - Error

    Hi Experts ,
    EP Version - EP 2004s
    I have configured a web repository as per the guide "How to configure a web repositiry and crawl it for searching ..".
    I have configured this for portal index page. I can see the folder created under 'root' and one link created in that folder . When I click on that link I can access the portal index page.
    I have created an index for this and crawled but after crawling it has indexed only one page . I have tried this with some document iViews (HTML).But unfortunately it is indexing only one page.
    Can anybody tell me what is wrong !
    This is kind of urgent as I am at the customer site.
    Note: Helpful answers will be rewarded with points.
    Thanks & Regards,
    Amit Kade

    Hi Praksh ,
    Thanks a lot for the quick reply . Actually I have already gone through these links .
    To make it simple I have created a simple website containing some html pages and links.
    I have created a web repository and crawl it  for indexing , this time with custome properties of index like 'IndexContentOfExternalLink' & 'IndexInternalLinks'.
    But to my disappointment it has again indexed only one page that is initial page.
    Any suggessions ?
    Thanks in advance .
    Thanks & Regards,
    Amit Kade

  • How to configure a web repository

    Hi All,
    At customer site we have following configuration.
    1. There is one web server and it is connected to 5 document servers (Back-up   servers).
    2. All the 5 document servers maintains the same data . (HTML documents)
    3. Web server redirects the user to nearest document server depending on the user ID.(Normal web server functionality)
    Requirement is to connect this web server to the portal . This can be achieved by configuring web repository.
    If I configure a web repository ...
    1. How to pass the user login data to the webserver so that it can redirect the user to nearest document server.
    Thanks & Regards,
    Amit kade

    Hi Praksh ,
    Thanks a lot for the quick reply . Actually I have already gone through these links .
    To make it simple I have created a simple website containing some html pages and links.
    I have created a web repository and crawl it  for indexing , this time with custome properties of index like 'IndexContentOfExternalLink' & 'IndexInternalLinks'.
    But to my disappointment it has again indexed only one page that is initial page.
    Any suggessions ?
    Thanks in advance .
    Thanks & Regards,
    Amit Kade

  • PREPROCESSOR_ACTIVITY_ERROR-6401 for web repository

    Hello,
    We have web repositories which consist of http urls as well as
    documents. The links appear properly in CM without errors in Comp monitor.But while indexing we get a preprocessor error "preparation failed HTTP status code 401 Unauthorised" .
    The start page and system path for the website is a folder path on the webserver.for eg : \parent folder\folder1 .Is this because the index_service does not have access to this path.
    But if I create a website for one of the individual links in this folder
    ie \parent folder\folder1\doc1 then the document is indexed properly. This is the additional error code in the preprocessor log
    *returning PREPROCESSOR_ACTIVITY_ERROR (Code 6401).
    The index_service user is also not locked .
    EP 7 SP13, Trex 7.0
    Any ideas would be very much welcome.
    Rgds

    Hello,
      Is it possible to integrate Sharepoint links only by using WebDav repository rather than a web repository?
    The links which I am having a problem are all sharepoint 2007 links.
    I even tried with a webdav repository but get the following error in CM  after creating the http system and webdav RM.
    +Authorization Required+
    *The repository you are attempting to access requires specific permissions that you did not provide. Make sure you provide appropriate access information for this repository*.
    The user specified also has necessary rights, and i can access the same link from the browser.What could be the problem.
    Hope someone can comment on this
    Rgds

  • Searching TREX Index through Web Dynpro

    Hi experts,
    I'm trying to search through my TREX indexes with Web Dynpro. I have found some sources on this website and they seem to work, but when I test my Application, the table I want to fill stays empty.
    The strange thing is that the "getNumberResultKeys()" variable is returning a correct value, but that the "ISearchResultList" stays empty.
    I'm a newbie in Web Dynpro, so any help will be appreciated!
    I wrote the following code:
         try{
              com.sap.security.api.IUser nwUser = UMFactory.getAuthenticator().getLoggedInUser();
              com.sapportals.portal.security.usermanagement.IUser user;
              user = WPUMFactory.getUserFactory().getEP5User(nwUser);
              ResourceContext resourseContext = new ResourceContext(user);
              IIndexService indexService = (IIndexService)ResourceFactory.getInstance().getServiceFactory().getService ( "IndexmanagementService");
              IFederatedSearch federatedSearch = (IFederatedSearch)indexService.getObjectInstance("federatedSearchInstance");
              List indexlist=indexService.getActiveIndexes();
              SearchQueryListBuilder sqb = new SearchQueryListBuilder();
              sqb.setSearchTerm("SomeSearchTerm");
              IQueryEntryList qel = sqb.buildSearchQueryList();
              ISearchSession session = federatedSearch.searchWithSession(qel, indexlist,resourseContext);
              ISearchResultList results = session.getSearchResults(1, session.getTotalNumberResultKeys());
              ISearchResultListIterator iter = results.listIterator();
              while (iter.hasNext())
              ISearchResult result = iter.next();
              IPrivateDetailView.IPracticeDataElement PracticeData = wdContext.createPracticeDataElement();
                                            PracticeData.setContentSnippet(result.getContentSnippet());
                                            PracticeData.setResource(result.getResource().toString());
                                            wdContext.nodePracticeData().addElement(PracticeData);
         } catch (ResourceException e1) {
                        throw new WDRuntimeException(e1);
         } catch (UserManagementException e2) {
                          throw new WDRuntimeException(e2);
         } catch (WcmException e3) {
                        throw new WDRuntimeException(e3);
    Thanks in advance for you help,
    Edwin
    null

    My context:
    value node - PracticeData
    value attribute - Resource
    value attribute - ContentSnippet
    The results are written to the context here (inside the while - statement):
    IPrivateDetailView.IPracticeDataElement PracticeData = wdContext.createPracticeDataElement();
                                            PracticeData.setContentSnippet(result.getContentSnippet());
                                            PracticeData.setResource(result.getResource().toString());
                                            wdContext.nodePracticeData().addElement(PracticeData);
    I created a table with databinding to the value node (and value attributes).

  • Subscription for a web repository

    Hi Experts ,
    I am using EP 2004s , I have created a web repository and activated a subscription service for that.
    Unfirtunately I am not able to receive any subscription notification emais , I can receive the subscription email for the repositories created inside the portal.
    As per my understanding from sap help , for a web repository we should receive subscription notification emails once crawler detects the changes .
    I have schedule the crawler accordingly so that it can detect the changes .
    Can anybody through some light on , on which basis crawler considers that this document has been modified ?
    Any advice , points are assured .
    Thanks & Regards,
    Amit Kade

    Hi Amit
    The following snippet is taken from the description of Subscription Event Mapping:
    Subscription Event Mapping
    A subscription event mapping maps events sent by a repository manager onto events that are meaningful with respect to subscriptions to resources...
    As you can see, the Subscription Event Mapping handles events send by a repository manager, but since no events can be send from a Web Repository (as far as I know), you are actually not performing any mapping, and thus it does not work.
    I was once told by SAP support that "Send events" does not work for a WebDAV repository, unless the changes/event was triggered from within the KM framework. Since a web repository isn't more closely integrated into the KM, my guess is that a web repository does not support "Send events" triggered from outside the portal either.
    With "custom built" I mean that "Send events" (and thus notification emails) might work if you develop a custom web repository manager, that can handle a very good integration between a website and the portals KM.
    Kind regards,
    Martin

  • Error while crawling web repository

    Hi Experts,
    System in use - EP 2004s
    We have a web server which has number of documents on that server. I have created a web repository for this web server . Repository is working fine , but while crawling it has indexed about 20000  documents and given some errors for 600 documents.
    Errors are like :
    1. Crawler error
    2. TREX preparation error
    and when I search for the indexed pages it gives search result but when I click on html version it gives an error message 'No index service found'.
    Any suggessions !
    (Points are assured ..)
    Thanks & Regards,
    Amit Kade

    Hi Tamil,
    Thanks a lot for the help ! But I have already set everything correctly as per the sap help .
    TREX is indexing the documents but as I mentioned not indexing all the documents and I cannot view the searched documents in HTML format.
    One observation : It is not indexing large documents (Greater than 10 - 12 KB).
    Any suggessions !
    Thanks & Regards,
    Amit Kade

Maybe you are looking for

  • Problematic frame: # J java.nio.MappedByteBuffer.load()

    Helo all... I'm getting an error when generating a PDF using BIRT Framework in Eclipse. Nothing fancy, just compiling a simple report to pdf. The problem is, the JVM crashes with the following error: # A fatal error has been detected by the Java Runt

  • OC4J Version Confusion - 2.0.0.0 or 9.0.2.0.0 ?????

    On my desktop PC I have JDevelop9i RC, which reports its OC4J version as Oracle9iAS (2.0.0.0) Containers for J2EE. On my server PC I have the developer's preview of OC4J Version 2 as downloaded from the Oracle site, which reports its version as Oracl

  • Need to delete Itunes and re install it. Will i loose all my music library?

    I accidentally deleted Quick time. It is needed to run itunes. A message tells me to uninstall Itunes and re install it again. My concern is : will i loose all my music library in process? Here's some additional info : WinXP Ipod Classic (5th gen) An

  • Please help me with this indicator on my screen...it has reduced voice quality

    Please help me with the rectangular symbol on the top next to the speaker symbol. It seems that I have turned on some option by mistake and everytime I make or receive a call, this symbol comes up. It wasn't the case earlier. Because of this my voice

  • HT1766 How to reverse a sync

    I have my phone linked to I Tunes and my husband's is linked to a work computer. I thought his was on my I Tunes. So he got a new phone and I did a full backup to my I Tunes here at home and now my old data is on his old phone. How do I undo it?