Web Repository searching - Confusion

Hi Experts,
I have a query regarding Web Repositories . I am using EP 2004s.
I have done something like this :
1. Created a web repository as given in "How to Create a Web Repository ...." guide.
2. I have created it for the portal index page and given the start page as the page for webservices.
3. Web repository is working fine .
4. When I search this repository it displayes some indexed html pages , which are nothing but some part of the start page itself.
Now , my understanding of web repository is it indexes the links on a website and when you search for it , it will display the documents which are accessable using those links .
I know it sounds so confising , I am also confused .
Can anyone tell me how exactly a web repository should work ?
Note : Helpful answers will be rewarded with points.
Thanks & Regards,
Amit Kade

Hi Amit,
Web Repository is working as two ways.
The one way is you just configure one website to index as data source. Then you are able to find the documents/links from that website. You can configure websites as many as you want.
The second way accessing the documents from remote server. So in this case you have some folders/documents on remote server. You just create one index and add those folders as data sources (You should have configured web repository in a portal). Now you are able to find the docuents from remote server also.
If you want more information or still have aproblem, please feel free to ask.
Regards,
Chamkaur

Similar Messages

  • Web Repository - Search From Here option

    Hi,
    I've created several repositories along with indexes. Everything is working fine but 1 or 2 of the web repositories do not have the "Search From Here" option when I go to KMContent iView and  click the tag to the right of the repository.  As far as I can tell the properties are the same as the other "working" repositories. I know they were all set up the same way. Any ideas?
    Thanks for any help!

    Hi Brent,
    web repositories are treatet in navigation as single
    resources. 
    'Search from Here' is not offered on a single resource, as it is not reasonable to search on only one document.
    Maybe SAP has to improve this for web repositories.
    As a workaroubnd you can place the web repository in a folder and assign this folder to the index. Then search from here is possible.
    Regards Matthias Röbig-Landau
    PS: Please think of validating my answer.

  • Indexing and searching on a web repository -- No document excerpt available

    Hi everybody,
    I created a web repository with content of our intranet. I also created a index for this web repository. Everything seems to work fine. But when I search on the index no document excerpt is shown. It says: "No document excerpt available" under each search result.
    I think no full text index is performed, where and how can I tell that I want a full text index?
    Thank you in advance, Christoph

    Hi,
    Have you solved this problem ?
    Best Regards,
    Fabien

  • Newly created Web repository not showing up in explorer

    I'm trying to create an index which would finally enable me to search the 'CNN' website (following the 'how to' document : 'HOW TO SET UP A WEB REPOSITORY AND CRAWLING IT FOR INDEXING'). I'm unable to assign the web repository which I  created in an earlier step to the index because it doesn't show up in the repository/folder listing. I can't even see it under 'Content Administration -> KM Content -> Repositories'.
    What exactly am I doing wrong?!
    thanks,
    Biju.

    Hi Karsten,
    - We're on EP6 SP2.
    - Yes, I'm referring to the document you mentioned.
    - Yes, the repository does show up with a green light under KM -> Component Monitor.
    BTW, I was able to bring up the web repository now that I've specified under 'System Admin ® System Config ® Service Config ® Applications ® com.sap.portal.ivs.httpservice ® Services ® proxy ® HTTP-Bypass Proxy Servers' that everything under 'mycompany.com' must be bypassed. I'd originally specified the proxy set up in the TREX configuration, which runs on another (physical) server than the portal.
    I don't really get the connection, but essentially took the cue from the earlier reply I got.
    So, in short, I think the problem is solved at least for the moment.
    Thanks again for your help.

  • Error while crawling web repository

    Hi Experts,
    System in use - EP 2004s
    We have a web server which has number of documents on that server. I have created a web repository for this web server . Repository is working fine , but while crawling it has indexed about 20000  documents and given some errors for 600 documents.
    Errors are like :
    1. Crawler error
    2. TREX preparation error
    and when I search for the indexed pages it gives search result but when I click on html version it gives an error message 'No index service found'.
    Any suggessions !
    (Points are assured ..)
    Thanks & Regards,
    Amit Kade

    Hi Tamil,
    Thanks a lot for the help ! But I have already set everything correctly as per the sap help .
    TREX is indexing the documents but as I mentioned not indexing all the documents and I cannot view the searched documents in HTML format.
    One observation : It is not indexing large documents (Greater than 10 - 12 KB).
    Any suggessions !
    Thanks & Regards,
    Amit Kade

  • How to see properties in details dialogue in a web repository

    Hi everyone
    I have set up a web repository that uses a number of web property extractors to extract content from meta tags into predefined properties. The funny part is that I can use these predefined properties in my search scenario, but I'm not able to see the properties in my details > properties dialogue. Is there any special configuration that needs to be done in order to see these properties when one is using a Web Repository Manager?
    I have configured a property group in "Property Structures"-configuration that contains all predefined properties and added this group to "all_groups". But still I dont see the predefined properties even though there must be something in them when I can use them in my search. And just to specify: I cannot see the predefined properties at all - its not just the content that is missing.
    Any help will be rewarded!
    Best regards,
    Martin Søgaard

    hi sap2008,
    abaper only can help u abt this
    refer this
    http://abaplog.wordpress.com/2007/07/23/displaying-sap-error-messages-in-a-nice-way/
    kaustubh

  • Web Repository Manager and robots.txt

    Hello,
    I would like to search an intranet site and therefore set up a crawler according to the guide "How to set up a Web Repository and Crawl It for Indexing".
    Everything works fine.
    Now this web site uses a robots.txt as follows:
    <i>User-agent: googlebot
    Disallow: /folder_a/folder_b/
    User-agent: *
    Disallow: /</i>
    So obviously, only google is allowed to crawl (parts of) that web site.
    My question: If I'd like to add the TRex crawler to the robots.txt what's the name of the "User-agent" I have to specify here?
    Maybe the name I defined in the SystemConfiguration > ... > Global Services > Crawler Parameters > Index Management Crawler?
    Thanks in advance,
    Stefan

    Hi Stefan,
    I'm sorry but this is hard coded. I found it in the class : com.sapportals.wcm.repository.manager.web.cache.WebCache
    private HttpRequest createRequest(IResourceContext context, IUriReference ref)
            HttpRequest request = new HttpRequest(ref);
            String userAgent = "SAP-KM/WebRepository 1.2";
            if(sessionWatcher != null)
                String ua = sessionWatcher.getUserAgent();
                if(ua != null)
                    userAgent = ua;
            request.setHeader("User-Agent", userAgent);
            Locale locale = context.getLocale();
            if(locale != null)
                request.setHeader("Accept-Language", locale.getLanguage());
            return request;
    So recompile the component or changing the filter... I would prefer to change the roberts.txt
    hope this helps,
    Axel

  • Web Repository - how can i delete the cache ?

    Hi,
    we are using a webrepository for searching with TREX in our intranet.
    So far so good. Now we switched our Intranet from a Lotus Notes App to Opentext Websolutions,  but the URL remains the same (just IP change).
    When i now reindexed the webrepository it still shows the pages it crawled when it was a Lotus Notes App.
    When i click in the web repository folder on the entry with the Intranet name, it opens a new window with the new Intranet.
    A clearing of the Caches under System Administration > Monitoring > Knowledge Management > Cache-Monitor
    did not change anything.
    Anything else what I can do ?
    Regards,
    Kai

    By default, when you type in the location bar it matches your bookmarks and your history. You can change that in preferences to match only bookmarks, only history, or not provide any matches at all. This article explains how: [https://support.mozilla.com/en-US/kb/Location%20bar%20autocomplete#os=linux&browser=fx35 Location bar autocomplete | How to | Firefox Help].
    Does that let you make it work the way you want?
    If you meant the Google search box, that's somewhere else...

  • Crawling web repository

    Hello,
                 I have configures a simple web repository and i am able to search on this repository. For crawling this repository i first used standard crawler but it was crawling only first page, then i created one new crawler in which i explicitly specified the depth to crawl the repository but it doesn't work. Please suggest how to crawl through all the pages.
    Note our wer repository is local intranet and there's no restriction to crawl through it.

    hi,
    go through these links.. this will help you..
    http://help.sap.com/bp_epv260/EP_JA/documentation/How-to_Guides/27_WebRepEn.pdf
    http://help.sap.com/saphelp_nw04s/helpdata/en/46/5d5040b48a6913e10000000a1550b0/content.htm
    http://download.microsoft.com/download/e/c/c/ecc8b7a5-ddea-4f4e-a6c6-
    5e96dc1a2908/06_Application%20Interop%200%20-%20MOSS%20and%
    20Biztalk.pdf
    http://blogs.msdn.com/saptech/archive/2007/02/15/enterprise-search-combining-sap-and-microsoft-realtime-crawling-results.aspx
    reward me points..

  • Crawling Web Repository - Error

    Hi Experts ,
    EP Version - EP 2004s
    I have configured a web repository as per the guide "How to configure a web repositiry and crawl it for searching ..".
    I have configured this for portal index page. I can see the folder created under 'root' and one link created in that folder . When I click on that link I can access the portal index page.
    I have created an index for this and crawled but after crawling it has indexed only one page . I have tried this with some document iViews (HTML).But unfortunately it is indexing only one page.
    Can anybody tell me what is wrong !
    This is kind of urgent as I am at the customer site.
    Note: Helpful answers will be rewarded with points.
    Thanks & Regards,
    Amit Kade

    Hi Praksh ,
    Thanks a lot for the quick reply . Actually I have already gone through these links .
    To make it simple I have created a simple website containing some html pages and links.
    I have created a web repository and crawl it  for indexing , this time with custome properties of index like 'IndexContentOfExternalLink' & 'IndexInternalLinks'.
    But to my disappointment it has again indexed only one page that is initial page.
    Any suggessions ?
    Thanks in advance .
    Thanks & Regards,
    Amit Kade

  • HTML property extractor for web repository

    Hi All,
    I was just wondering has anyone worked on this ...
    We are using EP 2004s and for one of our web server we have created a web repository and now we want to use 'html property extractor" to extract the values of META tag of the html documents.
    This will help us in filtering the search result , so we have followed following steps :
    1. Created a html property extractor for the meta tags (META all =<meta tag name>).
    2. Assigned this extractor to the web repository.
    3. Crawled the website .
    After this we tried to search for the documents using meta tag values but not able to find any document.
    We have even tried to filter the result by adding this custom property in 'custom properties' , but this also didn't work.
    Is there anything we are missing or this has to work in some other way ?
    Note : We have tried a html extractor with <title> tag and extracted it successfully .
    Useful answers will be rewarded points !
    Thanks & Regards,
    Amit Kade

    Hi Amit
    I'm working on a solution using Web Property Extractors as well, and would like to know if you managed to find a solution to your problem.
    Kind regards,
    Martin Søgaard

  • How to change the default operators in sap web ui Search screen?

    How to change the default operators in sap web ui Search screen?
    For eg. Using advance search option , I have some fields with default operators like equals, contains,is between, is less than and is greater than. I don't need all these operators for this field.
    I need only "equals" operator. How do i remove the rest of the operators?

    There is a view cluster crmvc_dq where all the standard setting is present related to you r issue. Please try if you can modify that, that way you will avoid the code.
    Incase you are not able to make any changes there then in that case you have to redefine the method GET_DQUERY_DEFINITION () of the IMPL class to delete the operators for a particular serach field.
    Regards,
    Harshit

  • SAP Web Repository, how can acess it from outside

    I'm reading this http://help.sap.com/saphelp_sm40/helpdata/en/14/030fc5b63f11d5993900508b6b8b11/content.htm
    and it says this:
    "You use a Web repository (manager) to provide read access to documents stored on remote Web servers."
    This could be really useful to me, but how does this work? I'm able to add objects to it using the transaction SMW0 but how can I get a valid url so that the users in my network can acess these files.
    the only FM I know is 'DP_PUBLISH_WWW_URL' but this generates a kind of links that my browser can't process (something like SAPR3://WebRepository/0123456789/ZMY_FILE?Version=00001")
    How can I do to create valid links to these documents, so that I can open them im my web browser??

    though they have the same name, they are different! The SAP library you mention is about Knowledge Management.
    If you want to generate an external URL for the Web Repository (SMW0), first read Note 865853 - WebReporting/WebRFC obsolete as of NW2004s. I don't know your SAP release, but if you are on a 7.0, you need to "release for internet" WWW_GET_MIME_OBJECT function module via SMW0, and probably activate /sap/bc/webrfc service using SICF transaction. Then you'll be able to access your ZZZZZ web document using URL like this: http://youserver:port/sap/bc/webrfc/!?_function=WWW_GET_MIME_OBJECT&_object_id=ZZZZZ&client=220&language=EN
    Note: DP is only for allowing the HTML browser (and other SAPGUI controls) to access objects transmitted from SAP to the SAPGUI using the Data Provider service.
    Edited by: Sandra Rossi on Jun 30, 2010 10:45 PM CET

  • Error while generating Static web pages hierarchy for Web Crawler searching

    Hi All,
    When we are trying to generate Static web pages for Web Crawler searching for our B2C application it is resulting in an error :
    u201CUnable to initiate generation of static pages; check logsu201C
    After tracing the log files we found out the below detailed information:
    application [catalogtool] Error in method getResourceAsStream(WEB-INF/cfg/catalog-site-config.xml). The error is: com.sap.engine.services.servlets_jsp.server.exceptions.WebMalformedURLException: A resource path must begin with [/]. The error occurred in [].
    at com.sap.engine.services.servlets_jsp.server.runtime.context.ServletContextImpl.getResource(ServletContextImpl.java:452)
    at com.sap.engine.services.servlets_jsp.server.runtime.context.ServletContextImpl.getResourceAsStream(ServletContextImpl.java:481)
    at com.sap.isa.core.ActionServlet.getResourceAsStream(ActionServlet.java:297)
    at com.sap.isa.catalog.impl.CatalogSite.loadConfigFile(CatalogSite.java:666)
    at com.sap.isa.catalog.impl.CatalogSite.initBackendObject(CatalogSite.java:209)
    at com.sap.isa.core.eai.BackendObjectManagerImpl.createBackendBusinessObject(BackendObjectManagerImpl.java:190)
    at com.sap.isa.catalog.CatalogFactory.getCatalogSite(CatalogFactory.java:261)
    at com.sap.isa.catalog.webcatalog.WebCatInfo.init(WebCatInfo.java:237)
    at com.sap.isa.businessobject.webcatalog.CatalogBusinessObjectManager.createCatalog(CatalogBusinessObjectManager.java:103)
    at com.sap.isa.catadmin.actions.WebCrawlerAction.doPerform(WebCrawlerAction.java:93)
    at com.sap.isa.core.BaseAction.execute(BaseAction.java:212)
    at org.apache.struts.action.RequestProcessor.processActionPerform(RequestProcessor.java:484)
    at com.sap.isa.core.RequestProcessor.processActionPerform(RequestProcessor.java:683)
    at org.apache.struts.action.RequestProcessor.process(RequestProcessor.java:274)
    at com.sap.isa.core.RequestProcessor.process(RequestProcessor.java:400)
    at org.apache.struts.action.ActionServlet.process(ActionServlet.java:1482)
    at com.sap.isa.core.ActionServlet.process(ActionServlet.java:243)
    at org.apache.struts.action.ActionServlet.doPost(ActionServlet.java:525)
    at javax.servlet.http.HttpServlet.service(HttpServlet.java:760)
    at javax.servlet.http.HttpServlet.service(HttpServlet.java:853)
    at com.sap.engine.services.servlets_jsp.server.runtime.RequestDispatcherImpl.doWork(RequestDispatcherImpl.java:321)
    at com.sap.engine.services.servlets_jsp.server.runtime.RequestDispatcherImpl.forward(RequestDispatcherImpl.java:377)
    at org.apache.struts.action.RequestProcessor.doForward(RequestProcessor.java:1069)
    at org.apache.struts.action.RequestProcessor.processForwardConfig(RequestProcessor.java:455)
    at com.sap.isa.core.RequestProcessor.processForwardConfig(RequestProcessor.java:276)
    at org.apache.struts.action.RequestProcessor.process(RequestProcessor.java:279)
    at com.sap.isa.core.RequestProcessor.process(RequestProcessor.java:400)
    at org.apache.struts.action.ActionServlet.process(ActionServlet.java:1482)
    at com.sap.isa.core.ActionServlet.process(ActionServlet.java:243)
    at org.apache.struts.action.ActionServlet.doPost(ActionServlet.java:525)
    at javax.servlet.http.HttpServlet.service(HttpServlet.java:760)
    at javax.servlet.http.HttpServlet.service(HttpServlet.java:853)
    at com.sap.engine.services.servlets_jsp.server.HttpHandlerImpl.runServlet(HttpHandlerImpl.java:401)
    at com.sap.engine.services.servlets_jsp.server.HttpHandlerImpl.handleRequest(HttpHandlerImpl.java:266)
    at com.sap.engine.services.httpserver.server.RequestAnalizer.startServlet(RequestAnalizer.java:387)
    at com.sap.engine.services.httpserver.server.RequestAnalizer.startServlet(RequestAnalizer.java:365)
    at com.sap.engine.services.httpserver.server.RequestAnalizer.invokeWebContainer(RequestAnalizer.java:944)
    at com.sap.engine.services.httpserver.server.RequestAnalizer.handle(RequestAnalizer.java:266)
    at com.sap.engine.services.httpserver.server.Client.handle(Client.java:95)
    at com.sap.engine.services.httpserver.server.Processor.request(Processor.java:175)
    at com.sap.engine.core.service630.context.cluster.session.ApplicationSessionMessageListener.process(ApplicationSessionMessageListener.java:33)
    at com.sap.engine.core.cluster.impl6.session.MessageRunner.run(MessageRunner.java:41)
    at com.sap.engine.core.thread.impl3.ActionObject.run(ActionObject.java:37)
    at java.security.AccessController.doPrivileged(Native Method)
    at com.sap.engine.core.thread.impl3.SingleThread.execute(SingleThread.java:100)
    at com.sap.engine.core.thread.impl3.SingleThread.run(SingleThread.java:170)
    Can anyone assist as to what is getting wrong?
    I have a doubt in specifying the parameters "dumpFolder" & "templateFolder" in catalogtool XCM settings, We need to pass full path of the server directory here. please suggest what exactly has to be passed here.
    Currently i am passing values like this "/catalog/dump"& "/catalog/templates".
    We are running on CRM-7.0
    Thanks in advance.
    Pnakaj.

    Any Updates friends??

  • Index and crawler not working on Web Repository

    Hi Team,
    I'm trying to setup a Web Repository and crawling it for indexing. I've followed the steps from a SAP "how-To" document, but I guess the problem might be the way I'm confuring the web site in EP. I've created a Virtual Directory on my laptop's IIS 5.0 web server and the URL of the web site has been set as http://laptop-ashishk/myWebSite.
    Do I need to set the START PAGE as /index.html (as per the spec it says it's not mandatory)...
    Let me know whether you need any information with regards to this problem.
    Ashish

    They've set:
    meta name="viewport" content="initial-scale=2.3, user-scalable=no"
    It's the user-scalable that's the problem. Apple considers the default (per their web coding rules at http://developer.apple.com/iphone/designingcontent.html to be yes.
    I've noticed the same thing.
    Aym

Maybe you are looking for

  • The document "Domain.Sites2" could not be opened

    I upgraded to iWeb '08 last week, and it has been working OK since then. My '06 sites were transfered to '08 sites - everything ok. Until last night, when I tried to open iWeb '08 again... I got this error message: [img]www.av123.nl/Misc/apple/domain

  • Displays VGA or HDMI

    I have just been advised that I should use the VGA connection over the DVI - HDMI route for the best quality viewing of viewing my FCP footage from my macpro to an HDTV monitor. Is this correct? I thought that HDMI offfered the best quality pictures

  • 3L4 to 3M0

    Hi, Has anyone encountered any problems with migrating their application from 3L4 to 3M0 ? When we upgraded our development machines, our application would no longer run, complaining that it was not able to connect to the database or create our servi

  • RDA implemented scnarios

    Hi All, I want to know where we can use the RDA ( Real Time Data Acquizition) data flow. Its is supposed to be used for enabling the user to take teh tacticale decisions. I want the scnarios where tacticals decisions are needed from BW reports. I am

  • Le mode bureau étendu n'apparaît avec airplay et appletv sous moutain lion. Existe-t'il une solution pour retrouver le mode bureau étendu avec airplay et apple tv sous mountain lion ?

    Le mode bureau étendu dans les préférences moniteur a disparu sous Mountain Lion. Seul Airplay apparaît et le mode bureau étendu ne figure pas (il n'y a que le mode recopie vidéo AirPlay). Par conséquent, sous AppleTv 2 on ne peut pas afficher un dia