URL Boundary Rules/Include Domain

Hi,
I would like to restrict Ultra Search to get result only within the URL http://www.myWeb.com. So I set the starting address to http://www.myWeb.com, and I added a Domain rule to Include myWeb.com. This should limits the Ultra Search crawler to hosts belonging to myWeb.com.
But the results page lists a lot of other links such as crri.rutgers.edu/post/03/photos1.htm...
In the log file, these links seem to be rejected "WKG-30027: Not allowed URL: ...", but nevertheless processed !!
What am I doing wrong ?
Thanks for help.

Hi Dominique & Chris,
I'm having the similar problem - trying to limit the crawler to a specific website. I did set the crawler's source to www.x.com and set the URL Boundary Rule to Include host x.com. The crawler is only caching the first page and nothing else. If I dont include any rule it goes out the x.com website. I cant use depth cos' the leaf pages are at variable depth. Any suggestions??
thx!
ornov

Similar Messages

  • URL Boundary Rules and the minus sign "-"

    Hi!
    I need to set a Boundary Rule which contains one or more minus signs "-" in the hostname/domain, but Ultra Search doesn't accept such a rule and displays "Hostname can not contain multibyte characters.
    I've searched the net for hours now, but haven't found a single explanation, hint or fix for this problem.
    All help is welcome,
    Karsten

    Hi Dominique & Chris,
    I'm having the similar problem - trying to limit the crawler to a specific website. I did set the crawler's source to www.x.com and set the URL Boundary Rule to Include host x.com. The crawler is only caching the first page and nothing else. If I dont include any rule it goes out the x.com website. I cant use depth cos' the leaf pages are at variable depth. Any suggestions??
    thx!
    ornov

  • URL Boundary Rules

    Hi,
    I'm trying to do the following. I would like to index the site "www.mysite.com/topic". But I don't want to index the root site "www.mysite.com" itself, links which point to this site from my /topic pages should not be indexed.
    I did the following configuration Ultrasearch 9i:
    - create a source on "http://www.mysite.com/topic"
    - create a domain rule : include www.mysite.com
    - create a path rule : include /topic
    This configuration does index the link which point to www.mysite.com.
    How should I configure the URL Boundary to achieve my goal?
    Do you have here some input ?
    Best regards
    Dominique

    Hi Dominique & Chris,
    I'm having the similar problem - trying to limit the crawler to a specific website. I did set the crawler's source to www.x.com and set the URL Boundary Rule to Include host x.com. The crawler is only caching the first page and nothing else. If I dont include any rule it goes out the x.com website. I cant use depth cos' the leaf pages are at variable depth. Any suggestions??
    thx!
    ornov

  • Sieve Rule on domain

    Hello,
    I want to implement some sieve rules on domain basis.
    The attribute for that is mailDomainSieveRuleSource but when i put a simple sieve rule in it, it does not work!
    The same rule in mailSieveRuleSource in user entry work fine.
    What have i missed?
    The rule is:
    #RULE: $Name="Test" $Order=1 $Type="DEFAULT_TYPE" require ["fileinto"]; #BEGINFILTER if allof ( header :matches ["From","Sender","Resent-from","Resent-sender","Return-path"] "testtest*" ){         fileinto "Spam";                 } #ENDFILTER
    Messaging server version:
    Sun Java(tm) System Messaging Server 7.0-3.01 64bit (built Dec 9 2008)
    libimta.so 7.0-3.01 64bit (built 09:57:35, Dec 9 2008)
    Using /opt/sun/comms/messaging64/config/imta.cnf
    SunOS sunpp2 5.10 Generic_141414-09 sun4v sparc SUNW,Sun-Fire-T200
    Regards
    Daniel

    Joe, your HRs look ok for me in FF 3.6.  But your text is overlapping big time. See screenshot.
    You have this in your CSS:
    .copy {
        float: left;
        height: 238px;
        width: 350px;
        margin-left: 1.5em;
    Height is a restriction in all but IE browsers.  You should remove height limitations and I think your float should be right; not left.
    .copy {
        float: right;
        width: 350px;
        margin-left: 1.5em;
    Nancy O.
    Alt-Web Design & Publishing
    Web | Graphics | Print | Media  Specialists
    http://alt-web.com/
    http://twitter.com/altweb
    http://alt-web.blogspot.com

  • URL Rewrite Rule: HTTP to HTTPS Not Working

    Here is my problem. I cannot get HTTP to HTTPS redirect to work using the URL Rewrite module. I am using version 2, by the way, which I understand is the latest version. I've also enabled the "Proxy Server" and Application Request Routing"
    features.
    I've trolled through the Internet for 2 days now for solutions to my problem, including the ones provided by the TechNet forums and by MVP Scott Forsyth. I've tried over 30 solutions, and none have resolved my issue. Granted some of the solutions I've tried
    may have been repeats of others. After trying so many, I have had a hard time discerning the differences. Does it really matter, for instance if the pattern match for the {HTTPS} input is specified as
    "^off$" or just "off"? I see this discrepancy (among others) everywhere. It seems like there are so many variations out there. It is quite confusing for the uninitiated and the newbies like myself to the
    URL Rewrite technology. I have enabled and consulted my "Failed Request Tracing Rules" logs but cannot make any sense of the cryptic information it is providing.
    I just cannot get my reverse proxy server to redirect http requests from the Internet to https to my internal web server. When a user specifies "https://server1.xxxxx.com". he is able to access the internal server via the reverse proxy (IIS) server.
    But when he specifies "http://server1.xxxxx.com" he receives "Server Error 500 - Internal server error.There is a problem with the resource you are looking for, and it cannot be displayed."
    At this point I am at my wits end, and am even considering configuring the protocol translation on our firewall. But this not my preference as it presents another layer of complexity, and device to troubleshoot later on. Below I've pasted my entire web.config
    file with my most current version of the rule in question (in bold) for your review. I started to paste just the specific rule, but thought you might need to see the entire file in case something else may be conflicting with the rule.
    Thank you in advance for your help.
    Dave Robinson
    <?xml version="1.0" encoding="UTF-8"?>
    <configuration>
        <system.webServer>
            <rewrite>
                  <rules>
                    <clear />
    <rule name="HTTP to HTTPS Redirect" enabled="true" stopProcessing="true">
                    <match url="(.*)" />
                   <conditions>
                      <add input="{HTTPS}" pattern="off" />
                    </conditions>
                      <action type="Redirect" url="https://{HTTP_HOST}/{REQUEST_URI}" redirectType="Found" />
                    </rule>
    <rule name="ReverseProxyInboundRule1" stopProcessing="true">
                        <match url="(.*)" />
                        <conditions logicalGrouping="MatchAll" trackAllCaptures="false">
                            <add input="{CACHE_URL}" pattern="^(https?)://" />
                        </conditions>
                        <serverVariables>
                            <set name="HTTP_X_ORIGINAL_ACCEPT_ENCODING" value="{HTTP_ACCEPT_ENCODING}" />
                            <set name="HTTP_ACCEPT_ENCODING" value="" />
                    </serverVariables>
                        <action type="Rewrite" url="{C:1}://server1.xxxxx.com/{R:1}" />
                    </rule>
                </rules>
                <outboundRules>
                    <rule name="RestoreAcceptEncoding" preCondition="NeedsRestoringAcceptEncoding">
                    <match serverVariable="HTTP_ACCEPT_ENCODING" pattern="^(.*)" />
                    <action type="Rewrite" value="{HTTP_X_ORIGINAL_ACCEPT_ENCODING}" />
                    </rule>
                    <rule name="ReverseProxyOutboundRule1" preCondition="ResponseIsHtml1">
                        <match filterByTags="A, Form, Img" pattern="^http(s)?://server1.xxxxx.com/(.*)" />
                        <action type="Rewrite" value="http{R:1}://server1.xxxxx.local/{R:2}" />
                    </rule>
                    <preConditions>
                        <preCondition name="ResponseIsHtml1">
                            <add input="{RESPONSE_CONTENT_TYPE}" pattern="^text/html" />
                        </preCondition>
                        <preCondition name="NeedsRestoringAcceptEncoding">
                            <add input="{HTTP_X_ORIGINAL_ACCEPT_ENCODING}" pattern=".+" />
                        </preCondition>
                    </preConditions>
                </outboundRules>
            </rewrite>
            <tracing>
                <traceFailedRequests>
                    <add path="*">
                        <traceAreas>
                            <add provider="ASP" verbosity="Verbose" />
                            <add provider="ISAPI Extension" verbosity="Verbose" />
                            <add provider="WWW Server" areas="Authentication,Security,Filter,StaticFile,CGI,Compression,Cache,RequestNotifications,Module,FastCGI,WebSocket"
    verbosity="Verbose" />
                        </traceAreas>
    <failureDefinitions timeTaken="00:00:00" statusCodes="404" />
                    </add>
                </traceFailedRequests>
            </tracing>
            <httpErrors errorMode="Custom" />
            <httpRedirect enabled="false" destination="" exactDestination="false" childOnly="false" httpResponseStatus="Permanent" />
        </system.webServer>
    </configuration>

    On Wed, 28 Jan 2015 17:53:41 +0000, dwrobins2000 wrote:
    Here is my problem. I cannot get HTTP to HTTPS redirect to work using the URL Rewrite module. I am using version 2, by the way, which I understand is the latest version. I've also enabled the "Proxy Server" and Application Request Routing"
    features.
    Web/IIS related issues are better posted where the IIS experts are:
    http://forums.iis.net
    Paul Adare - FIM CM MVP
    "The equivalent of treating dandruff by decapitation"
    -- Frank Zappa on the Parents Music Resource Center' censorship plans

  • Running Windows 7, all versions of Firefox beyond 3.6.0 will not access websites, showing Error 404 and displaying only the portion of the URL that follows the domain name

    Choosing any of several dozen bookmarks, Firefox will show a screen reporting Error 404 and display only part of the complete URL (whatever follows the slant mark beyond the domain name). When going to a new site, such as provided by a Google search, the initial page may show up but clicking on links within that page again brings up the 404 error. My only solution has been to uninstall whatever new version of Firefox has been automatically downloaded and then re-install version 3.6.0, which works.

    A possible cause is security software (firewall) that blocks or restricts Firefox without informing you about that, possibly after detecting changes (update) to the Firefox program.
    Remove all rules for Firefox from the permissions list in the firewall and let your firewall ask again for permission to get full unrestricted access to internet for Firefox.
    See [[Server not found]] and [[Firewalls]] and http://kb.mozillazine.org/Firewalls
    See also http://kb.mozillazine.org/Error_loading_websites

  • Add a URL guessing for custom domain.

    When I select text on webpage which contains an url, after right mouse button click on text popup menu appears.
    In that menu I have options something like "Open link" (or "Open URL", I have a Russian language in Firefox), "Open link in new tab" and "Open link in new window".
    It works fine, but I want to customize guessing mechanism - I want to add a .i2p domain into it.
    Please help me find the way to solve this issue.

    Hi,
    </p>
    How you have created that button?? You can also create a link text instead of button to do the same URL redirect.
    For example.
    Try putting this code in Region Footer, or Region Header
    </p>
    '<'
    a href="#" onclick="javascript:redirect('f?p=&APP_ID.:1:&APP_SESSION.::NO:::');">Take Me Home'</a>
    </p>On click of "Take me Home” it will redirect to the page 1. You can customize it as per your need.
    </p>
    Regards,</p>
    Manish

  • URL Rewrite Rules in 10.1.4

    Hi,
    I am having some trouble rewriting the urls, I can't make it work ...
    What I want is to change the url:
    http://www.mycompany.com/portal/page/portal/portal_sverige/
    to
    http://www.mycompany.com/sverige
    Where "portal_sverige" is the page group.
    What I did was:
    Configure the properties in the page group:
    Search pattern: /portal/page/portal/portal_sweden/
    Replace pattern: /sweden/
    And I went to enterprise manager, MT, http server, administration, advanced server properties, httpd.conf, and put in the lines:
    RewriteEngine On
    RewriteRule ^/sweden/(.*)$/portal/page/portal/portal_sweden/$1 [PT]
    However, when I try to use navigator to edit a page in that group, I get:
    http://www.mycompany.com/sweden/pagename?_mode=16
    Returns:
    Object not found. (WWC-50003)
    And just typing in www.mycompany.com/sweden returns 404 error instead of the root page as I would have liked.
    Any ideas about what to do? I can't really find any information ... is the change made in the MT ? IM ? Do I have to do it in some specific order? Is the syntax wrong?
    Appreciate any help, thanks

    Hello Sayeeas,
    I posted an answer to this question a couple months back... check out the thread
    Re: Portal URL Rewrite
    Here is the answer:
    In this example the name of the page group is "www":
    Our rewrite rule is also set up in a virtual host section of the httpd.conf file as well so some of the options may not be required for your set up:
    RewriteEngine on
    RewriteOptions inherit
    RewriteCond %{REQUEST_URI} !^/(portal|pls|images).*$
    RewriteRule ^(.+) /portal/page/portal/www$1 [PT]
    The rewrite condition allows us to ignore the rewrite for certain items in the path. This allows the portal navigator to work. There may be a couple more items that need to be added to the condition because I recently noticed that the help pages dont function properly.
    In the portal configuration for the rewrite rule we specify:
    Search pattern: /portal/page/portal/www
    Replace pattern:
    Thats right, the replace pattern is blank.
    To see it in action check out http://www.sagelogix.com
    -sean
    Solutions Architect
    http://www.sagelogix.com

  • URL field shows base domain only

    In Safari 6.0.3, OS 10.8.5, the URL field only shows base domain, not the full URL.
    How can I restore the normal information, so it will show me the complete URL?

    Not seeing that... Aha! Issue has resolved itself with no changes from me! Must be intermittent.

  • URL Content Rule with SSL

    Hi
    I have two different ssl services on the same servers. One service is published with standard 443 port and 444 port.
    I would like to balance those services with two different contents. They should have the same VIP address and the standard ssl port (443 tcp). Then, the difference between both contents would be the url.
    content 1
    url "//myserver.com/APL1/*"
    vip address 192.168.1.1
    port 443
    add service service1_443
    add service service2_443
    content 2
    url "//myserver.com/APL2/*"
    vip address 192.168.1.1
    port 443
    add service service1_444
    add service service2_444
    I've tried it but it doesn't work.
    Have I done anything wrong?
    Regards

    the main purpose of SSL is security.
    So, what is security ?
    Securit means you don't want other person/devices to see the content of your traffic.
    If nobody can see the content, this includes the CSS.
    So, the CSS is unable to see the URL which is part of the content.
    Your solution can't work.
    Unless you install an ssl module with the key of the server so the module can decrypt the traffic.
    Regards,
    Gilles.

  • Send portal screen parameters to one absolute url in a different domain

    Hi Experts,
    We have a requirement wherein we have to send some screen parameters of the portal application to one url which is in a different domain.
    We are currently using Iframe UI element and binding the source property of the UI element to a string attribute wherein we have set the url and the parameters.
    The portal application is a webdynpro java application. The only requirement is to send the screen parameters to one particular url and nothing else. We do not want to create any html files and moreover we also do not want to make Iframe ui element visible in our webdynpro application.
    Can you please suggest how can I acheive this?
    Thanks & Regards,
    Anurag

    you can add the portal parameters to the target application URL
    target app : http://some.com/application.jsp&param1=myvalue&param2=value
    read URL parameters in Dynpro:
    WDWebContextAdapter.getWebContextAdapter().getRequestParameter("param1");
    Lemme know your requirement passing to URL or getting portal params
    Regards,
    Nikhil

  • Rewrite Rules with Domain Alias

    Version:
    iPlanet Messaging Server 5.2 HotFix 2.03 (built Nov 22 2004)
    libimta.so 5.2 HotFix 2.03 (built 13:20:08, Nov 22 2004)
    I am hosting 1 domain with an domain alias for the same hosted domain.
    I would like all mail coming in as domain A to be rewritten as domain B. Same for all outgoing mail outgoing as domain A to rewritten as domain B
    What should my rewrite rules look like?
    domain.a $U%domain.b

    That makes senses. I am using mailequivalentaddress with a domain alias. I also have the following rule in my imta.cnf:
    domain.A $R$U%domain.B
    domain.A $U%domain.B
    There rewrites all of the mailheaders but I noticed the the envelope headers are not rewritten. Is it a good idea to rewrite the envelope headers although the domain beinging rewritten to is an alias? The mail headers on the to: and from: address are rewritten as domain.B but envelope headers show [email protected] for delivery.

  • Does the "The Project Owner has the same RBS value as the User" dynamic category rule include Status Managers?

    I tested the following scenario in Project Server 2010 and 2013:
    Assume you have three users: User1, User2, and User3.
    User1 has RBS value Org.UnitA
    User2 and User3 both have RBS value Org.UnitB
    User3 belongs to a group that gets permissions to view projects from a category with "The Project Owner has the same RBS value as the User" setting enabled.
    User2 creates ProjectABC and by default becomes the project Owner. User2 creates a task in the project, and by default becomes the Status Manager of the task.
    User2 then changes the project owner to User1 and republishes the project. User2 remains the Status Manager in the project task.
    User3 can still see ProjectABC. Apparently this is because User2, who has the same RBS value as User3, is still a Status Manager of project tasks.
    If I change User2 RBS value to Org.UnitA, then User3 can no longer see ProjectABC. Note that User2 is not the project Owner at this stage, but the change in User2's RBS still affects the visibility of the project.
    It seems to me that in "The Project Owner has the same RBS value as the User" dynamic category rule, the "Owner" should be "Owner or Status Manager".
    Is this a bug or by design?

    Hi Barbara,
    Thanks for going through the trouble of reproducing the behaviour. It's interesting to know that Effective Rights in Project Online displays different results.
    The system in which I tested this scenario is updated with the December CU, so I'm guessing you'll still get the same result after updating your environment.
    In my usage scenario, a user needs to create projects on behalf of other users, and then change the Owner. Unfortunately this doesn't change the Status Manager of tasks, and consequently the visibility of the project is not what was expected.
    The user who creates the project can change the Owner, but can't change the Status Manager. The new owner would have to open the project for editing in Project Pro, change the Status Manager value of every task, and republish. Precisely the sort of extra
    work that the users want to avoid.
    Using delegation might be a workaround, but opens up a whole new can of worms in terms of information security. It's unfortunate that the "The Project Owner has the same RBS value as the User" setting doesn't do what it says, but potentially
    opens up visibility of projects to a wider audience than intended.

  • How set Url path in include .css or .js

    Dear Pakar
    i have .css with content below
    #pageWrapper {
         /*background: #5b93cc;*/
         background:#999999;
         background-image:url(/image/background.jpg);
    how to set url directory in apex style directory ?
    Thanks
    Imansyah

    Hello Imansyah,
    If the JPG is on your application server in the (virtual) /i/ directory (referencing the /images/ directory of APEX), you should try:
    #pageWrapper {
    /*background: #5b93cc;*/
    background:#999999;
    background-image:url(/i/background.jpg);
    }Regards,
    Roel
    http://roelhartman.blogspot.com/
    http://www.bloggingaboutoracle.org/
    http://www.logica.com/

  • Parameterizing the target base URL (e.g. Domain name) in e-Test scripts

    Hi there,
    I am currently building all my functional tests (scripts) using e-Test, given a target TG1 machine with URL (http://www.myfirsturl.com)
    Later I am planning to run these tests against another target TG2 environment, let us say (http://www.mysecondurl.com).
    Is there a way to do easily it, thus avoiding the need to re-record all tests again (i.e. if the target system changes)?
    Many thanks,
    Nuno

    Yes, it should be available as part of the eTester group of products. You have to check the checkboxes for these products when you are installing eTester. Also, eManager and eLoad need separate licenses than eTester. Script Updater is a new Utility that is available with eTester 8.2. So if you have an older version, you will not see it. There are other ways to change URLs, using VBA, so if you cannot find Script Updater or can't upgrade to 8.2, you can try them.
    Do you have access to the Archived Empirix Forum? There are some threads that deal with this issue.

Maybe you are looking for

  • Goods receipt can able to post without releasing the scheduling agreement.

    Hi experts, We have the SA which is created with document type LPA. Line item is having the confimation control key 0001.We believe that without releasing the SA we can't make the GR. ASN has been posted against the SA but without releasing the agree

  • Performance of Network Monitoring in SCOM 2012 SP1

    Hi all! I´ve problems with performance of the network monitoring feature in SCOM 2012 SP1. According to the guide the following should be adeqate: •1000 network devices (approximately 12,500 monitored ports) managed by a resource pool that has three

  • Multiple connector instances

    Hello,           how can I access multiple instances of an EIS? Is it possible to deploy more than one instances of a resource adapter? If so, how should I deploy those resource adapter instances. Thanks in advance.           

  • Objects in OWB

    The OWB documentation tells you how to 'drive' objects - but is not clarify when to use them - or give examples of how to use So, can anybody tell me Why and when would I use: - a Mapping Input Parameter - a Mapping Output Parameter - an External Pro

  • Planned order dates

    Hi Guru's,               please explain how the planned order dates are calculated. in MTS scenario if a delivery exists and another planned order is raised the system is not calculating dates as per dates given in the Material master.it is cheduling