Guest VM failover cluster on Hyper-V 2012 Cluster does not work across hosts

Hi all,
We are evaluating Hyper-V on Windows Server 2012, and I have bumped in to this problem:
I have a Exchange 2010SP2 DAG installed on 2 vms in our Hyper-V cluster (a DAG forms a failover cluster, but does not use any shared storage). As long as my vms are on the same host, all is good. However, if I live migrate or shutdown-->move-->start one
of the guest nodes on another pysical host, it loses connectivity with the cluster. "regular" network is fine across hosts, and I can ping/browse one guest node from the other. I have tried looking for guidance for Exchange on Hyper-V clusters but have not
been able to find anything.
According to the Exchange documentation this configuration is supported, so I guess I'm asking for any tips and pointers on where to troubleshoot this.
regards,
Trond

Hi All,
so some updates...
We have a ticket logged with Microsoft, more of a check box exercise to reassure the business we're doing the needful.  Anyway, they had us....
Apply hotfix http://support.microsoft.com/kb/2789968?wa=wsignin1.0  to both guest DAG nodes, which seems pretty random, but they wanted to update the TCP/IP stack...
There was no change in error, move guest to another Hyper-V node, and the failover cluster, well, fails with the following event ids I the node that fails...
1564 -File share witness resource 'xxxx)' failed to arbitrate for the file share 'xxx'. Please ensure that file share '\xxx' exists and is accessible by the cluster..
1069 - Cluster resource 'File Share Witness (xxxxx)' in clustered service or application 'Cluster Group' failed
1573 - Node xxxx  failed to form a cluster. This was because the witness was not accessible. Please ensure that the witness resource is online and available
The other node stays up, and the Exchange DB's mounted on that node stay up, the ones mounted on the way that fails failover to the remaining node...
So we then
Removed 3 x Nic's in one of the 4 x NIC teams, so, leaving a single NIC in the team (no change)
Removed one NIC from the LACP group on each Hyper-V host
Created new Virtual Switch using this simple trunk port NIC on each Hyper-V host
Moved the DAG nodes to this vSwitch
Failover cluster works as expected, guest VM's running on separate Hyper-V hosts, when on this vswitch with single NIC
So Microsoft were keen to close the call, as there scope was, I kid you not, to "consider this issue
resolved once we are able to find the cause of the above mentioned issue", which we have now done, as in, teaming is the cause... argh.
But after talking, they are now escalating internally.
The other thing we are doing, is building Server 2010 Guests, and installing Exchange 2010 SP3, to get a Exchange 2010 DAG running on Server 2010 and see if this has the same issue, as people indicate that this is perhaps not got the same problem.
Cheers
Ben
Name                   : Virtual Machine Network 1
Members                : {Ethernet, Ethernet 9, Ethernet 7, Ethernet 12}
TeamNics               : Virtual Machine Network 1
TeamingMode            : Lacp
LoadBalancingAlgorithm : HyperVPort
Status                 : Up
Name                   : Parent Partition
Members                : {Ethernet 8, Ethernet 6}
TeamNics               : Parent Partition
TeamingMode            : SwitchIndependent
LoadBalancingAlgorithm : TransportPorts
Status                 : Up
Name                   : Heartbeat
Members                : {Ethernet 3, Ethernet 11}
TeamNics               : Heartbeat
TeamingMode            : SwitchIndependent
LoadBalancingAlgorithm : TransportPorts
Status                 : Up
Name                   : Virtual Machine Network 2
Members                : {Ethernet 5, Ethernet 10, Ethernet 4}
TeamNics               : Virtual Machine Network 2
TeamingMode            : Lacp
LoadBalancingAlgorithm : HyperVPort
Status                 : Up
A Cloud Mechanic.

Similar Messages

  • Hyper-V 2012 R2 does not disable the protocols on the physical network adapter

    Hello, 
    I have a problem with the creation of an external network in Hyper-V (2012 r2), when I create the external network does not want to share with the host, therefore I take the check to share it. Hyper-V on the network adapter "real" does not disable
    all protocols, but active all inclusive TCP4 and 6. 
    How so? 
    I have 2 other servers with the same configuration and run regularly.
    Thanks in advance

    Hi Fabio,
    "when I create the external network does not want to share with the host, therefore I take the check to share it. "
    Based on my understanding ,after creating a external Vswitch  the physical NIC will only have protocol  "hyper-v external virtual switch " whatever we check "allow management operation...." or not .
    Do you mean after you create the external Vswitch , the physical adapter still have another protocol ?
    If yes , my suggestion is to remove the physical NIC from "device manager " , then restart server and reload the driver  then create the external Vswitch again .
    Best Regards
    Elton Ji
    We
    are trying to better understand customer views on social support experience, so your participation in this
    interview project would be greatly appreciated if you have time.
    Thanks for helping make community forums a great place.

  • Hyper V over SMB does not work on non-Microsoft Samba servers

    I want to use for Hyper V (2012 R2 Standard) over SMB. I tried to move a virtual machine from local storage over to the SMB share and receive the error: "Remote SMB share does not support resiliency."
    I have verified the SMB share is SMB 3.0. According to Microsoft, this should be supported:
    "Using Hyper-V with SMB has the following requirements:
    One or more computers running Windows Server 2012 R2 or Windows Server 2012 with the Hyper-V role installed. You can also use non-Microsoft file servers that implement the SMB 3.0 protocol."
    http://technet.microsoft.com/en-us/libr ... 34187.aspx
    But according to others, this is an ongoing issue with Samba. (http://www.readynas.com/forum/viewtopic.php?f=119&t=77809)
    Microsoft, is this supported or not? If so, what else can I try to get this to work?

    I understand Samba will always be in catch-up mode, and I would not blame MSFT for developments within Samba. But, Microsoft does specifically state in their support documents "You can also use non-Microsoft file servers that implement the
    SMB 3.0 protocol."
    The fact the the 3.0.x *nix versions may not in fact include the features Microsoft needs means Hyper V is not in fact able to be used as identified. So, Microsoft should not promote this as being a supported feature. They should simply stop after saying "One
    or more computers running Windows Server 2012 R2 or Windows Server 2012 with the Hyper-V role installed." 
    It's not MSFT problem some people understand SMB spec too "freely"... There are some known working SMB3 non-MSFT implementations (EMC, NetApp and maybe Nutanix) and at least few of them are indeed BASED on an open source SAMBA. But these companies
    did a huge amount of work to make their fork-outs from SAMBA actually work. Unfortunately nobody bothered to back-contribute their fixes to original code base. Again, it's not MSFT problem. 
    StarWind VSAN [Virtual SAN] clusters Hyper-V without SAS, Fibre Channel, SMB 3.0 or iSCSI, uses Ethernet to mirror internally mounted SATA disks between hosts.

  • Load Balancing on RDS 2012 R2 does not work

    Hello,
    I'm working on a test deployment with 3 Session Hosts,  GW/WA/CB on one server and licensing on another.  
    GW and CB are also on a Failover cluster so i don't want High Availability now (I want to start with simpler and work out the errors first) 
    One issue is when I try load balancing option on my Collection's Properties by changing the relative weight of the servers, nothing happens at all. All the sessions are created on a single server (GW/WA/CB) and nothing on the server I want to redirect to.
    Any help is appreciated!

    Hi,
    Thank you for posting in Windows Server Forum.
    Please see that you have added following registry key to the connection broker.
    HKLM\SYSTEM\CurrentControlSet\Control\TerminalServer\ClusterSettings
    DefaultTsvUrl  tsv://VMResource.1.Virtualpool1
    After this you can use “mstsc /admin” to perform remote desktop.
    In addition, we can use DNS RR method to configure Load Balancing. Please check below thread for information.
    2012 R2 Load-Balanced RDS farm
    https://social.technet.microsoft.com/Forums/en-US/2ec5e96e-7ca6-44b5-85f8-5286c6881dea/2012-r2-loadbalanced-rds-farm?forum=winserverTS
    Hope it helps!
    Thanks.
    Dharmesh Solanki
    TechNet Community Support

  • Windows 8.1 with Hyper-V external switch - 802.1x not working in host nor guest

    Hi,
    I have Windows 8.1 with installed Hyper-V and virtual machines connected via Hyper-V Switch - External. 802.1x wired authentication is not working in host nor in guest machine (computer is not responding to switch requests). If I will change switch mode
    to Internal/Private it will start working.
    Do you have any idea how to get it working also in external mode?

    Hi,
    What is the physical network configuration in your environment? Are you using wireless or wired connection?
    We tested this issue as you described here, and the external switch is working fine.
    Have you tried to delete the created switches and recreate one, test this issue again?
    Yolanda Zhu
    TechNet Community Support

  • SCVMM 2012, Conntect via conosle does not work when logged in with specified credentials

    Hi all
    I have a problem with the SCVMM 2012 console. When I start the console I select the "Specify credentials" option and then I enter my admin credentials. Afterward the console opens
    and everything works except the "Connect via Console" function.
    When I try to connect to a VM I get the following message:
    Virtual Machine Manager lost the connection to the virtual machine for one of the following reasons: Another connection was established to the console of this machine. The virtual machine has been shut down or put into the saved state. The user credentials
    provided do not have the necessary privilege to connect (0x0003, 0x0300)
    It seems that the Virtual Machine Viewer runs not under the admin credentials which I enter at the beginning and therefore the console cannot connect.
    Has someone the same issue ? Is it by design that the console connection does not work with specified credentials or is a sort of a bug or error ?
    Thx
    J0fe

    Being new to SCVMM, I've just come across this issue myself, though I came to a different (not necessarily better, but good enough for me) resolution.
    I have a somewhat convoluted but not abnormal configuration where the SCVMM server lives in a separate untrusted forest to that which the Hyper-V Server hosts live.
    As people have noted in this and other threads, the VirtualMachineViewer.exe process launches in the same security context with which you logged onto your pc. While the VmmAdminUI.exe process also runs in the same security context,
    the network logon it issues makes use of the credentials provided at the SCVMM MMC's logon screen, which VirtualMachineViewer.exe does not.
    I elected to use Credential Manager to resolve this issue.
    In Credential Manager, I created a "Windows Credential" entry where the hostname matched the FQDN of the hostname that appears within SCVMM as the virtual host entry. Using an IP does not match if you've used an FQDN in the SCVMM
    registration, and vice versa. If you used an IP for the host in SCVMM, then you need to supply that IP as the hostname in Credential Manager, and likewise if you used a FQDN.
    This solution isn't perfect as it requires you to manage numerous Credential Manager entries (one per host), but that still appealed to me far more than one per guest. If I had a larger environment, I'd script the account configuration
    given it only needs to happen once per host, rather than on an ongoing basis of once per guest as some of the above approaches would require.
    I haven't tested the different combinations to verify the least permissions required. For the time being I've gone with the model of using a non-privileged domain user account (to avoid managing multiple identities and passwords)
    which I added to the local Administrators group of each Hyper-V Server host. This stuck me as being a sound enough compromise between rigid security and ease-of-administration (given the per host arrangement in Credential Manager).
    With the account correctly specified in Credential Manager, I'm able to move on past the 0x0003 error and successfully use the remote function.
    If I was to point out an upside to this, it would be that this approach would allow me to remote Hyper-V servers in untrusted forests (including workgroups), which is something I couldn't do if the VirtualMachineViewer.exe process
    did in fact use my SCVMM credentials alone. Ultimately, that's something you have to come to terms with yourself, though.
    Cheers,
    Lain
    Thanks a bunch Lain, this helped! I'd still consider this a bug, however, and not a workable solution for a large number of servers/admins... But for now this does the trick, thanks!

  • I am trying to allow guests on windows to connect to shares on my Lion server but it keeps asking for a password for guest. I have allow guest users to access this share enabled but it still does not work.

    I am trying to allow guests on windows to connect to shares on my Lion server but it keeps asking for a password for guest. I have allow guest users to access this share enabled but it still does not work.

    Ditto. Guest accounts shouldn't have a password. No way to enter one in System Prefs...

  • External display on late 2012 macmini does not go to sleep.

    External display on late 2012 macmini does not go to sleep, instead the content freezes. After Pram reset it does go to sleep, till it undergoes next system sleep cycle (hibernatemode 25). I need to suspend it to disk because it is an offgrid setup and the power needs to be cut between Sessions. Display is connected to the hdmi port. Runing OS 10.8.4. Reinstalling did not help, neither. I have struggled with this for half a year now, it accured about 2 month after first use of the machine.
    Thank you for possible help!

    Hey
    Thank you for the quick anwswer!
    I am aware of what you described, here is the preferece file:
    <?xml version="1.0" encoding="UTF-8"?>
    <!DOCTYPE plist PUBLIC "-//Apple//DTD PLIST 1.0//EN" "http://www.apple.com/DTDs/PropertyList-1.0.dtd">
    <plist version="1.0">
    <dict>
              <key>Custom Profile</key>
              <dict>
                        <key>AC Power</key>
                        <dict>
                                  <key>AutoPowerOff Delay</key>
                                  <integer>14400</integer>
                                  <key>AutoPowerOff Enabled</key>
                                  <integer>1</integer>
                                  <key>Automatic Restart On Power Loss</key>
                                  <integer>0</integer>
                                  <key>Disk Sleep Timer</key>
                                  <integer>10</integer>
                                  <key>Display Sleep Timer</key>
                                  <integer>0</integer>
                                  <key>Hibernate File</key>
                                  <string>/var/vm/sleepimage</string>
                                  <key>Hibernate Mode</key>
                                  <integer>25</integer>
                                  <key>PrioritizeNetworkReachabilityOverSleep</key>
                                  <integer>0</integer>
                                  <key>Sleep On Power Button</key>
                                  <integer>1</integer>
                                  <key>Standby Delay</key>
                                  <integer>4200</integer>
                                  <key>Standby Enabled</key>
                                  <integer>0</integer>
                                  <key>System Sleep Timer</key>
                                  <integer>0</integer>
                                  <key>TTYSPreventSleep</key>
                                  <integer>1</integer>
                                  <key>Wake On LAN</key>
                                  <integer>0</integer>
                        </dict>
              </dict>
              <key>LastSleepUUID</key>
              <dict>
                        <key>Date</key>
                        <date>2013-07-20T21:41:21Z</date>
                        <key>UUID</key>
                        <string>53DDFFCE-AFA3-402F-B35E-8CBED6E293DF</string>
              </dict>
    </dict>
    </plist>
    Next on my list would be an installation from scracht - wich i would obviously love to avoid
    The bewildering thing is that through a pram reset i can get the display to sleep once again...
    Maybe a hardware prob?
    Might want to check using the thunderbolt port instead, as soon as i can afford the adaptor.
    Anyway i will go to offline world for some time now, on a crazy bicycle tour trough remote eastern europe.
    I would love to hear from you on an occasion of a peek to cyberworld during my trip!
    So long
    Senior KLikklac

  • Do the Mac minis support using AirPlay and all of its functions?   I thought I saw a comment yesterday that suggested the Mac mini (particularly the late 2012 models) does not support AirPlay.  Thank you.

    Do the Mac minis support using AirPlay and all of its functions?   I thought I saw a comment yesterday that suggested the Mac mini (particularly the late 2012 models) does not support AirPlay.  Thank you.

    Hello,
    AirPlay Mirroring
    Requires a second-generation Apple TV or later. Supports the following Mac models:
    iMac (Mid 2011 or newer)
    Mac mini (Mid 2011 or newer)
    MacBook Air (Mid 2011 or newer)
    MacBook Pro (Early 2011 or newer)
    http://support.apple.com/kb/HT5444

  • Cluster Example shipped with 7.0 SP4 does not work !!!

              I have tried and tried to setup a cluster example using the instructions shipped
              in the cluster example and it does not work. When I call the Session.jsp via
              the proxy URL I get the following output in the proxy log.
              <Mon Apr 26 08:49:39 BST 2004>: Start connection timeout scheduler
              <Mon Apr 26 08:49:39 BST 2004>: GenericProxyServelt: init()
              <Mon Apr 26 08:49:39 BST 2004>: HttpClusterServlet:init()
              <Mon Apr 26 08:49:43 BST 2004>: ===New Request===GET /ClusterTest/Session.jsp
              HT
              TP/1.1
              <Mon Apr 26 08:49:43 BST 2004>: Found session from url: null
              <Mon Apr 26 08:49:43 BST 2004>: attempt #0 out of a max of 5
              <Mon Apr 26 08:49:43 BST 2004>: #### Trying to connect with server cl1!9001!443
              <Mon Apr 26 08:49:43 BST 2004>: Failed to connect: cl1!9001!443
              <Mon Apr 26 08:49:43 BST 2004>: marked bad: cl1!9001!443
              <Mon Apr 26 08:49:43 BST 2004>: attempt #0 out of a max of 5
              <Mon Apr 26 08:49:43 BST 2004>: #### Trying to connect with server cl2!9002!443
              <Mon Apr 26 08:49:43 BST 2004>: Failed to connect: cl2!9002!443
              <Mon Apr 26 08:49:43 BST 2004>: marked bad: cl2!9002!443
              <Mon Apr 26 08:49:43 BST 2004>: attempt #0 out of a max of 5
              <Mon Apr 26 08:49:43 BST 2004>: === whole list is bad, reverting to the all serv
              ers list ===
              <Mon Apr 26 08:49:43 BST 2004>: #### Trying to connect with server cl1!9001!443
              <Mon Apr 26 08:49:43 BST 2004>: Failed to connect: cl1!9001!443
              <Mon Apr 26 08:49:43 BST 2004>: marked bad: cl1!9001!443
              <Mon Apr 26 08:49:43 BST 2004>: Sleeping for 2 secs .....
              <Mon Apr 26 08:49:45 BST 2004>: attempt #1 out of a max of 5
              <Mon Apr 26 08:49:45 BST 2004>: #### Trying to connect with server cl2!9002!443
              <Mon Apr 26 08:49:45 BST 2004>: Failed to connect: cl2!9002!443
              <Mon Apr 26 08:49:45 BST 2004>: marked bad: cl2!9002!443
              <Mon Apr 26 08:49:45 BST 2004>: Sleeping for 2 secs .....
              <Mon Apr 26 08:49:47 BST 2004>: attempt #2 out of a max of 5
              <Mon Apr 26 08:49:47 BST 2004>: === whole list is bad, reverting to the all serv
              ers list ===
              <Mon Apr 26 08:49:47 BST 2004>: #### Trying to connect with server cl2!9002!443
              <Mon Apr 26 08:49:47 BST 2004>: Failed to connect: cl2!9002!443
              <Mon Apr 26 08:49:47 BST 2004>: marked bad: cl2!9002!443
              <Mon Apr 26 08:49:47 BST 2004>: Sleeping for 2 secs .....
              <Mon Apr 26 08:49:49 BST 2004>: attempt #3 out of a max of 5
              <Mon Apr 26 08:49:49 BST 2004>: #### Trying to connect with server cl1!9001!443
              <Mon Apr 26 08:49:49 BST 2004>: Failed to connect: cl1!9001!443
              <Mon Apr 26 08:49:49 BST 2004>: marked bad: cl1!9001!443
              <Mon Apr 26 08:49:49 BST 2004>: Sleeping for 2 secs .....
              <Mon Apr 26 08:49:51 BST 2004>: attempt #4 out of a max of 5
              <Mon Apr 26 08:50:17 BST 2004>: #### Trying to connect with server cl2!9002!443
              <Mon Apr 26 08:50:17 BST 2004>: Failed to connect: cl2!9002!443
              <Mon Apr 26 08:50:17 BST 2004>: marked bad: cl2!9002!443
              <Mon Apr 26 08:50:17 BST 2004>: Sleeping for 2 secs .....
              <Mon Apr 26 08:50:19 BST 2004>: attempt #5 out of a max of 5
              <Mon Apr 26 08:50:19 BST 2004>: #### Trying to connect with server cl1!9001!443
              <Mon Apr 26 08:50:19 BST 2004>: Failed to connect: cl1!9001!443
              <Mon Apr 26 08:50:19 BST 2004>: marked bad: cl1!9001!443
              <Mon Apr 26 08:50:19 BST 2004>: Tried all servers but didn't succeed
              Any ideas ?
              For starters I think the documentation is wrong.
              It states
              servlet weblogic.servlet.internal.HttpClusterServlet
              yet in the weblogic upgrade 6.0 to 7.0 documentation it says replace this with
              weblogic.servlet.proxy.HttpClusterServlet
              Thanks
              Justin
              

    Have you tried deleting the iDVD preference file, com.apple. iDVD.plist, that resides in your User/Library/Preferences folder?
    OT

  • WLS6sp2 Hot deployment in cluster does not work

              I have two WLS6sp2 servers in cluster in two different machines. There is an EJB
              deployed in the cluster that works fine. Now I go to the console and I undeploy
              it: the servers show a message of "ejb undeployed". Now I activate the "deploy"
              checkbox again: the following error appears:
              weblogic.ejb20.EJBDeploymentException: Error deploying stateless EJB; nested exception
              is:
              javax.naming.NameAlreadyBoundException: Can't rebind anything but a replica-aware
              stub to a name that is currently bound to a replica-aware stub; remaining name
              javax.naming.NameAlreadyBoundException: Can't rebind anything but a replica-aware
              stub to a name that is currently bound to a replica-aware stub; remaining name
              The object is obviously clusterable (it is a stateless EJB with "home-is-clusterable=true").
              It has been deployed to the cluster target. When is working it is load balanced
              between both servers. It deploys perfectly when starting up both servers, but
              hot deployment does not work.
              This works fine when only one server is up.
              ¿Is not possible to do hot deployment in cluster?
              

    Hm, where are you seeing the javax.naming.NameAlreadyBoundException? In one of your managed
              servers, in all of your managed server, or in your admin server?
              If you see this in your admin server, then this means you got your admin joined in the cluster,
              which is not good. Remove it from the cluster if this is so.
              Gene
              "David Agudo" <[email protected]> wrote in message news:[email protected]...
              >
              > Yes, target="mycluster".
              >
              > "Gene Chuang" <[email protected]> wrote:
              > >Did you target the ejb to the cluster, or to each individual Node in
              > >your cluster? There is a
              > >difference...
              > >
              > >Gene
              > >
              > >"David Agudo" <[email protected]> wrote in message news:[email protected]...
              > >>
              > >> I have two WLS6sp2 servers in cluster in two different machines. There
              > >is an EJB
              > >> deployed in the cluster that works fine. Now I go to the console and
              > >I undeploy
              > >> it: the servers show a message of "ejb undeployed". Now I activate
              > >the "deploy"
              > >> checkbox again: the following error appears:
              > >>
              > >> weblogic.ejb20.EJBDeploymentException: Error deploying stateless EJB;
              > >nested exception
              > >> is:
              > >> javax.naming.NameAlreadyBoundException: Can't rebind anything but a
              > >replica-aware
              > >> stub to a name that is currently bound to a replica-aware stub; remaining
              > >name
              > >> ''
              > >> javax.naming.NameAlreadyBoundException: Can't rebind anything but a
              > >replica-aware
              > >> stub to a name that is currently bound to a replica-aware stub; remaining
              > >name
              > >> ''
              > >>
              > >> The object is obviously clusterable (it is a stateless EJB with "home-is-clusterable=true").
              > >> It has been deployed to the cluster target. When is working it is load
              > >balanced
              > >> between both servers. It deploys perfectly when starting up both servers,
              > >but
              > >> hot deployment does not work.
              > >>
              > >> This works fine when only one server is up.
              > >>
              > >> ¿Is not possible to do hot deployment in cluster?
              > >>
              > >
              > >
              >
              

  • Guest network does not work

    Hi
    I have latest Airport Extreme with 2TB hard disk (Time Capsule). I have working private network after latest firmware update (7.7.2 here) except my guest network. I have tried to setup the guest network as open network and WPA/WPA2 network with password but it does not work.
    I can see the guest network but when I try to connect to it the connection does not work (= there is no internet connection or I cannot connect at all). Private network works fine (WPA2 Personal). Router mode is off (bridged network).
    Any ideas how to get it work?

    Router mode is off (bridged network).
    Any ideas how to get it work?
    The TC will nor produce a working guest network without being in Router mode.. ie DHCP + NAT.. you will not get it working in bridge.. it tricks you.. it looks so like it is going to work.. but no. it cannot.
    You must rearrange your network so the TC is main router..
    Or
    IMHO the easier way is to go and buy a cheap wireless router.. a $35 tp-link etc.. and simply use that as AP. It is cheaper, easier and much faster.. guest network is still adding clients to the same wireless.. whereas setting up a second AP allows you to have an entirely separate connection.

  • Primary network does not work, guest network does work

    I updated the firmware on the Time Capsule, after which we had a power outage. When the Time Capsule came back up, after a bunch of fiddling around we discovered that although the primary network cannot connect the Internet through our phone system router, the guest network works fine. The primary network does not work with the guest network turned off. Resetting the Time Capsule had no effect.

    After an entire day of messing around, we went into Airport Utility, selected Internet, selected DHCP, and changed DHCP Beginning Address to match the one showing on the guest network (172.16). Now the primary network can see the Internet.

  • Cluster does not work after a while

    Hi :
              I have 2 clustered EJB server, the IP address is 192.168.0.226 and
              192.168.0.227.
              A servlet server is calling these two EJB servers with
              t3://192.168.0.226,192.168.0.227:7001
              All three machine use weblogic 5.1 sp9, Win2K Advanced Server.
              If I restart weblogic service, then the cluster works fine. but after while
              the cluster does not work, all the client request goto only one machine, in
              most case it's 227.
              It has no use even if I restart the service in the losted server. I have to
              restart all the server.
              The weblogic.log in the losted server is like this :
              Thu Apr 26 00:04:16 GMT 2001:<I> <RJVM> Signaling
              peer -6817319611378695685S192.168.0.204:[7001,7001,7002,7002,7001,-1] gone:
              weblogic.rjvm.PeerGoneException:
              - with nested exception:
              [java.io.EOFException]
              Thu Apr 26 00:04:47 GMT 2001:<I> <RJVM> Signaling
              peer -2123734719233546013S192.168.0.227:[7001,7001,7002,7002,7001,-1] gone:
              weblogic.rjvm.PeerGoneException:
              - with nested exception:
              [java.io.EOFException]
              Thu Apr 26 00:14:16 GMT 2001:<E> <ConMan> Attempt to sendMsg using a closed
              connection
              Thu Apr 26 00:14:16 GMT 2001:<E> <RJVM> Exception on send :
              weblogic.rmi.ConnectException: Attempt to sendMsg using a closed connection
              Please help me
              thanks
              andrew
              

    Are you running with any proxy? Is the servlet server also the static httpd
              server?
              What type of network equipment are you using? If its high-end (Cisco) make sure
              all ports are set to 100/Full.
              Also, try setting the SendDelay slightly higher (ie. 25, 30, 35). Experiment
              with different values.
              weblogic.cluster.multicastSendDelay=25
              andrew wrote:
              > Mike, thanks for reply
              > The 192.168.0.204 is the servlet server.
              > I changed the NICs to use 100Mbps/Full duplex. it looks better: the error
              > msg is less than before, the it still happened.
              > Any suggestion?
              > thanks
              > andrew
              >
              > "Mike Kincer" <[email protected]> wrote in message
              > news:[email protected]...
              > > Which box is 192.168.0.204 ??
              > > It is communicating via multicast, which I would say is suspect.
              > > Otherwise, I'd say you have some network issues.
              > > Make sure all switch ports and all NICS are NOT set to "auto" config.
              > Select
              > > 100Mbps/Full duplex on all ports.
              > >
              > > andrew wrote:
              > >
              > > > Hi :
              > > > I have 2 clustered EJB server, the IP address is 192.168.0.226 and
              > > > 192.168.0.227.
              > > > A servlet server is calling these two EJB servers with
              > > > t3://192.168.0.226,192.168.0.227:7001
              > > >
              > > > All three machine use weblogic 5.1 sp9, Win2K Advanced Server.
              > > >
              > > > If I restart weblogic service, then the cluster works fine. but after
              > while
              > > > the cluster does not work, all the client request goto only one machine,
              > in
              > > > most case it's 227.
              > > >
              > > > It has no use even if I restart the service in the losted server. I have
              > to
              > > > restart all the server.
              > > >
              > > > The weblogic.log in the losted server is like this :
              > > >
              > > > Thu Apr 26 00:04:16 GMT 2001:<I> <RJVM> Signaling
              > > > peer -6817319611378695685S192.168.0.204:[7001,7001,7002,7002,7001,-1]
              > gone:
              > > > weblogic.rjvm.PeerGoneException:
              > > > - with nested exception:
              > > > [java.io.EOFException]
              > > > Thu Apr 26 00:04:47 GMT 2001:<I> <RJVM> Signaling
              > > > peer -2123734719233546013S192.168.0.227:[7001,7001,7002,7002,7001,-1]
              > gone:
              > > > weblogic.rjvm.PeerGoneException:
              > > > - with nested exception:
              > > > [java.io.EOFException]
              > > > Thu Apr 26 00:14:16 GMT 2001:<E> <ConMan> Attempt to sendMsg using a
              > closed
              > > > connection
              > > > Thu Apr 26 00:14:16 GMT 2001:<E> <RJVM> Exception on send :
              > > > weblogic.rmi.ConnectException: Attempt to sendMsg using a closed
              > connection
              > > >
              > > > Please help me
              > > > thanks
              > > > andrew
              > >
              > > --
              > > /\/\i|<e
              > >
              > > Mike Kincer
              > > Solutions Developer/Engineer
              > > Atlas Commerce "ebusiness evolved"
              > > Office phone: +1-607-741-8877
              > > mailto:[email protected] [http://www.atlascommerce.com]
              > >
              > >
              /\/\i|<e
              Mike Kincer
              Solutions Developer/Engineer
              Atlas Commerce "ebusiness evolved"
              Office phone: +1-607-741-8877
              mailto:[email protected] [http://www.atlascommerce.com]
              

  • Jrockit multicast bug on RHEL4, WLS cluster does not works!

    OS and Kernel Version: Linux achilles1 2.6.9-34.0.1.ELsmp #1 SMP Wed May 17 16:59:36 EDT 2006 x86_64 x86_64 x86_64 GNU/Linux
    We use Bea Weblogic 8.1SP5 with Jrockit JVM 1.4.2 on RHEL3. We upgraded to operating system to RHEL4, and found that the interface binding for multicast communication does not bind to the correct interface, it is always using the default interface. We tried it from a sample program, and found that the error is in the JVM.
    This is a critical error for us, because our application server cluster use multicast for cluster communication.
    No error msg. It seems like this BUG:
    http://bugs.sun.com/bugdatabase/view_bug.do?bug_id=4082533 but it is solved.
    List steps to reproduce the problem:
    socket = new MulticastSocket(i);
    if(s2 != null)
    try
    InetAddress inetaddress = InetAddress.getByName(s2);
    System.out.println("Using interface at " + inetaddress.getHostAddress());
    socket.setInterface(inetaddress);
    System.out.println(socket.getInterface());
    catch(SocketException socketexception)
    throw socketexception;
    }

    Helo,
    You can reproduce it by this way (we have 6 RHEL4 servers, problem is the same). We tried with programs written in C (mcast,Tibco), multicast perfectly works on all interfaces on all servers. But in Java:
    Server:
    [wlxapp@webex3]$ uname -a
    Linux webex3 2.6.9-34.0.1.ELsmp #1 SMP Wed May 17 17:05:24 EDT 2006 i686 i686 i386 GNU/Linux
    Problem - with Bea mukticast tester (or any Java class), using the nondefault webex3-priv interface(eth1):
    [wlxapp@webex3]$ java -cp wl9.jar utils.MulticastTest -N webex3_priv -I webex3_priv -A 224.1.1.11 Starting test. Hit any key to abort Using interface at 192.168.1.1 Using multicast address 224.1.1.11:7001 Will send messages under the name webex3_priv every 2 seconds Will print warning every 600 seconds if no messages are received
    I (webex3_priv) sent message num 1
    I (webex3_priv) sent message num 2
    I (webex3_priv) sent message num 3
    Did not receive any message.
    But if we use the default webex3 interface (eth0)
    [wlxapp@webex3]$ java -cp wl9.jar utils.MulticastTest -N webex3 -I webex3 -A 224.1.1.11 Starting test. Hit any key to abort Using interface at 10.4.20.4 Using multicast address 224.1.1.11:7001 Will send messages under the name webex3 every 2 seconds Will print warning every 600 seconds if no messages are received
    I (webex3) sent message num 1
    I (webex3) sent message num 2 Received message 2 from webex3
    I (webex3) sent message num 3 Received message 3 from webex3
    Works perfectly.
    TCPDUMP:Messages are out from server(other srv also see it), so we have problems probably with receiving them:
    [root@webex3]# tcpdump -i eth1
    tcpdump: verbose output suppressed, use -v or -vv for full protocol decode listening on eth1, link-type EN10MB (Ethernet), capture size 96 bytes
    11:42:56.647362 IP webex3_priv.idmz.t-mobile.intra.afs3-callback > 224.1.1.111.afs3-callback: rx type 95 (1472)
    11:42:56.648168 IP webex3_priv.idmz.t-mobile.intra > 224.1.1.111: udp
    11:42:56.648245 IP webex3_priv.idmz.t-mobile.intra > 224.1.1.111: udp
    We examined the lsof output, the tester bind to all interfaces, so it should receive the messages:
    [wlxapp@webex3]$ /usr/sbin/lsof -p 11127
    gij 11127 wlxapp 8u IPv4 150320 UDP *:afs3-callback
    We tried the test with Java 1.4.2 and 1.5 (Jrockit and Sun), the results are the same.
    You can easily reproduce the problem with an RHEL4 server with 2 network interface.
    Our real problem that if multicast have problems, our Weblogic clusters does not works.

Maybe you are looking for