Node failed to join the cluster because it ould not send and receive failure detection network messages

One of my customers has a Windows Server 2008 R2 cluster for an Exchange 2010 Mailbox Database Availability Group.  Lately, they've been having problems with one of their nodes (the one node that is on a different subnet in a different datacenter) where
their Exchange databases aren't replicating.  While looking into this issue it seems that the problem is the Network Manager isn't started because the cluster service is failing.  Since the issue seems to be with the cluster service, and not Exchange,
I'm asking here. 
When the cluster service starts, it appears to start working, but within a few minutes the following is logged in the system event log.
FailoverClustering
1572
Critical
Cluster Virtual Adapter
Node 'nodename' failed to join the cluster because it could not send and receive failure detection network messages with other cluster nodes. ...
It seems that the problem is with the 169.254 address on the cluster virtual adapter.  An entry in the cluster.log file says: Aborting connection because NetFT route to node nodename on virtual IP 169.254.1.44:~3343~ has failed to come up. 
In my experience, you never have to mess with the cluster virtual adapter.  I'm not sure what happened here, but I doubt it has been modified.  I need the cluster to communicate with its other nodes on our routed 10. network.  I've never experienced
this before and found little in my searches on the subject.  Any idea how I can fix this?
Thanks,
Joe
Joseph M. Durnal MCM: Exchange 2010 MCITP: Enterprise Messaging Administrator, Exchange 2010 MCITP: Enterprise Messaging Administrator, MCITP: Enterprise Administrator

Hi,
I suspected an issue with communication on UDP port 3343. Please confirm the set rules for port 3343 on all the nodes in firewall and enabled all connections for all the profiles
in firewall on all the nodes are opened, or confirm the connectivity of all the node.
Use ipconfig /flushdns to update all the node DNS register, then confirm the DNS in your DNS server entry is correct.
The similar issue article:
Exchange 2010 DAG - NetworkManager has not yet been initialized
https://blogs.technet.com/b/dblanch/archive/2012/03/05/exchange-2010-dag-networkmanager-has-not-yet-been-initialized.aspx?Redirected=true
Hope this helps.
We
are trying to better understand customer views on social support experience, so your participation in this
interview project would be greatly appreciated if you have time.
Thanks for helping make community forums a great place.

Similar Messages

  • Node fails to join the cluster

    We are observing a problem where a node, after getting restarted, fails to join the cluster.
    We run two coherence clusters across three boxes. Each box runs 8 java processes, 4 processes of one cluster, another 4 process
    of the other cluster. They all run as windows NT services. Sometimes, some node goes down and gets restarted. But then it fails to join the cluster with following exception :
    "com.tangosol.net.RequestTimeoutException: Timeout during service start: ServiceInfo(Id=8, Name=DistributedFIIndicativeCacheWithPublishingCacheStore, Type=DistributedCache"
    Has anyone experienced and addressed such a problem? If required, I can provide exact details of the cluster setup.
    -Bharat

    Hi Bharat,
    This may be caused by a stuck or slow DistributedService thread on one of your nodes. Please log into http://support.oracle.com and take a look at [Note 845363.1|https://support.oracle.com/CSP/main/article?cmd=show&type=NOT&id=845363.1] for more details. Additionally, consider upgrading to Coherence 3.5 as it includes the [Service Guardian for deadlock detection/resolution|http://blackbeanbag.net/wp/2009/07/20/coherence-3-5-service-guardian-deadlock-detection/].
    Thanks,
    Patrick

  • 11gR1 1 node won't join the cluster after reboot.

    This is a high level description of a problem.
    We usually run a two node cluster.
    This week we had an issue where one node needed to be taken down. It became non responsive and upon reboot the other node no longer functioned correctly.
    So one node was left running until the maintenance window.
    Apparently when it's brought back up it has the MAC of the second node in the arp cache.
    This leads to node1 not being able to join the cluster.
    I've seen workarounds that involve refreshing the arp cache but is there anything else to this?

    This is a high level description of a problem.
    We usually run a two node cluster.
    This week we had an issue where one node needed to be taken down. It became non responsive and upon reboot the other node no longer functioned correctly.
    So one node was left running until the maintenance window.
    Apparently when it's brought back up it has the MAC of the second node in the arp cache.
    This leads to node1 not being able to join the cluster.
    I've seen workarounds that involve refreshing the arp cache but is there anything else to this?

  • How can i view the conversations in my messages?  the screen is blank after i send and receive., how can i view the conversations in my messages?  the screen is blank after i send and receive.

    i cannot view the messages that i send and receive.  usually i can view the conversation but now it's empty.  what settings do i need to change?

    The NEF file only gives a little "nef" icon.  i was able to convert the NEF to DNG on the old computer and then brought the files over to the new computer.  On the old computer, the NEF files were depicted as little jpeg's so I would be able to see a picture of what each file represented.  On the new computer, both the NEF files and the DNG files are depicted with icons, "nef" or "dng", but not as little pictures.  On the new computer, though, I am to import the DNG's into Adobe and edit them so that is working regardless of whether or not a picture is present for the icon of each file.  I've downloaded the DNG converter from Adobe and will try the conversion directly on the new computer when I get home from work.  I do appreciate your responses and will let you know how it works.
    I am curious then, since right now the files are represented by little "nef"'s and little "dng"'s: If I also download the viewNX (spelling might be wrong, sorry) from nikon, do you know if that maintains the file as a RAW format file but allows a picture representation of the RAW file for the icon?  (That would make it easier to choose which RAW files I would choose to edit and which to not even bother with as an initial culling if that makes sense.
    Cheers.

  • My Ipad 2 verifies my email address on the ipad2 but I can not send or receive meassages. What is the fix?

    My Ipad2 verifies my email address on it but I can not send or receive messages. How do I fix this?

    I had this problem also. Yesterday, i called my e-mail provider. Turns out I had to change the port numbers in both the incoming and outgoing servers to numbers specific to that provider. Solved the problem.

  • How do I shut off receiving phone and text messages when I a out of the country but instill want to send and receive emails emails

    How do I shut off receiving phone and text messages when I am out of the country.  I still want to be able to email and receive email on wifi

    Enable Airplane mode followed by turning on wi-fi access.

  • HT1338 After migration from MacBook 13' running OSX 10.6.8 to MacBook Pro 15' Retina OSX 10.8.2 using Migration assistant, the Bluetooth assistant mentions "name not available" and it cannot detect my Magic trackpad. There is no mention of any mishap.

    I just migrated from MacBook 13' OSX 10.6.8 to MacBook Pro 15' Retina OSX 10.8.2 with the use of Migration Assistant. When attempting to install my Magic Trackpad, the Bluetooth Assistant mentions "name not available" so that the device cannot be detected.  In have disconnected the firewall.Thanks for assistance.

    Create a new, temporary admin account and see if you can set it up in that account.

  • I have been unable to view the content of any messages since installing OS X Mavericks.  I can send and receive mail, but viewing message content crashes the mail app.

    Need assistance accessing mail message content.  Message display in inbox, but content cannot be viewed.  I have attempted reindexing without success.  Send, compose and receive all work fine.  When opening content of message, the mail app crashes.

    Mail: Troubleshooting sending & receiving email messages

  • Apple ID stolen, password changed, cannot reset the account because I was too late and received no notice of an ID change..help what should I do? He he changed the email so I cannot recover anything!

    Alright here's the problem: I come on to log in one day and find my recovery email, password, and reset security question was changed, I relieved no email of it doing so, and cannot log in it anything on my main account, I spent a lot of money with apple and I'd like it back..HOW DO I GET IT :(?
    I have all the information from when last I had it..

    Contact iTunes store support.
    If you had a credit card linked to the account, report it to the issuing bank ASAP.

  • Why does the port settings on my iPad change every time I sync with my computer. After syncing I have to go in and change the port settings so I can send and receive emails on my iPad.

    Every time iSync my iPad 2 with my computer it changes the port settings. Has anyone else had this issue?

    When you sync the iPad, do you have mail settings configured to sync to the iPad? You could have different settings on your computer that might be affecting the email set up on the iPad when you sync.

  • After patching the node, the node is not joining the cluster.

    Dear All,
    We are having a two node suncluster with below release
    Sun Cluster 3.2u1 for Solaris 10 sparc
    Copyright 2008 Sun Microsystems, Inc. All Rights Reserved.
    And nodes are
    Node Name Status
    scrbdomdefrm005 Online
    scrbdomderue005 Offline
    We are patching the nodes with 2q 2009 quarter patches, first we patched the node scrbdomderue005. we have followed the below step to patch the server.
    1) Our root d0 has d1(c0t0d0s0) and d2(c1t0d0s0)
    2) we have detached the d2 from d0; metaclear d2
    3) mount the c1t0d0s0 /mnt
    4) use the patchadd -R /mnt to patch the server. While patching we got only one error the patch 126106-27 need to be install in noncluster mode.
    5) switch the RG's from node scrbdomderue005 to scrbdomdfrm005.
    6) shutdown the scrbdomderue005, boot the scrbdomderue005 with c1t0d0s0 in noncluster-single user mode, and installed the patch 126106-27 and it was successful.
    7) shutdown the scrbdomderue005, boot the scrbdomderue005 with c1t0d0s0 in clustermode, and we are getting the following error.
    Booting as part of a cluster
    NOTICE: CMM: Node scrbdomdefrm005 (nodeid = 1) with votecount = 1 added.
    NOTICE: CMM: Node scrbdomderue005 (nodeid = 2) with votecount = 1 added.
    WARNING: CMM: Open failed for quorum device /dev/did/rdsk/d5s2 with error 1.
    NOTICE: clcomm: Adapter nxge7 constructed
    NOTICE: clcomm: Adapter nxge3 constructed
    NOTICE: CMM: Node scrbdomderue005: attempting to join cluster.
    NOTICE: nxge3: xcvr addr:0x0a - link is up 1000 Mbps full duplex
    NOTICE: nxge7: xcvr addr:0x0a - link is up 1000 Mbps full duplex
    WARNING: CMM: Open failed for quorum device /dev/did/rdsk/d5s2 with error 1.
    NOTICE: CMM: Cluster doesn't have operational quorum yet; waiting for quorum.
    NOTICE: clcomm: Path scrbdomderue005:nxge7 - scrbdomdefrm005:nxge7 errors during initiation
    NOTICE: clcomm: Path scrbdomderue005:nxge3 - scrbdomdefrm005:nxge3 errors during initiation
    WARNING: Path scrbdomderue005:nxge7 - scrbdomdefrm005:nxge7 initiation encountered errors, errno = 62. Remote node may be down or unreachable through this path.
    WARNING: Path scrbdomderue005:nxge3 - scrbdomdefrm005:nxge3 initiation encountered errors, errno = 62. Remote node may be down or unreachable through this path.
    exit from console.
    We are able to boot the node scrbdomderue005 in noncluster-mode and it was successful., please check the below details.
    scrbdomderue005:/# uname -a
    SunOS scrbdomderue005 5.10 Generic_138888-07 sun4u sparc SUNW,SPARC-Enterprise
    scrbdomderue005:/#
    Before pathcing the server scrbdomderue005 the kernel version was.
    SunOS scrbdomderue005 5.10 Generic_137111-07 sun4u sparc SUNW,SPARC-Enterprise
    If i boot the scrbdomderue005 with d1(c0t0d0s0), the server is properly joining the cluster without issue.
    could any one please guide me, what could be the problem... how to resolve the issue.

    Hi
    I could be because you have installed patch 138888. It has problems with nxge interfaces used as interconnect.
    Rgds
    Carsten

  • Managed server not able to join the cluster

    Hi
    I have two storage node enabled coherence servers on two different machines.These two are able to form the cluster without any problem. I also have two Managed servers. When I start one, will join the cluster without any issue but when I start the fourth one which does not join the cluster. Only one Managed server joins the cluster. I am getting the following error.
    2011-12-22 15:39:26.940/356.798 Oracle Coherence GE 3.6.0.4 <Info> (thread=[ACTIVE] ExecuteThread: '2' for queue: 'weblogic.kernel.Default (self-tuning)', member=n/a): Loaded cache configuration from "file:/u02/oracle/admin/atddomain/atdcluster/ATD/config/atd-client-cache-config.xml"
    2011-12-22 15:39:26.943/356.801 Oracle Coherence GE 3.6.0.4 <D4> (thread=[ACTIVE] ExecuteThread: '2' for queue: 'weblogic.kernel.Default (self-tuning)', member=n/a): TCMP bound to /172.23.34.91:8190 using SystemSocketProvider
    2011-12-22 15:39:57.909/387.767 Oracle Coherence GE 3.6.0.4 <Warning> (thread=Cluster, member=n/a): This Member(Id=0, Timestamp=2011-12-22 15:39:26.944, Address=172.23.34.91:8190, MachineId=39242, Location=site:dev.icd,machine:appsoad2-web2,process:24613, Role=WeblogicServer) has been attempting to join the cluster at address 231.1.1.50:7777 with TTL 4 for 30 seconds without success; this could indicate a mis-configured TTL value, or it may simply be the result of a busy cluster or active failover.
    2011-12-22 15:39:57.909/387.767 Oracle Coherence GE 3.6.0.4 <Warning> (thread=Cluster, member=n/a): Received a discovery message that indicates the presence of an existing cluster:
    Message "NewMemberAnnounceWait"
    FromMember=Member(Id=2, Timestamp=2011-12-22 15:22:56.607, Address=172.23.34.74:8090, MachineId=39242, Location=site:dev.icd,machine:appsoad4,process:23937,member:CoherenceServer2, Role=WeblogicWeblogicCacheServer)
    FromMessageId=0
    Internal=false
    MessagePartCount=1
    PendingCount=0
    MessageType=9
    ToPollId=0
    Poll=null
    Packets
    [000]=Broadcast{PacketType=0x0DDF00D2, ToId=0, FromId=2, Direction=Incoming, ReceivedMillis=15:39:57.909, MessageType=9, ServiceId=0, MessagePartCount=1, MessagePartIndex=0, Body=0}
    Service=ClusterService{Name=Cluster, State=(SERVICE_STARTED, STATE_ANNOUNCE), Id=0, Version=3.6}
    ToMemberSet=null
    NotifySent=false
    ToMember=Member(Id=0, Timestamp=2011-12-22 15:39:26.944, Address=172.23.34.91:8190, MachineId=39242, Location=site:dev.icd,machine:appsoad2-web2,process:24613, Role=WeblogicServer)
    SeniorMember=Member(Id=1, Timestamp=2011-12-22 15:22:53.032, Address=172.23.34.73:8090, MachineId=39241, Location=site:dev.icd,machine:appsoad3,process:19339,member:CoherenceServer1, Role=WeblogicWeblogicCacheServer)
    2011-12-22 15:40:02.915/392.773 Oracle Coherence GE 3.6.0.4 <Warning> (thread=Cluster, member=n/a): Received a discovery message that indicates the presence of an existing cluster:
    Message "NewMemberAnnounceWait"
    FromMember=Member(Id=2, Timestamp=2011-12-22 15:22:56.607, Address=172.23.34.74:8090, MachineId=39242, Location=site:dev.icd,machine:appsoad4,process:23937,member:CoherenceServer2, Role=WeblogicWeblogicCacheServer)
    FromMessageId=0
    Internal=false
    MessagePartCount=1
    PendingCount=0
    MessageType=9
    ToPollId=0
    Poll=null
    Packets
    {                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                               

    Hi,
    By default Coherence uses a multicast protocol to discover other nodes when forming a cluster. Since you are having difficulties in establishing a cluster via multicast, Can you please perform a multicast test and see if multicast is configured properly.
    http://wiki.tangosol.com/display/COH32UG/Multicast+Test
    Hope you are using same configuration files across the cluster members and all members of the cluster must specify the same cluster name in order to be allowed to join the cluster.
    <cluster-name system-property="tangosol.coherence.cluster";>xxx</cluster-name>
    I would suggest, try using the unicast-listener with the well-known-addresses instead of muticast-listener.
    http://wiki.tangosol.com/display/COH32UG/well-known-addresses
    Add similar entries like below in your tangosol override xml..
    <well-known-addresses>
    <socket-address id="1">
    <address> 172.23.34.91<;/address>
    <port>8190</port>
    </socket-address>
    <socket-address id="2">
    <address> 172.23.34.74<;/address>
    <port> 8090</port>
    </socket-address>
    </well-known-addresses>
    This list is used by all other nodes to find their way into the cluster without the use of multicast, thus at least one well known node must be running for other nodes to be able to join.
    Hope this helps!!
    Thanks,
    Ashok.
    <div id="isChromeWebToolbarDiv" style="display:none"></div>

  • Clinet Application without joining the cluster

    Hi All,
    is it possible for client application to access the cache within a coherence cluster. The Client application is not a part of cluster and it didnt start with and cache config files or anything else.
    The client application just uses :
    NamedCache cache = CacheFactory.getCache("VirtualCache");
    if a client application starts with a cache-config file it will also join the cluster in this case the JVM of the client app will aslo be loaded/distributed/replicated with the cache contents ?
    Please clarify my doubts.
    Regards
    Srinivas.

    The only clean way of NOT joining the cluster is to connect via Extend.
    You can join the cluster, and specify LocalStorage=False parameter, however, that is only applicable for distributed cache. Replicated cache data still exists on every node. A bigger issue in my mind, is that your node will be actively managing membership of other members in the cluster, and that can become a problem.
    Timur

  • Every movie purchase through iTunes fails to transfer to my iPhone or iPad and the message I get is this: some of the items in the itunes library, including were not copied to the ipad because you are not authorized for them on this computer... help?

    For a couple months now, every movie purchase through iTunes fails to transfer to my iPhone or iPad giving me the error: "some of the items in the iTunes library, including were not copied to the iPad because you are not authorized for them on this computer". I have read many forums about deleting, reinstalling, de-authorizing and authorizing my iMac through the iTunes store. Many suggestions don't work except to rename the movie to *.MP4 in Finder under the iTunes movie directory. I imagine this is a setting or a bug somewhere... any suggestions on how to bypass having to rename the files in Finder? My research into Google suggests this problem dates back to 2007 or so but it is now more prevalent than ever.
    Thanks.

    Try:
    iTunes repeatedly prompts to authorize computer to play iTunes Store purchases

  • "An error has occurred saving the file because it could not be written to. Check ..."

    I have the familiar Mavericks issue of failing to save scanned files from my Mac:
     "An error has occurred saving the file because it could not be written to. Check the properties for the file to make sure it is not read-only."
    As far as I can see, there has been no resolution to this issue, despite reassurances dating to last fall (2013).
    The support website acts funky, too, sending me around in circles when I try to download the drivers; it may be a Safari-only problem, but still, please fix your software!

    Hi,
    What is your printer model? what Mac OS X version?
    Have you tried scanning with Image Capture or Preview?
    http://h10025.www1.hp.com/ewfrf/wc/document?docname=c03967506&tmp_task=useCategory&cc=us&dlc=en&lc=e...
    Say thanks by clicking the Kudos thumb up in the post.
    If my post resolve your problem please mark it as an Accepted Solution

Maybe you are looking for

  • Add field in standard ALV report.

    Hi friends, In SAP standard ALV report, t.code- S_ALR_87012050, user wants the vendor to include information about source document - vendor number, vendor name, invoice number, original document number of the transaction in the case of payroll being

  • Satellite P300: cursor jumps to different positions while typing

    Hi All, i have a Satellite P300, I am having trouble when i type. When i am typing using the keyboard, the first couple of words are ok, but then the words start typing in the wrong place, eg back further into the middle of the sentence i am typing.

  • Cant get into free trail

    hi, i click on free trail for premiere cc and it doesn't do anything, please help

  • Purchase returns

    hi kings How to return the goods aganist purchase order.  Ex :- I can raise a purchase order for 100 Units , afterthat i did GR and Invoice for the same. But I want to return 30 units for the PO.  i am Unable to find the process of the return. So ple

  • Portal Logoff Redirection

    Dear All, We have the requirement in which on the basis of unique id we have to logoff the user. To accomplish the requirement we have made the changes in com.sap.portal.navigation.masthead par in HeaderIview.jsp and written the following code:- bool