Lost quorum using quorum server

I have a two node cluster with a third node being used as a quorum server. The quorum server is on a different VLAN than the cluster nodes.
If I shut down or reboot node 1, node 2 dies. I can reboot node 2 without any impact on node 1, it works as it's supposed to. In this scenario the nodes seem to be unable to talk to the quorum server so the cluster thinks its lost quorum and it dies. After it is back up the quorum server is corrupt and shown as offline so the only way to restore it is to remove it from the cluster and readd it.
Node 2 never seems to be able to see the quorum server, it always shows it as offline, even when I add it from node 2. Node 3 is not a cluster configured server, it only has the quorum server installed on it to support the cluster of node 1 & 2. Node 3 is on another vlan, therefore there is a router in the path.
Does the quorum server have to be on the same VLAN so that there's no router in the loop?
Does anybody have any ideas on what could be wrong here? Does the quorum server really work or do I have to add a third server to the cluster just to have a stable cluster?
--- Further info, the clqs show command on node 3 shows disabled false, and keys for node 1, but nothing for node 2 and node 2 can't open the qs.
Thanks,
Terry
Edited by: taccooper on Jan 21, 2010 4:00 AM
Edited by: taccooper on Jan 26, 2010 3:58 AM
Edited by: taccooper on Jan 26, 2010 6:19 AM

I may have answered my own question by applying patch 127406-04. With this patch applied both cluster nodes are now seeing the quorum server. The patch among other things solves an issue with simultaneous opens failing. I will have to do some failure testing to confirm this.
Update: failure testing complete, this patch solved the issue.
Terry
Edited by: taccooper on Jan 27, 2010 5:20 AM

Similar Messages

  • Cannot start a WSFC node with -ForceQuorum in a cluster that's lost quorum

    Hi,
    I've got a really simple setup: a two node healthy cluster constisting of SRV1 and SRV2. Current Vote is 1 for SRV1 and 0 for SRV2. To simulate a lost node (and in this case cluster losing quorum) I remove SRV1 from the network. Failover Cluster Manager
    (FCM) on SRV1 pretty instantly reports the status of the nodes as:
    SRV1 - UP
    SRV2 - DOWN
    Fine. On SRV2 however, nothing happens in FCM for some time. After about a minute, FCM loses contact with the cluster. When I try to reconnect FCM to the local node (SRV2), I get the following error:
    Node 'SRV2' is in the process of being started. The remote server has been paused or is in the process of being started.
    Waiting does not help - the problem persists. I then resort to PowerShell and "Start-ClusterNode -ForceQuorum". It responds with State=Joining. But the node is never started. Cannot connect to it in FCM. And any other PowerShell command (e.g. Get-ClusterNode)
    returns "The remote server has been paused or is in the process of being started".
    What am I doing wrong? How can I manually force a node to start in a cluster that's lost quorum?
    Kindly,
    Fredrik

    Hi Fredrik,
    Could you clarify how you “remove SRV1” did you use the FCM or unplug-in the network? If you use FCM the cluster resource will move two SRV2 automatically and you needn’t to
    force the SRV 2 up, but why you say the cluster resource is:
    SRV1 - UP
    SRV2 – DOWN
    Does the SRV1 has “removed” right? if it is mistype and the scenario is
    SRV1 – DOWN, it must your cluster may have some incorrect configuration, please run the cluster validation first then post the warning and error section. With two node witness we need to use Node and Disk Majority quorum mode please confirm
    you have choose the correct witness mode.
    The related KB:
    Appendix B: Additional Information About Quorum Modes
    https://technet.microsoft.com/en-us/library/cc770830(v=ws.10).aspx
    Overview and Requirements for a Two-Node Failover Cluster
    https://technet.microsoft.com/en-us/library/cc772544(v=ws.10).aspx
    I’m glad to be of help to you!
    Please remember to mark the replies as answers if they help and unmark them if they provide no help. If you have feedback for TechNet Support, contact [email protected]

  • Quorum Server Redundancy Question

    Hi All,
    I'm just investigating my options for a new cluster configuration and was trying to find out about multiple quorum servers hosts. All the examples I have come across in the documentation have 1 physical host acting as a quorum server for an n+1 node cluster. I'm ssuming that there will be quorum issues in the event the physical host, hosting the quorum server, is down and the cluster nodes performed a reconfiguration/switch when the quorum server was unavailable.
    Is it possible to have 2 physical hosts, with quorum servers defined on each, that can then be confidured in to the cluster. Effectivly pointing at two different quorum servers for votes?

    Correct, the QS is only used if the cluster changes state, i.e. nodes leave or join. However, having more than 1 QS for a single cluster does not help. You simply lower your overall availability because there are more failure scenarios where one of these is down, leading to insufficient votes for the remaining cluster node to obtain.
    Active monitoring and prompt repair of the QS (or QD) is the right approach.
    Tim
    ---

  • Quorum Server Bug?

    I have a 2 node cluster using a quorum server running on a third server. I turned off one node this morning, and then used svcadm to disable the quorum server on the third server. THis was in an attempt to panic the remaining node for testing purposes.
    It's been about 20 minutes now, and the remaining node has not panicked from losing quorum. clq status and scstat -q show essentially the same things, though in diffent formats. There are 2 votes needed, 2 present, and 3 possible. Under the details, the first node has 0 votes present, the up node has 1, and the disabled quorum server has 0.
    cletus# clq status
    Cluster Quorum ===
    --- Quorum Votes Summary ---
                Needed   Present   Possible
                2        2         3
    --- Quorum Votes by Node ---
    Node Name       Present       Possible       Status
    brandine        0             1              Offline
    cletus          1             1              Online
    --- Quorum Votes by Device ---
    Device Name       Present      Possible      Status
    hwi_san_qs        0            1             Offline
    cletus# scstat -q
    -- Quorum Summary --
      Quorum votes possible:      3
      Quorum votes needed:        2
      Quorum votes present:       2
    -- Quorum Votes by Node --
                        Node Name           Present Possible Status
      Node votes:       brandine            0        1       Offline
      Node votes:       cletus              1        1       Online
    -- Quorum Votes by Device --
                        Device Name         Present Possible Status
      Device votes:     hwi_san_qs          0        1       Offline
    cletus# cletus# clq status
        1             1              Online
    --- Quorum Votes by Device ---
    Device Name       Present      Possible      Status
    hwi_san_qs        0            1             OfflineCould this be a bug with quorum math when used with quorum server? Is it related to me using svcadm to shut down the quorum server gracefully?

    To add what Tim has described. The improved quorum monitoring feature will detect that you shut down the quorum server, but will not panic the remaining node. Why should it? It knows it is the only node left and can safely continue to run. When the second node failed it had enough votes to reconfigure!
    Regards
    hartmut

  • Quorum Server Question

    Hi All,
    If using two or more clusters, can cluster nodes be quorum servers for
    for other clusters?
    /Regards
    Ulf

    Yes, if is feasible. There have been discussions internally about the possibility of making an HA quorum server. Personally, I'm not sure of the value of doing this as you need to guard against the possibility of correlated failured that cause everything to fail in a cascading manner.
    Tim
    ---

  • SC3.2 Quorum Server in S10 container?

    Hi,
    Is it possible to have a single server act as a quorum device for several (test / development) clusters? If it's not supported by the software, perhaps by installing multiple copies of the software within S10 zones?
    TIA. Tom

    Hi Tom,
    It is supported and documented in http://docs.sun.com/app/docs/doc/819-5360
    you just have to add separate lines in /etc/scqsd/scqsd.conf and differentiate by instance name and port.
    You only have to be aware that a single quorum server is s single point of failure and thatt's it.
    Cheers
    Detlef

  • Lync 2013 FE quorum server

    Hi,
    1. how i can check which is the perimary quorum server in the FE pool lync 2013 ?
    2. how i can change the quorum server before restart the FE server ?
    Thanks,

    1. Get-CsPoolFabricState will get you the node order. 2. You can't. Well not that I've been able to work out. The only option is to reset the fabric, which you can do of you run into issues with services not starting as a result of fabric issues.
    If this helped you please click "Vote As Helpful" if it answered your question please click "Mark As Answer" | Blog
    www.lynced.com.au | Twitter
    @imlynced

  • Clq status shows quorum server offline even though the clq service is runni

    Hi,
    In a 2 Node + 1 QS sun cluster 3.2 cluster, clq status is showing quorum sever offline even though the clq process is running on the quorum server. to make the quorum server online, i have to either remove and add the quorum server from cluster, or incase if there is a failure on any one of the node's both th nodes will reboot and once both joined to the cluster, I can see clq status showing quorum server online!!!
    Why is the quorum server going offline automatically?
    Any help would be highly appreciated
    Many thanks in advance
    Ushas Symon

    Hi,
    I asssume you mean the scqsd process is running on the QS, right?
    A QS is shown as offline, it the monitor could not reach it when it last tried. This is usually due to a networking problem.
    If you issue a clq status, the monitor checks again and if it can reach the QS will change its status back to online.
    If this does not happen, check your logs, what kind of error message showed up.
    Does clqs show on the QS show the correct information?
    It is obvious, that if a node dies and the QS has been offline prior to the node death, that the other node will die as well due to lack of quorum, i.e. it has less votes than needed. You seem to have a basic networking problem or something is really wrong with your QS.
    Regards
    Hartmut

  • Clq status shows quorum server offline eventhough it is online

    Hi,
    I am facing a problem in my cluster that when I bring all the three nodes in a cluster (2 app-nodes and one quorum server) at almost the same time, the clq status on any of the cluster nodes is showing quorum server as offline. when I do the clqs show on the quorum server, i ge the below output.
    clqs show
    === Quorum Server on port 9000 ===
    --- Cluster beacluster (id 0x4916625B) Reservation ---
    Node ID: 1
    Reservation key: 0x4916625b00000001
    --- Cluster beacluster (id 0x4916625B) Registrations ---
    Node ID: 1
    Registration key: 0x4916625b00000001
    Node ID: 2
    Registration key: 0x4916625b00000002
    this is cluster 3.2
    any inputs will be appreciated
    Thaks in advance
    Ushas Symon

    Hi, this is solaris cluster 3.2u1..
    I got the quorum server online by
    # clq status
    --- Quorum Votes by Device ---
    Device Name Present Possible Status
    rac1 1 1 Offline
    # clq add -t quorum_server -p qshost=xxxxxxxxxx -p port 9001 rac2
    # clq status
    --- Quorum Votes by Device ---
    Device Name Present Possible Status
    rac1 1 1 Online
    rac2 1 1 Online
    By just adding one another QS both the QS status came online !!!!!!!!!!!!
    no IDEA, what is happening...
    anyways I have deleted the second QS by #clq remove rac2 and #clq reset
    now it is fine..
    Thanking you all
    Ushas Symon

  • Adobe cloud has lost connection with the server

    Adobe cloud has lost connection with the server, can't get adobe cloud to work so I can re-install photoshop to fix a corrupted DLL file.

    *Adding new items/removing orphans*
    Try iTunes Folder Watch or iTunes Library Updater. Folder Watch is much faster on the adding files front, can be set to run in the background and includes a useful exclusion feature, however it’s slow at removing orphans. iTLU is better for this although doing it manually after looking at a list of proposed removals generated by Folder Watch is probably faster still. iTLU can also be set to update iTunes when you've used 3rd party tools to change tag info.
    You may need to amend the list of file types these programs look for. My list includes:
    .mp3 .mp4 .m4a .m4b .m4p .m4v .mov .wav .aif .mid .ipa .ipg .ite .itlp .m4r .pdf
    Note the last 6 types may not be recognised as already being in the library so should either be omitted from the search or you can add (at least for Folder Watch) individual exclusions for files you know are already in your library.
    tt2

  • PI 7.11: Cannot connect to server using message server:...

    Hello Guys,
    we make the Application Management for a Customer PI System.
    Scenario:
    - the SAP Gui Connection to the ABAP Stack is routed via SAPRouter and Works fine.
      SAP Gui -> our SAP Router -> VPN Box from Customer -> Firewall Customer -> ABAP Stack PI System
    - WebAccess its working fine, the Customer use Webdispatcher on every PI Server...
      Browser -> VPN Box from Customer -> Firewall Customer -> Java Stack (Port: 5xx00 btw. 81xx (Webdispatcher))
    Problem:
    Our Problem ist, we can not proceed the Integration Builder or the ESB, the Java Web Start works fine and open the Logon Screen Correctly -> but i fill the Logon Screen with my User name and Password and press Logon come the follwing Error:
    "Cannot connect to server using message server: ms://<hostname>.<domain>:8134/P4"
    In the Details from the Error Message:
    "<hostname>.<domain>:53404 Reason: com.sap.engine.services.rmi_p4.P4IOException:
    Cannot open connection to host: <IP-Adress of Central Instance> and Port: 53404"
    The Customer says, the Firewall is open with the IP Adresses and P4 Port but i dont think so...
    Can everybody help me, or have tips for me! I have checked a lot of OSS Messages (PI High Availabilty etc... its all correct on the System)
    Sorry for my bad English
    Best Regards,
    Markus

    Hi Markus,
    did you check if the browser is using a proxy? (In this case your scenario unfortunately won't work).
    P4-port should generally be routed via a proxy (described in the help.sap.com), but within the PI-Tools(JNLP) the proxy-usage is not implemented.  There is even a SAP-note that describes how to check the JavaWebStart-Proxyconfiguration, but this won't help either.
    If there is a proxy defined in the browser everything is working fine till you pass the logon-screen but even with the correct "javaws"-settings you won't be able to go on.
    (This problem is pretty bad if you do have developers and the SAP-servers seperated because of security issues. I'm hoping that this malfunction will be solved with upcoming patches.)
    Solution: Establish a connection without any proxy in between.
      E.g.: a terminal server in the same network
    It would be helpful to find more people with the same problem to force a fix from SAP for that.
    If anyone else is having problems with this, please add a comment to this thread.
    Best regards
    Christian

  • "cannot send email message using the server icloud" on 10.6.8

    I use mail from my desktop, not from iCloud on the Internet. I have a MacBook Pro with 10.6.8 Snow Leopard. My mail was working fine until yesterday.
    It says "Cannot send message using the server iCloud. connections to the server smtp.me.com on the default ports timed out. Select a different outgoing mail server from the list below.
    The list has:
       ICloud offline
       and Icloud
    Neither of them work.
    What has Apple changed regarding this in September?
    (I know others have posted similar message, but they were on 10.7)

    Same problem here.
    Recently an @icloud.com version (alias) of my existing @me.com email address appeared on my account. I can still send messages form the OSX build in Mail client as long as I am using the @me.com version of my email, but I do get this message when I try to use the new @icloud.com account.
    My account with iCloud states that both @me.com and @icloud.com versions of my email are active.
    Would be nice to know if this is a temporary problem of if this is a permanent one.
    The settings on the outgoing mail server are default as retrieved from apple when I configured the @me.com email for the first time. I went through the troubleshooting suggestions as provided on Apple website, double checking all the settings, no joy. At the very list I can still use the old @me.com alias with no problem (for now)
    MacBook Pro / Mountain Lion 10.8.2

  • "Cannot send message using the server....."

    Hi all,
    Considering the nature of the problem I am about to relate I would have to say at the outset that I would be very very surprised if other people have not come across this problem, so here goes...
    We have around 60 users of Apple Mail from both 10.4 and 10.5, so varying degrees of versions of Apple Mail however most if not all are updated to 10.4.11 and 10.5.2.
    We have been plagued with people being frustrated about emails bouncing back with an immediate error which is basically the following...
    "Cannot send message using the server smtp.xxx.com:user
    Sending the message content to the server failed.
    Select a different outgoing mail server from the list below etc etc"
    I am sure a lot of you have seen this error.
    However, it is totally random but I am at the end of my tether with it. It generally revolves around emails with attachments and can be totally random. I was trying to send a screenshot today, very small screenshot, using the Apple-Shift-4 technique, sent the .png file, then saved it out as a .jpg, nothing. Tiny file, around 5k. Got the error above, took it out, sent no problem. Other similar files on the desktop refused to send but a .pdf did. I then thought it might be our server, so sent teh same attachments using my .mac account. Same result and failed to send. Reports from other users in our group show that they too get random results, maybe moving the attachment in the email makes it go, sometimes putting it before your signature, sometimes putting your signature copied and pasted in so many times makes it work, all sorts of methods but all resulting in the same conclusion, Apple Mail can be very unreliable.
    We have even migrated some users to Entourage and the problem disappears. Even to Thunderbird, but those users miss the search capability as it is quicker and more reliable. So they want to go back.
    Considering I have been struggling with this issue back in the day when we were on the Apple Mail related version in 10.4 I was hoping that the version released in 10.5 would remedy the problems. Sometimes I feel it has just got worse.
    Is anyone else experiencing this sort of difficulty in Apple Mail, I really feel isolated and at a loss with how to remedy this for so many users.
    If anyone can share their experiences and how they have got around similar issues in Mail I am all ears and open to any suggestions.
    Thanks everyone for taking the time to read through this. There is more but the experiences are so random it is not worth trying to put it all down.
    Thanks again.
    Gerry McCoy

    I went in to Connection Doctor and. oddly enough, for this Mac account it said I was on Port 25. Si I changed it to Port 587 and saved the changes.
    Still, I have the same problem with the same error messages.
    I go back to the mail preferences > Accounts > Advanced and it shows Port 143 still there grayed out.
    What about SSL - it's not checked.
    Odd that this problem only seems to be from one .mac account emailing to another .mac account. Could the server be down?

  • Cannot send message using the server (null)

    i use mail 2.1.
    i have a .mac account and have three other email accounts attached to my mail account.
    lately, i cannot send any email.
    the switchiing ports fix hasn't helped either.
    this is the error message:
    CANNOT SEND MESSAGE USING THE SERVER (null)
    The server response was: 5.1.0 <email [email protected]>...
    From address does not match authentication.
    Use the pop-up menu below to try a different outgoing mail server. All messages will use this server until you quit Mail or change your network settings.
    Message from: email <[email protected]>
    Send message using: [there is a combo box here with all the four accounts servers listed]
    no matter which one i pick it doesn't work and no email is sent.
    anyone have this error before? or now how to fix it?
    i'd be appreciative.
    thanks
    1.67 GHz Power PC PowerBook G4   Mac OS X (10.4.6)   Sony HDR HC3 HD HandyCam MiniDV

    I was having a similar problem (don't feel like typing all the details)
    I was about to to delete my com.apple.mail.plist, when finally it hit me.
    I ran ethereal (again, I'm sorry, but learning how to use ethereal is a topic unto itself). Following the TCP stream (ie. looking at the smtp messages being sent back and forth) I came across two problems. For some reason my port number was set to 567 or something like that, when it's supposed to be 25, as I had originally set it to.
    Once I corrected the port number I started receiving an error message from the smtp server. It said the return email address could not be authenticated. (using xyz.com as an example) The correct return email address was supposed to be [email protected], but for some reason it was changed to john@xyz in the account settings.
    Anyway, to get to the point, another thing to check is that your return address has been set correctly, and if all else fails, make sure you have X11 installed and use fink to install and run ethereal. This will let you know if you are actually connecting to the server, and will show you any error messages.
    PS. I think this problem started occurring with the last update made to mail. I believe it somehow corrupted my settings. This would explain how my port number could have been changed to the default port number of .mac mail.

  • Cannot Send Messages Using  the Server

    I am dependent (during the day) on a wireless connection to the Minneapolis Wi-Fi system (U.S. Wireless).
    I've got an e-mail account with comcast and I have successfully interfaced this with the Apple Mail application that came with OS X 10.5.4 and for many weeks have been happily sending and receiving e-mails. But....
    I've been on the road and had to connect with Hotel internet services and I was picking up free WiFi in NYC when I was there.
    I first noticed the problem when I was staying at a hotel in Vermont.
    I would try to send e-mails and I would get the message:
    CANNOT SEND MESSGE USING THE SERVER __________.
    Select a different outgoing mail server....
    Now, I am back home and using my U.S. Wireless connection (which has been really bad lately).
    I keep getting these blasted messages and my mail sometimes goes through but more than often, I get these "cannot send message.." notices and my e-mail just sits there going nowhere in the outbox.
    How can I solve this problem?

    Beside the SMTP name -- smtp.comcast.net -- there is a pair of arrows, with one pointing up and one pointing down. If you click on those arrows you will be presented with a list of all SMTP ever enter (you may only have one), and also the command to Edit Server List. If you choose Edit Server List, you will be presented with a completely new setup window, dealing only with SMTP servers, and that window will have two tabs, one of which is also Advanced.
    From the name, smtp.comcast.net, without your Username appended, would indicate that an Authentication of None is currently in effect. With changes that Comcast has made recently, whether you use Port 25 or Port 587, I believe you would have to use Password Authentication, most certainly if the latter Port 587 is chosen.
    If you click on the link below, although not for Comcast, you will nevertheless see in section 12 through 15, screenshots that cover the SMTP setup that I am describing above.
    http://wildblueworld.com/dishmail.net/howdoi-applemail.php#2
    Ernie

Maybe you are looking for

  • What are the folder/file names of PSE12 catalogs?

      I'm curious about what files/folders are created by PSE 12 when setting up the original catalog and then each subsequent catalog.  Also, what files/folders are created for the PSE backup and is there a default folder/file structure tied to the cata

  • Error converting files to PDF in Internet Explorer- Adobe Acrobat Pro 8

    I've recently upgraded to Adobe Acrobat Pro 8, from AAP 7. With AAP 7 I was able to open my windows based program, right click and convert the page to PDF. Since I've upgraded I am unable to do this anymore. When I right click and click to convert to

  • Save Copy to Complete Later

    Any suggestions on how to create forms that allow users this option? What I'm experiencing so far is that Acrobat either saves the form as an unfillable PDF, or saves the fillable form without any of the data the user has entered... how does one save

  • Inputting from the keyboard

    hi everyone, can anyone help me on how to read what the user inputs from the keyboard when running a program in command-line(ms-dos prompt)interface.Does it have something to do with System.in any help would be appreciated. (Example) System.out.print

  • Alv hierarchy subtotal in 2 fields

    Hi gurus, First, thanks to all those who helped me with my first problem. I have a new question, still regarding subtotal. The client now wants to have a subtotal in alv hierarchy per vendor and purchase doc. My output displays a subtotal per purchas