Checking cluster ocfs2 is offline,Starting cluster ocfs2: Failed

Checking cluster ocfs2 is offline. And when I online the ocfs2 ,display the following error message,who can tell me what's wrong with the ocfs2? thanks!!!!
# /etc/init.d/o2cb status
Module "configfs": Loaded
Filesystem "configfs": Mounted
Module "ocfs2_nodemanager": Loaded
Module "ocfs2_dlm": Loaded
Module "ocfs2_dlmfs": Loaded
Filesystem "ocfs2_dlmfs": Mounted
Checking cluster ocfs2: Offline
# /etc/init.d/o2cb force-reload
Unmounting ocfs2_dlmfs filesystem: OK
Unloading module "ocfs2_dlmfs": OK
Unmounting configfs filesystem: OK
Unloading module "configfs": OK
Loading module "configfs": OK
Mounting configfs filesystem at /config: OK
Loading module "ocfs2_nodemanager": OK
Loading module "ocfs2_dlm": OK
Loading module "ocfs2_dlmfs": OK
Mounting ocfs2_dlmfs filesystem at /dlm: OK
Starting cluster ocfs2: Failed
Cluster ocfs2 created
o2cb_ctl: Configuration error discovered while populating cluster ocfs2. None of its nodes were considered local. A node is considered local when its node name in the configuration maches this machine's host name.
Stopping cluster ocfs2: OK
# /etc/init.d/o2cb online ocfs2
Starting cluster ocfs2: Failed
Cluster ocfs2 created
o2cb_ctl: Configuration error discovered while populating cluster ocfs2. None of its nodes were considered local. A node is considered local when its node name in the configuration maches this machine's host name.
Stopping cluster ocfs2: OK
ocfs2_hb_ctl: Unable to access cluster service while starting heartbeat
mount.ocfs2: Error when attempting to run /sbin/ocfs2_hb_ctl: "Operation not permitted"
: Could not mount /dev/sdb1
o2cb_ctl: Unable to access cluster service while creating node

Hi
Getting exactly the same errors when trying to implement OCFS2 on a VMWARE solution
[root@raclinux1 init.d]# ./o2cb status
Module "configfs": Loaded
Filesystem "configfs": Mounted
Module "ocfs2_nodemanager": Loaded
Module "ocfs2_dlm": Loaded
Module "ocfs2_dlmfs": Loaded
Filesystem "ocfs2_dlmfs": Mounted
Checking cluster ocfs2: Offline
[root@raclinux1 init.d]# ./o2cb online ocfs2
Starting cluster ocfs2: Failed
Cluster ocfs2 created
o2cb_ctl: Configuration error discovered while populating cluster ocfs2. None of its nodes were considered local. A node is considered local when its node name in the configuration maches this machine's host name.
Stopping cluster ocfs2: OK
And from ocfs2console we get the following errors when trying to add our node to the configuration
o2cb_ctl: Unable to access cluster service while creating node
Could not add node raclinux1
Any one got it to work on VMWARE?
PMcM

Similar Messages

  • Starting cluster ocfs2: Failed,Checking cluster ocfs2 is offline

    Checking cluster ocfs2 is offline. And when I online the ocfs2 ,display the following error message,who can tell me what's wrong with the ocfs2? thanks!!!!
    # /etc/init.d/o2cb status
    Module "configfs": Loaded
    Filesystem "configfs": Mounted
    Module "ocfs2_nodemanager": Loaded
    Module "ocfs2_dlm": Loaded
    Module "ocfs2_dlmfs": Loaded
    Filesystem "ocfs2_dlmfs": Mounted
    Checking cluster ocfs2: Offline
    # /etc/init.d/o2cb force-reload
    Unmounting ocfs2_dlmfs filesystem: OK
    Unloading module "ocfs2_dlmfs": OK
    Unmounting configfs filesystem: OK
    Unloading module "configfs": OK
    Loading module "configfs": OK
    Mounting configfs filesystem at /config: OK
    Loading module "ocfs2_nodemanager": OK
    Loading module "ocfs2_dlm": OK
    Loading module "ocfs2_dlmfs": OK
    Mounting ocfs2_dlmfs filesystem at /dlm: OK
    Starting cluster ocfs2: Failed
    Cluster ocfs2 created
    o2cb_ctl: Configuration error discovered while populating cluster ocfs2. None of its nodes were considered local. A node is considered local when its node name in the configuration maches this machine's host name.
    Stopping cluster ocfs2: OK
    # /etc/init.d/o2cb online ocfs2
    Starting cluster ocfs2: Failed
    Cluster ocfs2 created
    o2cb_ctl: Configuration error discovered while populating cluster ocfs2. None of its nodes were considered local. A node is considered local when its node name in the configuration maches this machine's host name.
    Stopping cluster ocfs2: OK
    ocfs2_hb_ctl: Unable to access cluster service while starting heartbeat
    mount.ocfs2: Error when attempting to run /sbin/ocfs2_hb_ctl: "Operation not permitted"
    : Could not mount /dev/sdb1
    o2cb_ctl: Unable to access cluster service while creating node

    http://unirac.in/qtoa/122/o2cb_ctl-unable-access-cluster-service-while-creating-node
    TODO:
    - Quit ocfs2console
    - Stop the service
    - Remove file /etc/ocfs2/cluster.conf
    - Restart ocfs2console
    - Configure the nodes again
    [root@rac1 ~]#  /etc/init.d/ocfs2 stop
    Stopping Oracle Cluster File System (OCFS2)                [  OK  ]
    [root@rac1 ~]# /etc/init.d/o2cb offline ocfs2
    [root@rac1 ~]# /etc/init.d/o2cb unload
    Unmounting ocfs2_dlmfs filesystem: OK
    Unloading module "ocfs2_dlmfs": OK
    Unmounting configfs filesystem: OK
    Unloading module "configfs": OK
    [root@rac1 ~]# rm -f /etc/ocfs2/cluster.conf
    [root@rac1 ~]# /usr/sbin/ocfs2console &

  • Cluster resource does not start automatically

    Hello,
    I have a strange issue. last saterday i replaced the motherboards of three HP DL380G6 servers, there is an HW issue with the production date of this hardware.
    After change of the HW, the cluster resources does not start automatically, when booting the servers. The resources come in state 'offline'(except the master IP resource). When start manually, the cluster resources works fine. I checked the eDirectory , and changed also the startup parameter of the cluster (autostart-manually-autostart) but with no results.
    When i create a new (simple) CL-resource, i have the same issue with this resource. Only the master ip CL-resource starts automatically??
    Does anyone have an answer for this problem....
    Thanks in advanced
    3-node SLES9sp3 SLES/OES1 cluster (i know its old...)
    - iSCSI connection to an SLES11-SAN

    Originally Posted by changju
    The state is stored in eDir, so your problem is most likely caused by an eDir issue.
    Please check the attribute "NCS:CRM Failover Mode" of the resource object in eDir. If its value is an odd number, it means the resource will load automatically. If it's not, add 1 to the number and save the new value.
    Then, you need to check file "/var/opt/novell/ncs/resources.xml". Find the section for the resource, and check "nCSCRMFailoverMode" to make sure it matches what you have in eDir. If not, please run command "/opt/novell/ncs/bin/ncs-configd.py -init" to try to update it.
    If the problem persists after all the above, please contact NTS. We will have it resolved.
    Best regards,
    Changju

  • Grid Infrastructure Does Not Start Cluster Resources

    Hello Gurus,
    I configured a 2 node RAC cluster using VirtualBox.It has been running fine all along and each time I started one of the nodes, I will definitely see all of other Cluster Resources will be started eventually.
    However, after I left it untouched for a month (VM is stopped), I found out that after starting up the machine, only local resource which is ONLINE.
    This is what I get:
    [grid@oel6-112-rac1 bin]$ ./crsctl status resource -t
    NAME           TARGET  STATE        SERVER                   STATE_DETAILS      
    Local Resources
    ora.CRS.dg
                   ONLINE  ONLINE       oel6-112-rac1                               
    ora.DATADG.dg
                   ONLINE  ONLINE       oel6-112-rac1                               
    ora.FRADG.dg
                   ONLINE  ONLINE       oel6-112-rac1                               
    ora.LISTENER.lsnr
                   OFFLINE OFFLINE      oel6-112-rac1                               
    ora.asm
                   ONLINE  ONLINE       oel6-112-rac1            Started            
    ora.gsd
                   OFFLINE OFFLINE      oel6-112-rac1                               
    ora.net1.network
                   ONLINE  ONLINE       oel6-112-rac1                               
    ora.ons
                   ONLINE  ONLINE       oel6-112-rac1                               
    Cluster Resources
    ora.LISTENER_SCAN1.lsnr
          1        OFFLINE OFFLINE                                                  
    ora.LISTENER_SCAN2.lsnr
          1        OFFLINE OFFLINE                                                  
    ora.LISTENER_SCAN3.lsnr
          1        OFFLINE OFFLINE                                                  
    ora.cvu
          1        OFFLINE OFFLINE                                                  
    ora.oc4j
          1        OFFLINE OFFLINE                                                  
    ora.oel6-112-rac1.vip
          1        OFFLINE OFFLINE                                                  
    ora.oel6-112-rac2.vip
          1        OFFLINE OFFLINE                                                  
    ora.racdb.db
          1        OFFLINE OFFLINE                               Instance Shutdown  
          2        OFFLINE OFFLINE                                                  
    ora.scan1.vip
          1        OFFLINE OFFLINE                                                  
    ora.scan2.vip
          1        OFFLINE OFFLINE                                                  
    ora.scan3.vip
          1        OFFLINE OFFLINE                                               
    and these are my other resources
    [grid@oel6-112-rac1 bin]$ ./crsctl status resource -t -init
    NAME           TARGET  STATE        SERVER                   STATE_DETAILS    
    Cluster Resources
    ora.asm
          1        ONLINE  ONLINE       oel6-112-rac1            Started          
    ora.cluster_interconnect.haip
          1        ONLINE  ONLINE       oel6-112-rac1                             
    ora.crf
          1        ONLINE  ONLINE       oel6-112-rac1                             
    ora.crsd
          1        ONLINE  ONLINE       oel6-112-rac1                             
    ora.cssd
          1        ONLINE  ONLINE       oel6-112-rac1                             
    ora.cssdmonitor
          1        ONLINE  ONLINE       oel6-112-rac1                             
    ora.ctssd
          1        ONLINE  ONLINE       oel6-112-rac1            ACTIVE:0         
    ora.diskmon
          1        OFFLINE OFFLINE                                                
    ora.evmd
          1        ONLINE  ONLINE       oel6-112-rac1                             
    ora.gipcd
          1        ONLINE  ONLINE       oel6-112-rac1                             
    ora.gpnpd
          1        ONLINE  ONLINE       oel6-112-rac1                             
    ora.mdnsd
          1        ONLINE  ONLINE       oel6-112-rac1                       
    Where do I supposed to check to see why the Cluster Resource like SCAN Listener, Database and etc not running?
    I've been checking on the logs but I haven't figured out what I should be looking at.
    Can some body help me?
    Thank you in advanced,
    Adhika

    Hi Freddie,
    I saw these lines on that Clusterware alert log:
    2013-07-01 22:39:20.084
    [crsd(3338)]CRS-1012:The OCR service started on node oel6-112-rac1.
    2013-07-01 22:39:20.208
    [evmd(3145)]CRS-1401:EVMD started on node oel6-112-rac1.
    2013-07-01 22:39:21.549
    [crsd(3338)]CRS-1201:CRSD started on node oel6-112-rac1.
    2013-07-01 22:39:22.715
    [/u01/app/11203/grid/bin/oraagent.bin(3450)]CRS-5016:Process "/u01/app/11203/grid/bin/lsnrctl" spawned by agent "/u01/app/11203/grid/bin/oraagent.bin" for action "check" failed: details at "(:CLSN00010:)" in "/u01/app/11203/grid/log/oel6-112-rac1/agent/crsd/oraagent_grid/oraagent_grid.log"
    2013-07-01 22:39:22.728
    [/u01/app/11203/grid/bin/oraagent.bin(3450)]CRS-5016:Process "/u01/app/11203/grid/bin/lsnrctl" spawned by agent "/u01/app/11203/grid/bin/oraagent.bin" for action "check" failed: details at "(:CLSN00010:)" in "/u01/app/11203/grid/log/oel6-112-rac1/agent/crsd/oraagent_grid/oraagent_grid.log"
    2013-07-01 22:39:22.772
    [/u01/app/11203/grid/bin/oraagent.bin(3450)]CRS-5016:Process "/u01/app/11203/grid/bin/lsnrctl" spawned by agent "/u01/app/11203/grid/bin/oraagent.bin" for action "check" failed: details at "(:CLSN00010:)" in "/u01/app/11203/grid/log/oel6-112-rac1/agent/crsd/oraagent_grid/oraagent_grid.log"
    2013-07-01 22:39:22.811
    [/u01/app/11203/grid/bin/oraagent.bin(3450)]CRS-5016:Process "/u01/app/11203/grid/bin/lsnrctl" spawned by agent "/u01/app/11203/grid/bin/oraagent.bin" for action "check" failed: details at "(:CLSN00010:)" in "/u01/app/11203/grid/log/oel6-112-rac1/agent/crsd/oraagent_grid/oraagent_grid.log"
    2013-07-01 22:39:23.069
    [/u01/app/11203/grid/bin/oraagent.bin(3450)]CRS-5016:Process "/u01/app/11203/grid/opmn/bin/onsctli" spawned by agent "/u01/app/11203/grid/bin/oraagent.bin" for action "check" failed: details at "(:CLSN00010:)" in "/u01/app/11203/grid/log/oel6-112-rac1/agent/crsd/oraagent_grid/oraagent_grid.log"
    2013-07-01 22:39:23.567
    [crsd(3338)]CRS-2772:Server 'oel6-112-rac1' has been assigned to pool 'Generic'.
    2013-07-01 22:39:23.568
    [crsd(3338)]CRS-2772:Server 'oel6-112-rac1' has been assigned to pool 'ora.racdb'.
    The I started looking in the /u01/app/11203/grid/log/oel6-112-rac1/agent/crsd/oraagent_grid/oraagent_grid.log file and found out that at the same time (2013-07-01 22:39:22),
    I saw the following lines:
    2013-07-01 22:39:22.433: [ora.CRS.dg][1644160768] {1:13152:2} [check] DgpAgent::getConnxn connection failure 1
    2013-07-01 22:39:22.433: [ora.CRS.dg][1644160768] {1:13152:2} [check] DgpAgent::getConnxn failed CRS-5000: Expected resource ora.asm does not exist in agent process
    2013-07-01 22:39:22.434: [ora.CRS.dg][1644160768] {1:13152:2} [check] DgpAgent::getConnxn try getInstanceInforWhenASMFail
    2013-07-01 22:39:22.434: [ora.CRS.dg][1644160768] {1:13152:2} [check] CrsCmd::ClscrsCmdData::stat entity 1 statflag 33 useFilter 0
    But that does not prevent the asm from being started properly.
    The only local resource that didn't start up automatically was the LISTENER.
    The following command shows that the local LISTENER has hard dependency on ora.cluster_vip_net1.type which is the ora.oel6-112-rac1.vip
    [grid@oel6-112-rac1 bin]$ ./crsctl status resource ora.LISTENER.lsnr -p | grep -ie dependencies
    START_DEPENDENCIES=hard(type:ora.cluster_vip_net1.type) pullup(type:ora.cluster_vip_net1.type)
    STOP_DEPENDENCIES=hard(intermediate:type:ora.cluster_vip_net1.type)
    NAME=ora.oel6-112-rac1.vip
    TYPE=ora.cluster_vip_net1.type
    START_DEPENDENCIES=hard(ora.net1.network) pullup(ora.net1.network)
    STOP_DEPENDENCIES=hard(ora.net1.network)
    [grid@oel6-112-rac1 bin]$ ./crsctl status resource ora.net1.network -p | grep -ie dependencies
    START_DEPENDENCIES=
    STOP_DEPENDENCIES=
    The ora.net1.network resource started properly and I didn't see that this prevent the ora.oel6-112-rac1.vip from starting up.
    The following lines also show that the ora.asm resource is has a weak dependency only against the ora.LISTENER.lsnr
    [grid@oel6-112-rac1 bin]$ ./crsctl status resource ora.racdb.db -p | grep -ie dependencies
    START_DEPENDENCIES=hard(ora.DATADG.dg,ora.FRADG.dg) weak(type:ora.listener.type,global:type:ora.scan_listener.type,uniform:ora.ons,global:ora.gns) pullup(ora.DATADG.dg,ora.FRADG.dg)
    STOP_DEPENDENCIES=hard(intermediate:ora.asm,shutdown:ora.DATADG.dg,shutdown:ora.FRADG.dg)
    [grid@oel6-112-rac1 bin]$ ./crsctl status resource ora.CRS.dg -p | grep -ie dependenci
    START_DEPENDENCIES=hard(ora.asm) pullup(ora.asm)
    STOP_DEPENDENCIES=hard(intermediate:ora.asm)
    [grid@oel6-112-rac1 bin]$ ./crsctl status resource ora.DATADG.dg -p | grep -ie dependencies
    START_DEPENDENCIES=hard(ora.asm) pullup(ora.asm)
    STOP_DEPENDENCIES=hard(intermediate:ora.asm)
    [grid@oel6-112-rac1 bin]$ ./crsctl status resource ora.FRADG.dg -p | grep -ie dependencies
    START_DEPENDENCIES=hard(ora.asm) pullup(ora.asm)
    STOP_DEPENDENCIES=hard(intermediate:ora.asm)
    [grid@oel6-112-rac1 bin]$ ./crsctl status resource ora.asm -p | grep -ie dependencies
    START_DEPENDENCIES=weak(ora.LISTENER.lsnr)
    STOP_DEPENDENCIES=
    I'm a little lost here.
    A suggestion would be very much appreciated.
    Thank you,
    Adhika

  • How can i start cluster stack ?

    Dears,,
    I am using Oracle Enterprise Linux 32-bit
    and trying to install Oracle Clusterware 10g
    While trying to use ocfs2console to Configure Oracle Cluster File System (OCFS2)
    Error appear to me as following:
    Could not start cluster stack error.This must be resolved before any OCFS2 filesystem can be mounted.
    How can solve this please?
    thanks & regards,,

    M o h a m m e d wrote:
    While trying to use ocfs2console to Configure Oracle Cluster File System (OCFS2)
    Error appear to me as following:
    Could not start cluster stack error.This must be resolved before any OCFS2 filesystem can be mounted.What is the contents of the file +/etc/ocfs2/cluster.conf+ ? Does the same file (same contents) exist on all nodes in the cluster?
    What is the contents of the file +/etc/fstab+ ? Is the ocfs2 device configured to be mounted on a mount point? Does this mount point exist? Is this (config and mount point) identical across all cluster nodes?
    Lastly, what happens when you issue the following commands (on any cluster node) as root:
    +/etc/init.d/o2cb load+
    +/etc/init.d/o2cb online+
    +/etc/init.d/ocfs2 start+

  • App Server 8.1 cluster not all members starting up.

    Hi,
    Solaris 10.
    Two instances of app servers, each in a separate zone, on same machine.
    App servers were just configured with the configure-ha-cluster command to operate in HA mode.
    When restarting the cluster with two members, only one of them comes up:
    ./asadmin start-cluster --user admin --passwordfile domain1.pwd idm-cluster
    Not all server instances in cluster idm-cluster were successfully started.
    Failed to retrieve RMIServer stub: javax.naming.NameNotFoundException: management/rmi-jmx-connector
    Command start-cluster executed successfully.Funnily enough, it's the instance on the local machine that will not start up. I had a similar problem earlier where creating a hadb domain could not contact the local hadb instance...
    The node-agent is running though, but the instance will not start up.
    ./asadmin list-clusters --user admin --passwordfile domain1.pwd
    idm-cluster partially running
    Command list-clusters executed successfully.
    ./asadmin list-node-agents --user admin --passwordfile domain1.pwd
    idm1 running
    idm2 running
    Command list-node-agents executed successfully.
    ./asadmin start-instance --user admin --passwordfile domain1.pwd idm1-instance
    Operation 'startServerInstance' failed in 'servers' Config Mbean.
    Target exception message: Failed to retrieve RMIServer stub: javax.naming.NameNotFoundException: management/rmi-jmx-connector
    CLI137 Command start-instance failed.Any pointers or ideas?
    Thanks!

    Hmmm. I seem to be having the same problem.
    Both node agents are running fine, but the DAS is unable to talk to the cluster.
    Exactly the same error occurs.
    Doesn't seem particularly robust software, imho. Happened several times.

  • Stop/start cluster

    Which is the better one to stop/start the cluster 11.2.0.3;
    What is the difference b/n these commands below ?
    #crsctl start cluster
    # crsctl start has
    #crsctl start crs

    Hello,
    To understand the basic difference b/w all three commands, first you need to understand the daemon process architecture:
    There are four levels of daemons are there from 11.2 onward.
    Level-1 : OHASD'S Spawn_
    Here OHASD(Super Daemon process) will spawn itself and it will start :
    a) CSSDAGENT, b) CSSDMONITOR, c) ORAROOTAGENT and d) ORAAGENT
    Level-2: OHASD cssdagent, cssdmonitor, orarootagent and oraagent spawn_
    Here cssdagent, cssdmonitor, orarootagent and oraagent, they all will start:
    a) CSSD, b) CRSD, c) CTSSD, d) DISKMON, e) GIPCD, f) GPNPD, g) EVMD, h) MDNSD etc.     
    Level-3: CRSD spawns:_
    In this level CRSD will start again:
    a) ORAROOTAGENT and b) ORAAGENT
    Level-4: CRSD orarootagent and oraagent SPAWNS:_
    Both orarootagent and oraagent both will start all the resources like ASM, DB, LISTENER, VIPs etc.
    Now Answer for your question:
    *# CRSCTL STOP CLUSTER:*
    This command will stop all the resources being managed by CRSD's Oragent & Orarootagent.
    *# CRSCTL STOP CRS:*
    This command will stop entire gridInfra i.e all daemons.
    *# CRSCTL STOP HAS:*
    This command is similar like CRSCTL STOP CRS but better to use this command in standalone server having GridInfra.
    Thanks,
    Ashish Shukla
    Edited by: Ashish Shukla on May 20, 2013 3:39 PM

  • SQL Cluster services is not start in secound node.

    SQL Cluster services is not start in secound node.

    Start or Stop the Cluster Service on a Cluster Node
    https://technet.microsoft.com/en-us/library/cc771742.aspx
    Ahsan Kabir Please remember to click Mark as Answer and Vote as Helpful on posts that help you. This can be beneficial to other community members reading the thread. http://www.aktechforum.blogspot.com/

  • Hello. I have a problem with OEL 6.5 and ocfs2. When I mount ocfs2 with mount -a command all ocfs2 partitions mount and work, but when I reboot no ocfs2 partitions auto mount. No error messages in log. I use DAS FC and iSCSI FC.

    Hello.
    I have a problem with OEL 6.5 and ocfs2.
    When I mount ocfs2 with mount -a command all ocfs2 partitions mount and work, but when I reboot no ocfs2 partitions auto mount. No error messages in log. I use DAS FC and iSCSI FC.
    fstab:
    UUID=32130a0b-2e15-4067-9e65-62b7b3e53c72 /some/4 ocfs2 _netdev,defaults 0 0
    #UUID=af522894-c51e-45d6-bce8-c0206322d7ab /some/9 ocfs2 _netdev,defaults 0 0
    UUID=1126b3d2-09aa-4be0-8826-0b2a590ab995 /some/3 ocfs2 _netdev,defaults 0 0
    #UUID=9ea9113d-edcf-47ca-9c64-c0d4e18149c1 /some/8 ocfs2 _netdev,defaults 0 0
    UUID=a368f830-0808-4832-b294-d2d1bf909813 /some/5 ocfs2 _netdev,defaults 0 0
    UUID=ee816860-5a95-493c-8559-9d528e557a6d /some/6 ocfs2 _netdev,defaults 0 0
    UUID=3f87634f-7dbf-46ba-a84c-e8606b40acfe /some/7 ocfs2 _netdev,defaults 0 0
    UUID=5def16d7-1f58-4691-9d46-f3fa72b74890 /some/1 ocfs2 _netdev,defaults 0 0
    UUID=0e682b5a-8d75-40d1-8983-fa39dd5a0e54 /some/2 ocfs2 _netdev,defaults 0 0

    What is the output of:
    # chkconfig --list o2cb
    # chkconfig --list ocfs2
    # cat /etc/ocfs2/cluster.conf

  • Guest Cluster error in Hyper-V Cluster

    Hello everybody,
    in my environment I do have an issue with failover clusters (Exchange, Fileserver) while performing a live migration of one virtual clusternode. The clustergroup is going offline.
    The environment is the following:
    2x Hyper-V Clusters: Hyper-V-Cluster1 and Hyper-V-Cluster2 (Windows Server 2012 R2) with 5 Nodes per Cluster
    1x Scaleout Fileserver (Windows Server 2012 R2) with 2 Nodes
    1x Exchange Cluster (Windows Server 2012 R2) with EX01 VM running on Hyper-V-Cluster1 and EX02 VM running on Hyper-V-Cluster2
    1x Fileserver Failover Cluster (Windows Server 2012 R2) with FS01 VM running on Hyper-V-Cluster1 and FS02 VM running on Hyper-V-Cluster2
    The physical networks on the Hyper-V Nodes are redundant with 2x 10Gb/s uplinks to 2x physical switches for VMs in a LBFO Team:
    New-NetLbfoTeam
    -Name 10Gbit_TEAM -TeamMembers 10Gbit_01,10Gbit_02
    -TeamingMode SwitchIndependent -LoadBalancingAlgorithm HyperVPort
    The SMB 3 traffic runs on 2x 10Gb/s NIC without NIC-Teaming (SMB-Multichannel).
    SMB is used for livemigrations.
    The VMs for clustering were installed according to the technet guideline:
    http://technet.microsoft.com/en-us/library/dn265980.aspx
    Because my Hyper-V Uplinks are allready redundant, I am using one NIC inside the VM.
    As I understand, there is no advantage of using two NICs inside the VM as long they are connected to the same vSwitch.
    Now, when I want to perform a hardware maintenance, I have to livemigrate the EX01 VM from Hyper-V-Cluster1-Node-1 to Hyper-V-Cluster1-Node-2.
    EX02 VM still runs untouched on Hyper-V-Cluster2-Node-1.
    At the end of the livemigration I see error 1135 (source: FailoverClustering) on EX01 VM, which says that EX02 VM was removed from Failover Cluster and I have to check my network.
    The clustergroup of exchange is offline after that event and I have to bring it online again manually.
    Any ideas what can cause this behavior?
    Thanks.
    Greetings,
    torsten

    Hello again,
    I found the cause and the solution :-)
    In the article here: http://technet.microsoft.com/en-us/library/dn440540.aspx
    is the description of my cluster failure:
    ########## relevant part from article #######################
    Protect against short-term network interruptions
    Failover cluster nodes use the network to send heartbeat packets to other nodes of the cluster. If a node does not receive a response from another node for a specified period of time, the cluster removes the node from cluster membership. By default, a guest
    cluster node is considered down if it does not respond within 5 seconds. Other nodes that are members of the cluster will take over any clustered roles that were running on the removed node.
    Typically, during the live migration of a virtual machine there is a fast final transition when the virtual machine is stopped on the source node and is running on the destination node. However, if something causes the final transition to take longer than
    the configured heartbeat threshold settings, the guest cluster considers the node to be down even though the live migration eventually succeeds. If the live migration final transition is completed within the TCP time-out interval (typically around 20 seconds),
    clients that are connected through the network to the virtual machine seamlessly reconnect.
    To make the cluster heartbeat time-out more consistent with the TCP time-out interval, you can change the
    SameSubnetThreshold and CrossSubnetThreshold cluster properties from the default of 5 seconds to 20 seconds. By default, the cluster sends a heartbeat every 1 second. The threshold specifies how many heartbeats to miss in succession
    before the cluster considers the cluster node to be down.
    After changing both parameters in failover cluster as described the error is gone.
    Greetings,
    torsten

  • SAP cluster active/active Windows and cluster active/pasive Database

    Dear Experts,
    I need to know if SAP is restricted to work in an active / active  cluster Windows R2 and database cluster with passive / active. Maybe Oracle database.
    Maybe they can give me some information on how to do it or tell me if this combination is NOT possible. Maybe Oracle as database.
    Please any information you can give me I thank.

    Hi Kumar,
    Please gothru the Installation guides under http://service.sap/com/instguides for required SAP DVD's and PAM for compatible DB's.
    In Node A --> You start with ASCS + SCS installation as below
    1.) Central Services Instance for ABAP (ASCS) -- Consists of Msg. service & Enqueue Service (Shared Disk)
    2.) Central Services Instance (SCS) -- Consists of Msg. service & Enqueue Service (Should be installed after installation of 1.) (Same Shared Disk)
    3.) First Microsoft Cluster Service (MSCS) -- This step creates the SAP cluster group on the first MSCS node and adds the ASCS and SCS instance(s) to the SAP cluster group
    4.) Database Instance
    Node B
    5.) Additional Microsoft Cluster Service (MSCS)
    6.) Enqueue Replication Server BOTH NODES, First in NODE A followed by NODE B
    7.) Central Instance NODE A-- If you install the central instance on one of the NODE A, make sure that you use a local drive, not a shared drive.
    8.) Dialog Instance NODE B-- Same procedure as in NODE A.
    When NODE A fails, the ASCS and SCS failsover to NODE B and Dialog instance on NODE B act as CI.
    Refer to below doc for more information  - http://www.sdn.sap.com/irj/scn/go/portal/prtroot/docs/library/uuid/00004f25-6190-2b10-2c8b-fa419234c28b
    http://scn.sap.com/docs/DOC-25454
    Thanks,
    Arjun

  • Can we install a new mssql cluster on the same windows cluster which already containes a mssql cluster with named instance

    We have a MSSQL 2008R2 Enterprise edition with a two node active passive fail-over cluster running on 2008R2 windows cluster with out any issues,
    Now my question is can we add one more MSSSQL cluster instance for the same setup with out disturbing the existing one ?
    Also give thoughts on load sharing as the second node is mostly ideal now except fail-over scenarios,
    Why we go for this situation is because of the collation setting which can be set only one per instance(Database collation setting change not working), we need a different default collation for the new setup

    hi,
    >>Now my question is can we add one more MSSSQL cluster instance for the same setup with out disturbing the existing one ?
    Yes it is possible .You need to add new drives as cluster aware and install SQL server and put data and log files on thse drives.YOu would need to create named instance of SQL server and need to create different resource group.Both old installation and new
    onw would work separately.
    >>Also give thoughts on load sharing as the second node is mostly ideal now except fail-over scenarios,
    Good point indeed.You are about to create Multi instance cluster and should plan for scenario where one node is down and other node is handling load for both instances.Memory and CPU should be enough to handle the load.
    >>Why we go for this situation is because of the collation setting which can be set only one per instance(Database collation setting change not working), we need a different default collation for the new setup .
    Just for collation if you are installing new instance seems little wierd to me.You can manage collation at column ,database and at server level.
    http://technet.microsoft.com/en-us/library/aa174903(v=sql.80).aspx
    Please mark this reply as the answer or vote as helpful, as appropriate, to make it useful for other readers

  • How to upload files in cluster without bringing down the cluster.

    Hi,
    Here is my problem:
    My weblogic portal 7.0 application should run in 24x7 environment. The production
    system is running in a cluster mode with external staging. Both the managed servers
    are pointing to the same application sync database.
    So to give a solution to this problem, we are trying to bring down one managed
    server down and apply the patches into that managed server without bringing down
    the other. Since webflow files and entitlements needs to be synchronized with
    the portal database. If we perform a sync on the portal database, there might
    be a chance that it might corrupt the existing portal database state. Our deployment
    would not be complete unless we perform a sync on the other managed servers. So
    we could not able to update the latest pathces into the cluster without bringing
    down the cluster.
    What I would like to know is "Is there any work around to get rid of this problem?".
    I would be delighted if any one provides the solution.
    Thanks & Regards,
    Ratnakar

    Hi Ratnakar,
    When you perform sync you do it against the admin server which in turn does it
    to the managed server. If you want to apply any changes to webflow you can do
    it on the live system itself. I mean apply the new webflow and entitlements and
    do the sync on the live system. Also while doing sync there are two modes. One
    mode will sync only the modified changes and the other will apply all of it. You
    can opt for the first option by which you can sync only the modified changes.
    This may bring a small disturbance but it will be unnoticed.
    The other option is to have two separate domains(each having one admin & one or
    more managed servers) which may be having two separate portal databases. In this
    way you can apply patches to one domain while other domain may provide the service.
    Regards
    Shaan
    "Ratnakar Sonti" <[email protected]> wrote:
    >
    Hi,
    Here is my problem:
    My weblogic portal 7.0 application should run in 24x7 environment. The
    production
    system is running in a cluster mode with external staging. Both the managed
    servers
    are pointing to the same application sync database.
    So to give a solution to this problem, we are trying to bring down one
    managed
    server down and apply the patches into that managed server without bringing
    down
    the other. Since webflow files and entitlements needs to be synchronized
    with
    the portal database. If we perform a sync on the portal database, there
    might
    be a chance that it might corrupt the existing portal database state.
    Our deployment
    would not be complete unless we perform a sync on the other managed servers.
    So
    we could not able to update the latest pathces into the cluster without
    bringing
    down the cluster.
    What I would like to know is "Is there any work around to get rid of
    this problem?".
    I would be delighted if any one provides the solution.
    Thanks & Regards,
    Ratnakar

  • Power book G4 Leopard 10.5.8. File sharing preference checked but not working after start up until unchecked and the rechecked?

    I hope the title is enough info, please ask. In the mean time I have created an applescript to open preferences/ sharing and uncheck then check the file sharing box. This works but I would obviously prefer it just work normally. I have no corrupt preference files. Perhaps a file is missing? Thank you.

    Greetings Kwnez
    I'm not up to your level of expertise yet. I'm networking and file sharing too, on my Mac Powerbook 1.33 GHz PowerPC G4; OS 10.5.8 Lepoard. At the public library I found by asking the clerk a book Authored by David Pogue. "Mac OS X Leopard the Missing Manual". It (the book) has helped me ask better questions. I'm not saying that your question "
    Power book G4 Leopard 10.5.8. File sharing preference checked but not working after start up until unchecked and the rechecked?
    is hard for me to understand, I went to system preferences and looked at the sharing box. when I clik it it opens up and the first thing I see is my computers name, and under that i see, " Computers on your local network can access your computer at: ...-powerbook-g4-15.local"
    What I'm saying is "Are we on the same Page"

  • Why does firefox put a check mark on work offline?

    I have a problem with a fresh install of firefox.
    It seems to place a check mark on work offline.
    After removal it worked fine for a time & then firefox put a check mark back on work offline.
    Please help.

    Firefox detects when your computer goes offline and automatically puts a check mark next to "Work Offline".

Maybe you are looking for