ONS failed to start on second node

Hi,
I have a problem with ons on 10g rac running on linux 5.3
on node 1 it is running without problem but on second node i got this error
2009-04-08 16:30:41.318: [    RACG][3065611968] [16553][3065611968][ora.rac2.ons]: RCV: Permission denied
Communication error with the OPMN server local port.
Check the OPMN log files
RCV: Permission denied
Communication error with the OPMN server local port.
Check the OPMN log files
RCV: Permission denied
Communication error with the
2009-04-08 16:30:41.319: [    RACG][3065611968] [16553][3065611968][ora.rac2.ons]: OPMN server local port.
Check the OPMN log files
RCV: Permission denied
Communication error with the OPMN server local port.
Check the OPMN log files
RCV: Permission denied
Communication error with the OPMN server local port.
Check the OPMN log files
2009-04-08 16:30:41.319: [    RACG][3065611968] [16553][3065611968][ora.rac2.ons]: RCV: Permission denied
Communication error with the OPMN server local port.
Check the OPMN log files
Number of onsconfiguration retrieved, numcfg = 2
onscfg[0]
{node = rac1, port = 6200}
Adding remote host rac1:6200
onscfg[1]
{node = rac2, port = 6
2009-04-08 16:30:41.319: [    RACG][3065611968] [16553][3065611968][ora.rac2.ons]: 200}
Adding remote host rac2:6200
RCV: Permission denied
Communication error with the OPMN server local port.
Check the OPMN log files
RCV: Permission denied
Communication error with the OPMN server local port.
Check the OPMN log files
RCV: Permission d
2009-04-08 16:30:41.319: [    RACG][3065611968] [16553][3065611968][ora.rac2.ons]: enied
Communication error with the OPMN server local port.
Check the OPMN log files
RCV: Permission denied
Communication error with the OPMN server local port.
Check the OPMN log files
RCV: Permission denied
Communication error with the OPMN server loca
2009-04-08 16:30:41.319: [    RACG][3065611968] [16553][3065611968][ora.rac2.ons]: l port.
Check the OPMN log files
RCV: Permission denied
Communication error with the OPMN server local port.
Check the OPMN log files
Number of onsconfiguration retrieved, numcfg = 2
onscfg[0]
{node = rac1, port = 6200}
Adding remote host rac1:6200
o
2009-04-08 16:30:41.319: [    RACG][3065611968] [16553][3065611968][ora.rac2.ons]: nscfg[1]
{node = rac2, port = 6200}
Adding remote host rac2:6200
onsctl: ons failed to start
2009-04-08 16:30:41.319: [    RACG][3065611968] [16553][3065611968][ora.rac2.ons]: clsrcexecut: env ORACLE_CONFIG_HOME=/u01/app/crs
2009-04-08 16:30:41.319: [    RACG][3065611968] [16553][3065611968][ora.rac2.ons]: clsrcexecut: cmd = /u01/app/crs/bin/racgeut -e USRORA_DEBUG=0 540 /u01/app/crs/bin/onsctl start
2009-04-08 16:30:41.320: [    RACG][3065611968] [16553][3065611968][ora.rac2.ons]: clsrcexecut: rc = 1, time = 2.580s
2009-04-08 16:30:42.148: [    RACG][3065611968] [16553][3065611968][ora.rac2.ons]: RCV: Permission denied
Communication error with the OPMN server local port.
Check the OPMN log files
RCV: Permission denied
Communication error with the OPMN server local port.
Check the OPMN log files
RCV: Permission denied
Communication error with the
2009-04-08 16:30:42.150: [    RACG][3065611968] [16553][3065611968][ora.rac2.ons]: OPMN server local port.
Check the OPMN log files
RCV: Permission denied
Communication error with the OPMN server local port.
Check the OPMN log files
RCV: Permission denied
Communication error with the OPMN server local port.
Check the OPMN log files
2009-04-08 16:30:42.150: [    RACG][3065611968] [16553][3065611968][ora.rac2.ons]: RCV: Permission denied
Communication error with the OPMN server local port.
Check the OPMN log files
Number of onsconfiguration retrieved, numcfg = 2
onscfg[0]
{node = rac1, port = 6200}
Adding remote host rac1:6200
onscfg[1]
{node = rac2, port = 6
2009-04-08 16:30:42.150: [    RACG][3065611968] [16553][3065611968][ora.rac2.ons]: 200}
Adding remote host rac2:6200
ons is not running ...
2009-04-08 16:30:42.151: [    RACG][3065611968] [16553][3065611968][ora.rac2.ons]: clsrcexecut: env ORACLE_CONFIG_HOME=/u01/app/crs
2009-04-08 16:30:42.151: [    RACG][3065611968] [16553][3065611968][ora.rac2.ons]: clsrcexecut: cmd = /u01/app/crs/bin/racgeut -e USRORA_DEBUG=0 540 /u01/app/crs/bin/onsctl ping
2009-04-08 16:30:42.151: [    RACG][3065611968] [16553][3065611968][ora.rac2.ons]: clsrcexecut: rc = 1, time = 0.840s
2009-04-08 16:30:42.153: [    RACG][3065611968] [16553][3065611968][ora.rac2.ons]: end for resource = ora.rac2.ons, action = start, status = 1, time = 3.620s
2009-04-08 16:30:44.376: [    RACG][3066242752] [17061][3066242752][ora.rac2.ons]: onsctl: shutting down ons daemon ...
Number of onsconfiguration retrieved, numcfg = 2
onscfg[0]
{node = rac1, port = 6200}
Adding remote host rac1:6200
onscfg[1]
{node = rac2, port = 6200}
Adding remote host rac2:6200
Any idea how to fix this?
Thanks

check the output for crs_getperm for the resource from both nodes. If you could, post them here.
Regards,
Ganesh

Similar Messages

  • Rs-ora:resource group failed to start on chosen node; it may end up failing

    I have configured two node failover cluster environment using netra a/d 1000 storage. When I try to deploy oracle server application it throws the following error
    rs-ora: resource group failed to start on chosen node; it may end up failing over to other node(s)
    I created metaset and gave one raw did disk to that metaset.
    I created logical hostname resource, ha-storage plus resource. Later I brought the resource group to online using following command
    #clrg online –emM rg-ora
    Later I created oracle cluster resource using following command.
    #clrs create -g rg-ora -t SUNW.oracle_server -p ORACLE_HOME=/global/oracle/product/10.2.0/db_1 -p ORACLE_SID=infra -p Alert_log_file=/global/oracle/product/10.2.0/db_1/admin/infra/bdump/alert_infra.log -p Connect_string=sysdba/dbadmin1@infra -p Resource_dependencies=rs-ora-has rs-ora
    node1 - Validation failed. ORACLE_HOME /global/oracle/product/10.2.0/db_1 does not exist
    node1 - ALERT_LOG_FILE /global/oracle/product/10.2.0/db_1/admin/infra/bdump/alert_infra.log doesn't exist
    node1 - PARAMETER_FILE: /global/oracle/product/10.2.0/db_1/dbs/initinfra.ora nor server PARAMETER_FILE: /global/oracle/product/10.2.0/db_1/dbs/spfileinfra.ora exists
    node1 - This resource depends on a HAStoragePlus resouce that is not online on this node. Ignoring validation errors.
    rs-ora: resource group failed to start on chosen node; it may end up failing over to other node(s)
    The status of oracle resource shows as follows.
    Resource Name Node Name State Status Message
    rs-ora node1 Start failed Faulted
    I used solaris 10 update 6 patch level is Generic_137137-09, Oracle version 10.2.0, Sun clusters 3.2 update1. Following are the vfstab and /var/adm/messages of both nodes.
    Node1#grep ora /etc/vfstab
    /dev/md/oradg/dsk/d300 /dev/md/oradg/rdsk/d300 /global/oracle ufs 5 no logging
    Node2#grep ora /etc/vfstab
    /dev/md/oradg/dsk/d300 /dev/md/oradg/rdsk/d300 /global/oracle ufs 5 no logging
    Node1#more /var/adm/messages
    Oct 17 05:19:17 node1 Cluster.RGM.rgmd: [ID 224900 daemon.notice] launching method <hafoip_prenet_start> for resource <ha-
    host-1>, resource group <rg-ora>, node <node1>, timeout <300> seconds
    Oct 17 05:19:17 node1 Cluster.RGM.rgmd: [ID 751138 daemon.notice] 47 fe_rpc_command: cmd_type(enum):<1>:cmd=</usr/cluster/
    lib/rgm/rt/hafoip/hafoip_prenet_start>:tag=<rg-ora.ha-host-1.10>: Calling security_clnt_connect(..., host=<node1>, sec_typ
    e {0:WEAK, 1:STRONG, 2:DES} =<1>, ...)
    Oct 17 05:19:17 node1 Cluster.RGM.rgmd: [ID 515159 daemon.notice] method <hafoip_prenet_start> completed successfully for
    resource <ha-host-1>, resource group <rg-ora>, node <node1>, time used: 0% of timeout <300 seconds>
    Oct 17 05:19:17 node1 Cluster.RGM.rgmd: [ID 224900 daemon.notice] launching method <hastorageplus_prenet_start> for resour
    ce <rs-ora-has>, resource group <rg-ora>, node <node1>, timeout <1800> seconds
    Oct 17 05:19:17 node1 Cluster.RGM.rgmd: [ID 751138 daemon.notice] 47 fe_rpc_command: cmd_type(enum):<1>:cmd=</usr/cluster/
    lib/rgm/rt/hastorageplus/hastorageplus_prenet_start>:tag=<rg-ora.rs-ora-has.10>: Calling security_clnt_connect(..., host=<tes
    tlab5>, sec_type {0:WEAK, 1:STRONG, 2:DES} =<1>, ...)
    Oct 17 05:19:18 node1 Cluster.RGM.rgmd: [ID 375444 daemon.notice] 8 fe_rpc_command: cmd_type(enum):<2>:cmd=<null>:tag=<rg-
    ora.rs-ora-has.10>: Calling security_clnt_connect(..., host=<node1>, sec_type {0:WEAK, 1:STRONG, 2:DES} =<0>, ...)
    Oct 17 05:19:18 node1 Cluster.RGM.rgmd: [ID 316625 daemon.notice] Timeout monitoring on method tag <rg-ora.rs-ora-has.10>
    has been suspended.
    Oct 17 05:19:20 node1 Cluster.Framework: [ID 801593 daemon.notice] stdout: becoming primary for oradg
    Oct 17 05:19:21 node1 Cluster.RGM.rgmd: [ID 375444 daemon.notice] 8 fe_rpc_command: cmd_type(enum):<3>:cmd=<null>:tag=<rg-
    ora.rs-ora-has.10>: Calling security_clnt_connect(..., host=<node1>, sec_type {0:WEAK, 1:STRONG, 2:DES} =<0>, ...)
    Oct 17 05:19:21 node1 Cluster.RGM.rgmd: [ID 316625 daemon.notice] Timeout monitoring on method tag <rg-ora.rs-ora-has.10>
    has been resumed.
    Oct 17 05:19:25 node1 Cluster.RGM.rgmd: [ID 515159 daemon.notice] method <hastorageplus_prenet_start> completed successful
    ly for resource <rs-ora-has>, resource group <rg-ora>, node <node1>, time used: 0% of timeout <1800 seconds>
    Oct 17 05:19:25 node1 Cluster.RGM.rgmd: [ID 224900 daemon.notice] launching method <hafoip_start> for resource <ha-host-1>
    , resource group <rg-ora>, node <node1>, timeout <500> seconds
    Oct 17 05:19:25 node1 Cluster.RGM.rgmd: [ID 751138 daemon.notice] 47 fe_rpc_command: cmd_type(enum):<1>:cmd=</usr/cluster/
    lib/rgm/rt/hafoip/hafoip_start>:tag=<rg-ora.ha-host-1.0>: Calling security_clnt_connect(..., host=<node1>, sec_type {0:WEA
    K, 1:STRONG, 2:DES} =<1>, ...)
    Oct 17 05:19:25 node1 Cluster.RGM.rgmd: [ID 515159 daemon.notice] method <hafoip_start> completed successfully for resourc
    e <ha-host-1>, resource group <rg-ora>, node <node1>, time used: 0% of timeout <500 seconds>
    Oct 17 05:19:25 node1 Cluster.RGM.rgmd: [ID 224900 daemon.notice] launching method <hafoip_monitor_start> for resource <ha
    -host-1>, resource group <rg-ora>, node <node1>, timeout <300> seconds
    Oct 17 05:19:25 node1 Cluster.RGM.rgmd: [ID 224900 daemon.notice] launching method <hastorageplus_start> for resource <rs-
    ora-has>, resource group <rg-ora>, node <node1>, timeout <90> seconds
    Oct 17 05:19:25 node1 Cluster.RGM.rgmd: [ID 510020 daemon.notice] 46 fe_rpc_command: cmd_type(enum):<1>:cmd=</usr/cluster/
    lib/rgm/rt/hafoip/hafoip_monitor_start>:tag=<rg-ora.ha-host-1.7>: Calling security_clnt_connect(..., host=<node1>, sec_typ
    e {0:WEAK, 1:STRONG, 2:DES} =<1>, ...)
    Oct 17 05:19:25 node1 Cluster.RGM.rgmd: [ID 751138 daemon.notice] 47 fe_rpc_command: cmd_type(enum):<1>:cmd=</usr/cluster/
    lib/rgm/rt/hastorageplus/hastorageplus_start>:tag=<rg-ora.rs-ora-has.0>: Calling security_clnt_connect(..., host=<node1>,
    sec_type {0:WEAK, 1:STRONG, 2:DES} =<1>, ...)
    Oct 17 05:19:25 node1 Cluster.RGM.rgmd: [ID 515159 daemon.notice] method <hafoip_monitor_start> completed successfully for
    resource <ha-host-1>, resource group <rg-ora>, node <node1>, time used: 0% of timeout <300 seconds>
    Oct 17 05:19:25 node1 Cluster.RGM.rgmd: [ID 515159 daemon.notice] method <hastorageplus_start> completed successfully for
    resource <rs-ora-has>, resource group <rg-ora>, node <node1>, time used: 0% of timeout <90 seconds>
    Oct 17 05:19:25 node1 Cluster.RGM.rgmd: [ID 224900 daemon.notice] launching method <hastorageplus_monitor_start> for resou
    rce <rs-ora-has>, resource group <rg-ora>, node <node1>, timeout <90> seconds
    Oct 17 05:19:25 node1 Cluster.RGM.rgmd: [ID 751138 daemon.notice] 47 fe_rpc_command: cmd_type(enum):<1>:cmd=</usr/cluster/
    lib/rgm/rt/hastorageplus/hastorageplus_monitor_start>:tag=<rg-ora.rs-ora-has.7>: Calling security_clnt_connect(..., host=<tes
    tlab5>, sec_type {0:WEAK, 1:STRONG, 2:DES} =<1>, ...)
    Oct 17 05:19:25 node1 Cluster.RGM.rgmd: [ID 515159 daemon.notice] method <hastorageplus_monitor_start> completed successfu
    lly for resource <rs-ora-has>, resource group <rg-ora>, node <node1>, time used: 0% of timeout <90 seconds>
    Oct 17 05:19:38 node1 Cluster.RGM.rgmd: [ID 224900 daemon.notice] launching method <bin/oracle_server_validate> for resour
    ce <rs-ora>, resource group <rg-ora>, node <node1>, timeout <120> seconds

    Oct 17 05:19:38 node1 Cluster.RGM.rgmd: [ID 375444 daemon.notice] 8 fe_rpc_command: cmd_type(enum):<1>:cmd=</opt/SUNWscor/
    oracle_server/bin/oracle_server_validate>:tag=<rg-ora.rs-ora.2>: Calling security_clnt_connect(..., host=<node1>, sec_type
    {0:WEAK, 1:STRONG, 2:DES} =<1>, ...)
    Oct 17 05:19:38 node1 Cluster.RGM.rgmd: [ID 515159 daemon.notice] method <bin/oracle_server_validate> completed successful
    ly for resource <rs-ora>, resource group <rg-ora>, node <node1>, time used: 0% of timeout <120 seconds>
    Oct 17 05:19:38 node1 Cluster.RGM.rgmd: [ID 224900 daemon.notice] launching method <bin/oracle_server_init> for resource <
    rs-ora>, resource group <rg-ora>, node <node1>, timeout <30> seconds
    Oct 17 05:19:38 node1 Cluster.RGM.rgmd: [ID 751138 daemon.notice] 47 fe_rpc_command: cmd_type(enum):<1>:cmd=</opt/SUNWscor
    /oracle_server/bin/oracle_server_init>:tag=<rg-ora.rs-ora.4>: Calling security_clnt_connect(..., host=<node1>, sec_type {0
    :WEAK, 1:STRONG, 2:DES} =<1>, ...)
    Oct 17 05:19:38 node1 Cluster.RGM.rgmd: [ID 515159 daemon.notice] method <bin/oracle_server_init> completed successfully f
    or resource <rs-ora>, resource group <rg-ora>, node <node1>, time used: 0% of timeout <30 seconds>
    Oct 17 05:19:38 node1 Cluster.CCR: [ID 973933 daemon.notice] resource rs-ora added.
    Oct 17 05:19:39 node1 Cluster.RGM.rgmd: [ID 224900 daemon.notice] launching method <bin/oracle_server_start> for resource
    <rs-ora>, resource group <rg-ora>, node <node1>, timeout <600> seconds
    Oct 17 05:19:39 node1 Cluster.RGM.rgmd: [ID 751138 daemon.notice] 47 fe_rpc_command: cmd_type(enum):<1>:cmd=</opt/SUNWscor
    /oracle_server/bin/oracle_server_start>:tag=<rg-ora.rs-ora.0>: Calling security_clnt_connect(..., host=<node1>, sec_type {
    0:WEAK, 1:STRONG, 2:DES} =<1>, ...)
    Oct 17 05:19:48 node1 SC[SUNWscor.oracle_server.start]:rg-ora:rs-ora: [ID 876834 daemon.error] Could not start server
    Oct 17 05:19:48 node1 Cluster.RGM.rgmd: [ID 938318 daemon.error] Method <bin/oracle_server_start> failed on resource <rs-o
    ra> in resource group <rg-ora> [exit code <1>, time used: 1% of timeout <600 seconds>]
    Node2# more /var/adm/messages
    Oct 14 20:20:04 node2 Cluster.RGM.rgmd: [ID 529407 daemon.notice] resource group rg-ora state on node node2 change to RG_PENDIN
    G_OFFLINE
    Oct 14 20:20:04 node2 Cluster.RGM.rgmd: [ID 443746 daemon.notice] resource rs-ora-has state on node node2 change to R_MON_STOPP
    ING
    Oct 14 20:20:04 node2 Cluster.RGM.rgmd: [ID 443746 daemon.notice] resource ha-host-1 state on node node2 change to R_MON_STOPPI
    NG
    Oct 14 20:20:04 node2 Cluster.RGM.rgmd: [ID 224900 daemon.notice] launching method <hafoip_monitor_stop> for resource <ha-host
    -1>, resource group <rg-ora>, node <node2>, timeout <300> seconds
    Oct 14 20:20:04 node2 Cluster.RGM.rgmd: [ID 224900 daemon.notice] launching method <hastorageplus_monitor_stop> for resource <
    rs-ora-has>, resource group <rg-ora>, node <node2>, timeout <90> seconds
    Oct 14 20:20:04 node2 Cluster.RGM.rgmd: [ID 268902 daemon.notice] 45 fe_rpc_command: cmd_type(enum):<1>:cmd=</usr/cluster/lib/
    rgm/rt/hafoip/hafoip_monitor_stop>:tag=<rg-ora.ha-host-1.8>: Calling security_clnt_connect(..., host=<node2>, sec_type {0:WEAK
    , 1:STRONG, 2:DES} =<1>, ...)
    Oct 14 20:20:04 node2 Cluster.RGM.rgmd: [ID 510020 daemon.notice] 46 fe_rpc_command: cmd_type(enum):<1>:cmd=</usr/cluster/lib/
    rgm/rt/hastorageplus/hastorageplus_monitor_stop>:tag=<rg-ora.rs-ora-has.8>: Calling security_clnt_connect(..., host=<node2>, s
    ec_type {0:WEAK, 1:STRONG, 2:DES} =<1>, ...)
    Oct 14 20:20:05 node2 Cluster.RGM.rgmd: [ID 515159 daemon.notice] method <hastorageplus_monitor_stop> completed successfully f
    or resource <rs-ora-has>, resource group <rg-ora>, node <node2>, time used: 0% of timeout <90 seconds>
    Oct 14 20:20:05 node2 Cluster.RGM.rgmd: [ID 443746 daemon.notice] resource rs-ora-has state on node node2 change to R_ONLINE_UN
    MON
    Oct 14 20:20:05 node2 Cluster.RGM.rgmd: [ID 443746 daemon.notice] resource rs-ora-has state on node node2 change to R_STOPPING
    Oct 14 20:20:05 node2 Cluster.RGM.rgmd: [ID 784560 daemon.notice] resource rs-ora-has status on node node2 change to R_FM_UNKNO
    WN
    Oct 14 20:20:05 node2 Cluster.RGM.rgmd: [ID 922363 daemon.notice] resource rs-ora-has status msg on node node2 change to <Stopp
    ing>
    Oct 14 20:20:05 node2 Cluster.RGM.rgmd: [ID 224900 daemon.notice] launching method <hastorageplus_stop> for resource <rs-ora-h
    as>, resource group <rg-ora>, node <node2>, timeout <1800> seconds
    Oct 14 20:20:05 node2 Cluster.RGM.rgmd: [ID 510020 daemon.notice] 46 fe_rpc_command: cmd_type(enum):<1>:cmd=</usr/cluster/lib/
    rgm/rt/hastorageplus/hastorageplus_stop>:tag=<rg-ora.rs-ora-has.1>: Calling security_clnt_connect(..., host=<node2>, sec_type
    {0:WEAK, 1:STRONG, 2:DES} =<1>, ...)
    Oct 14 20:20:05 node2 Cluster.RGM.rgmd: [ID 515159 daemon.notice] method <hafoip_monitor_stop> completed successfully for reso
    urce <ha-host-1>, resource group <rg-ora>, node <node2>, time used: 0% of timeout <300 seconds>
    Oct 14 20:20:05 node2 Cluster.RGM.rgmd: [ID 443746 daemon.notice] resource ha-host-1 state on node node2 change to R_ONLINE_UNM
    ON
    Oct 14 20:20:05 node2 Cluster.RGM.rgmd: [ID 515159 daemon.notice] method <hastorageplus_stop> completed successfully for resou
    rce <rs-ora-has>, resource group <rg-ora>, node <node2>, time used: 0% of timeout <1800 seconds>
    Oct 14 20:20:05 node2 Cluster.RGM.rgmd: [ID 443746 daemon.notice] resource rs-ora-has state on node node2 change to R_STOPPED
    Oct 14 20:20:05 node2 Cluster.RGM.rgmd: [ID 443746 daemon.notice] resource ha-host-1 state on node node2 change to R_STOPPING
    Oct 14 20:20:05 node2 Cluster.RGM.rgmd: [ID 224900 daemon.notice] launching method <hafoip_stop> for resource <ha-host-1>, res
    ource group <rg-ora>, node <node2>, timeout <300> seconds
    Oct 14 20:20:05 node2 Cluster.RGM.rgmd: [ID 784560 daemon.notice] resource ha-host-1 status on node node2 change to R_FM_UNKNOW
    N
    Oct 14 20:20:05 node2 Cluster.RGM.rgmd: [ID 922363 daemon.notice] resource ha-host-1 status msg on node node2 change to <Stoppi
    ng>
    Oct 14 20:20:05 node2 Cluster.RGM.rgmd: [ID 510020 daemon.notice] 46 fe_rpc_command: cmd_type(enum):<1>:cmd=</usr/cluster/lib/
    rgm/rt/hafoip/hafoip_stop>:tag=<rg-ora.ha-host-1.1>: Calling security_clnt_connect(..., host=<node2>, sec_type {0:WEAK, 1:STRO
    NG, 2:DES} =<1>, ...)
    Oct 14 20:20:06 node2 ip: [ID 678092 kern.notice] TCP_IOC_ABORT_CONN: local = 192.168.032.244:0, remote = 000.000.000.000:0, s
    tart = -2, end = 6
    Oct 14 20:20:06 node2 ip: [ID 302654 kern.notice] TCP_IOC_ABORT_CONN: aborted 0 connection
    Oct 14 20:20:06 node2 Cluster.RGM.rgmd: [ID 784560 daemon.notice] resource ha-host-1 status on node node2 change to R_FM_OFFLIN
    E
    Oct 14 20:20:06 node2 Cluster.RGM.rgmd: [ID 922363 daemon.notice] resource ha-host-1 status msg on node node2 change to <Logica
    lHostname offline.>
    Oct 14 20:20:06 node2 Cluster.RGM.rgmd: [ID 515159 daemon.notice] method <hafoip_stop> completed successfully for resource <ha
    -host-1>, resource group <rg-ora>, node <node2>, time used: 0% of timeout <300 seconds>
    Oct 14 20:20:06 node2 Cluster.RGM.rgmd: [ID 443746 daemon.notice] resource ha-host-1 state on node node2 change to R_OFFLINE
    Oct 14 20:20:06 node2 Cluster.RGM.rgmd: [ID 443746 daemon.notice] resource rs-ora-has state on node node2 change to R_POSTNET_S
    TOPPING

  • Crs doesn't start on second node

    Guys,
    RAC on 2 nodes
    Release 10.2.0.5.0
    Solaris 10
    There was a problem with the cable that enables connection for the interconnect, but the problem has been solved. One of the nodes was evicted and all resources were move to the other node. Once the problem was solved I tried to start the cluster that was evicted but to no success. when I run crs_stat -t I get the infamous CRS-0184.
    I have checked the ocr and olsnodes; ocr seems to be fine and the second node is recognized as part of the cluster.
    cluvfy comp ocr -n lenin,trotsky -verbose
    Verifying OCR integrity
    Checking OCR integrity...
    Checking the absence of a non-clustered configuration...
    All nodes free of non-clustered, local-only configurations.
    Uniqueness check for OCR device passed.
    Checking the version of OCR...
    OCR of correct Version "2" exists.
    Checking data integrity of OCR...
    Data integrity check for OCR passed.
    OCR integrity check passed.
    Verification of OCR integrity was successful.
    oracle@trotsky > cluvfy comp nodereach -n lenin,trotsky -srcnode trotsky -verbose
    Verifying node reachability
    Checking node reachability...
    Check: Node reachability from node "trotsky"
    Destination Node Reachable?
    lenin yes
    trotsky yes
    Result: Node reachability check passed from node "trotsky".
    I have checked /var/adm/messages and crs and cssd log but I didn't see anything that stands out...
    I have also tried to delete the content of /var/tmp/.oracle and restart crs but again to no success.
    I have read in another thread in this forum that crs problems are either related to the interconnect or ocr/voting disks but as mentioned before they seem to be OK.
    I'm running out of ideas, any suggestions?
    One of the nodes now holds both vip addresses:
    bge0:1: flags=1040843<UP,BROADCAST,RUNNING,MULTICAST,DEPRECATED,IPv4> mtu 1500 index 2
    inet 192.168.191.184 netmask ffffff00 broadcast 192.168.191.255
    bge0:2: flags=1040843<UP,BROADCAST,RUNNING,MULTICAST,DEPRECATED,IPv4> mtu 1500 index 2
    inet 192.168.191.182 netmask ffffff00 broadcast 192.168.191.255
    Do I need to manually reconfigure the interface do that is then held by the second node?
    Thanks in advance for your help

    Cheers for your input!
    The results on the suggested cluvfy command is: passed on all checks with the exception of the daemon liveness (as expected).
    Excerpts from the different logs:
    alert.log
    2010-11-19 13:12:35.033
    [cssd(4928)]CRS-1605:CSSD voting file is online: /dev/rdsk/c1t500601604BA03AEAd0s5. Details in /u01/crs/10.2.0/crs_1/log/trotsky/cssd/ocssd.log.
    2010-11-19 13:12:35.050
    [cssd(4928)]CRS-1605:CSSD voting file is online: /dev/rdsk/c1t500601604BA03AEAd0s4. Details in /u01/crs/10.2.0/crs_1/log/trotsky/cssd/ocssd.log.
    2010-11-19 13:12:35.062
    [cssd(4928)]CRS-1605:CSSD voting file is online: /dev/rdsk/c1t500601604BA03AEAd0s6. Details in /u01/crs/10.2.0/crs_1/log/trotsky/cssd/ocssd.log.
    cssd.log
    [    CSSD]2010-11-19 13:16:47.059 [21] >WARNING: clssnmLocalJoinEvent: takeover aborted due to ALIVE node on Disk
    [    CSSD]2010-11-19 13:16:47.059 [21] >WARNING: clssnmRcfgMgrThread: not possible to join the cluster. Please reboot the node.
    [    CSSD]2010-11-19 13:16:47.059 [21] >WARNING: clssnmReconfigThread: state(1) clusterState(0) exit
    I have tried rebooting the node but that did not help.
    crsd.log
    2010-11-19 13:53:49.652: [  CRSRTI][1] CSS is not ready. Received status 3 from CSS. Waiting for good status ..
    2010-11-19 13:53:50.889: [ COMMCRS][1802]clsc_connect: (1009ac310) no listener at (ADDRESS=(PROTOCOL=ipc)(KEY=OCSSD_LL_trotsky_))
    2010-11-19 13:53:50.889: [ CSSCLNT][1]clsssInitNative: connect failed, rc 9
    2010-11-19 13:53:50.890: [  CRSRTI][1] CSS is not ready. Received status 3 from CSS. Waiting for good status ..
    2010-11-19 13:53:51.899: [    CRSD][1][PANIC] CRSD exiting: Could not init the CSS context
    2010-11-19 13:53:51.899: [    CRSD][1] Done.
    Does this help?

  • Timed out waiting for the CRS stack to start on Second node.

    Hi,
    I am trying to setup 2 node 11gR2 RAC in vmware. I face the Timed out waiting for the CRS stack to start error on second node while running root.sh.I checked the cluster log files located in  /u01/app/11.2.0/grid/log/node1/alertnode2.log it shows as mentioned below.But when i logged into ASM instance and checked the diskgroup and it is in mount state on both nodes,So i am really confused why i ended up with error in 2nd node on running root.sh. Can anyone tell me how to correct this error or any other checking needs to be done?
    /u01/app/11.2.0/grid/log/node1/alertnode2.log
    [ohasd(23261)]CRS-2765:Resource 'ora.crsd' has failed on server 'node2'.
    2014-01-21 21:40:10.314
    [crsd(25309)]CRS-1006:The OCR location +DATA is inaccessible. Details in /u01/app/11.2.0/grid/log/node2/crsd/crsd.log.
    2014-01-21 21:40:11.217
    [ohasd(23261)]CRS-2765:Resource 'ora.crsd' has failed on server 'node2'.
    2014-01-21 21:40:12.422
    [crsd(25335)]CRS-1006:The OCR location +DATA is inaccessible. Details in /u01/app/11.2.0/grid/log/node2/crsd/crsd.log.
    2014-01-21 21:40:13.327
    [ohasd(23261)]CRS-2765:Resource 'ora.crsd' has failed on server 'node2'.
    2014-01-21 21:40:14.514
    [crsd(25356)]CRS-1006:The OCR location +DATA is inaccessible. Details in /u01/app/11.2.0/grid/log/node2/crsd/crsd.log.
    2014-01-21 21:40:15.439
    [ohasd(23261)]CRS-2765:Resource 'ora.crsd' has failed on server 'node2'.
    2014-01-21 21:40:15.440
    [ohasd(23261)]CRS-2771:Maximum restart attempts reached for resource 'ora.crsd'; will not restart.
    root.sh output
    [root@node2 ~]# /u01/app/11.2.0/grid/root.sh
    Running Oracle 11g root.sh script...
    The following environment variables are set as:
        ORACLE_OWNER= oracle
        ORACLE_HOME=  /u01/app/11.2.0/grid
    Enter the full pathname of the local bin directory: [/usr/local/bin]:
    The file "dbhome" already exists in /usr/local/bin.  Overwrite it? (y/n)
    [n]: y
       Copying dbhome to /usr/local/bin ...
    The file "oraenv" already exists in /usr/local/bin.  Overwrite it? (y/n)
    [n]: y
       Copying oraenv to /usr/local/bin ...
    The file "coraenv" already exists in /usr/local/bin.  Overwrite it? (y/n)
    [n]: y
       Copying coraenv to /usr/local/bin ...
    Entries will be added to the /etc/oratab file as needed by
    Database Configuration Assistant when a database is created
    Finished running generic part of root.sh script.
    Now product-specific root actions will be performed.
    2014-01-21 21:37:55: Parsing the host name
    2014-01-21 21:37:55: Checking for super user privileges
    2014-01-21 21:37:55: User has super user privileges
    Using configuration parameter file: /u01/app/11.2.0/grid/crs/install/crsconfig_p                   
    arams
    LOCAL ADD MODE
    Creating OCR keys for user 'root', privgrp 'root'..
    Operation successful.
    Adding daemon to inittab
    CRS-4123: Oracle High Availability Services has been started.
    ohasd is starting
    CRS-4402: The CSS daemon was started in exclusive mode but found an active CSS daemon on node 11gdb, number 1, and is terminating
    An active cluster was found during exclusive startup, restarting to join the cluster
    CRS-2672: Attempting to start 'ora.mdnsd' on 'node2'
    CRS-2676: Start of 'ora.mdnsd' on 'node2' succeeded
    CRS-2672: Attempting to start 'ora.gipcd' on 'node2'
    CRS-2676: Start of 'ora.gipcd' on 'node2' succeeded
    CRS-2672: Attempting to start 'ora.gpnpd' on 'node2'
    CRS-2676: Start of 'ora.gpnpd' on 'node2' succeeded
    CRS-2672: Attempting to start 'ora.cssdmonitor' on 'node2'
    CRS-2676: Start of 'ora.cssdmonitor' on 'node2' succeeded
    CRS-2672: Attempting to start 'ora.cssd' on 'node2'
    CRS-2672: Attempting to start 'ora.diskmon' on 'node2'
    CRS-2676: Start of 'ora.diskmon' on 'node2' succeeded
    CRS-2676: Start of 'ora.cssd' on 'node2' succeeded
    CRS-2672: Attempting to start 'ora.ctssd' on 'node2'
    CRS-2676: Start of 'ora.ctssd' on 'node2' succeeded
    CRS-2672: Attempting to start 'ora.drivers.acfs' on 'node2'
    CRS-2676: Start of 'ora.drivers.acfs' on 'node2' succeeded
    CRS-2672: Attempting to start 'ora.asm' on 'node2'
    CRS-2676: Start of 'ora.asm' on 'node2' succeeded
    CRS-2672: Attempting to start 'ora.crsd' on 'node2'
    CRS-2676: Start of 'ora.crsd' on 'node2' succeeded
    CRS-2672: Attempting to start 'ora.evmd' on 'node2'
    CRS-2676: Start of 'ora.evmd' on 'node2' succeeded
    Timed out waiting for the CRS stack to start.
    Regards,
    007

    Hi,
    Are you trying to install on Vmware Workstation or Vmware Server?
    If you are using vmware server add below line in your .vmx file for both node
    scsi1.sharedBus = "VIRTUAL"
    First check which scsi serial number are you using as above I have selected scsi1 serial during disk addition, 

  • Why I failed to start a second webserver in same weblogic server installed one machine

    Everyone,
              I want to start a second webserver bindding with intranet IP address,
              same time, one webserver had been started in this weblogic server in
              extranet ip address.
              But i cann't configurate it successfully.Why ? Help me !
              Thinks everyone
              [email protected]
              

              They need to use different IP address and the same port number if they are to
              be clustered. If they are not to be cluster they can simply use different port
              numbers.
              On Unix, the ifconfig command can be used to create virtual IP addresses.
              Mike
              [email protected] (jiangxianlou) wrote:
              >Everyone,
              >
              >I want to start a second webserver bindding with intranet IP address,
              >same time, one webserver had been started in this weblogic server in
              >extranet ip address.
              >But i cann't configurate it successfully.Why ? Help me !
              >
              >Thinks everyone
              >
              >[email protected]
              

  • RAC, ASM failed to start up on second node , ORA-03113: end-of-file on comm

    i'm installing an RAC with 2 nodes on top of ASM
    when creating ASM Diskgroup , it failed and reported error CRS-0215 failed to start asm on node2
    Oracle 10.2.0.1
    linux CentOs 4.x
    u01/app/oracle/product/10.2.0/db_1/bin/dbca  -progress_only   -configureASM -templateName NO_VALUE -gdbName NO -sid NO      -emConf
    iguration NONE    -diskList /dev/raw/raw2,/dev/raw/raw3  -diskGroupName DATA -datafileJarLocation /u01/app/oracle/product/10.2.0/db_
    1/assistants/dbca/templates  -responseFile NO_VALUE  -nodeinfo node1,node2    -obfuscatedPasswords true   -oratabLocation /u01/app/o
    racle/product/10.2.0/db_1/install/oratab   -asmSysPassword 05dbb0be38ecf8cca822cf3cf99e675448  -redundancy EXTERNA
    [oracle@node2 bin]$ ./crs_stat -t -v
    Name           Type           R/RA   F/FT   Target    State     Host       
    ora....SM1.asm application    0/5    0/0    ONLINE    ONLINE    node1      
    ora....E1.lsnr application    0/5    0/0    ONLINE    ONLINE    node1      
    ora.node1.gsd  application    0/5    0/0    ONLINE    ONLINE    node1      
    ora.node1.ons  application    0/3    0/0    ONLINE    ONLINE    node1      
    ora.node1.vip  application    0/0    0/0    ONLINE    ONLINE    node1      
    ora....SM2.asm application    0/5    0/0    OFFLINE   OFFLINE              
    ora....E2.lsnr application    0/5    0/0    ONLINE    ONLINE    node2      
    ora.node2.gsd  application    0/5    0/0    ONLINE    ONLINE    node2      
    ora.node2.ons  application    0/3    0/0    ONLINE    ONLINE    node2      
    ora.node2.vip  application    0/0    0/0    ONLINE    ONLINE    node2  
    i checked the status , asm is able to start on both nodes if not at the same time ,
    when trying to start the second node , with srvctl or sqlplus , each give the error 03113
    can anyone suggest me of how to bring up both instances ,
    thanks~
    [oracle@node2 bin]$ srvctl stop asm -n node1
    [oracle@node2 bin]$ srvctl start asm -n node1
    [oracle@node2 bin]$ srvctl start asm -n node2
    PRKS-1009 : Failed to start ASM instance "+ASM2" on node "node2", [PRKS-1009 : Failed to start ASM instance "+ASM2" on node "node2", [node2:ora.node2.ASM2.asm:
    node2:ora.node2.ASM2.asm:SQL*Plus: Release 10.2.0.1.0 - Production on Wed May 27 16:14:50 2009
    node2:ora.node2.ASM2.asm:
    node2:ora.node2.ASM2.asm:Copyright (c) 1982, 2005, Oracle.  All rights reserved.
    node2:ora.node2.ASM2.asm:
    node2:ora.node2.ASM2.asm:Enter user-name: Connected to an idle instance.
    node2:ora.node2.ASM2.asm:
    node2:ora.node2.ASM2.asm:SQL> ORA-03113: end-of-file on communication channel
    node2:ora.node2.ASM2.asm:SQL> Disconnected
    node2:ora.node2.ASM2.asm:
    [code/]
    Edited by: zs_hzh on May 27, 2009 1:25 AM                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                   

    Is it possible to start ASM on second node with SQL*Plus in NOMOUNT state?

  • ORA-27504: IPC error creating OSD context : Unable to start second node

    I have set the DB parameter CLUSTER_INTERCONNECT to point to the Inet addr.
    oifcfg getif
    bondeth0
    172.23.250.128  global  public
    bondib0  192.168.8.0  global
    cluster_interconnect
    When I try to restart the DB services, it is throwing below error while starting the second node.
    These are set of commands I have executed to change the DB Parameter
    alter system set cluster_interconnects =  '192.168.10.6' scope=spfile sid='RAC1' ;
    alter system set cluster_interconnects =  '192.168.10.7' scope=spfile sid='RAC2' ;
    alter system set cluster_interconnects =  '192.168.10.6' scope=spfile sid='ASM1' ;
    alter system set cluster_interconnects =  '192.168.10.7' scope=spfile sid='ASM2' ;
    On second node
    SQL> startup ;
    ORA-27504: IPC error creating OSD context
    ORA-27300: OS system dependent operation:if_not_found failed with status: 0
    ORA-27301: OS failure message: Error 0
    ORA-27302: failure occurred at: skgxpvaddr9
    ORA-27303: additional information: requested interface 192.168.10.6 not found. Check output from ifconfig command
    SQL>
    please let me know whether the proceedure I have followed is wrong
    Thanks

    Node 1:
    [oracle@prdat137db03 etc]$ /sbin/ifconfig bondib0
    bondib0   Link encap:InfiniBand  HWaddr 80:00:00:48:FE:80:00:00:00:00:00:00:00:00:00:00:00:00:00:00
              inet addr:192.168.10.6  Bcast:192.168.11.255  Mask:255.255.252.0
              inet6 addr: fe80::221:2800:1ef:bc4f/64 Scope:Link
              UP BROADCAST RUNNING MASTER MULTICAST  MTU:65520  Metric:1
              RX packets:32550051 errors:0 dropped:0 overruns:0 frame:0
              TX packets:32395961 errors:0 dropped:42 overruns:0 carrier:0
              collisions:0 txqueuelen:0
              RX bytes:19382043590 (18.0 GiB)  TX bytes:17164065360 (15.9 GiB)
    [oracle@prdat137db03 etc]$
    Node 2:
    [oracle@prdat137db04 ~]$ /sbin/ifconfig bondib0
    bondib0   Link encap:InfiniBand  HWaddr 80:00:00:48:FE:80:00:00:00:00:00:00:00:00:00:00:00:00:00:00
              inet addr:192.168.10.7  Bcast:192.168.11.255  Mask:255.255.252.0
              inet6 addr: fe80::221:2800:1ef:abdb/64 Scope:Link
              UP BROADCAST RUNNING MASTER MULTICAST  MTU:65520  Metric:1
              RX packets:29618287 errors:0 dropped:0 overruns:0 frame:0
              TX packets:30769233 errors:0 dropped:12 overruns:0 carrier:0
              collisions:0 txqueuelen:0
              RX bytes:16453595058 (15.3 GiB)  TX bytes:18960175021 (17.6 GiB)
    [oracle@prdat137db04 ~]$

  • Root.sh failed at second node OUL 6.3 Oracle GRID 11.2.0.3

    Hi, im installing a two node cluster mounted on Oracle Linux 6.3 with Oracle DB 11.2.0.3, the installation went smooth up until the execution of the root.sh script on the second node.
    THe script return this final lines:
    CRS-4402: The CSS daemon was started in exclusive mode but found an active CSS daemon on node nodo1, number 1, and is terminating
    An active cluster was found during exclusive startup, restarting to join the cluster
    Start of resource "ora.crsd" failed
    CRS-2800: Cannot start resource 'ora.asm' as it is already in the INTERMEDIATE state on server 'nodo2'
    CRS-4000: Command Start failed, or completed with errors.
    Failed to start Oracle Grid Infrastructure stack
    Failed to start Cluster Ready Services at /u01/app/11.2.0/grid/crs/install/crsconfig_lib.pm line 1286.
    /u01/app/11.2.0/grid/perl/bin/perl -I/u01/app/11.2.0/grid/perl/lib -I/u01/app/11.2.0/grid/crs/install /u01/app/11.2.0/grid/crs/install/rootcrs.pl execution failed
    In $GRID_HOME/log/node2/alertnode.log It appears to be a Cluster Time Synchronization Service issue, (i didn't synchronyze the nodes..) however the CTSS is running in observer mode, wich i believe it shouldn't affect the installation process. After that i lost it...there's an entry CRS-5018 indicating that an unused HAIP route was removed... and then, out of the blue: CRS-5818:Aborted command 'start' for resource 'ora.asm'. Some clarification will be deeply apreciated.
    Here's the complete log:
    2013-04-01 13:39:35.358
    [client(12163)]CRS-2101:The OLR was formatted using version 3.
    2013-04-01 19:40:19.597
    [ohasd(12338)]CRS-2112:The OLR service started on node nodo2.
    2013-04-01 19:40:19.657
    [ohasd(12338)]CRS-1301:Oracle High Availability Service started on node nodo2.
    [client(12526)]CRS-10001:01-Apr-13 13:41 ACFS-9459: ADVM/ACFS is not supported on this OS version: '2.6.39-400.17.2.el6uek.i686'
    [client(12528)]CRS-10001:01-Apr-13 13:41 ACFS-9201: Not Supported
    [client(12603)]CRS-10001:01-Apr-13 13:41 ACFS-9459: ADVM/ACFS is not supported on this OS version: '2.6.39-400.17.2.el6uek.i686'
    2013-04-01 19:41:17.509
    [ohasd(12338)]CRS-2302:Cannot get GPnP profile. Error CLSGPNP_NO_DAEMON (GPNPD daemon is not running).
    2013-04-01 19:41:17.618
    [gpnpd(12695)]CRS-2328:GPNPD started on node nodo2.
    2013-04-01 19:41:21.363
    [cssd(12755)]CRS-1713:CSSD daemon is started in exclusive mode
    2013-04-01 19:41:23.194
    [ohasd(12338)]CRS-2767:Resource state recovery not attempted for 'ora.diskmon' as its target state is OFFLINE
    2013-04-01 19:41:56.144
    [cssd(12755)]CRS-1707:Lease acquisition for node nodo2 number 2 completed
    2013-04-01 19:41:57.545
    [cssd(12755)]CRS-1605:CSSD voting file is online: /dev/oracleasm/disks/ASM_DISK_1; details in /u01/app/11.2.0/grid/log/nodo2/cssd/ocssd.log.
    [cssd(12755)]CRS-1636:The CSS daemon was started in exclusive mode but found an active CSS daemon on node nodo1 and is terminating; details at (:CSSNM00006:) in /u01/app/11.2.0/grid/log/nodo2/cssd/ocssd.log
    2013-04-01 19:41:58.549
    [ohasd(12338)]CRS-2765:Resource 'ora.cssdmonitor' has failed on server 'nodo2'.
    2013-04-01 19:42:10.025
    [gpnpd(12695)]CRS-2329:GPNPD on node nodo2 shutdown.
    2013-04-01 19:42:11.407
    [mdnsd(12685)]CRS-5602:mDNS service stopping by request.
    2013-04-01 19:42:29.642
    [gpnpd(12947)]CRS-2328:GPNPD started on node nodo2.
    2013-04-01 19:42:33.241
    [cssd(13012)]CRS-1713:CSSD daemon is started in clustered mode
    2013-04-01 19:42:35.104
    [ohasd(12338)]CRS-2767:Resource state recovery not attempted for 'ora.diskmon' as its target state is OFFLINE
    2013-04-01 19:42:44.065
    [cssd(13012)]CRS-1707:Lease acquisition for node nodo2 number 2 completed
    2013-04-01 19:42:45.484
    [cssd(13012)]CRS-1605:CSSD voting file is online: /dev/oracleasm/disks/ASM_DISK_1; details in /u01/app/11.2.0/grid/log/nodo2/cssd/ocssd.log.
    2013-04-01 19:42:52.138
    [cssd(13012)]CRS-1601:CSSD Reconfiguration complete. Active nodes are nodo1 nodo2 .
    2013-04-01 19:42:55.081
    [ctssd(13076)]CRS-2403:The Cluster Time Synchronization Service on host nodo2 is in observer mode.
    2013-04-01 19:42:55.581
    [ctssd(13076)]CRS-2401:The Cluster Time Synchronization Service started on host nodo2.
    2013-04-01 19:42:55.581
    [ctssd(13076)]CRS-2407:The new Cluster Time Synchronization Service reference node is host nodo1.
    2013-04-01 19:43:08.875
    [ctssd(13076)]CRS-2412:The Cluster Time Synchronization Service detects that the local time is significantly different from the mean cluster time. Details in /u01/app/11.2.0/grid/log/nodo2/ctssd/octssd.log.
    2013-04-01 19:43:08.876
    [ctssd(13076)]CRS-2409:The clock on host nodo2 is not synchronous with the mean cluster time. No action has been taken as the Cluster Time Synchronization Service is running in observer mode.
    2013-04-01 19:43:13.565
    [u01/app/11.2.0/grid/bin/orarootagent.bin(13064)]CRS-5018:(:CLSN00037:) Removed unused HAIP route: 169.254.0.0 / 255.255.0.0 / 0.0.0.0 / eth0
    2013-04-01 19:53:09.800
    [u01/app/11.2.0/grid/bin/oraagent.bin(12922)]CRS-5818:Aborted command 'start' for resource 'ora.asm'. Details at (:CRSAGF00113:) {0:0:223} in /u01/app/11.2.0/grid/log/nodo2/agent/ohasd/oraagent_oracle/oraagent_oracle.log.
    2013-04-01 19:53:11.827
    [ohasd(12338)]CRS-2757:Command 'Start' timed out waiting for response from the resource 'ora.asm'. Details at (:CRSPE00111:) {0:0:223} in /u01/app/11.2.0/grid/log/nodo2/ohasd/ohasd.log.
    2013-04-01 19:53:12.779
    [u01/app/11.2.0/grid/bin/oraagent.bin(12922)]CRS-5019:All OCR locations are on ASM disk groups [DATA], and none of these disk groups are mounted. Details are at "(:CLSN00100:)" in "/u01/app/11.2.0/grid/log/nodo2/agent/ohasd/oraagent_oracle/oraagent_oracle.log".
    2013-04-01 19:53:13.892
    [u01/app/11.2.0/grid/bin/oraagent.bin(12922)]CRS-5019:All OCR locations are on ASM disk groups [DATA], and none of these disk groups are mounted. Details are at "(:CLSN00100:)" in "/u01/app/11.2.0/grid/log/nodo2/agent/ohasd/oraagent_oracle/oraagent_oracle.log".
    2013-04-01 19:53:43.877
    [u01/app/11.2.0/grid/bin/oraagent.bin(12922)]CRS-5019:All OCR locations are on ASM disk groups [DATA], and none of these disk groups are mounted. Details are at "(:CLSN00100:)" in "/u01/app/11.2.0/grid/log/nodo2/agent/ohasd/oraagent_oracle/oraagent_oracle.log".
    2013-04-01 19:54:13.891
    [u01/app/11.2.0/grid/bin/oraagent.bin(12922)]CRS-5019:All OCR locations are on ASM disk groups [DATA], and none of these disk groups are mounted. Details are at "(:CLSN00100:)" in "/u01/app/11.2.0/grid/log/nodo2/agent/ohasd/oraagent_oracle/oraagent_oracle.log".
    2013-04-01 19:54:43.906
    [u01/app/11.2.0/grid/bin/oraagent.bin(12922)]CRS-5019:All OCR locations are on ASM disk groups [DATA], and none of these disk groups are mounted. Details are at "(:CLSN00100:)" in "/u01/app/11.2.0/grid/log/nodo2/agent/ohasd/oraagent_oracle/oraagent_oracle.log".
    2013-04-01 19:55:13.914
    [u01/app/11.2.0/grid/bin/oraagent.bin(12922)]CRS-5019:All OCR locations are on ASM disk groups [DATA], and none of these disk groups are mounted. Details are at "(:CLSN00100:)" in "/u01/app/11.2.0/grid/log/nodo2/agent/ohasd/oraagent_oracle/oraagent_oracle.log".
    2013-04-01 19:55:43.918
    [u01/app/11.2.0/grid/bin/oraagent.bin(12922)]CRS-5019:All OCR locations are on ASM disk groups [DATA], and none of these disk groups are mounted. Details are at "(:CLSN00100:)" in "/u01/app/11.2.0/grid/log/nodo2/agent/ohasd/oraagent_oracle/oraagent_oracle.log".
    2013-04-01 19:56:13.922
    [u01/app/11.2.0/grid/bin/oraagent.bin(12922)]CRS-5019:All OCR locations are on ASM disk groups [DATA], and none of these disk groups are mounted. Details are at "(:CLSN00100:)" in "/u01/app/11.2.0/grid/log/nodo2/agent/ohasd/oraagent_oracle/oraagent_oracle.log".
    2013-04-01 19:56:53.209
    [crsd(13741)]CRS-1012:The OCR service started on node nodo2.
    2013-04-01 20:07:01.128
    [crsd(13741)]CRS-0810:Cluster Ready Service aborted due to failure to communicate with Event Management Service with error [1]. Details at (:CRSD00120:) in /u01/app/11.2.0/grid/log/nodo2/crsd/crsd.log.
    2013-04-01 20:07:01.278
    [ohasd(12338)]CRS-2765:Resource 'ora.crsd' has failed on server 'nodo2'.
    2013-04-01 20:07:08.689
    [crsd(15248)]CRS-1012:The OCR service started on node nodo2.
    2013-04-01 20:13:10.138
    [ctssd(13076)]CRS-2412:The Cluster Time Synchronization Service detects that the local time is significantly different from the mean cluster time. Details in /u01/app/11.2.0/grid/log/nodo2/ctssd/octssd.log.
    2013-04-01 20:17:13.024
    [crsd(15248)]CRS-0810:Cluster Ready Service aborted due to failure to communicate with Event Management Service with error [1]. Details at (:CRSD00120:) in /u01/app/11.2.0/grid/log/nodo2/crsd/crsd.log.
    2013-04-01 20:17:13.171
    [ohasd(12338)]CRS-2765:Resource 'ora.crsd' has failed on server 'nodo2'.
    2013-04-01 20:17:20.826
    [crsd(16746)]CRS-1012:The OCR service started on node nodo2.
    2013-04-01 20:27:25.020
    [crsd(16746)]CRS-0810:Cluster Ready Service aborted due to failure to communicate with Event Management Service with error [1]. Details at (:CRSD00120:) in /u01/app/11.2.0/grid/log/nodo2/crsd/crsd.log.
    2013-04-01 20:27:25.176
    [ohasd(12338)]CRS-2765:Resource 'ora.crsd' has failed on server 'nodo2'.
    2013-04-01 20:27:31.591
    [crsd(18266)]CRS-1012:The OCR service started on node nodo2.
    2013-04-01 20:37:35.668
    [crsd(18266)]CRS-0810:Cluster Ready Service aborted due to failure to communicate with Event Management Service with error [1]. Details at (:CRSD00120:) in /u01/app/11.2.0/grid/log/nodo2/crsd/crsd.log.
    2013-04-01 20:37:35.808
    [ohasd(12338)]CRS-2765:Resource 'ora.crsd' has failed on server 'nodo2'.
    2013-04-01 20:37:43.209
    [crsd(19762)]CRS-1012:The OCR service started on node nodo2.
    2013-04-01 20:43:11.160
    [ctssd(13076)]CRS-2412:The Cluster Time Synchronization Service detects that the local time is significantly different from the mean cluster time. Details in /u01/app/11.2.0/grid/log/nodo2/ctssd/octssd.log.
    2013-04-01 20:47:47.487
    [crsd(19762)]CRS-0810:Cluster Ready Service aborted due to failure to communicate with Event Management Service with error [1]. Details at (:CRSD00120:) in /u01/app/11.2.0/grid/log/nodo2/crsd/crsd.log.
    2013-04-01 20:47:47.637
    [ohasd(12338)]CRS-2765:Resource 'ora.crsd' has failed on server 'nodo2'.
    2013-04-01 20:47:55.086
    [crsd(21242)]CRS-1012:The OCR service started on node nodo2.
    2013-04-01 20:57:59.343
    [crsd(21242)]CRS-0810:Cluster Ready Service aborted due to failure to communicate with Event Management Service with error [1]. Details at (:CRSD00120:) in /u01/app/11.2.0/grid/log/nodo2/crsd/crsd.log.
    2013-04-01 20:57:59.492
    [ohasd(12338)]CRS-2765:Resource 'ora.crsd' has failed on server 'nodo2'.
    2013-04-01 20:58:06.996
    [crsd(22744)]CRS-1012:The OCR service started on node nodo2.
    2013-04-01 21:08:11.046
    [crsd(22744)]CRS-0810:Cluster Ready Service aborted due to failure to communicate with Event Management Service with error [1]. Details at (:CRSD00120:) in /u01/app/11.2.0/grid/log/nodo2/crsd/crsd.log.
    2013-04-01 21:08:11.192
    [ohasd(12338)]CRS-2765:Resource 'ora.crsd' has failed on server 'nodo2'.
    2013-04-01 21:08:18.726
    [crsd(24260)]CRS-1012:The OCR service started on node nodo2.
    2013-04-01 21:13:12.000
    [ctssd(13076)]CRS-2412:The Cluster Time Synchronization Service detects that the local time is significantly different from the mean cluster time. Details in /u01/app/11.2.0/grid/log/nodo2/ctssd/octssd.log.
    2013-04-01 21:18:22.262
    [crsd(24260)]CRS-0810:Cluster Ready Service aborted due to failure to communicate with Event Management Service with error [1]. Details at (:CRSD00120:) in /u01/app/11.2.0/grid/log/nodo2/crsd/crsd.log.
    2013-04-01 21:18:22.411
    [ohasd(12338)]CRS-2765:Resource 'ora.crsd' has failed on server 'nodo2'.
    2013-04-01 21:18:29.927
    [crsd(25759)]CRS-1012:The OCR service started on node nodo2.
    2013-04-01 21:28:34.467
    [crsd(25759)]CRS-0810:Cluster Ready Service aborted due to failure to communicate with Event Management Service with error [1]. Details at (:CRSD00120:) in /u01/app/11.2.0/grid/log/nodo2/crsd/crsd.log.
    2013-04-01 21:28:34.616
    [ohasd(12338)]CRS-2765:Resource 'ora.crsd' has failed on server 'nodo2'.
    2013-04-01 21:28:41.990
    [crsd(27291)]CRS-1012:The OCR service started on node nodo2.
    2013-04-01 21:38:45.012
    [crsd(27291)]CRS-0810:Cluster Ready Service aborted due to failure to communicate with Event Management Service with error [1]. Details at (:CRSD00120:) in /u01/app/11.2.0/grid/log/nodo2/crsd/crsd.log.
    2013-04-01 21:38:45.160
    [ohasd(12338)]CRS-2765:Resource 'ora.crsd' has failed on server 'nodo2'.
    2013-04-01 21:38:52.790
    [crsd(28784)]CRS-1012:The OCR service started on node nodo2.
    2013-04-01 21:43:12.378
    [ctssd(13076)]CRS-2412:The Cluster Time Synchronization Service detects that the local time is significantly different from the mean cluster time. Details in /u01/app/11.2.0/grid/log/nodo2/ctssd/octssd.log.
    2013-04-01 21:48:56.285
    [crsd(28784)]CRS-0810:Cluster Ready Service aborted due to failure to communicate with Event Management Service with error [1]. Details at (:CRSD00120:) in /u01/app/11.2.0/grid/log/nodo2/crsd/crsd.log.
    2013-04-01 21:48:56.435
    [ohasd(12338)]CRS-2765:Resource 'ora.crsd' has failed on server 'nodo2'.
    2013-04-01 21:49:04.421
    [crsd(30272)]CRS-1012:The OCR service started on node nodo2.
    2013-04-01 21:59:08.183
    [crsd(30272)]CRS-0810:Cluster Ready Service aborted due to failure to communicate with Event Management Service with error [1]. Details at (:CRSD00120:) in /u01/app/11.2.0/grid/log/nodo2/crsd/crsd.log.
    2013-04-01 21:59:08.318
    [ohasd(12338)]CRS-2765:Resource 'ora.crsd' has failed on server 'nodo2'.
    2013-04-01 21:59:15.860
    [crsd(31772)]CRS-1012:The OCR service started on node nodo2.

    Hi santysharma, thanks for the reply, i have two ethernet interfaces: eth0 (public network 192.168.1.0) and eth1 (private network 10.5.3.0), there is no device using that ip range, here's the output of route command:
    (Sorry for the alignment, i tried to tab it but the editor trims it again)
    Kernel IP routing table
    Destination Gateway Genmask Flags Metric Ref Use Iface
    default 192.168.1.1 0.0.0.0 UG 0 0 0 eth0
    private * 255.255.255.0 U 0 0 0 eth1
    link-local * 255.255.0.0 U 1002 0 0 eth0
    link-local * 255.255.0.0 U 1003 0 0 eth1
    public * 255.255.255.0 U 0 0 0 eth0
    And the /etc/hosts file
    127.0.0.1 localhost localhost.localdomain localhost4 localhost4.localdomain4
    ::1 localhost localhost.localdomain localhost6 localhost6.localdomain6
    10.5.3.1 nodo1.cluster nodo1
    10.5.3.2 nodo2.cluster nodo2
    192.168.1.13 cluster-scan
    192.168.1.14 nodo1-vip
    192.168.1.15 nodo2-vip
    And the ifconfig -a
    eth0 Link encap:Ethernet HWaddr C8:3A:35:D9:C6:2B
    inet addr:192.168.1.12 Bcast:192.168.1.255 Mask:255.255.255.0
    inet6 addr: fe80::ca3a:35ff:fed9:c62b/64 Scope:Link
    UP BROADCAST RUNNING MULTICAST MTU:1500 Metric:1
    RX packets:34708 errors:0 dropped:18 overruns:0 frame:0
    TX packets:24693 errors:0 dropped:0 overruns:0 carrier:0
    collisions:0 txqueuelen:1000
    RX bytes:48545969 (46.2 MiB) TX bytes:1994381 (1.9 MiB)
    eth1 Link encap:Ethernet HWaddr 00:0D:87:D0:A3:8E
    inet addr:10.5.3.2 Bcast:10.5.3.255 Mask:255.255.255.0
    inet6 addr: fe80::20d:87ff:fed0:a38e/64 Scope:Link
    UP BROADCAST RUNNING MULTICAST MTU:1500 Metric:1
    RX packets:0 errors:0 dropped:0 overruns:0 frame:0
    TX packets:44 errors:0 dropped:0 overruns:0 carrier:0
    collisions:0 txqueuelen:1000
    RX bytes:0 (0.0 b) TX bytes:5344 (5.2 KiB)
    Interrupt:23 Base address:0x6000
    lo Link encap:Local Loopback
    inet addr:127.0.0.1 Mask:255.0.0.0
    inet6 addr: ::1/128 Scope:Host
    UP LOOPBACK RUNNING MTU:16436 Metric:1
    RX packets:20 errors:0 dropped:0 overruns:0 frame:0
    TX packets:20 errors:0 dropped:0 overruns:0 carrier:0
    collisions:0 txqueuelen:0
    RX bytes:1320 (1.2 KiB) TX bytes:1320 (1.2 KiB)
    Now that i'm thinking i've read somewhere that ipv6 was no supported...yet there's no relation with the 169.254.x.x ip range.

  • Root.sh fails on second node

    I already posted this issue on database installation forum, and was suggested to post it on this forum.
    Here are the details.
    I am running Linux 64bit on ESx clients. Installing Oracle 11gR2.
    It passed all the per-requisite. Run root.sh on first node. It finished with no errorrs.
    On second node I got the following:
    Running Oracle 11g root.sh script...
    The following environment variables are set as:
    ORACLE_OWNER= oracle
    ORACLE_HOME= /u01/app/11.2.0/grid
    Enter the full pathname of the local bin directory: [usr/local/bin]:
    The file "dbhome" already exists in /usr/local/bin. Overwrite it? (y/n)
    [n]:
    The file "oraenv" already exists in /usr/local/bin. Overwrite it? (y/n)
    [n]:
    The file "coraenv" already exists in /usr/local/bin. Overwrite it? (y/n)
    [n]:
    Entries will be added to the /etc/oratab file as needed by
    Database Configuration Assistant when a database is created
    Finished running generic part of root.sh script.
    Now product-specific root actions will be performed.
    2010-07-13 12:51:28: Parsing the host name
    2010-07-13 12:51:28: Checking for super user privileges
    2010-07-13 12:51:28: User has super user privileges
    Using configuration parameter file: /u01/app/11.2.0/grid/crs/install/crsconfig_params
    Creating trace directory
    LOCAL ADD MODE
    Creating OCR keys for user 'root', privgrp 'root'..
    Operation successful.
    Adding daemon to inittab
    CRS-4123: Oracle High Availability Services has been started.
    ohasd is starting
    CRS-4402: The CSS daemon was started in exclusive mode but found an active CSS daemon on node fred0224, number 1, and is terminating
    An active cluster was found during exclusive startup, restarting to join the cluster
    CRS-2672: Attempting to start 'ora.mdnsd' on 'fred0225'
    CRS-2676: Start of 'ora.mdnsd' on 'fred0225' succeeded
    CRS-2672: Attempting to start 'ora.gipcd' on 'fred0225'
    CRS-2676: Start of 'ora.gipcd' on 'fred0225' succeeded
    CRS-2672: Attempting to start 'ora.gpnpd' on 'fred0225'
    CRS-2676: Start of 'ora.gpnpd' on 'fred0225' succeeded
    CRS-2672: Attempting to start 'ora.cssdmonitor' on 'fred0225'
    CRS-2676: Start of 'ora.cssdmonitor' on 'fred0225' succeeded
    CRS-2672: Attempting to start 'ora.cssd' on 'fred0225'
    CRS-2672: Attempting to start 'ora.diskmon' on 'fred0225'
    CRS-2676: Start of 'ora.diskmon' on 'fred0225' succeeded
    CRS-2676: Start of 'ora.cssd' on 'fred0225' succeeded
    CRS-2672: Attempting to start 'ora.ctssd' on 'fred0225'
    Start action for octssd aborted
    CRS-2676: Start of 'ora.ctssd' on 'fred0225' succeeded
    CRS-2672: Attempting to start 'ora.drivers.acfs' on 'fred0225'
    CRS-2672: Attempting to start 'ora.asm' on 'fred0225'
    CRS-2676: Start of 'ora.drivers.acfs' on 'fred0225' succeeded
    CRS-2676: Start of 'ora.asm' on 'fred0225' succeeded
    CRS-2664: Resource 'ora.ctssd' is already running on 'fred0225'
    CRS-4000: Command Start failed, or completed with errors.
    Command return code of 1 (256) from command: /u01/app/11.2.0/grid/bin/crsctl start resource ora.asm -init
    Start of resource "ora.asm -init" failed
    Failed to start ASM
    Failed to start Oracle Clusterware stack
    In the ocssd.log I found
    [ CSSD][3559689984]clssnmvDHBValidateNCopy: node 1, fred0224, has a disk HB, but no network HB, DHB has rcfg 174483948, wrtcnt, 232, LATS 521702664, lastSeqNo 232, uniqueness 1279039649, timestamp 1279039959/521874274
    In oraagent_oracle.log I found
    [ clsdmc][1212365120]Fail to connect (ADDRESS=(PROTOCOL=ipc)(KEY=fred0225DBG_GPNPD)) with status 9
    2010-07-13 12:54:07.234: [ora.gpnpd][1212365120] [check] Error = error 9 encountered when connecting to GPNPD
    2010-07-13 12:54:07.238: [ora.gpnpd][1212365120] [check] Calling PID check for daemon
    2010-07-13 12:54:07.238: [ora.gpnpd][1212365120] [check] Trying to check PID = 20584
    2010-07-13 12:54:07.432: [ COMMCRS][1285794112]clsc_connect: (0x1304d850) no listener at (ADDRESS=(PROTOCOL=ipc)(KEY=fred0225DBG_GPNPD))
    [ clsdmc][1222854976]Fail to connect (ADDRESS=(PROTOCOL=ipc)(KEY=fred0225DBG_MDNSD)) with status 9
    2010-07-13 12:54:08.649: [ora.mdnsd][1222854976] [check] Error = error 9 encountered when connecting to MDNSD
    2010-07-13 12:54:08.649: [ora.mdnsd][1222854976] [check] Calling PID check for daemon
    2010-07-13 12:54:08.649: [ora.mdnsd][1222854976] [check] Trying to check PID = 20571
    2010-07-13 12:54:08.841: [ COMMCRS][1201875264]clsc_connect: (0x12f3b1d0) no listener at (ADDRESS=(PROTOCOL=ipc)(KEY=fred0225DBG_MDNSD))
    [ clsdmc][1159915840]Fail to connect (ADDRESS=(PROTOCOL=ipc)(KEY=fred0225DBG_GIPCD)) with status 9
    2010-07-13 12:54:10.051: [ora.gipcd][1159915840] [check] Error = error 9 encountered when connecting to GIPCD
    2010-07-13 12:54:10.051: [ora.gipcd][1159915840] [check] Calling PID check for daemon
    2010-07-13 12:54:10.051: [ora.gipcd][1159915840] [check] Trying to check PID = 20566
    2010-07-13 12:54:10.242: [ COMMCRS][1254324544]clsc_connect: (0x12f35630) no listener at (ADDRESS=(PROTOCOL=ipc)(KEY=fred0225DBG_GIPCD))
    In oracssdagent_root.log I found
    2010-07-13 12:52:28.698: [ CSSCLNT][1102481728]clssscConnect: gipc request failed with 29 (0x16)
    2010-07-13 12:52:28.698: [ CSSCLNT][1102481728]clsssInitNative: connect failed, rc 29
    2010-07-13 12:53:55.222: [ CSSCLNT][1102481728]clssnsqlnum: RPC failed rc 3
    2010-07-13 12:53:55.222: [ USRTHRD][1102481728] clsnomon_cssini: failed 3 to fetch node number
    2010-07-13 12:53:55.222: [ USRTHRD][1102481728] clsnomon_init: css init done, nodenum -1.
    2010-07-13 12:53:55.222: [ CSSCLNT][1102481728]clsssRecvMsg: got a disconnect from the server while waiting for message type 43
    2010-07-13 12:53:55.222: [ CSSCLNT][1102481728]clsssGetNLSData: Failure receiving a msg, rc 3
    If you need more info, let me know.

    Well, the error clearly indicates that a communication problem exists on the private interconnect.
    Could this be a setting in ESX, which prevents some communication between the clients on the second network card? Any routing table in ESX not configured correctly?
    Sebastian

  • Root.sh on second node fails

    I am running Linux 64bit. Installing Oracle 11gR2.
    It passed all the per-requisite. Run root.sh on first node. It finished with no errorrs.
    On second node I got the following:
    Running Oracle 11g root.sh script...
    The following environment variables are set as:
    ORACLE_OWNER= oracle
    ORACLE_HOME= /u01/app/11.2.0/grid
    Enter the full pathname of the local bin directory: [usr/local/bin]:
    The file "dbhome" already exists in /usr/local/bin. Overwrite it? (y/n)
    [n]:
    The file "oraenv" already exists in /usr/local/bin. Overwrite it? (y/n)
    [n]:
    The file "coraenv" already exists in /usr/local/bin. Overwrite it? (y/n)
    [n]:
    Entries will be added to the /etc/oratab file as needed by
    Database Configuration Assistant when a database is created
    Finished running generic part of root.sh script.
    Now product-specific root actions will be performed.
    2010-07-13 12:51:28: Parsing the host name
    2010-07-13 12:51:28: Checking for super user privileges
    2010-07-13 12:51:28: User has super user privileges
    Using configuration parameter file: /u01/app/11.2.0/grid/crs/install/crsconfig_params
    Creating trace directory
    LOCAL ADD MODE
    Creating OCR keys for user 'root', privgrp 'root'..
    Operation successful.
    Adding daemon to inittab
    CRS-4123: Oracle High Availability Services has been started.
    ohasd is starting
    CRS-4402: The CSS daemon was started in exclusive mode but found an active CSS daemon on node fred0224, number 1, and is terminating
    An active cluster was found during exclusive startup, restarting to join the cluster
    CRS-2672: Attempting to start 'ora.mdnsd' on 'fred0225'
    CRS-2676: Start of 'ora.mdnsd' on 'fred0225' succeeded
    CRS-2672: Attempting to start 'ora.gipcd' on 'fred0225'
    CRS-2676: Start of 'ora.gipcd' on 'fred0225' succeeded
    CRS-2672: Attempting to start 'ora.gpnpd' on 'fred0225'
    CRS-2676: Start of 'ora.gpnpd' on 'fred0225' succeeded
    CRS-2672: Attempting to start 'ora.cssdmonitor' on 'fred0225'
    CRS-2676: Start of 'ora.cssdmonitor' on 'fred0225' succeeded
    CRS-2672: Attempting to start 'ora.cssd' on 'fred0225'
    CRS-2672: Attempting to start 'ora.diskmon' on 'fred0225'
    CRS-2676: Start of 'ora.diskmon' on 'fred0225' succeeded
    CRS-2676: Start of 'ora.cssd' on 'fred0225' succeeded
    CRS-2672: Attempting to start 'ora.ctssd' on 'fred0225'
    Start action for octssd aborted
    CRS-2676: Start of 'ora.ctssd' on 'fred0225' succeeded
    CRS-2672: Attempting to start 'ora.drivers.acfs' on 'fred0225'
    CRS-2672: Attempting to start 'ora.asm' on 'fred0225'
    CRS-2676: Start of 'ora.drivers.acfs' on 'fred0225' succeeded
    CRS-2676: Start of 'ora.asm' on 'fred0225' succeeded
    CRS-2664: Resource 'ora.ctssd' is already running on 'fred0225'
    CRS-4000: Command Start failed, or completed with errors.
    Command return code of 1 (256) from command: /u01/app/11.2.0/grid/bin/crsctl start resource ora.asm -init
    Start of resource "ora.asm -init" failed
    Failed to start ASM
    Failed to start Oracle Clusterware stack
    In the ocssd.log I found
    [    CSSD][3559689984]clssnmvDHBValidateNCopy: node 1, fred0224, has a disk HB, but no network HB, DHB has rcfg 174483948, wrtcnt, 232, LATS 521702664, lastSeqNo 232, uniqueness 1279039649, timestamp 1279039959/521874274
    In oraagent_oracle.log I found
    [  clsdmc][1212365120]Fail to connect (ADDRESS=(PROTOCOL=ipc)(KEY=fred0225DBG_GPNPD)) with status 9
    2010-07-13 12:54:07.234: [ora.gpnpd][1212365120] [check] Error = error 9 encountered when connecting to GPNPD
    2010-07-13 12:54:07.238: [ora.gpnpd][1212365120] [check] Calling PID check for daemon
    2010-07-13 12:54:07.238: [ora.gpnpd][1212365120] [check] Trying to check PID = 20584
    2010-07-13 12:54:07.432: [ COMMCRS][1285794112]clsc_connect: (0x1304d850) no listener at (ADDRESS=(PROTOCOL=ipc)(KEY=fred0225DBG_GPNPD))
    [  clsdmc][1222854976]Fail to connect (ADDRESS=(PROTOCOL=ipc)(KEY=fred0225DBG_MDNSD)) with status 9
    2010-07-13 12:54:08.649: [ora.mdnsd][1222854976] [check] Error = error 9 encountered when connecting to MDNSD
    2010-07-13 12:54:08.649: [ora.mdnsd][1222854976] [check] Calling PID check for daemon
    2010-07-13 12:54:08.649: [ora.mdnsd][1222854976] [check] Trying to check PID = 20571
    2010-07-13 12:54:08.841: [ COMMCRS][1201875264]clsc_connect: (0x12f3b1d0) no listener at (ADDRESS=(PROTOCOL=ipc)(KEY=fred0225DBG_MDNSD))
    [  clsdmc][1159915840]Fail to connect (ADDRESS=(PROTOCOL=ipc)(KEY=fred0225DBG_GIPCD)) with status 9
    2010-07-13 12:54:10.051: [ora.gipcd][1159915840] [check] Error = error 9 encountered when connecting to GIPCD
    2010-07-13 12:54:10.051: [ora.gipcd][1159915840] [check] Calling PID check for daemon
    2010-07-13 12:54:10.051: [ora.gipcd][1159915840] [check] Trying to check PID = 20566
    2010-07-13 12:54:10.242: [ COMMCRS][1254324544]clsc_connect: (0x12f35630) no listener at (ADDRESS=(PROTOCOL=ipc)(KEY=fred0225DBG_GIPCD))
    In oracssdagent_root.log I found
    2010-07-13 12:52:28.698: [ CSSCLNT][1102481728]clssscConnect: gipc request failed with 29 (0x16)
    2010-07-13 12:52:28.698: [ CSSCLNT][1102481728]clsssInitNative: connect failed, rc 29
    2010-07-13 12:53:55.222: [ CSSCLNT][1102481728]clssnsqlnum: RPC failed rc 3
    2010-07-13 12:53:55.222: [ USRTHRD][1102481728] clsnomon_cssini: failed 3 to fetch node number
    2010-07-13 12:53:55.222: [ USRTHRD][1102481728] clsnomon_init: css init done, nodenum -1.
    2010-07-13 12:53:55.222: [ CSSCLNT][1102481728]clsssRecvMsg: got a disconnect from the server while waiting for message type 43
    2010-07-13 12:53:55.222: [ CSSCLNT][1102481728]clsssGetNLSData: Failure receiving a msg, rc 3
    If anyone needs more info please let me know.

    On all nodes,
    1. Modify the /etc/sysconfig/oracleasm with:
    ORACLEASM_SCANORDER="dm"
    ORACLEASM_SCANEXCLUDE="sd"
    2. restart the asmlib by :
    # /etc/init.d/oracleasm restart
    3. Run root.sh on the 2nd node
    hope this helps you

  • 11gr2 crsd core dump during failover or start attempt on second node

    Hi,
    I installed 11gr2 with ASM on one node (solaris SPARC). Then I added another node to this cluster (via addNode.sh script).
    Than got strange error: If my first node is up, second node is started fine and run well. If I shutdown first node - crsd on second node dump to core and fails to restart. I get the same error if I try to start second node when the first one is down.
    In the crsd.log I see the following:
    [  clsdmt][2]Listening to (ADDRESS=(PROTOCOL=ipc)(KEY=mskbkp2DBG_CRSD))
    2010-03-03 17:31:35.330: [  clsdmt][2]PID for the Process [18669], connkey 1
    2010-03-03 17:31:35.331: [  clsdmt][2]Creating PID [18669] file for home /u01/grid/11.2.0 host mskbkp2 bin crs to /u01/grid/11
    .2.0/crs/init/
    2010-03-03 17:31:35.331: [  clsdmt][2]Writing PID [18669] to the file [u01/grid/11.2.0/crs/init/mskbkp2.pid]
    2010-03-03 17:31:35.925: [ default][1] CRS Daemon Starting
    2010-03-03 17:31:35.933: [ default][1] ENV Logging level for Module: AGENT 1
    2010-03-03 17:31:35.934: [ default][1] ENV Logging level for Module: AGFW 0
    2010-03-03 17:31:35.934: [ default][1] ENV Logging level for Module: CLSFRAME 0
    2010-03-03 17:31:35.934: [ default][1] ENV Logging level for Module: CLSVER 0
    2010-03-03 17:31:35.934: [ default][1] ENV Logging level for Module: CLUCLS 0
    2010-03-03 17:31:35.934: [ default][1] ENV Logging level for Module: COMMCRS 0
    2010-03-03 17:31:35.934: [ default][1] ENV Logging level for Module: COMMNS 0
    2010-03-03 17:31:35.936: [ default][1] ENV Logging level for Module: CRSAPP 0
    2010-03-03 17:31:35.936: [ default][1] ENV Logging level for Module: CRSCCL 0
    2010-03-03 17:31:35.936: [ default][1] ENV Logging level for Module: CRSCEVT 0
    2010-03-03 17:31:35.936: [ default][1] ENV Logging level for Module: CRSCOMM 1
    2010-03-03 17:31:35.936: [    CRSD][1] ENV Debug Level(CRSD): 50
    2010-03-03 17:31:35.936: [    CRSD][1] ENV Logging level for Module: CRSD 50
    2010-03-03 17:31:35.937: [    CRSD][1] ENV Debug Level(CRSEVT): 0
    2010-03-03 17:31:35.937: [    CRSD][1] ENV Logging level for Module: CRSEVT 0
    2010-03-03 17:31:35.937: [    CRSD][1] ENV Debug Level(CRSMAIN): 1
    2010-03-03 17:31:35.937: [    CRSD][1] ENV Logging level for Module: CRSMAIN 1
    2010-03-03 17:31:35.937: [    CRSD][1] ENV Debug Level(CRSOCR): 0
    2010-03-03 17:31:35.937: [    CRSD][1] ENV Logging level for Module: CRSOCR 0
    2010-03-03 17:31:35.939: [    CRSD][1] ENV Debug Level(CRSPE): 0
    2010-03-03 17:31:35.939: [    CRSD][1] ENV Logging level for Module: CRSPE 0
    2010-03-03 17:31:35.939: [    CRSD][1] ENV Debug Level(CRSPLACE): 0
    2010-03-03 17:31:35.939: [    CRSD][1] ENV Logging level for Module: CRSPLACE 0
    2010-03-03 17:31:35.940: [    CRSD][1] ENV Debug Level(CRSRES): 0
    2010-03-03 17:31:35.940: [    CRSD][1] ENV Logging level for Module: CRSRES 0
    2010-03-03 17:31:35.940: [    CRSD][1] ENV Debug Level(CRSRPT): 0
    2010-03-03 17:31:35.940: [    CRSD][1] ENV Logging level for Module: CRSRPT 0
    2010-03-03 17:31:35.940: [    CRSD][1] ENV Debug Level(CRSRTI): 0
    2010-03-03 17:31:35.940: [    CRSD][1] ENV Logging level for Module: CRSRTI 0
    2010-03-03 17:31:35.940: [    CRSD][1] ENV Debug Level(CRSSE): 0
    2010-03-03 17:31:35.941: [    CRSD][1] ENV Logging level for Module: CRSSE 0
    2010-03-03 17:31:35.941: [    CRSD][1] ENV Debug Level(CRSSEC): 0
    2010-03-03 17:31:35.941: [    CRSD][1] ENV Logging level for Module: CRSSEC 0
    2010-03-03 17:31:35.941: [    CRSD][1] ENV Debug Level(CRSSHARED): 0
    2010-03-03 17:31:35.941: [    CRSD][1] ENV Logging level for Module: CRSSHARED 0
    2010-03-03 17:31:35.942: [    CRSD][1] ENV Debug Level(CRSTIMER): 0
    2010-03-03 17:31:35.942: [    CRSD][1] ENV Logging level for Module: CRSTIMER 0
    2010-03-03 17:31:35.942: [    CRSD][1] ENV Debug Level(CRSUI): 0
    2010-03-03 17:31:35.942: [    CRSD][1] ENV Logging level for Module: CRSUI 0
    2010-03-03 17:31:35.942: [    CRSD][1] ENV Debug Level(CSSCLNT): 0
    2010-03-03 17:31:35.942: [    CRSD][1] ENV Logging level for Module: CSSCLNT 0
    2010-03-03 17:31:35.942: [    CRSD][1] ENV Debug Level(OCRAPI): 1
    2010-03-03 17:31:35.942: [    CRSD][1] ENV Logging level for Module: OCRAPI 1
    2010-03-03 17:31:35.942: [    CRSD][1] ENV Debug Level(OCRASM): 1
    2010-03-03 17:31:35.943: [    CRSD][1] ENV Logging level for Module: OCRASM 1
    2010-03-03 17:31:35.943: [    CRSD][1] ENV Debug Level(OCRCAC): 1
    2010-03-03 17:31:35.943: [    CRSD][1] ENV Logging level for Module: OCRCAC 1
    2010-03-03 17:31:35.943: [    CRSD][1] ENV Debug Level(OCRCLI): 1
    2010-03-03 17:31:35.943: [    CRSD][1] ENV Logging level for Module: OCRCLI 1
    2010-03-03 17:31:35.943: [    CRSD][1] ENV Debug Level(OCRMAS): 1
    2010-03-03 17:31:35.943: [    CRSD][1] ENV Logging level for Module: OCRMAS 1
    2010-03-03 17:31:35.944: [    CRSD][1] ENV Debug Level(OCRMSG): 1
    2010-03-03 17:31:35.944: [    CRSD][1] ENV Logging level for Module: OCRMSG 1
    2010-03-03 17:31:35.945: [    CRSD][1] ENV Debug Level(OCROSD): 1
    2010-03-03 17:31:35.945: [    CRSD][1] ENV Logging level for Module: OCROSD 1
    2010-03-03 17:31:35.945: [    CRSD][1] ENV Debug Level(OCRRAW): 1
    2010-03-03 17:31:35.945: [    CRSD][1] ENV Logging level for Module: OCRRAW 1
    2010-03-03 17:31:35.945: [    CRSD][1] ENV Debug Level(OCRSRV): 1
    2010-03-03 17:31:35.945: [    CRSD][1] ENV Logging level for Module: OCRSRV 1
    2010-03-03 17:31:35.945: [    CRSD][1] ENV Debug Level(OCRUTL): 1
    2010-03-03 17:31:35.945: [    CRSD][1] ENV Logging level for Module: OCRUTL 1
    2010-03-03 17:31:35.945: [    CRSD][1] ENV Debug Level(SuiteTes): 1
    2010-03-03 17:31:35.946: [    CRSD][1] ENV Logging level for Module: SuiteTes 1
    2010-03-03 17:31:35.946: [    CRSD][1] ENV Debug Level(UiServer): 0
    2010-03-03 17:31:35.946: [    CRSD][1] ENV Logging level for Module: UiServer 0
    2010-03-03 17:31:35.946: [ CRSMAIN][1] Checking the OCR device
    2010-03-03 17:31:35.948: [ CRSMAIN][1] Connecting to the CSS Daemon
    2010-03-03 17:31:35.976: [ CRSMAIN][1] Initializing OCR
    2010-03-03 17:31:35.981: [  OCRAPI][1]clsu_get_private_ip_addr: Calling clsu_get_private_ip_addresses to get first private ip
    2010-03-03 17:31:35.981: [  OCRAPI][1]Check namebufs
    2010-03-03 17:31:35.981: [  OCRAPI][1]Finished checking namebufs
    2010-03-03 17:31:35.982: [    GIPC][1] gipcCheckInitialization: possible incompatible non-threaded init from [clsinet.c : 3232
    ], original from [clsss.c : 5026]
    2010-03-03 17:31:36.036: [    GPnP][1]clsgpnp_Init: [at clsgpnp0.c:405] gpnp tracelevel 3, component tracelevel 0
    2010-03-03 17:31:36.037: [    GPnP][1]clsgpnp_Init: [at clsgpnp0.c:535] '/u01/grid/11.2.0' in effect as GPnP home base.
    2010-03-03 17:31:36.059: [    GIPC][1] gipcCheckInitialization: possible incompatible non-threaded init from [clsgpnp0.c : 680
    ], original from [clsss.c : 5026]
    2010-03-03 17:31:36.067: [    GPnP][1]clsgpnp_InitCKProviders: [at clsgpnp0.c:3867] Init gpnp local security key providers (2)
    fatal if both fail
    2010-03-03 17:31:36.068: [    GPnP][1]clsgpnp_InitCKProviders: [at clsgpnp0.c:3870] Init gpnp local security key proveders 1 o
    f 2: file wallet (LSKP-FSW)
    2010-03-03 17:31:36.068: [    GPnP][1]clsgpnpkwf_initwfloc: [at clsgpnpkwf.c:398] Using FS Wallet Location : /u01/grid/11.2.0/
    gpnp/mskbkp2/wallets/peer/
    2010-03-03 17:31:36.069: [    GPnP][1]clsgpnp_InitCKProviders: [at clsgpnp0.c:3892] Init gpnp local security key provider 1 of
    2: file wallet (LSKP-FSW) OK
    2010-03-03 17:31:36.069: [    GPnP][1]clsgpnp_InitCKProviders: [at clsgpnp0.c:3898] Init gpnp local security key proveders 2 o
    f 2: OLR wallet (LSKP-CLSW-OLR)
    [   CLWAL][1]clsw_Initialize: OLR initlevel [30000]
    2010-03-03 17:31:36.080: [    GPnP][1]clsgpnp_InitCKProviders: [at clsgpnp0.c:3921] Init gpnp local security key provider 2 of
    2: OLR wallet (LSKP-CLSW-OLR) OK
    2010-03-03 17:31:36.081: [    GPnP][1]clsgpnp_getCK: [at clsgpnp0.c:1952] <Get gpnp security keys (wallet) for id:1,typ;7. (2
    providers - fatal if all fail)
    2010-03-03 17:31:36.081: [    GPnP][1]clsgpnpkwf_getWalletPath: [at clsgpnpkwf.c:501] req_id=1 ck_prov_id=1 wallet path: /u01/
    grid/11.2.0/gpnp/mskbkp2/wallets/peer/
    2010-03-03 17:31:36.152: [    GPnP][1]clsgpnpwu_walletfopen: [at clsgpnpwu.c:496] Opened SSO wallet: '/u01/grid/11.2.0/gpnp/ms
    kbkp2/wallets/peer/cwallet.sso'
    2010-03-03 17:31:36.152: [    GPnP][1]clsgpnp_getCK: [at clsgpnp0.c:1968] Result: (0) CLSGPNP_OK. Get gpnp wallet - provider 1
    of 2 (LSKP-FSW(1))
    2010-03-03 17:31:36.152: [    GPnP][1]clsgpnp_getCK: [at clsgpnp0.c:1982] Got gpnp security keys (wallet).>
    2010-03-03 17:31:36.172: [    GPnP][1]clsgpnp_getCK: [at clsgpnp0.c:1952] <Get gpnp security keys (wallet) for id:1,typ;4. (2
    providers - fatal if all fail)
    2010-03-03 17:31:36.172: [    GPnP][1]clsgpnpkwf_getWalletPath: [at clsgpnpkwf.c:501] req_id=1 ck_prov_id=1 wallet path: /u01/
    grid/11.2.0/gpnp/mskbkp2/wallets/peer/
    2010-03-03 17:31:36.238: [    GPnP][1]clsgpnpwu_walletfopen: [at clsgpnpwu.c:496] Opened SSO wallet: '/u01/grid/11.2.0/gpnp/ms
    kbkp2/wallets/peer/cwallet.sso'
    2010-03-03 17:31:36.239: [    GPnP][1]clsgpnp_getCK: [at clsgpnp0.c:1968] Result: (0) CLSGPNP_OK. Get gpnp wallet - provider 1
    of 2 (LSKP-FSW(1))
    2010-03-03 17:31:36.239: [    GPnP][1]clsgpnp_getCK: [at clsgpnp0.c:1982] Got gpnp security keys (wallet).>
    2010-03-03 17:31:36.239: [    GPnP][1]clsgpnp_Init: [at clsgpnp0.c:840] GPnP client pid=18669, tl=3, f=0
    2010-03-03 17:31:36.541: [GIPCXCPT][1] gipcShutdownF: skipping shutdown, count 2, from [ clsinet.c : 1735], ret gipcretSuccess
    (0)
    2010-03-03 17:31:36.552: [GIPCXCPT][1] gipcShutdownF: skipping shutdown, count 1, from [ clsgpnp0.c : 1021], ret gipcretSucces
    s (0)
    2010-03-03 17:31:36.771: [  OCRRAW][1]proprioo: for disk 0 (+DR2_BIN), id match (1), total id sets, (1) need recover (0), my v
    otes (0), total votes (0), commit_lsn (9), lsn (9)
    2010-03-03 17:31:36.771: [  OCRRAW][1]proprioo: my id set: (833490748, 1028247821, 0, 0, 0)
    2010-03-03 17:31:36.772: [  OCRRAW][1]proprioo: 1st set: (833490748, 1028247821, 0, 0, 0)
    2010-03-03 17:31:36.772: [  OCRRAW][1]proprioo: 2nd set: (0, 0, 0, 0, 0)
    2010-03-03 17:31:36.830: [  OCRSRV][1]th_init: Successfully retrieved CSS misscount [31].
    2010-03-03 17:31:36.830: [  OCRSRV][1]th_init: Successfully query CLSS mode [3].
    [  OCRMAS][20]th_calc_av:5': Rturn persisted AV [186646784] [11.2.0.1.0]
    2010-03-03 17:31:36.920: [  OCRSRV][20]th_not_master_change: Master change callback not registered
    2010-03-03 17:31:36.920: [  OCRMAS][20]th_master:12: I AM THE NEW OCR MASTER at incar 1. Node Number 2
    2010-03-03 17:31:37.134: [  OCRASM][20]proprasmo: ASM cache size is [5MB]
    2010-03-03 17:31:37.142: [  OCRASM][20]proprasmo: ASM cache [5MB] enabled for disk group [DR2_BIN].
    2010-03-03 17:31:37.155: [  OCRRAW][20]proprioo: for disk 0 (+DR2_BIN), id match (1), total id sets, (1) need recover (0), my
    votes (0), total votes (0), commit_lsn (9), lsn (9)
    2010-03-03 17:31:37.155: [  OCRRAW][20]proprioo: my id set: (833490748, 1028247821, 0, 0, 0)
    2010-03-03 17:31:37.155: [  OCRRAW][20]proprioo: 1st set: (833490748, 1028247821, 0, 0, 0)
    2010-03-03 17:31:37.155: [  OCRRAW][20]proprioo: 2nd set: (0, 0, 0, 0, 0)
    2010-03-03 17:31:37.214: [  OCRMAS][20]proath_master:18: Spawned connection mgr thread
    2010-03-03 17:31:37.214: [  OCRMAS][20]proath_master:20: Spawned upgrade thread
    2010-03-03 17:31:37.214: [  OCRMAS][20]th_master:19.1: Wake up upgrade thread
    2010-03-03 17:31:37.216: [  OCRSRV][1]th_snap_local_spawn: Inside snap local spawn. host is [mskbkp2]
    2010-03-03 17:31:37.219: [ CRSMAIN][1] Running as user: root
    2010-03-03 17:31:37.219: [ CRSMAIN][1] CRSD running as the Privileged user
    2010-03-03 17:31:37.219: [  CLSVER][1] Static Version 11.2.0.1.0
    2010-03-03 17:31:37.226: [  OCRMAS][20]th_master:1': Recvd pubdata event from node [2]
    2010-03-03 17:31:37.227: [  OCRMAS][20]th_master:2': Recvd pubdata event for self. Do nothing.
    2010-03-03 17:31:37.227: [  CLSVER][1] Daemon version: 11.2.0.1.0 Software version: 11.2.0.1.0
    2010-03-03 17:31:37.231: [  CLSVER][1] Active Version from OCR:11.2.0.1.0
    2010-03-03 17:31:37.232: [  CLSVER][1] Active Version and Software Version are same
    2010-03-03 17:31:37.232: [  CLSVER][1] Active Version changed to 11.2.0.1.0
    2010-03-03 17:31:37.232: [  OCRSRV][1]th_reg_master_change: Master change callback registered
    2010-03-03 17:31:37.232: [  OCRAPI][1]a_reg_master_change: Registered master change callback
    2010-03-03 17:31:37.232: [  OCRSRV][1]th_not_master_change: Invoking master change callback. Master [2] Inc [1]
    2010-03-03 17:31:37.232: [  OCRAPI][1]a_reg_master_change: Notified master change
    2010-03-03 17:31:37.232: [ CRSMAIN][1] CAA Node Group Pri Data size: 128
    2010-03-03 17:31:37.233: [ CRSMAIN][1] CAA Node Group Pub Data size: 128
    2010-03-03 17:31:37.247: [ CRSMAIN][1] Getting private data of booted nodes
    2010-03-03 17:31:37.247: [ CRSMAIN][1] Checking for booted param on nodenum: 2
    2010-03-03 17:31:37.306: [    CLSE][1]clse_get_auth_loc: Returning default authloc: /u01/grid/11.2.0/auth/crs/mskbkp2
    2010-03-03 17:31:37.306: [ CRSMAIN][1] Using Authorizer location: /u01/grid/11.2.0/auth/crs/mskbkp2
    2010-03-03 17:31:37.314: [  OCRSRV][23]th_upgrade: Starting upgrade calculation
    2010-03-03 17:31:37.364: [  CLSCLU][1]clsclu_init: rc 0
    2010-03-03 17:31:37.381: [  OCRSRV][23]th_upgrade:10.1 AV [186646784]. State [11]. Already upgraded.Updated global data to the
    crs version group. Return [0]
    2010-03-03 17:31:37.385: [ CRSMAIN][1] Initializing RTI
    2010-03-03 17:31:37.433: [ CRSMAIN][1] Initializing ResouceStateListener
    2010-03-03 17:31:37.433: [CRSTIMER][37] Timer Thread Starting.
    2010-03-03 17:31:37.433: [ CRSMAIN][1] Initializing EVMMgr
    2010-03-03 17:31:37.446: [ CRSMAIN][1] Initializing ResourceMap Map
    2010-03-03 17:31:37.461: [ CRSMAIN][1] Subscribing to EVM events for apps
    2010-03-03 17:31:37.504: [ CRSMAIN][1] CRSD locked during state recovery, please wait.
    2010-03-03 17:31:37.516: [ CRSMAIN][1] CRSD recovered, unlocked.
    2010-03-03 17:31:37.525: [ default][1]clsu_get_private_ip_addr: Calling clsu_get_private_ip_addresses to get first private ip
    2010-03-03 17:31:37.525: [ default][1]Check namebufs
    2010-03-03 17:31:37.525: [ default][1]Finished checking namebufs
    2010-03-03 17:31:37.526: [    GIPC][1] gipcCheckInitialization: possible incompatible non-threaded init from [clsinet.c : 3232
    ], original from [clsss.c : 5026]
    2010-03-03 17:31:37.569: [    GPnP][1]clsgpnp_Init: [at clsgpnp0.c:405] gpnp tracelevel 3, component tracelevel 0
    2010-03-03 17:31:37.569: [    GPnP][1]clsgpnp_Init: [at clsgpnp0.c:535] '/u01/grid/11.2.0' in effect as GPnP home base.
    2010-03-03 17:31:37.587: [    GIPC][1] gipcCheckInitialization: possible incompatible non-threaded init from [clsgpnp0.c : 680
    ], original from [clsss.c : 5026]
    2010-03-03 17:31:37.595: [    GPnP][1]clsgpnp_InitCKProviders: [at clsgpnp0.c:3867] Init gpnp local security key providers (2)
    fatal if both fail
    2010-03-03 17:31:37.595: [    GPnP][1]clsgpnp_InitCKProviders: [at clsgpnp0.c:3870] Init gpnp local security key proveders 1 o
    f 2: file wallet (LSKP-FSW)
    2010-03-03 17:31:37.596: [    GPnP][1]clsgpnpkwf_initwfloc: [at clsgpnpkwf.c:398] Using FS Wallet Location : /u01/grid/11.2.0/
    gpnp/mskbkp2/wallets/peer/
    2010-03-03 17:31:37.596: [    GPnP][1]clsgpnp_InitCKProviders: [at clsgpnp0.c:3892] Init gpnp local security key provider 1 of
    2: file wallet (LSKP-FSW) OK
    2010-03-03 17:31:37.596: [    GPnP][1]clsgpnp_InitCKProviders: [at clsgpnp0.c:3898] Init gpnp local security key proveders 2 o
    f 2: OLR wallet (LSKP-CLSW-OLR)
    [   CLWAL][1]clsw_Initialize: OLR initlevel [30000]
    2010-03-03 17:31:37.607: [    GPnP][1]clsgpnp_InitCKProviders: [at clsgpnp0.c:3921] Init gpnp local security key provider 2 of
    2: OLR wallet (LSKP-CLSW-OLR) OK
    2010-03-03 17:31:37.607: [    GPnP][1]clsgpnp_getCK: [at clsgpnp0.c:1952] <Get gpnp security keys (wallet) for id:1,typ;7. (2
    providers - fatal if all fail)
    2010-03-03 17:31:37.607: [    GPnP][1]clsgpnpkwf_getWalletPath: [at clsgpnpkwf.c:501] req_id=1 ck_prov_id=1 wallet path: /u01/
    grid/11.2.0/gpnp/mskbkp2/wallets/peer/
    2010-03-03 17:31:37.673: [    GPnP][1]clsgpnpwu_walletfopen: [at clsgpnpwu.c:496] Opened SSO wallet: '/u01/grid/11.2.0/gpnp/ms
    kbkp2/wallets/peer/cwallet.sso'
    2010-03-03 17:31:37.673: [    GPnP][1]clsgpnp_getCK: [at clsgpnp0.c:1968] Result: (0) CLSGPNP_OK. Get gpnp wallet - provider 1
    of 2 (LSKP-FSW(1))
    2010-03-03 17:31:37.673: [    GPnP][1]clsgpnp_getCK: [at clsgpnp0.c:1982] Got gpnp security keys (wallet).>
    2010-03-03 17:31:37.690: [    GPnP][1]clsgpnp_getCK: [at clsgpnp0.c:1952] <Get gpnp security keys (wallet) for id:1,typ;4. (2
    providers - fatal if all fail)
    2010-03-03 17:31:37.690: [    GPnP][1]clsgpnpkwf_getWalletPath: [at clsgpnpkwf.c:501] req_id=1 ck_prov_id=1 wallet path: /u01/
    grid/11.2.0/gpnp/mskbkp2/wallets/peer/
    2010-03-03 17:31:37.754: [    GPnP][1]clsgpnpwu_walletfopen: [at clsgpnpwu.c:496] Opened SSO wallet: '/u01/grid/11.2.0/gpnp/ms
    kbkp2/wallets/peer/cwallet.sso'
    2010-03-03 17:31:37.754: [    GPnP][1]clsgpnp_getCK: [at clsgpnp0.c:1968] Result: (0) CLSGPNP_OK. Get gpnp wallet - provider 1
    of 2 (LSKP-FSW(1))
    2010-03-03 17:31:37.754: [    GPnP][1]clsgpnp_getCK: [at clsgpnp0.c:1982] Got gpnp security keys (wallet).>
    2010-03-03 17:31:37.755: [    GPnP][1]clsgpnp_Init: [at clsgpnp0.c:840] GPnP client pid=18669, tl=3, f=0
    2010-03-03 17:31:37.806: [GIPCXCPT][1] gipcShutdownF: skipping shutdown, count 2, from [ clsinet.c : 1735], ret gipcretSuccess
    (0)
    2010-03-03 17:31:37.817: [GIPCXCPT][1] gipcShutdownF: skipping shutdown, count 1, from [ clsgpnp0.c : 1021], ret gipcretSucces
    s (0)
    2010-03-03 17:31:37.822: [ CRSMAIN][1] CRSD listening on 10 style E2E port (ADDRESS=(PROTOCOL=tcp)(HOST=172.31.25.112)(PORT=38
    983))
    2010-03-03 17:31:37.835: [ CRSMAIN][1] Starting Threads
    2010-03-03 17:31:37.858: [    CLSE][1]clse_get_auth_loc: Returning default authloc: /u01/grid/11.2.0/auth/crs/mskbkp2
    2010-03-03 17:31:37.858: [    CRSD][1] AuthLoc /u01/grid/11.2.0/auth/crs/mskbkp2
    2010-03-03 17:31:37.859: [    CRSD][1] PE active version: 11.2.0.1.0
    2010-03-03 17:31:37.859: [    CRSD][1] PE Engine: NEW
    2010-03-03 17:31:37.859: [    CRSD][1] Using OCR batch ops : ENABLED
    2010-03-03 17:31:37.860: [ CRSMAIN][1] Initializing Node Down Monitor
    2010-03-03 17:31:37.860: [ CRSMAIN][1] CRS Daemon Started.
    2010-03-03 17:31:37.860: [    CRSD][1] Connecting to the CSS Daemon
    2010-03-03 17:31:37.861: [    CRSD][1] Local CSS Node Number is: 2
    2010-03-03 17:31:37.863: [    CRSD][1] Local Css Node Name is: mskbkp2
    2010-03-03 17:31:37.863: [    CRSD][1] CRSDPersonality initialized
    2010-03-03 17:31:37.864: [ CRSMAIN][1] Process member data: CRSD:mskbkp2
    2010-03-03 17:31:37.864: [    CRSD][1][F-ALGO] getIpcPath returning (ADDRESS=(PROTOCOL=IPC)(KEY=CRSD_IPC_SOCKET_11))
    2010-03-03 17:31:37.865: [CLSFRAME][1] Inited lsf context 102b3f670
    2010-03-03 17:31:37.865: [CLSFRAME][1] Initing CLS Framework messaging
    2010-03-03 17:31:37.869: [    CRSD][1][F-ALGO] getIpcPath returning (ADDRESS=(PROTOCOL=IPC)(KEY=CRSD_IPC_SOCKET_11))
    2010-03-03 17:31:37.873: [UiServer][1] UI Comms initalize() 1
    2010-03-03 17:31:37.873: [CLSFRAME][1] New Framework state: 2
    2010-03-03 17:31:37.873: [CLSFRAME][1] M2M is starting...
    2010-03-03 17:31:37.873: [  CRSCCL][1]clsCclInit called by process: 18669
    2010-03-03 17:31:37.885: [  CRSCCL][1]USING CLSC ============
    2010-03-03 17:31:37.895: [ default][1]clsu_get_private_ip_addr: Calling clsu_get_private_ip_addresses to get first private ip
    2010-03-03 17:31:37.895: [ default][1]Check namebufs
    2010-03-03 17:31:37.895: [ default][1]Finished checking namebufs
    2010-03-03 17:31:37.950: [    GPnP][1]clsgpnp_Init: [at clsgpnp0.c:405] gpnp tracelevel 3, component tracelevel 0
    2010-03-03 17:31:37.951: [    GPnP][1]clsgpnp_Init: [at clsgpnp0.c:535] '/u01/grid/11.2.0' in effect as GPnP home base.
    2010-03-03 17:31:37.970: [    GPnP][1]clsgpnp_InitCKProviders: [at clsgpnp0.c:3867] Init gpnp local security key providers (2)
    fatal if both fail
    2010-03-03 17:31:37.970: [    GPnP][1]clsgpnp_InitCKProviders: [at clsgpnp0.c:3870] Init gpnp local security key proveders 1 o
    f 2: file wallet (LSKP-FSW)
    2010-03-03 17:31:37.970: [    GPnP][1]clsgpnpkwf_initwfloc: [at clsgpnpkwf.c:398] Using FS Wallet Location : /u01/grid/11.2.0/
    gpnp/mskbkp2/wallets/peer/
    2010-03-03 17:31:37.970: [    GPnP][1]clsgpnp_InitCKProviders: [at clsgpnp0.c:3892] Init gpnp local security key provider 1 of
    2: file wallet (LSKP-FSW) OK
    2010-03-03 17:31:37.970: [    GPnP][1]clsgpnp_InitCKProviders: [at clsgpnp0.c:3898] Init gpnp local security key proveders 2 o
    f 2: OLR wallet (LSKP-CLSW-OLR)
    [   CLWAL][1]clsw_Initialize: OLR initlevel [70000]
    2010-03-03 17:31:37.980: [    GPnP][1]clsgpnp_InitCKProviders: [at clsgpnp0.c:3921] Init gpnp local security key provider 2 of
    2: OLR wallet (LSKP-CLSW-OLR) OK
    2010-03-03 17:31:37.980: [    GPnP][1]clsgpnp_getCK: [at clsgpnp0.c:1952] <Get gpnp security keys (wallet) for id:1,typ;7. (2
    providers - fatal if all fail)
    2010-03-03 17:31:37.980: [    GPnP][1]clsgpnpkwf_getWalletPath: [at clsgpnpkwf.c:501] req_id=1 ck_prov_id=1 wallet path: /u01/
    grid/11.2.0/gpnp/mskbkp2/wallets/peer/
    2010-03-03 17:31:38.049: [    GPnP][1]clsgpnpwu_walletfopen: [at clsgpnpwu.c:496] Opened SSO wallet: '/u01/grid/11.2.0/gpnp/ms
    kbkp2/wallets/peer/cwallet.sso'
    2010-03-03 17:31:38.049: [    GPnP][1]clsgpnp_getCK: [at clsgpnp0.c:1968] Result: (0) CLSGPNP_OK. Get gpnp wallet - provider 1
    of 2 (LSKP-FSW(1))
    2010-03-03 17:31:38.049: [    GPnP][1]clsgpnp_getCK: [at clsgpnp0.c:1982] Got gpnp security keys (wallet).>
    2010-03-03 17:31:38.068: [    GPnP][1]clsgpnp_getCK: [at clsgpnp0.c:1952] <Get gpnp security keys (wallet) for id:1,typ;4. (2
    providers - fatal if all fail)
    2010-03-03 17:31:38.068: [    GPnP][1]clsgpnpkwf_getWalletPath: [at clsgpnpkwf.c:501] req_id=1 ck_prov_id=1 wallet path: /u01/
    grid/11.2.0/gpnp/mskbkp2/wallets/peer/
    2010-03-03 17:31:38.134: [    GPnP][1]clsgpnpwu_walletfopen: [at clsgpnpwu.c:496] Opened SSO wallet: '/u01/grid/11.2.0/gpnp/ms
    kbkp2/wallets/peer/cwallet.sso'
    2010-03-03 17:31:38.135: [    GPnP][1]clsgpnp_getCK: [at clsgpnp0.c:1968] Result: (0) CLSGPNP_OK. Get gpnp wallet - provider 1
    of 2 (LSKP-FSW(1))
    2010-03-03 17:31:38.135: [    GPnP][1]clsgpnp_getCK: [at clsgpnp0.c:1982] Got gpnp security keys (wallet).>
    2010-03-03 17:31:38.135: [    GPnP][1]clsgpnp_Init: [at clsgpnp0.c:840] GPnP client pid=18669, tl=3, f=3
    2010-03-03 17:31:38.184: [GIPCXCPT][1] gipcShutdownF: skipping shutdown, count 2, from [ clsinet.c : 1735], ret gipcretSuccess
    (0)
    2010-03-03 17:31:38.194: [GIPCXCPT][1] gipcShutdownF: skipping shutdown, count 1, from [ clsgpnp0.c : 1021], ret gipcretSucces
    s (0)
    2010-03-03 17:31:38.200: [  CRSCCL][1]Listening endpoint created sucessfully @ (ADDRESS=(PROTOCOL=tcp)(DEV=54)(HOST=172.31.25.
    112)(PORT=38984)).con = 10359a0d0
    2010-03-03 17:31:38.209: [  CRSCCL][48]CSS Group Registration complete.
    2010-03-03 17:31:38.213: [  CRSCCL][48]cclGetMemberData called
    2010-03-03 17:31:38.215: [  CRSCCL][48]Obtained first membership map.
    2010-03-03 17:31:38.215: [  CRSCCL][48]Dumping member data ------------------
    2010-03-03 17:31:38.215: [  CRSCCL][48]Member (2, 603412550) on node port=.
    2010-03-03 17:31:38.216: [  CRSCCL][48]Done ------------------
    2010-03-03 17:31:38.216: [  CRSCCL][48]Waiting for reconfigs
    2010-03-03 17:31:38.216: [  CRSCCL][49]cclCommunicationHandler started.
    2010-03-03 17:31:38.220: [ CRSCOMM][1] Ipc: m_pClscCtx=1020c4850m_pUgblm=1035b2a50
    2010-03-03 17:31:38.220: [ CRSCOMM][1] Ipc: Starting send thread
    2010-03-03 17:31:38.220: [ CRSCOMM][1] IpcL: Listener instantiated for: (ADDRESS=(PROTOCOL=IPC)(KEY=CRSD_IPC_SOCKET_11))
    2010-03-03 17:31:38.221: [ CRSCOMM][52] Ipc: sendWork thread started.
    2010-03-03 17:31:38.222: [ CRSCOMM][1] IpcL: Listener started listening.
    2010-03-03 17:31:38.223: [ CRSCOMM][53] IpcL: thread started listening
    2010-03-03 17:31:38.223: [CLSFRAME][1] Starting thread model named: AgfwProxySrvTM
    2010-03-03 17:31:38.224: [CLSFRAME][1] Starting thread model named: OcrModuleTM
    2010-03-03 17:31:38.225: [CLSFRAME][1] Starting thread model named: PolicyEngineTM
    2010-03-03 17:31:38.225: [CLSFRAME][1] Starting thread model named: SharedThreadTM
    2010-03-03 17:31:38.225: [CLSFRAME][1] Starting thread model named: UiServerTM
    2010-03-03 17:31:38.225: [CLSFRAME][1] New Framework state: 3
    2010-03-03 17:31:38.227: [  CRSRPT][62] Enabled
    2010-03-03 17:31:38.228: [   CRSPE][61] PE Role|State Update: old role [INVALID] new [INVALID]; old state [Not yet initialized
    ] new [Enabling: waiting for role]
    2010-03-03 17:31:38.229: [   CRSSE][62] Master Change Event; New Master Node ID:2 This Node's ID:2
    2010-03-03 17:31:38.230: [   CRSPE][61] PE Role|State Update: old role [INVALID] new [MASTER]; old state [Enabling: waiting fo
    r role] new [Configuring]
    2010-03-03 17:31:38.230: [   CRSPE][61] PE MASTER NAME: mskbkp2
    2010-03-03 17:31:38.230: [   CRSPE][61] Starting to read configuration
    2010-03-03 17:31:38.260: [   CRSPE][61] Reading (2) servers
    2010-03-03 17:31:38.459: [   CRSPE][61] DM: set global config version to: 150
    2010-03-03 17:31:38.459: [   CRSPE][61] DM: set pool freeze timeout to: 60000
    2010-03-03 17:31:38.459: [   CRSPE][61] DM: Set event seq number to: 13900000
    2010-03-03 17:31:38.459: [   CRSPE][61] DM: Set threshold event seq number to: 13980000
    2010-03-03 17:31:38.460: [   CRSPE][61] Sent request to write event sequence number 14000000 to repository
    2010-03-03 17:31:38.483: [   CRSPE][61] Wrote new event sequence to repository
    2010-03-03 17:31:38.568: [   CRSPE][61] Reading (15) types
    2010-03-03 17:31:38.593: [   CRSPE][61] Reading (3) server pools
    2010-03-03 17:31:38.624: [   CRSPE][61] Reading (21) resources
    2010-03-03 17:31:39.987: [   CRSPE][61] Finished reading configuration. Parsing...
    2010-03-03 17:31:39.988: [   CRSPE][61] Parsing resource types...
    2010-03-03 17:31:40.030: [    CRSD][61] Initializing the config version for type ora.asm.type to: 1
    2010-03-03 17:31:40.035: [    CRSD][61] Initializing the config version for type ora.cluster_resource.type to: 1
    2010-03-03 17:31:40.040: [    CRSD][61] Initializing the config version for type ora.cluster_vip.type to: 1
    2010-03-03 17:31:40.044: [    CRSD][61] Initializing the config version for type ora.cluster_vip_net1.type to: 1
    2010-03-03 17:31:40.048: [    CRSD][61] Dump State Starting ...
    2010-03-03 17:31:40.048: [    CRSD][61] State Dump for RTILock
    2010-03-03 17:31:40.048: [    CRSD][61] Lock State List is busy, skipping ..
    2010-03-03 17:31:40.048: [    CRSD][61] State Dump for Timer
    2010-03-03 17:31:40.049: [    CRSD][61] Timer map size=0
    2010-03-03 17:31:40.049: [   CRSPE][61] Dumping PE Data Model...:DM has [0 resources][0 types][0 servers][0 spools]
    ------------- RESOURCES:
    ------------- TYPES:
    ------------- SERVERS:
    ------------- SERVER POOLS:
    2010-03-03 17:31:40.049: [   CRSPE][61] Dumping ICE contents...:ICE operation count: 0
    2010-03-03 17:31:40.049: [    CRSD][61] Dump State Done.
    I guess that there is some thing wrong in configuration, but cannot find out what.
    Any help would be appreciated.
    Thanks

    Hi,
    Please check your disk attributes and permission of OCR/Voting and other ASM devices. The disk attribute should be changed to be shared among all nodes of cluster. It happened with us in 10.2.0.4 where disk was not shared and we were able to start crs from only one node at a time so please check disk attributes. Please see blog keyurmakwanacrs.blogspot.com for AIX which we faced. Not surle whether you've similar problem or not. We had 10.2.0.4 clusterware.
    thanks,
    Keyur

  • Ora.asm -init failed on second node root.sh

    Hi All,
    Installing Grid Infrastructure for a 11gr2 Cluster on two nodes Oracle Linux 5 + Vsware vSphere v4, shared disk on same host machine. When run root.sh, first node was success but the second node got following error message (actually the first node was cloned from the seoncd):
    CRS-2672: Attempting to start 'ora.ctssd' on 'wandrac2'
    Start action for octssd aborted
    CRS-2676: Start of 'ora.ctssd' on 'wandrac2' succeeded
    CRS-2672: Attempting to start 'ora.drivers.acfs' on 'wandrac2'
    CRS-2672: Attempting to start 'ora.asm' on 'wandrac2'
    CRS-2676: Start of 'ora.drivers.acfs' on 'wandrac2' succeeded
    CRS-2676: Start of 'ora.asm' on 'wandrac2' succeeded
    CRS-2664: Resource 'ora.ctssd' is already running on 'wandrac2'
    CRS-4000: Command Start failed, or completed with errors.
    Command return code of 1 (256) from command: /orapp/racsl/11.2.0/bin/crsctl start resource ora.asm -init
    Start of resource "ora.asm -init" failed
    Failed to start ASM
    Failed to start Oracle Clusterware stack
    Thanks in advance for any information and helps,

    Hi,
    I came across this error and I am about to start a fresh installation of the grid. (ealier one failed because it was unable to read the memory in rac2 )
    Is there anything specific I can change before I start my installation.
    PS - I didnt get what exactly is going on with the hosts file.
    My files are as follows :
    RAC1 - etc/hosts
    [oracle@falcen6a ~]$ cat /etc/hosts
    # Do not remove the following line, or various programs
    # that require network functionality will fail.
    127.0.0.1 localhost.localdomain localhost
    ::1 localhost6.localdomain6 localhost6
    # Public
    192.168.100.218 falcen6a.a.pri falcen6a
    192.168.100.219 falcen6b.a.pri falcen6b
    # Private
    192.168.210.101 falcen6a-priv.a.pri falcen6a-priv
    192.168.210.102 falcen6b-priv.a.pri falcen6b-priv
    # Virtual
    192.168.100.212 falcen6a-vip.a.pri falcen6a-vip
    192.168.100.213 falcen6b-vip.a.pri falcen6b-vip
    # SCAN
    #192.168.100.208 falcen6-scan.a.pri falcen6-scan
    #192.168.100.209 falcen6-scan.a.pri falcen6-scan
    #192.168.100.210 falcen6-scan.a.pri falcen6-scan
    on RAC2
    [oracle@falcen6b ~]$ cat /etc/hosts
    # Do not remove the following line, or various programs
    # that require network functionality will fail.
    127.0.0.1 localhost.localdomain localhost
    ::1 localhost6.localdomain6 localhost6
    #Public
    192.168.100.218 falcen6a.a.pri falcen6a
    192.168.100.219 falcen6b.a.pri falcen6b
    # Private
    192.168.210.101 falcen6a-priv.a.pri falcen6a-priv
    192.168.210.102 falcen6b-priv.a.pri falcen6b-priv
    # Virtual
    192.168.100.212 falcen6a-vip.a.pri falcen6a-vip
    192.168.100.213 falcen6b-vip.a.pri falcen6b-vip
    # SCAN
    #192.168.100.208 falcen6-scan.a.pri falcen6-scan
    #192.168.100.209 falcen6-scan.a.pri falcen6-scan
    #192.168.100.210 falcen6-scan.a.pri falcen6-scan
    Can someone please confirm this??

  • Failed to start resource: Name: ora.racdb.db, node: null, filter: null, ms

    Hi DBA's.
    Im, running
    Finalizing Installation 96% the following Warning:
    [Thread-288] [ 2010-01-21 14:28:57.456 ARST ] [CRSNative.internalStartResource:352] Failed to start resource: Name: ora.racdb.db, node: null, filter: null, msg CRS-2674:
    Start of 'ora.racdb.db' on 'linux2' failed
    CRS-2678: 'ora.racdb.db' on 'linux2' has experienced an unrecoverable failure
    CRS-0267: Human intervention required to resume its availability.
    CRS-5807: Agent failed to process the message
    ORA-01034: ORACLE not available
    ORA-27101: shared memory realm does not exist
    Linux Error: 2: No such file or directory
    Process ID: 0
    Session ID: 0 Serial number: 0
    [Thread-288] [ 2010-01-21 14:28:57.457 ARST ] [PostDBCreationStep.executeImpl:828] Exception while Starting with HA Database Resource PRCR-1079 : Failed to start resourc
    e ora.racdb.db
    CRS-2674: Start of 'ora.racdb.db' on 'linux2' failed
    CRS-2678: 'ora.racdb.db' on 'linux2' has experienced an unrecoverable failure
    CRS-0267: Human intervention required to resume its availability.
    CRS-5807: Agent failed to process the message
    ORA-01034: ORACLE not available
    ORA-27101: shared memory realm does not exist
    Linux Error: 2: No such file or directory
    Process ID: 0
    Session ID: 0 Serial number: 0
    oracle$ dbca

    Hi...
    Now is Ok.
    I did:
    srvctl start instance -d racdb -i racdb2
    [oracle@linux1 oracle]$ su - grid -c "crsctl status resource -w \"TYPE co 'ora'\" -t"
    Password:
    NAME TARGET STATE SERVER STATE_DETAILS
    Local Resources
    ora.CRS.dg
    ONLINE ONLINE linux1
    ONLINE ONLINE linux2
    ora.FRA.dg
    ONLINE ONLINE linux1
    ONLINE ONLINE linux2
    ora.LISTENER.lsnr
    ONLINE ONLINE linux1
    ONLINE ONLINE linux2
    ora.RACDB_DATA.dg
    ONLINE ONLINE linux1
    ONLINE ONLINE linux2
    ora.asm
    ONLINE ONLINE linux1 Started
    ONLINE ONLINE linux2 Started
    ora.eons
    ONLINE ONLINE linux1
    ONLINE ONLINE linux2
    ora.gsd
    OFFLINE OFFLINE linux1
    OFFLINE OFFLINE linux2
    ora.net1.network
    ONLINE ONLINE linux1
    ONLINE ONLINE linux2
    ora.ons
    ONLINE ONLINE linux1
    ONLINE ONLINE linux2
    Cluster Resources
    ora.LISTENER_SCAN1.lsnr
    1 ONLINE ONLINE linux1
    ora.linux1.vip
    1 ONLINE ONLINE linux1
    ora.linux2.vip
    1 ONLINE ONLINE linux2
    ora.oc4j
    1 OFFLINE OFFLINE
    ora.racdb.db
    1 ONLINE ONLINE linux1 Open
    2 ONLINE ONLINE linux2 Open
    ora.scan1.vip
    1 ONLINE ONLINE linux1
    Thanks.

  • Failed to start ASM instance "+ASM1" on node "rac1"

    I have a problem, because when I start RAC and write command crs_stat -t
    column State have 2 wrong parameter..
    Name Type Target State Host
    ora.....CRM.cs application ONLINE ONLINE rac2
    ora....db1.srv application ONLINE ONLINE rac2
    ora.devdb.db application ONLINE ONLINE rac2
    ora....b1.inst application ONLINE OFFLINE
    ora....b2.inst application ONLINE ONLINE rac2
    ora....SM1.asm application ONLINE UNKNOWN rac1
    ora....C1.lsnr application ONLINE ONLINE rac1
    ora.rac1.gsd application ONLINE ONLINE rac1
    ora.rac1.ons application ONLINE ONLINE rac1
    ora.rac1.vip application ONLINE ONLINE rac1
    ora....SM2.asm application ONLINE ONLINE rac2
    ora....C2.lsnr application ONLINE ONLINE rac2
    ora.rac2.gsd application ONLINE ONLINE rac2
    ora.rac2.ons application ONLINE ONLINE rac2
    ora.rac2.vip application ONLINE ONLINE rac2
    When I try
    srvctl start asm -n rac1 then is wrong:
    PRKS-1009 : Failed to start ASM instance "+ASM1" on node "rac1", [PRKS-1009 : Failed to start ASM instance "+ASM1" on node "rac1", [CRS-1028: Dependency analysis failed because of:
    CRS-0223: Resource 'ora.rac1.ASM1.asm' has placement error.]]
    [PRKS-1009 : Failed to start ASM instance "+ASM1" on node "rac1", [CRS-1028: Dependency analysis failed because of:
    CRS-0223: Resource 'ora.rac1.ASM1.asm' has placement error.]]
    and when I try start instance manualy then
    PRKP-1001 : Error starting instance devdb1 on node rac1
    CRS-1028: Dependency analysis failed because of:
    CRS-0223: Resource 'ora.devdb.devdb1.inst' has placement error.
    :( Where is my problem??

    hi, i have exactly the same error
    but your suggestions of remove an recreate the asm resource not working
    ./srvctl remove asm -n dbs2 -i +ASM2 -f
    PRKS-1023 : Failed to remove CRS resource for ASM instance "+ASM2" on node "dbs2", [CRS-0214: Could not unregister resource 'ora.dbs2.ASM2.asm'.]
    ./srvctl start asm -n dbs2
    PRKS-1009 : Failed to start ASM instance "+ASM2" on node "dbs2", [PRKS-1009 : Failed to start ASM instance "+ASM2" on node "dbs2", [CRS-1028: Dependency analysis failed because of:
    CRS-0223: Resource 'ora.dbs2.ASM2.asm' has placement error.]]
    [PRKS-1009 : Failed to start ASM instance "+ASM2" on node "dbs2", [CRS-1028: Dependency analysis failed because of:
    CRS-0223: Resource 'ora.dbs2.ASM2.asm' has placement error.]]
    how do i proceed?
    iam using solaris 10 with t2000 and t5210 server and oracle 10.2.0.4

  • Public Interface not responding after second node is started in the cluster

    Hi
    Has anyone ever experienced the public interface not responding between nodes in the cluster (ping, ssh, scp) after the second nodeapps is started in the cluster?
    This is a new install so all I have installed so far is the base release of CRS 10.2.0. This is on Solaris10. The vipca failed during the installation, however I was able to proceed and manually add the nodeapps using srvctl add nodeaps -n -o -A.
    It seems after the second node is started I loose all connectivity to the public interfaces and to my default gateway.
    Also I'm getting the following messages sometimes after I try and stop the nodeapps and start them back up.
    CRS-1006: No more members to consider
    CRS-0215: Could not start resource 'ora.node1.vip'.
    Any suggestions on where I should start troubleshooting?
    Thanks

    Do you have default GW?
    It can connects with GW, can't it?
    Check metalink
    CRS-0215: Could not start resource 'ora..vip' [ID 356535.1]
    CRS-1006: No more members to consider when starting service [ID 465364.1]
    Good Luck

Maybe you are looking for

  • My contacts won't open so I can add email addresses

    my contacts app won't open for me to add emails can someone please help me

  • How to create ABAP Proxy for SSL secured ABAP Service

    Hi guys, I try to set up transport security for my ABAP web service. The service should be called via a ABAP Proxy. These are my steps to create the ABAP web service: 1. Create function module (se80) 2. Create web service (web service definition) (se

  • RE: (forte-users) memory management

    Brenda, When a partition starts, it reserves the MinimumAllocation. Within this memory space, objects are created and more and more of this memory is actually used. When objects are no longer referenced, they remain in memory and the space they occup

  • QT Pro suddenly won't record.

    I purchased QT pro in order to make quick desktop audio & video recordings of songs I'm working on. When I don't have time to open a Garageband or Cubase project & just want to get a simple idea recorded, QT pro worked great. Then it just stopped wor

  • Itunes runs fine, but as soon as I connect my Ipod Classic it says it's...

    stopped working. I run Windows 7 Home Premium and Itunes v10.2.1.1. Windows tries to say that if it can find a solution it will, but it never does. Very frustating indeed... Any ideas anyone! Stuart