Runcluvfy.sh fails at precheck - nodes unreachable

Hi,
i am trying to run runcluvfy.sh to check Oracle Clusterware Requirements as:
[oracle@raclinux1 clusterware]$ ./runcluvfy.sh stage -pre crsinst -n raclinux1,raclinux2 -r 11gR1 -verbose
The output is the following:
Performing pre-checks for cluster services setup
Checking node reachability...
raclinux1.acme.com: raclinux1.acme.com
Check: Node reachability from node "null"
Destination Node Reachable?
raclinux1 no
raclinux2 no
Result: Node reachability check failed from node "null".
ERROR:
Unable to reach any of the nodes.
Verification cannot proceed.
Pre-check for cluster services setup was unsuccessful on all the nodes.
Although i can use ssh to the other nodes without giving password:
[oracle@raclinux1 clusterware]$ ssh raclinux1 "hostname"
raclinux1.acme.com
[oracle@raclinux1 clusterware]$ ssh raclinux2 "hostname"
raclinux2.acme.com
I checked CV software log in /tmp/bootstrap/cv/log/cvutrace.log.0 and found:
[main] [11:5:26:119] [TaskNodeConnectivity.performTask:311] nw:Performing Node Connectivity verification task...
; Tue Apr 22 11:05:26 CEST 2008
[main] [11:5:26:147] [ResultSet.traceResultSet:239]
Target ResultSet BEFORE Upload===>
Overall Status->UNKNOWN
; Tue Apr 22 11:05:26 CEST 2008
[main] [11:5:26:150] [ResultSet.traceResultSet:239]
Source ResultSet ===>
Overall Status->OPERATION_FAILED
raclinux2-->OPERATION_FAILED
raclinux1-->OPERATION_FAILED
; Tue Apr 22 11:05:26 CEST 2008
[main] [11:5:26:151] [ResultSet.traceResultSet:239]
Target ResultSet AFTER Upload===>
Overall Status->OPERATION_FAILED
raclinux2-->OPERATION_FAILED
raclinux1-->OPERATION_FAILED
; Tue Apr 22 11:05:26 CEST 2008
[main] [11:5:26:154] [CluvfyDriver.main:360] ==== cluvfy exiting normally.; Tue Apr 22 11:05:26 CEST 2008
What does not say too much for me.
Regarding the tech stack:
I am on x86 Enterprise Linux 5 Update 1
cvuqdisk-1.0.1-1
Any idea what i made wrong ?
thanks in advance
szabolcs

Hi All,
thanks for your quick answers :)
I am following this note from OTN:
http://www.oracle.com/technology/pub/articles/hunter_rac11gr1_iscsi_2.html
fzheng :
I am using iscsitarget service. They are visible from all nodes,as:
[oracle@raclinux1 .ssh]$ ls -la /dev/disk/by-path/
total 0
drwxr-xr-x 2 root root 240 Apr 21 19:35 .
drwxr-xr-x 6 root root 120 Apr 21 17:56 ..
lrwxrwxrwx 1 root root 9 Apr 21 17:56 ip-10.0.0.4:3260-iscsi-iqn.2008-04.com.acme:san1.crs -> ../../sdc
lrwxrwxrwx 1 root root 10 Apr 21 17:56 ip-10.0.0.4:3260-iscsi-iqn.2008-04.com.acme:san1.crs-part1 -> ../../sdc1
lrwxrwxrwx 1 root root 9 Apr 21 17:56 ip-10.0.0.4:3260-iscsi-iqn.2008-04.com.acme:san1.disk1 -> ../../sda
lrwxrwxrwx 1 root root 10 Apr 21 19:35 ip-10.0.0.4:3260-iscsi-iqn.2008-04.com.acme:san1.disk1-part1 -> ../../sda1
lrwxrwxrwx 1 root root 9 Apr 21 17:56 ip-10.0.0.4:3260-iscsi-iqn.2008-04.com.acme:san1.disk2 -> ../../sdb
lrwxrwxrwx 1 root root 10 Apr 21 19:35 ip-10.0.0.4:3260-iscsi-iqn.2008-04.com.acme:san1.disk2-part1 -> ../../sdb1
Chris slattery:
from the note above, i did the followings:
[oracle@raclinux1 .ssh]$ /usr/bin/ssh-keygen -t rsa
Generating public/private rsa key pair.
Enter file in which to save the key (/home/oracle/.ssh/id_rsa):
Enter passphrase (empty for no passphrase):
Enter same passphrase again:
Your identification has been saved in /home/oracle/.ssh/id_rsa.
Your public key has been saved in /home/oracle/.ssh/id_rsa.pub.
The key fingerprint is:
06:c1:07:04:07:bd:9c:1e:b5:9f:9a:16:82:31:cd:92 [email protected]
[oracle@raclinux2 ~]$ /usr/bin/ssh-keygen -t rsa
Generating public/private rsa key pair.
Enter file in which to save the key (/home/oracle/.ssh/id_rsa):
/home/oracle/.ssh/id_rsa already exists.
Overwrite (y/n)? y
Enter passphrase (empty for no passphrase):
Enter same passphrase again:
Your identification has been saved in /home/oracle/.ssh/id_rsa.
Your public key has been saved in /home/oracle/.ssh/id_rsa.pub.
The key fingerprint is:
29:e5:11:06:e8:e5:ba:35:3d:ab:52:73:28:8f:42:b7 [email protected]
[oracle@raclinux3 .ssh]$ /usr/bin/ssh-keygen -t rsa
Generating public/private rsa key pair.
Enter file in which to save the key (/home/oracle/.ssh/id_rsa):
Enter passphrase (empty for no passphrase):
Enter same passphrase again:
Your identification has been saved in /home/oracle/.ssh/id_rsa.
Your public key has been saved in /home/oracle/.ssh/id_rsa.pub.
The key fingerprint is:
8f:d1:d0:36:c7:17:fb:3d:53:10:ad:49:c8:55:46:ae [email protected]
Then, from node1:
[oracle@raclinux1 .ssh]$ rm authorized_keys
[oracle@raclinux1 .ssh]$ touch authorized_keys
[oracle@raclinux1 .ssh]$ ls -l *.pub
-rw-r--r-- 1 oracle oinstall 410 Apr 22 14:09 id_rsa.pub
[oracle@raclinux1 .ssh]$ ssh raclinux1 cat ~/.ssh/id_rsa.pub >> ~/.ssh/authorized_keys
oracle@raclinux1's password:
[oracle@raclinux1 .ssh]$ ssh raclinux2 cat ~/.ssh/id_rsa.pub >> ~/.ssh/authorized_keys
[oracle@raclinux1 .ssh]$ ssh raclinux3 cat ~/.ssh/id_rsa.pub >> ~/.ssh/authorized_keys
[oracle@raclinux1 .ssh]$ ssh raclinux3-priv cat ~/.ssh/id_rsa.pub >> ~/.ssh/authorized_keys
[oracle@raclinux1 .ssh]$ ssh raclinux2-priv cat ~/.ssh/id_rsa.pub >> ~/.ssh/authorized_keys
[oracle@raclinux1 .ssh]$ ssh raclinux1-priv cat ~/.ssh/id_rsa.pub >> ~/.ssh/authorized_keys
Enter passphrase for key '/home/oracle/.ssh/id_rsa':
scp ~/.ssh/authorized_keys raclinux2:.ssh/authorized_keys
scp ~/.ssh/authorized_keys raclinux3:.ssh/authorized_keys
[oracle@raclinux1 .ssh]$ scp ~/.ssh/authorized_keys raclinux2:.ssh/authorized_keys
authorized_keys 100% 2460 2.4KB/s 00:00
[oracle@raclinux1 .ssh]$ scp ~/.ssh/authorized_keys raclinux3:.ssh/authorized_keys
authorized_keys 100% 2460 2.4KB/s 00:00
[oracle@raclinux1 .ssh]$ chmod 600 authorized_keys
[oracle@raclinux1 .ssh]$ ssh raclinux1 hostname
Enter passphrase for key '/home/oracle/.ssh/id_rsa':
raclinux1.acme.com
[oracle@raclinux1 .ssh]$ ssh raclinux2 hostname
Enter passphrase for key '/home/oracle/.ssh/id_rsa':
Enter passphrase for key '/home/oracle/.ssh/id_rsa':
raclinux2.acme.com
[oracle@raclinux1 .ssh]$ ssh raclinux3 hostname
Enter passphrase for key '/home/oracle/.ssh/id_rsa':
raclinux3.acme.com
[oracle@raclinux1 .ssh]$ exec /usr/bin/ssh-agent $SHELL
[oracle@raclinux1 .ssh]$ /usr/bin/ssh-add
Enter passphrase for /home/oracle/.ssh/id_rsa:
Identity added: /home/oracle/.ssh/id_rsa (/home/oracle/.ssh/id_rsa)
[oracle@raclinux1 .ssh]$ ssh raclinux1 hostname
raclinux1.acme.com
[oracle@raclinux1 .ssh]$ ssh raclinux2 hostname
raclinux2.acme.com
I am going to chec the documentation..
thanks
szabolcs

Similar Messages

Runcluvfy.sh unable to find nodes

I am trying to install Orcale CRS 10.2.0.2 ON rhas 4.3 64 BIT AND HAVE IMPLEMENTED nic BONDING.
Upon running ./runcluvfy.sh comp nodecon -n <node name> it says nodes unreachable.
Checked the follwoing:
ssh ok,ping ok,bond drivers have same name,proper entires in host file,ifconfig output is ok.
What else do I need to check??
Any help will be much appreciated.
Thx,
Amit

Check that you are able to ssh to the same server without being prompted for a password.
for instance:
from Server1: ssh Server1
If this does not work, runcluvfy.sh will failed to reach the local servers.
If ssh does not work, make sure to include the ssh key got generated correctly and it is include on the ~home/.ssh/authorized_keys file for oracle and root.

Root.sh failed on second node while installing CRS 10g on centos 5.5

root.sh failed on second node while installing CRS 10g
Hi all,
I am able to install Oracle 10g RAC clusterware on first node of the cluster. However, when I run the root.sh script as root
user on second node of the cluster, it fails with following error message:
NO KEYS WERE WRITTEN. Supply -force parameter to override.
-force is destructive and will destroy any previous cluster
configuration.
Oracle Cluster Registry for cluster has already been initialized
Startup will be queued to init within 90 seconds.
Adding daemons to inittab
Expecting the CRS daemons to be up within 600 seconds.
Failure at final check of Oracle CRS stack.
10
and run cluvfy stage -post hwos -n all -verbose,it show message:
ERROR:
Could not find a suitable set of interfaces for VIPs.
Result: Node connectivity check failed.
Checking shared storage accessibility...
Disk Sharing Nodes (2 in count)
/dev/sda db2 db1
and run cluvfy stage -pre crsinst -n all -verbose,it show message:
ERROR:
Could not find a suitable set of interfaces for VIPs.
Result: Node connectivity check failed.
Checking system requirements for 'crs'...
No checks registered for this product.
and run cluvfy stage -post crsinst -n all -verbose,it show message:
Result: Node reachability check passed from node "DB2".
Result: User equivalence check passed for user "oracle".
Node Name CRS daemon CSS daemon EVM daemon
db2 no no no
db1 yes yes yes
Check: Health of CRS
Node Name CRS OK?
db1 unknown
Result: CRS health check failed.
check crsd.log and show message:
clsc_connect: (0x143ca610) no listener at (ADDRESS=(PROTOCOL=ipc)(KEY=OCSSD_LL_db2_crs))
clsssInitNative: connect failed, rc 9
Any help would be greatly appreciated.
Edited by: 868121 on 2011-6-24 上午12:31

Hello, it took a little searching, but I found this in a note in the GRID installation guide for Linux/UNIX:
Public IP addresses and virtual IP addresses must be in the same subnet.
In your case, you are using two different subnets for the VIPs.

Failed to add node to cluster

Hey, I am currently migrating my cluster.
I removed the server pool master according to the metalink note by doing a failover (stopped the agent on the server pool master)
Deleted the old master (node2) from the server pool.
Executed the cleanup script on node2 and switched it off
Modified the cluster.conf on the remaining node and remove the entries for the old master node2.
Replaced the old server with new hardware -
same name - same ip.
Now I try to add this server to the server pool, but I get a timeout message
OVM-1006 Register Oracle VM Server (node2) Failed: errcode=00001, errmsg=CDS accquire lock /etc/ovs-agent/db/srv.lock timeout. locker process is 8339
Where can I look ?
Christian

Lemeunier wrote:
> environment: sles 10 sp3, oes2, cluster services
>
> problem: reconfiguring oes to add a node to the cluster is causing the
> error *failed to add node to cluster*
>
> history: I installed a 4 node cluster in a HP C7000 blade. We had to
> replace the network switch in the blade center by a virtual connect
> flex-10. This resulted in a loss of network connectivity, so I removed 3
> of 4 nodes from cluster and eDirectory.
> This worked fine, replication and time synchronisation was succesfully
> and all server objects belonging to these 3 servers were deleted.
>
> Now the new switch has been configured and network connection
> reestablished. Reconfiguring eDirectory and other oes2 services
> succeeds, alle server objects are recreated, eDirectory is in sync, but
> reconfiguring cluster services does not succeed.
>
> What do I have to do, to reconfigure cluster service and add nodes to
> the cluster?
>
> Thank you for all hints.
>
> Ursula
>
>
Did you remove the cluster rpms and then reinstall the rpms. I would
recommend following TID 3131978 and see if that helps.

Root.sh failed in one node - CLSMON and UDLM

Hi experts.
My enviroment is:
2-node SunCluster Update3
Oracle RAC 10.2.0.1 > planning to upgrade to 10.2.0.4
The problem is: I installed the CRS services on 2 nodes - OK
After that, running root.sh fails in 1 node:
/u01/app/product/10/CRS/root.sh
WARNING: directory '/u01/app/product/10' is not owned by root
WARNING: directory '/u01/app/product' is not owned by root
WARNING: directory '/u01/app' is not owned by root
WARNING: directory '/u01' is not owned by root
Checking to see if Oracle CRS stack is already configured
Checking to see if any 9i GSD is up
Setting the permissions on OCR backup directory
Setting up NS directories
Oracle Cluster Registry configuration upgraded successfully
WARNING: directory '/u01/app/product/10' is not owned by root
WARNING: directory '/u01/app/product' is not owned by root
WARNING: directory '/u01/app' is not owned by root
WARNING: directory '/u01' is not owned by root
clscfg: EXISTING configuration version 3 detected.
clscfg: version 3 is 10G Release 2.
Successfully accumulated necessary OCR keys.
Using ports: CSS=49895 CRS=49896 EVMC=49898 and EVMR=49897.
node <nodenumber>: <nodename> <private interconnect name> <hostname>
node 0: spodhcsvr10 clusternode1-priv spodhcsvr10
node 1: spodhcsvr12 clusternode2-priv spodhcsvr12
clscfg: Arguments check out successfully.
NO KEYS WERE WRITTEN. Supply -force parameter to override.
-force is destructive and will destroy any previous cluster
configuration.
Oracle Cluster Registry for cluster has already been initialized
Sep 22 13:34:17 spodhcsvr10 root: Oracle Cluster Ready Services starting by user request.
Startup will be queued to init within 30 seconds.
Sep 22 13:34:20 spodhcsvr10 root: Cluster Ready Services completed waiting on dependencies.
Adding daemons to inittab
Expecting the CRS daemons to be up within 600 seconds.
Sep 22 13:34:34 spodhcsvr10 last message repeated 3 times
Sep 22 13:34:34 spodhcsvr10 root: Running CRSD with TZ = Brazil/East
Sep 22 13:34:40 spodhcsvr10 root: Oracle CLSMON terminated with unexpected status 10. Respawning
Sep 22 13:35:43 spodhcsvr10 last message repeated 9 times
Sep 22 13:36:07 spodhcsvr10 root: Cluster Ready Services completed waiting on dependencies.
Sep 22 13:36:07 spodhcsvr10 root: Running CRSD with TZ = Brazil/East
Sep 22 13:36:14 spodhcsvr10 su: libsldap: Status: 85 Mesg: openConnection: simple bind failed - Timed out
Sep 22 13:36:19 spodhcsvr10 root: Oracle CLSMON terminated with unexpected status 10. Respawning
Sep 22 13:37:35 spodhcsvr10 last message repeated 11 times
Sep 22 13:37:40 spodhcsvr10 root: Cluster Ready Services completed waiting on dependencies.
Sep 22 13:37:40 spodhcsvr10 root: Running CRSD with TZ = Brazil/East
Sep 22 13:37:42 spodhcsvr10 root: Oracle CLSMON terminated with unexpected status 10. Respawning
Sep 22 13:38:03 spodhcsvr10 last message repeated 3 times
Sep 22 13:38:10 spodhcsvr10 root: Oracle CLSMON terminated with unexpected status 10. Respawning
Sep 22 13:39:12 spodhcsvr10 last message repeated 9 times
Sep 22 13:39:13 spodhcsvr10 root: Cluster Ready Services completed waiting on dependencies.
Sep 22 13:39:13 spodhcsvr10 root: Running CRSD with TZ = Brazil/East
Sep 22 13:39:19 spodhcsvr10 root: Oracle CLSMON terminated with unexpected status 10. Respawning
Sep 22 13:40:42 spodhcsvr10 last message repeated 12 times
Sep 22 13:40:46 spodhcsvr10 root: Cluster Ready Services completed waiting on dependencies.
Sep 22 13:40:46 spodhcsvr10 root: Running CRSD with TZ = Brazil/East
Sep 22 13:40:49 spodhcsvr10 root: Oracle CLSMON terminated with unexpected status 10. Respawning
Sep 22 13:42:05 spodhcsvr10 last message repeated 11 times
Sep 22 13:42:11 spodhcsvr10 root: Cluster Ready Services completed waiting on dependencies.
Sep 22 13:42:12 spodhcsvr10 root: Oracle CLSMON terminated with unexpected status 10. Respawning
Sep 22 13:42:19 spodhcsvr10 root: Cluster Ready Services completed waiting on dependencies.
Sep 22 13:42:19 spodhcsvr10 root: Running CRSD with TZ = Brazil/East
Sep 22 13:42:19 spodhcsvr10 root: Oracle CLSMON terminated with unexpected status 10. Respawning
Sep 22 13:43:49 spodhcsvr10 last message repeated 13 times
Sep 22 13:43:51 spodhcsvr10 root: Cluster Ready Services completed waiting on dependencies.
Sep 22 13:43:51 spodhcsvr10 root: Running CRSD with TZ = Brazil/East
Sep 22 13:43:56 spodhcsvr10 root: Oracle CLSMON terminated with unexpected status 10. Respawning
Failure at final check of Oracle CRS stack.
I traced the ocssd.log and found some informations:
[    CSSD]2010-09-22 14:04:14.739 [6] >TRACE: clssnmDiskStateChange: state from 2 to 4 disk (0//dev/vx/rdsk/racdg/ora_vote1)
[    CSSD]2010-09-22 14:04:14.742 [6] >TRACE: clssnmReadDskHeartbeat: node(1) is down. rcfg(2) wrtcnt(2478) LATS(0) Disk lastSeqNo(2478)
[    CSSD]2010-09-22 14:04:14.742 [7] >TRACE: clssnmDiskStateChange: state from 2 to 4 disk (1//dev/vx/rdsk/racdg/ora_vote2)
[    CSSD]2010-09-22 14:04:14.744 [7] >TRACE: clssnmReadDskHeartbeat: node(1) is down. rcfg(2) wrtcnt(2478) LATS(0) Disk lastSeqNo(2478)
[    CSSD]2010-09-22 14:04:14.745 [8] >TRACE: clssnmDiskStateChange: state from 2 to 4 disk (2//dev/vx/rdsk/racdg/ora_vote3)
[    CSSD]2010-09-22 14:04:14.746 [8] >TRACE: clssnmReadDskHeartbeat: node(1) is down. rcfg(2) wrtcnt(2478) LATS(0) Disk lastSeqNo(2478)
[    CSSD]2010-09-22 14:04:14.785 [1] >TRACE: clssscSclsFatal: read value of disable
[    CSSD]2010-09-22 14:04:14.785 [10] >TRACE: clssnmFatalThread: spawned
[    CSSD]2010-09-22 14:04:14.785 [1] >TRACE: clssscSclsFatal: read value of disable
[    CSSD]2010-09-22 14:04:14.786 [11] >TRACE: clssnmconnect: connecting to node 0, flags 0x0001, connector 1
[    CSSD]2010-09-22 14:04:23.075 >USER: Oracle Database 10g CSS Release 10.2.0.1.0 Production Copyright 1996, 2004 Oracle. All rights reserved.
[    CSSD]2010-09-22 14:04:23.075 >USER: CSS daemon log for node spodhcsvr10, number 0, in cluster NET_RAC
[ clsdmt]Listening to (ADDRESS=(PROTOCOL=ipc)(KEY=spodhcsvr10DBG_CSSD))
[    CSSD]2010-09-22 14:04:23.082 [1] >TRACE: clssscmain: local-only set to false
[    CSSD]2010-09-22 14:04:23.096 [1] >TRACE: clssnmReadNodeInfo: added node 0 (spodhcsvr10) to cluster
[    CSSD]2010-09-22 14:04:23.106 [1] >TRACE: clssnmReadNodeInfo: added node 1 (spodhcsvr12) to cluster
[    CSSD]2010-09-22 14:04:23.129 [5] >TRACE: [0]Node monitor: dlm attach failed error LK_STAT_NOTCREATED
[    CSSD]CLSS-0001: skgxn not active
[    CSSD]2010-09-22 14:04:23.129 [5] >TRACE: clssnm_skgxnmon: skgxn init failed, rc 30
[    CSSD]2010-09-22 14:04:23.132 [1] >TRACE: clssnmInitNMInfo: misscount set to 600
[    CSSD]2010-09-22 14:04:23.136 [1] >TRACE: clssnmDiskStateChange: state from 1 to 2 disk (0//dev/vx/rdsk/racdg/ora_vote1)
[    CSSD]2010-09-22 14:04:23.139 [1] >TRACE: clssnmDiskStateChange: state from 1 to 2 disk (1//dev/vx/rdsk/racdg/ora_vote2)
[    CSSD]2010-09-22 14:04:23.143 [1] >TRACE: clssnmDiskStateChange: state from 1 to 2 disk (2//dev/vx/rdsk/racdg/ora_vote3)
[    CSSD]2010-09-22 14:04:25.139 [6] >TRACE: clssnmDiskStateChange: state from 2 to 4 disk (0//dev/vx/rdsk/racdg/ora_vote1)
[    CSSD]2010-09-22 14:04:25.142 [6] >TRACE: clssnmReadDskHeartbeat: node(1) is down. rcfg(2) wrtcnt(2488) LATS(0) Disk lastSeqNo(2488)
[    CSSD]2010-09-22 14:04:25.143 [7] >TRACE: clssnmDiskStateChange: state from 2 to 4 disk (1//dev/vx/rdsk/racdg/ora_vote2)
[    CSSD]2010-09-22 14:04:25.144 [7] >TRACE: clssnmReadDskHeartbeat: node(1) is down. rcfg(2) wrtcnt(2488) LATS(0) Disk lastSeqNo(2488)
[    CSSD]2010-09-22 14:04:25.145 [8] >TRACE: clssnmDiskStateChange: state from 2 to 4 disk (2//dev/vx/rdsk/racdg/ora_vote3)
[    CSSD]2010-09-22 14:04:25.148 [8] >TRACE: clssnmReadDskHeartbeat: node(1) is down. rcfg(2) wrtcnt(2489) LATS(0) Disk lastSeqNo(2489)
[    CSSD]2010-09-22 14:04:25.186 [1] >TRACE: clssscSclsFatal: read value of disable
[    CSSD]2010-09-22 14:04:25.186 [10] >TRACE: clssnmFatalThread: spawned
[    CSSD]2010-09-22 14:04:25.186 [1] >TRACE: clssscSclsFatal: read value of disable
[    CSSD]2010-09-22 14:04:25.187 [11] >TRACE: clssnmconnect: connecting to node 0, flags 0x0001, connector 1
[    CSSD]2010-09-22 14:04:33.449 >USER: Oracle Database 10g CSS Release 10.2.0.1.0 Production Copyright 1996, 2004 Oracle. All rights reserved.
[    CSSD]2010-09-22 14:04:33.449 >USER: CSS daemon log for node spodhcsvr10, number 0, in cluster NET_RAC
[ clsdmt]Listening to (ADDRESS=(PROTOCOL=ipc)(KEY=spodhcsvr10DBG_CSSD))
[    CSSD]2010-09-22 14:04:33.457 [1] >TRACE: clssscmain: local-only set to false
[    CSSD]2010-09-22 14:04:33.470 [1] >TRACE: clssnmReadNodeInfo: added node 0 (spodhcsvr10) to cluster
[    CSSD]2010-09-22 14:04:33.480 [1] >TRACE: clssnmReadNodeInfo: added node 1 (spodhcsvr12) to cluster
[    CSSD]2010-09-22 14:04:33.498 [5] >TRACE: [0]Node monitor: dlm attach failed error LK_STAT_NOTCREATED
[    CSSD]CLSS-0001: skgxn not active
[    CSSD]2010-09-22 14:04:33.498 [5] >TRACE: clssnm_skgxnmon: skgxn init failed, rc 30
[    CSSD]2010-09-22 14:04:33.500 [1] >TRACE: clssnmInitNMInfo: misscount set to 600
[    CSSD]2010-09-22 14:04:33.505 [1] >TRACE: clssnmDiskStateChange: state from 1 to 2 disk (0//dev/vx/rdsk/racdg/ora_vote1)
[    CSSD]2010-09-22 14:04:33.508 [1] >TRACE: clssnmDiskStateChange: state from 1 to 2 disk (1//dev/vx/rdsk/racdg/ora_vote2)
[    CSSD]2010-09-22 14:04:33.510 [1] >TRACE: clssnmDiskStateChange: state from 1 to 2 disk (2//dev/vx/rdsk/racdg/ora_vote3)
[    CSSD]2010-09-22 14:04:35.508 [6] >TRACE: clssnmDiskStateChange: state from 2 to 4 disk (0//dev/vx/rdsk/racdg/ora_vote1)
[    CSSD]2010-09-22 14:04:35.510 [6] >TRACE: clssnmReadDskHeartbeat: node(1) is down. rcfg(2) wrtcnt(2499) LATS(0) Disk lastSeqNo(2499)
[    CSSD]2010-09-22 14:04:35.510 [7] >TRACE: clssnmDiskStateChange: state from 2 to 4 disk (1//dev/vx/rdsk/racdg/ora_vote2)
[    CSSD]2010-09-22 14:04:35.512 [7] >TRACE: clssnmReadDskHeartbeat: node(1) is down. rcfg(2) wrtcnt(2499) LATS(0) Disk lastSeqNo(2499)
[    CSSD]2010-09-22 14:04:35.513 [8] >TRACE: clssnmDiskStateChange: state from 2 to 4 disk (2//dev/vx/rdsk/racdg/ora_vote3)
[    CSSD]2010-09-22 14:04:35.514 [8] >TRACE: clssnmReadDskHeartbeat: node(1) is down. rcfg(2) wrtcnt(2499) LATS(0) Disk lastSeqNo(2499)
[    CSSD]2010-09-22 14:04:35.553 [1] >TRACE: clssscSclsFatal: read value of disable
[    CSSD]2010-09-22 14:04:35.553 [10] >TRACE: clssnmFatalThread: spawned
[    CSSD]2010-09-22 14:04:35.553 [1] >TRACE: clssscSclsFatal: read value of disable
[    CSSD]2010-09-22 14:04:35.553 [11] >TRACE: clssnmconnect: connecting to node 0, flags 0x0001, connector 1
I believe the main error is:
[    CSSD]2010-09-22 14:04:33.498 [5] >TRACE: [0]Node monitor: dlm attach failed error LK_STAT_NOTCREATED
[    CSSD]CLSS-0001: skgxn not active
And the communication between UDLM and CLSMON. But i don't know how to resolve this.
My UDLM version is 3.3.4.9.
Somebody have any ideas about this?
Tks!

Now i finally installed CRS and run root.sh without errors (i think that problem is in some old file from other instalation tries...)
But now i have another problem: When install DB software, in step to copy instalation to remote node, this node have some failure in CLSMON/CSSD daemon and panicking:
Sep 23 16:10:51 spodhcsvr10 root: Oracle CLSMON terminated with unexpected status 138. Respawning
Sep 23 16:10:52 spodhcsvr10 root: Oracle CSSD failure. Rebooting for cluster integrity.
Sep 23 16:10:52 spodhcsvr10 root: [ID 702911 user.alert] Oracle CSSD failure. Rebooting for cluster integrity.
Sep 23 16:10:51 spodhcsvr10 root: [ID 702911 user.error] Oracle CLSMON terminated with unexpected status 138. Respawning
Sep 23 16:10:52 spodhcsvr10 root: [ID 702911 user.alert] Oracle CSSD failure. Rebooting for cluster integrity.
Sep 23 16:10:56 spodhcsvr10 Cluster.OPS.UCMMD: fatal: received signal 15
Sep 23 16:10:56 spodhcsvr10 Cluster.OPS.UCMMD: [ID 770355 daemon.error] fatal: received signal 15
Sep 23 16:10:59 spodhcsvr10 root: Oracle Cluster Ready Services waiting for SunCluster and UDLM to start.
Sep 23 16:10:59 spodhcsvr10 root: Cluster Ready Services completed waiting on dependencies.
Sep 23 16:10:59 spodhcsvr10 root: [ID 702911 user.error] Oracle Cluster Ready Services waiting for SunCluster and UDLM to start.
Sep 23 16:10:59 spodhcsvr10 root: [ID 702911 user.error] Cluster Ready Services completed waiting on dependencies.
Notifying cluster that this node is panicking
The instalation in first node continue and report error in copy to second node.
Any ideas? Tks!

Root.sh fails on 2nd node

AIX 6
Oracle grid infrastructure 11.2.0.3
At the end of the grid install, ran the root.sh on the first node then on the second node, but failed on the second node. Ran deconfig was successfull, but root.sh failed again :
The deconfig worked but not the root.sh:
Successfully deconfigured Oracle clusterware stack on this node
mtnx213:/oracle/app/grid/product/11.2.0/grid/crs/install#/oracle/app/grid/product/11.2.0/grid/root.sh
Performing root user operation for Oracle 11g
The following environment variables are set as:
ORACLE_OWNER= oragrid
ORACLE_HOME= /oracle/app/grid/product/11.2.0/grid
Enter the full pathname of the local bin directory: [/usr/local/bin]:
The contents of "dbhome" have not changed. No need to overwrite.
The contents of "oraenv" have not changed. No need to overwrite.
The contents of "coraenv" have not changed. No need to overwrite.
Entries will be added to the /etc/oratab file as needed by
Database Configuration Assistant when a database is created
Finished running generic part of root script.
Now product-specific root actions will be performed.
Using configuration parameter file: /oracle/app/grid/product/11.2.0/grid/crs/install/crsconfig_params
User ignored Prerequisites during installation
User oragrid has the required capabilities to run CSSD in realtime mode
OLR initialization - successful
Adding Clusterware entries to inittab
USM driver install actions failed
/oracle/app/grid/product/11.2.0/grid/perl/bin/perl -I/oracle/app/grid/product/11.2.0/grid/perl/lib -I/oracle/app/grid/product/11.2.0/grid/crs/install /oracle/app/grid/product/11.2.0/grid/crs/install/rootcrs.pl execution failed

My answer you can find here (in your duplicate post): root.sh fails on 2nd node Timed out waiting for the CRS stack to start

11G R2 root.sh failed on first node with OLE fetch parameter error

I have successfully installed 11G R2.1 on Centos 5.4 64 bit.
Now it's coming to install 11G R2.2 on Redhat 5.4 64bit with HDS storrage.
[grid@dmdb1 grid]$ uname -a
Linux dmdb1 2.6.18-164.el5 #1 SMP Tue Aug 18 15:51:48 EDT 2009 x86_64 x86_64 x86_64 GNU/Linux
I passed all pre-ins requirements except shared storage. However, I manually verify it with no problems.
[grid@dmdb1 grid]$ ./runcluvfy.sh stage -pre crsinst -fixup -n dmdb1,dmdb2,dmdb3,dmdb4 -verbose|grep -i fail
[grid@dmdb1 grid]$ ./runcluvfy.sh stage -post hwos -n dmdb1,dmdb2,dmdb3,dmdb4 -verbose|grep -i fail
[grid@dmdb1 grid]$ ./runcluvfy.sh comp sys -n dmdb1,dmdb2,dmdb3,dmdb4 -p crs -osdba dba -orainv oinstall
Verifying system requirement
Total memory check passed
Available memory check passed
Swap space check passed
Free disk space check passed for "dmdb4:/tmp"
Free disk space check passed for "dmdb3:/tmp"
Free disk space check passed for "dmdb2:/tmp"
Free disk space check passed for "dmdb1:/tmp"
User existence check passed for "grid"
Group existence check passed for "oinstall"
Group existence check passed for "dba"
Membership check for user "grid" in group "oinstall" [as Primary] passed
Membership check for user "grid" in group "dba" passed
Run level check passed
Hard limits check passed for "maximum open file descriptors"
Soft limits check passed for "maximum open file descriptors"
Hard limits check passed for "maximum user processes"
Soft limits check passed for "maximum user processes"
System architecture check passed
Kernel version check passed
Kernel parameter check passed for "semmsl"
Kernel parameter check passed for "semmns"
Kernel parameter check passed for "semopm"
Kernel parameter check passed for "semmni"
Kernel parameter check passed for "shmmax"
Kernel parameter check passed for "shmmni"
Kernel parameter check passed for "shmall"
Kernel parameter check passed for "file-max"
Kernel parameter check passed for "ip_local_port_range"
Kernel parameter check passed for "rmem_default"
Kernel parameter check passed for "rmem_max"
Kernel parameter check passed for "wmem_default"
Kernel parameter check passed for "wmem_max"
Kernel parameter check passed for "aio-max-nr"
Package existence check passed for "make-3.81"
Package existence check passed for "binutils-2.17.50.0.6"
Package existence check passed for "gcc-4.1"
Package existence check passed for "libaio-0.3.106 (i386)"
Package existence check passed for "libaio-0.3.106 (x86_64)"
Package existence check passed for "glibc-2.5-24 (i686)"
Package existence check passed for "glibc-2.5-24 (x86_64)"
Package existence check passed for "compat-libstdc++-33-3.2.3 (i386)"
Package existence check passed for "compat-libstdc++-33-3.2.3 (x86_64)"
Package existence check passed for "elfutils-libelf-0.125 (x86_64)"
Package existence check passed for "elfutils-libelf-devel-0.125"
Package existence check passed for "glibc-common-2.5"
Package existence check passed for "glibc-devel-2.5 (i386)"
Package existence check passed for "glibc-devel-2.5 (x86_64)"
Package existence check passed for "glibc-headers-2.5"
Package existence check passed for "gcc-c++-4.1.2"
Package existence check passed for "libaio-devel-0.3.106 (i386)"
Package existence check passed for "libaio-devel-0.3.106 (x86_64)"
Package existence check passed for "libgcc-4.1.2 (i386)"
Package existence check passed for "libgcc-4.1.2 (x86_64)"
Package existence check passed for "libstdc++-4.1.2 (i386)"
Package existence check passed for "libstdc++-4.1.2 (x86_64)"
Package existence check passed for "libstdc++-devel-4.1.2 (x86_64)"
Package existence check passed for "sysstat-7.0.2"
Package existence check passed for "unixODBC-2.2.11 (i386)"
Package existence check passed for "unixODBC-2.2.11 (x86_64)"
Package existence check passed for "unixODBC-devel-2.2.11 (i386)"
Package existence check passed for "unixODBC-devel-2.2.11 (x86_64)"
Package existence check passed for "ksh-20060214"
Check for multiple users with UID value 0 passed
Verification of system requirement was successful.
[grid@dmdb1 grid]$ ./runcluvfy.sh comp sys -n dmdb1,dmdb2,dmdb3,dmdb4 -p database -osdba dba -orainv oinstall|grep -i fail
[grid@dmdb1 grid]$ ./runcluvfy.sh comp ssa -n dmdb1,dmdb2,dmdb3,dmdb4
Verifying shared storage accessibility
Checking shared storage accessibility...
Storage operation failed
Shared storage check failed on nodes "dmdb4,dmdb3,dmdb2,dmdb1"
Verification of shared storage accessibility was unsuccessful on all the specified nodes.
I followed below article to verify shared storage issues:
http://www.webofwood.com/rac/oracle-response-to-shared-storage-check-failed-on-nodes/
it's ok.
So I skipped SSA issue and go on install with (./runInstaller -ignoreInternalDriverError).
However, when I ran root.sh with below error:
CRS-2673: Attempting to stop 'ora.mdnsd' on 'dmdb1'
CRS-2677: Stop of 'ora.mdnsd' on 'dmdb1' succeeded
CRS-2673: Attempting to stop 'ora.gipcd' on 'dmdb1'
CRS-2677: Stop of 'ora.gipcd' on 'dmdb1' succeeded
CRS-4000: Command Start failed, or completed with errors.
CRS-2672: Attempting to start 'ora.gipcd' on 'dmdb1'
CRS-2672: Attempting to start 'ora.mdnsd' on 'dmdb1'
CRS-2676: Start of 'ora.gipcd' on 'dmdb1' succeeded
CRS-2676: Start of 'ora.mdnsd' on 'dmdb1' succeeded
CRS-2672: Attempting to start 'ora.gpnpd' on 'dmdb1'
CRS-2676: Start of 'ora.gpnpd' on 'dmdb1' succeeded
CRS-2672: Attempting to start 'ora.cssdmonitor' on 'dmdb1'
CRS-2676: Start of 'ora.cssdmonitor' on 'dmdb1' succeeded
CRS-2672: Attempting to start 'ora.cssd' on 'dmdb1'
CRS-2672: Attempting to start 'ora.diskmon' on 'dmdb1'
CRS-2676: Start of 'ora.diskmon' on 'dmdb1' succeeded
CRS-2674: Start of 'ora.cssd' on 'dmdb1' failed
CRS-2679: Attempting to clean 'ora.cssd' on 'dmdb1'
CRS-2681: Clean of 'ora.cssd' on 'dmdb1' succeeded
CRS-2673: Attempting to stop 'ora.diskmon' on 'dmdb1'
CRS-2677: Stop of 'ora.diskmon' on 'dmdb1' succeeded
CRS-2673: Attempting to stop 'ora.gpnpd' on 'dmdb1'
CRS-2677: Stop of 'ora.gpnpd' on 'dmdb1' succeeded
CRS-2673: Attempting to stop 'ora.mdnsd' on 'dmdb1'
CRS-2677: Stop of 'ora.mdnsd' on 'dmdb1' succeeded
CRS-2673: Attempting to stop 'ora.gipcd' on 'dmdb1'
CRS-2677: Stop of 'ora.gipcd' on 'dmdb1' succeeded
CRS-4000: Command Start failed, or completed with errors.
Command return code of 1 (256) from command: /opt/app/11.2.0/grid/bin/crsctl start resource ora.ctssd -init
Start of resource "ora.ctssd -init" failed
Clusterware exclusive mode start of resource ora.ctssd failed
CRS-2500: Cannot stop resource 'ora.crsd' as it is not running
CRS-4000: Command Stop failed, or completed with errors.
Command return code of 1 (256) from command: /opt/app/11.2.0/grid/bin/crsctl stop resource ora.crsd -init
Stop of resource "ora.crsd -init" failed
Failed to stop CRSD
CRS-2500: Cannot stop resource 'ora.asm' as it is not running
CRS-4000: Command Stop failed, or completed with errors.
Command return code of 1 (256) from command: /opt/app/11.2.0/grid/bin/crsctl stop resource ora.asm -init
Stop of resource "ora.asm -init" failed
Failed to stop ASM
CRS-2673: Attempting to stop 'ora.cssdmonitor' on 'dmdb1'
CRS-2677: Stop of 'ora.cssdmonitor' on 'dmdb1' succeeded
Initial cluster configuration failed. See /opt/app/11.2.0/grid/cfgtoollogs/crsconfig/rootcrs_dmdb1.log for details
I manually ran '/opt/app/11.2.0/grid/bin/crsctl start resource ora.ctssd -init' and got below erros from /opt/app/11.2.0/grid/log/dmdb1/cssd/ocssd.log
Oracle Database 11g Clusterware Release 11.2.0.1.0 - Production Copyright 1996, 2009 Oracle. All rights reserved.
2011-09-23 19:06:41.501: [    CSSD][1812336384]clssscmain: Starting CSS daemon, version 11.2.0.1.0, in (exclusive) mode with uniqueness value 1316776001
2011-09-23 19:06:41.502: [    CSSD][1812336384]clssscmain: Environment is production
2011-09-23 19:06:41.502: [    CSSD][1812336384]clssscmain: Core file size limit extended
2011-09-23 19:06:41.515: [    CSSD][1812336384]clssscGetParameterOLR: OLR fetch for parameter logsize (8) failed with rc 21
2011-09-23 19:06:41.515: [    CSSD][1812336384]clssscSetPrivEnv: IPMI device not installed on this node
2011-09-23 19:06:41.517: [    CSSD][1812336384]clssscGetParameterOLR: OLR fetch for parameter priority (15) failed with rc 21
2011-09-23 19:06:41.539: [    CSSD][1812336384]clssscExtendLimits: The current soft limit for file descriptors is 65536, hard limit is 65536
2011-09-23 19:06:41.539: [    CSSD][1812336384]clssscExtendLimits: The current soft limit for locked memory is 4294967295, hard limit is 4294967295
2011-09-23 19:06:41.541: [    CSSD][1812336384]clssscmain: Running as user grid
anybody can help me fix it?

I opened on SR for this case.
it's ok now.
Below is from Oracle Global Service request:
=== ODM Action Plan ===
Dear customer, after went through the uploaded log files, we found the issue looks like
bug 9732641 : The clusterware gpnpd process crashes when there is more than 1 cluster with the same name.
To narrow down the issue, pls apply the following steps.
1. Pls clean the previous configuration with below steps, then run root.sh script on node1 again.
1.1 remove current configuration.
$GRID_HOME/crs/install/rootcrs.pl -verbose -deconfig -force
1.2 remove other related files.
if $GI_BASE/Clusterware/ckptGridHA_.xml still there, please remove it manually with "rm" command on all nodes
If the gpnp profile is still there, pls clean up them, then rebuild require directories.
$ rm -rf $GRID_HOME/gpnp/*
$ mkdir -p $GRID_HOME/gpnp/profiles/peer $GRID_HOME/gpnp/wallets/peer $GRID_HOME/gpnp/wallets/prdr $GRID_HOME/gpnp/wallets/pa $GRID_HOME/gpnp/wallets/root
2. After the previous configuration was cleaned up, pls rerun the root.sh script again. If the issue still there, pls upload the following:
Everything under <GI_HOME>/log
Everything under <ORACLE_BAES for grid user>/cfgtoollogs
Everything under <GI_HOME>/cfgtolllogs/crsconfig
OS log(/var/log/messages)
3. Pls also make sure there is only one GI running on your cluster.
See /opt/app/11.2.0/grid/cfgtoollogs/crsconfig/rootcrs_dmdb1.log for details

Free diskspace check failed foe ecah node

Hi all
I am tring to install oracle RAC in Windows 2003 srver.I have creared 2 virtual pc.I follwed every step correctly coz the clusterware software installs upto vipca.After that i sudelny see a blue screen stating that the hardware failure .Now i am trng eith 2 virtual pcs.when i run runcluvfy.bat stage -pre crsinst -n RAC1,RAC2 -verbose
The error is
Free disk space in
Check: Free disk space in "C:\DOCUME~1\ADMINI~1.RAC\LOCALS~1\Temp" dir
Node Name Available Required Comment
RAC2 unknown 400MB (409600KB) failed
RAC1 10.79GB (11313436KB) 400MB (409600KB) passed
Result: Free disk space check failed.
System requirement failed for 'crs'.
what should i do.Thanks
Edited by: user12119634(bobs) on Dec 13, 2009 9:02 PM

Define virtual PCs. Provide full description of the technology, vendor, and version number. If this is another attempt to use unsupported technology like VMWare then you are going against what Oracle recommends and you are on your own.
Since you do not have a valid clusterware installation as indicated by the cluster verify tool this is a really good time to stop the installation and think through these two questions:
1. What is your shared storage solution? With Windows you only have a couple of possible choices none of which you have mentioned.
2. What is your cache fusion interconnect strategy?
If you don't get these right you are wasting your time.
SB ... I have no idea why you think this question is in the wrong forum. Can you explain?

Runcluvfy.sh failed on user equivalence check for user "oracle"

I have user oracle set up on both nodes -RHEL5 as:
uid=505(oracle) gid=87(oinstall) groups=87(oinstall),88(dba)
but when run the runcluvfy.sh stage -pre crsinst -n node1,node2 -verbose
get the ERROR:
user equivalence check failed for user "oracle"
user equivalence unavailable on all the nodes.
Help appreciated!

When you test this, are you using the fully qualified domain name (FQDN) like node1.domain.com or just node1? Oracle is most likely using the FQDN, so if you did "ssh node1 date" and that worked fine, I'll bet that "ssh node1.domain.com date" will result in a prompt to authorize the host first and that's probably the issue.
I also usually ssh over the private interface as well. That is, do both of these from the node where the installer runs:
ssh node2.domain.com date
ssh node2-priv.domain.com date ### assuming that node2-priv is the name of the private network interface
Let us know if that's the issue.
Dan

Runcluvfy nodecon fails eventhough ssh and scp doesnt ask for password

Hi all,
this is my first attempt to oracle RAC . I configured the ssh gen as shown in documentation without using passphrase
I tested ssh and scp and it do not ask me password .
xt33db006[oracle:grid10p]/opt/home/oracle$ ssh xt33db007 date
Thu Apr 24 12:15:13 PDT 2008
xt33db007[oracle:grid10p]/opt/home/oracle$ ssh xt33db006 date
Thu Apr 24 12:15:43 PDT 2008
but when I try to do
runcluvfy.sh comp nodecon -n xt33db006,xt33db007 -verbose
Verifying node connectivity
ERROR:
User equivalence unavailable on all the nodes.
Verification cannot proceed.
do I need to put these 2 hostname in hosts.equiv file ? not sure what I am missing here
I will appreciate any pointer .
thanks
-Prasad

xt33db006[oracle:grid10p]/opt/home/oracle$ runcluvfy.sh stage -pre crsinst -n xt33db006,xt33db007 -verbose
Performing pre-checks for cluster services setup
Checking node reachability...
Check: Node reachability from node "xt33db006"
Destination Node Reachable?
xt33db006 yes
xt33db007 yes
Result: Node reachability check passed from node "xt33db006".
Checking user equivalence...
Check: User equivalence for user "oracle"
Node Name Comment
xt33db007 failed
xt33db006 failed
Result: User equivalence check failed for user "oracle".
ERROR:
User equivalence unavailable on all the nodes.
Verification cannot proceed.
Pre-check for cluster services setup was unsuccessful on all the nodes.

Root.sh fails on second node during clusterware installation

I am setting up a test instance of OEL 5.4 using VMware.
I am running the clusterware install and it is failing only on node2. See below.
I followed note 414897.1 on metalink for raw device setup.
Any help would be greatly appreciate.
2010-09-01 11:58:21.084: [ default][1275584]a_init:7!: Backend init unsuccessful : [22]
2010-09-01 11:58:21.091: [ OCRRAW][1275584]propriogid:1: INVALID FORMAT
2010-09-01 11:58:21.091: [ OCRRAW][1275584]ibctx:1:ERROR: INVALID FORMAT
2010-09-01 11:58:21.091: [ OCRRAW][1275584]proprinit:problem reading the bootblock or superbloc 22
2010-09-01 11:58:21.097: [ OCRRAW][1275584]propriogid:1: INVALID FORMAT
2010-09-01 11:58:21.139: [ OCRRAW][1275584]propriowv: Vote information on disk 0 [u01/app/oracle/oradata/ocr] is adjusted from [0/0] to [2/2]
2010-09-01 11:58:21.191: [ OCRRAW][1275584]propriniconfig:No 92 configuration
2010-09-01 11:58:21.192: [ OCRAPI][1275584]a_init:6a: Backend init successful
2010-09-01 11:58:21.299: [ OCRCONF][1275584]Initialized DATABASE keys in OCR
2010-09-01 11:58:21.555: [ OCRCONF][1275584]Successfully set skgfr block 0
2010-09-01 11:58:21.557: [ OCRCONF][1275584]Exiting [status=success]...

Oracle 10gR2 RAC Installation in RedHat 5 Linux Using VMware.
Important points to install 10gR2 oracle RAC in linux5.
1.LINUX 5(Redhat 5) doesn't have /etc/sysconfig/rawdevices file. so we have to configure it.
2. Edit the /etc/redhat-release version to redhat-4 and and to invoke the runInstaller use the command
$runInstaller -ignoreSysPrereqs. //this will bypass the os check //
3. Next during clusterware installation at the end of root.sh in node 2 end with error message.So we have adjust the parameters in vipca and srvctl files.
4. vipca will fail to run. so we have to adjust some parameters and configure it manually.
refer the link, it will be useful to you to complete your installation.
http://oracleinstance.blogspot.com/2010/03/oracle-10g-installation-in-linux-5.html

Root.sh failed at second node OUL 6.3 Oracle GRID 11.2.0.3

Hi, im installing a two node cluster mounted on Oracle Linux 6.3 with Oracle DB 11.2.0.3, the installation went smooth up until the execution of the root.sh script on the second node.
THe script return this final lines:
CRS-4402: The CSS daemon was started in exclusive mode but found an active CSS daemon on node nodo1, number 1, and is terminating
An active cluster was found during exclusive startup, restarting to join the cluster
Start of resource "ora.crsd" failed
CRS-2800: Cannot start resource 'ora.asm' as it is already in the INTERMEDIATE state on server 'nodo2'
CRS-4000: Command Start failed, or completed with errors.
Failed to start Oracle Grid Infrastructure stack
Failed to start Cluster Ready Services at /u01/app/11.2.0/grid/crs/install/crsconfig_lib.pm line 1286.
/u01/app/11.2.0/grid/perl/bin/perl -I/u01/app/11.2.0/grid/perl/lib -I/u01/app/11.2.0/grid/crs/install /u01/app/11.2.0/grid/crs/install/rootcrs.pl execution failed
In $GRID_HOME/log/node2/alertnode.log It appears to be a Cluster Time Synchronization Service issue, (i didn't synchronyze the nodes..) however the CTSS is running in observer mode, wich i believe it shouldn't affect the installation process. After that i lost it...there's an entry CRS-5018 indicating that an unused HAIP route was removed... and then, out of the blue: CRS-5818:Aborted command 'start' for resource 'ora.asm'. Some clarification will be deeply apreciated.
Here's the complete log:
2013-04-01 13:39:35.358
[client(12163)]CRS-2101:The OLR was formatted using version 3.
2013-04-01 19:40:19.597
[ohasd(12338)]CRS-2112:The OLR service started on node nodo2.
2013-04-01 19:40:19.657
[ohasd(12338)]CRS-1301:Oracle High Availability Service started on node nodo2.
[client(12526)]CRS-10001:01-Apr-13 13:41 ACFS-9459: ADVM/ACFS is not supported on this OS version: '2.6.39-400.17.2.el6uek.i686'
[client(12528)]CRS-10001:01-Apr-13 13:41 ACFS-9201: Not Supported
[client(12603)]CRS-10001:01-Apr-13 13:41 ACFS-9459: ADVM/ACFS is not supported on this OS version: '2.6.39-400.17.2.el6uek.i686'
2013-04-01 19:41:17.509
[ohasd(12338)]CRS-2302:Cannot get GPnP profile. Error CLSGPNP_NO_DAEMON (GPNPD daemon is not running).
2013-04-01 19:41:17.618
[gpnpd(12695)]CRS-2328:GPNPD started on node nodo2.
2013-04-01 19:41:21.363
[cssd(12755)]CRS-1713:CSSD daemon is started in exclusive mode
2013-04-01 19:41:23.194
[ohasd(12338)]CRS-2767:Resource state recovery not attempted for 'ora.diskmon' as its target state is OFFLINE
2013-04-01 19:41:56.144
[cssd(12755)]CRS-1707:Lease acquisition for node nodo2 number 2 completed
2013-04-01 19:41:57.545
[cssd(12755)]CRS-1605:CSSD voting file is online: /dev/oracleasm/disks/ASM_DISK_1; details in /u01/app/11.2.0/grid/log/nodo2/cssd/ocssd.log.
[cssd(12755)]CRS-1636:The CSS daemon was started in exclusive mode but found an active CSS daemon on node nodo1 and is terminating; details at (:CSSNM00006:) in /u01/app/11.2.0/grid/log/nodo2/cssd/ocssd.log
2013-04-01 19:41:58.549
[ohasd(12338)]CRS-2765:Resource 'ora.cssdmonitor' has failed on server 'nodo2'.
2013-04-01 19:42:10.025
[gpnpd(12695)]CRS-2329:GPNPD on node nodo2 shutdown.
2013-04-01 19:42:11.407
[mdnsd(12685)]CRS-5602:mDNS service stopping by request.
2013-04-01 19:42:29.642
[gpnpd(12947)]CRS-2328:GPNPD started on node nodo2.
2013-04-01 19:42:33.241
[cssd(13012)]CRS-1713:CSSD daemon is started in clustered mode
2013-04-01 19:42:35.104
[ohasd(12338)]CRS-2767:Resource state recovery not attempted for 'ora.diskmon' as its target state is OFFLINE
2013-04-01 19:42:44.065
[cssd(13012)]CRS-1707:Lease acquisition for node nodo2 number 2 completed
2013-04-01 19:42:45.484
[cssd(13012)]CRS-1605:CSSD voting file is online: /dev/oracleasm/disks/ASM_DISK_1; details in /u01/app/11.2.0/grid/log/nodo2/cssd/ocssd.log.
2013-04-01 19:42:52.138
[cssd(13012)]CRS-1601:CSSD Reconfiguration complete. Active nodes are nodo1 nodo2 .
2013-04-01 19:42:55.081
[ctssd(13076)]CRS-2403:The Cluster Time Synchronization Service on host nodo2 is in observer mode.
2013-04-01 19:42:55.581
[ctssd(13076)]CRS-2401:The Cluster Time Synchronization Service started on host nodo2.
2013-04-01 19:42:55.581
[ctssd(13076)]CRS-2407:The new Cluster Time Synchronization Service reference node is host nodo1.
2013-04-01 19:43:08.875
[ctssd(13076)]CRS-2412:The Cluster Time Synchronization Service detects that the local time is significantly different from the mean cluster time. Details in /u01/app/11.2.0/grid/log/nodo2/ctssd/octssd.log.
2013-04-01 19:43:08.876
[ctssd(13076)]CRS-2409:The clock on host nodo2 is not synchronous with the mean cluster time. No action has been taken as the Cluster Time Synchronization Service is running in observer mode.
2013-04-01 19:43:13.565
[u01/app/11.2.0/grid/bin/orarootagent.bin(13064)]CRS-5018:(:CLSN00037:) Removed unused HAIP route: 169.254.0.0 / 255.255.0.0 / 0.0.0.0 / eth0
2013-04-01 19:53:09.800
[u01/app/11.2.0/grid/bin/oraagent.bin(12922)]CRS-5818:Aborted command 'start' for resource 'ora.asm'. Details at (:CRSAGF00113:) {0:0:223} in /u01/app/11.2.0/grid/log/nodo2/agent/ohasd/oraagent_oracle/oraagent_oracle.log.
2013-04-01 19:53:11.827
[ohasd(12338)]CRS-2757:Command 'Start' timed out waiting for response from the resource 'ora.asm'. Details at (:CRSPE00111:) {0:0:223} in /u01/app/11.2.0/grid/log/nodo2/ohasd/ohasd.log.
2013-04-01 19:53:12.779
[u01/app/11.2.0/grid/bin/oraagent.bin(12922)]CRS-5019:All OCR locations are on ASM disk groups [DATA], and none of these disk groups are mounted. Details are at "(:CLSN00100:)" in "/u01/app/11.2.0/grid/log/nodo2/agent/ohasd/oraagent_oracle/oraagent_oracle.log".
2013-04-01 19:53:13.892
[u01/app/11.2.0/grid/bin/oraagent.bin(12922)]CRS-5019:All OCR locations are on ASM disk groups [DATA], and none of these disk groups are mounted. Details are at "(:CLSN00100:)" in "/u01/app/11.2.0/grid/log/nodo2/agent/ohasd/oraagent_oracle/oraagent_oracle.log".
2013-04-01 19:53:43.877
[u01/app/11.2.0/grid/bin/oraagent.bin(12922)]CRS-5019:All OCR locations are on ASM disk groups [DATA], and none of these disk groups are mounted. Details are at "(:CLSN00100:)" in "/u01/app/11.2.0/grid/log/nodo2/agent/ohasd/oraagent_oracle/oraagent_oracle.log".
2013-04-01 19:54:13.891
[u01/app/11.2.0/grid/bin/oraagent.bin(12922)]CRS-5019:All OCR locations are on ASM disk groups [DATA], and none of these disk groups are mounted. Details are at "(:CLSN00100:)" in "/u01/app/11.2.0/grid/log/nodo2/agent/ohasd/oraagent_oracle/oraagent_oracle.log".
2013-04-01 19:54:43.906
[u01/app/11.2.0/grid/bin/oraagent.bin(12922)]CRS-5019:All OCR locations are on ASM disk groups [DATA], and none of these disk groups are mounted. Details are at "(:CLSN00100:)" in "/u01/app/11.2.0/grid/log/nodo2/agent/ohasd/oraagent_oracle/oraagent_oracle.log".
2013-04-01 19:55:13.914
[u01/app/11.2.0/grid/bin/oraagent.bin(12922)]CRS-5019:All OCR locations are on ASM disk groups [DATA], and none of these disk groups are mounted. Details are at "(:CLSN00100:)" in "/u01/app/11.2.0/grid/log/nodo2/agent/ohasd/oraagent_oracle/oraagent_oracle.log".
2013-04-01 19:55:43.918
[u01/app/11.2.0/grid/bin/oraagent.bin(12922)]CRS-5019:All OCR locations are on ASM disk groups [DATA], and none of these disk groups are mounted. Details are at "(:CLSN00100:)" in "/u01/app/11.2.0/grid/log/nodo2/agent/ohasd/oraagent_oracle/oraagent_oracle.log".
2013-04-01 19:56:13.922
[u01/app/11.2.0/grid/bin/oraagent.bin(12922)]CRS-5019:All OCR locations are on ASM disk groups [DATA], and none of these disk groups are mounted. Details are at "(:CLSN00100:)" in "/u01/app/11.2.0/grid/log/nodo2/agent/ohasd/oraagent_oracle/oraagent_oracle.log".
2013-04-01 19:56:53.209
[crsd(13741)]CRS-1012:The OCR service started on node nodo2.
2013-04-01 20:07:01.128
[crsd(13741)]CRS-0810:Cluster Ready Service aborted due to failure to communicate with Event Management Service with error [1]. Details at (:CRSD00120:) in /u01/app/11.2.0/grid/log/nodo2/crsd/crsd.log.
2013-04-01 20:07:01.278
[ohasd(12338)]CRS-2765:Resource 'ora.crsd' has failed on server 'nodo2'.
2013-04-01 20:07:08.689
[crsd(15248)]CRS-1012:The OCR service started on node nodo2.
2013-04-01 20:13:10.138
[ctssd(13076)]CRS-2412:The Cluster Time Synchronization Service detects that the local time is significantly different from the mean cluster time. Details in /u01/app/11.2.0/grid/log/nodo2/ctssd/octssd.log.
2013-04-01 20:17:13.024
[crsd(15248)]CRS-0810:Cluster Ready Service aborted due to failure to communicate with Event Management Service with error [1]. Details at (:CRSD00120:) in /u01/app/11.2.0/grid/log/nodo2/crsd/crsd.log.
2013-04-01 20:17:13.171
[ohasd(12338)]CRS-2765:Resource 'ora.crsd' has failed on server 'nodo2'.
2013-04-01 20:17:20.826
[crsd(16746)]CRS-1012:The OCR service started on node nodo2.
2013-04-01 20:27:25.020
[crsd(16746)]CRS-0810:Cluster Ready Service aborted due to failure to communicate with Event Management Service with error [1]. Details at (:CRSD00120:) in /u01/app/11.2.0/grid/log/nodo2/crsd/crsd.log.
2013-04-01 20:27:25.176
[ohasd(12338)]CRS-2765:Resource 'ora.crsd' has failed on server 'nodo2'.
2013-04-01 20:27:31.591
[crsd(18266)]CRS-1012:The OCR service started on node nodo2.
2013-04-01 20:37:35.668
[crsd(18266)]CRS-0810:Cluster Ready Service aborted due to failure to communicate with Event Management Service with error [1]. Details at (:CRSD00120:) in /u01/app/11.2.0/grid/log/nodo2/crsd/crsd.log.
2013-04-01 20:37:35.808
[ohasd(12338)]CRS-2765:Resource 'ora.crsd' has failed on server 'nodo2'.
2013-04-01 20:37:43.209
[crsd(19762)]CRS-1012:The OCR service started on node nodo2.
2013-04-01 20:43:11.160
[ctssd(13076)]CRS-2412:The Cluster Time Synchronization Service detects that the local time is significantly different from the mean cluster time. Details in /u01/app/11.2.0/grid/log/nodo2/ctssd/octssd.log.
2013-04-01 20:47:47.487
[crsd(19762)]CRS-0810:Cluster Ready Service aborted due to failure to communicate with Event Management Service with error [1]. Details at (:CRSD00120:) in /u01/app/11.2.0/grid/log/nodo2/crsd/crsd.log.
2013-04-01 20:47:47.637
[ohasd(12338)]CRS-2765:Resource 'ora.crsd' has failed on server 'nodo2'.
2013-04-01 20:47:55.086
[crsd(21242)]CRS-1012:The OCR service started on node nodo2.
2013-04-01 20:57:59.343
[crsd(21242)]CRS-0810:Cluster Ready Service aborted due to failure to communicate with Event Management Service with error [1]. Details at (:CRSD00120:) in /u01/app/11.2.0/grid/log/nodo2/crsd/crsd.log.
2013-04-01 20:57:59.492
[ohasd(12338)]CRS-2765:Resource 'ora.crsd' has failed on server 'nodo2'.
2013-04-01 20:58:06.996
[crsd(22744)]CRS-1012:The OCR service started on node nodo2.
2013-04-01 21:08:11.046
[crsd(22744)]CRS-0810:Cluster Ready Service aborted due to failure to communicate with Event Management Service with error [1]. Details at (:CRSD00120:) in /u01/app/11.2.0/grid/log/nodo2/crsd/crsd.log.
2013-04-01 21:08:11.192
[ohasd(12338)]CRS-2765:Resource 'ora.crsd' has failed on server 'nodo2'.
2013-04-01 21:08:18.726
[crsd(24260)]CRS-1012:The OCR service started on node nodo2.
2013-04-01 21:13:12.000
[ctssd(13076)]CRS-2412:The Cluster Time Synchronization Service detects that the local time is significantly different from the mean cluster time. Details in /u01/app/11.2.0/grid/log/nodo2/ctssd/octssd.log.
2013-04-01 21:18:22.262
[crsd(24260)]CRS-0810:Cluster Ready Service aborted due to failure to communicate with Event Management Service with error [1]. Details at (:CRSD00120:) in /u01/app/11.2.0/grid/log/nodo2/crsd/crsd.log.
2013-04-01 21:18:22.411
[ohasd(12338)]CRS-2765:Resource 'ora.crsd' has failed on server 'nodo2'.
2013-04-01 21:18:29.927
[crsd(25759)]CRS-1012:The OCR service started on node nodo2.
2013-04-01 21:28:34.467
[crsd(25759)]CRS-0810:Cluster Ready Service aborted due to failure to communicate with Event Management Service with error [1]. Details at (:CRSD00120:) in /u01/app/11.2.0/grid/log/nodo2/crsd/crsd.log.
2013-04-01 21:28:34.616
[ohasd(12338)]CRS-2765:Resource 'ora.crsd' has failed on server 'nodo2'.
2013-04-01 21:28:41.990
[crsd(27291)]CRS-1012:The OCR service started on node nodo2.
2013-04-01 21:38:45.012
[crsd(27291)]CRS-0810:Cluster Ready Service aborted due to failure to communicate with Event Management Service with error [1]. Details at (:CRSD00120:) in /u01/app/11.2.0/grid/log/nodo2/crsd/crsd.log.
2013-04-01 21:38:45.160
[ohasd(12338)]CRS-2765:Resource 'ora.crsd' has failed on server 'nodo2'.
2013-04-01 21:38:52.790
[crsd(28784)]CRS-1012:The OCR service started on node nodo2.
2013-04-01 21:43:12.378
[ctssd(13076)]CRS-2412:The Cluster Time Synchronization Service detects that the local time is significantly different from the mean cluster time. Details in /u01/app/11.2.0/grid/log/nodo2/ctssd/octssd.log.
2013-04-01 21:48:56.285
[crsd(28784)]CRS-0810:Cluster Ready Service aborted due to failure to communicate with Event Management Service with error [1]. Details at (:CRSD00120:) in /u01/app/11.2.0/grid/log/nodo2/crsd/crsd.log.
2013-04-01 21:48:56.435
[ohasd(12338)]CRS-2765:Resource 'ora.crsd' has failed on server 'nodo2'.
2013-04-01 21:49:04.421
[crsd(30272)]CRS-1012:The OCR service started on node nodo2.
2013-04-01 21:59:08.183
[crsd(30272)]CRS-0810:Cluster Ready Service aborted due to failure to communicate with Event Management Service with error [1]. Details at (:CRSD00120:) in /u01/app/11.2.0/grid/log/nodo2/crsd/crsd.log.
2013-04-01 21:59:08.318
[ohasd(12338)]CRS-2765:Resource 'ora.crsd' has failed on server 'nodo2'.
2013-04-01 21:59:15.860
[crsd(31772)]CRS-1012:The OCR service started on node nodo2.

Hi santysharma, thanks for the reply, i have two ethernet interfaces: eth0 (public network 192.168.1.0) and eth1 (private network 10.5.3.0), there is no device using that ip range, here's the output of route command:
(Sorry for the alignment, i tried to tab it but the editor trims it again)
Kernel IP routing table
Destination Gateway Genmask Flags Metric Ref Use Iface
default 192.168.1.1 0.0.0.0 UG 0 0 0 eth0
private * 255.255.255.0 U 0 0 0 eth1
link-local * 255.255.0.0 U 1002 0 0 eth0
link-local * 255.255.0.0 U 1003 0 0 eth1
public * 255.255.255.0 U 0 0 0 eth0
And the /etc/hosts file
127.0.0.1 localhost localhost.localdomain localhost4 localhost4.localdomain4
::1 localhost localhost.localdomain localhost6 localhost6.localdomain6
10.5.3.1 nodo1.cluster nodo1
10.5.3.2 nodo2.cluster nodo2
192.168.1.13 cluster-scan
192.168.1.14 nodo1-vip
192.168.1.15 nodo2-vip
And the ifconfig -a
eth0 Link encap:Ethernet HWaddr C8:3A:35:D9:C6:2B
inet addr:192.168.1.12 Bcast:192.168.1.255 Mask:255.255.255.0
inet6 addr: fe80::ca3a:35ff:fed9:c62b/64 Scope:Link
UP BROADCAST RUNNING MULTICAST MTU:1500 Metric:1
RX packets:34708 errors:0 dropped:18 overruns:0 frame:0
TX packets:24693 errors:0 dropped:0 overruns:0 carrier:0
collisions:0 txqueuelen:1000
RX bytes:48545969 (46.2 MiB) TX bytes:1994381 (1.9 MiB)
eth1 Link encap:Ethernet HWaddr 00:0D:87:D0:A3:8E
inet addr:10.5.3.2 Bcast:10.5.3.255 Mask:255.255.255.0
inet6 addr: fe80::20d:87ff:fed0:a38e/64 Scope:Link
UP BROADCAST RUNNING MULTICAST MTU:1500 Metric:1
RX packets:0 errors:0 dropped:0 overruns:0 frame:0
TX packets:44 errors:0 dropped:0 overruns:0 carrier:0
collisions:0 txqueuelen:1000
RX bytes:0 (0.0 b) TX bytes:5344 (5.2 KiB)
Interrupt:23 Base address:0x6000
lo Link encap:Local Loopback
inet addr:127.0.0.1 Mask:255.0.0.0
inet6 addr: ::1/128 Scope:Host
UP LOOPBACK RUNNING MTU:16436 Metric:1
RX packets:20 errors:0 dropped:0 overruns:0 frame:0
TX packets:20 errors:0 dropped:0 overruns:0 carrier:0
collisions:0 txqueuelen:0
RX bytes:1320 (1.2 KiB) TX bytes:1320 (1.2 KiB)
Now that i'm thinking i've read somewhere that ipv6 was no supported...yet there's no relation with the 169.254.x.x ip range.

Root.sh fails on second node

I already posted this issue on database installation forum, and was suggested to post it on this forum.
Here are the details.
I am running Linux 64bit on ESx clients. Installing Oracle 11gR2.
It passed all the per-requisite. Run root.sh on first node. It finished with no errorrs.
On second node I got the following:
Running Oracle 11g root.sh script...
The following environment variables are set as:
ORACLE_OWNER= oracle
ORACLE_HOME= /u01/app/11.2.0/grid
Enter the full pathname of the local bin directory: [usr/local/bin]:
The file "dbhome" already exists in /usr/local/bin. Overwrite it? (y/n)
[n]:
The file "oraenv" already exists in /usr/local/bin. Overwrite it? (y/n)
[n]:
The file "coraenv" already exists in /usr/local/bin. Overwrite it? (y/n)
[n]:
Entries will be added to the /etc/oratab file as needed by
Database Configuration Assistant when a database is created
Finished running generic part of root.sh script.
Now product-specific root actions will be performed.
2010-07-13 12:51:28: Parsing the host name
2010-07-13 12:51:28: Checking for super user privileges
2010-07-13 12:51:28: User has super user privileges
Using configuration parameter file: /u01/app/11.2.0/grid/crs/install/crsconfig_params
Creating trace directory
LOCAL ADD MODE
Creating OCR keys for user 'root', privgrp 'root'..
Operation successful.
Adding daemon to inittab
CRS-4123: Oracle High Availability Services has been started.
ohasd is starting
CRS-4402: The CSS daemon was started in exclusive mode but found an active CSS daemon on node fred0224, number 1, and is terminating
An active cluster was found during exclusive startup, restarting to join the cluster
CRS-2672: Attempting to start 'ora.mdnsd' on 'fred0225'
CRS-2676: Start of 'ora.mdnsd' on 'fred0225' succeeded
CRS-2672: Attempting to start 'ora.gipcd' on 'fred0225'
CRS-2676: Start of 'ora.gipcd' on 'fred0225' succeeded
CRS-2672: Attempting to start 'ora.gpnpd' on 'fred0225'
CRS-2676: Start of 'ora.gpnpd' on 'fred0225' succeeded
CRS-2672: Attempting to start 'ora.cssdmonitor' on 'fred0225'
CRS-2676: Start of 'ora.cssdmonitor' on 'fred0225' succeeded
CRS-2672: Attempting to start 'ora.cssd' on 'fred0225'
CRS-2672: Attempting to start 'ora.diskmon' on 'fred0225'
CRS-2676: Start of 'ora.diskmon' on 'fred0225' succeeded
CRS-2676: Start of 'ora.cssd' on 'fred0225' succeeded
CRS-2672: Attempting to start 'ora.ctssd' on 'fred0225'
Start action for octssd aborted
CRS-2676: Start of 'ora.ctssd' on 'fred0225' succeeded
CRS-2672: Attempting to start 'ora.drivers.acfs' on 'fred0225'
CRS-2672: Attempting to start 'ora.asm' on 'fred0225'
CRS-2676: Start of 'ora.drivers.acfs' on 'fred0225' succeeded
CRS-2676: Start of 'ora.asm' on 'fred0225' succeeded
CRS-2664: Resource 'ora.ctssd' is already running on 'fred0225'
CRS-4000: Command Start failed, or completed with errors.
Command return code of 1 (256) from command: /u01/app/11.2.0/grid/bin/crsctl start resource ora.asm -init
Start of resource "ora.asm -init" failed
Failed to start ASM
Failed to start Oracle Clusterware stack
In the ocssd.log I found
[ CSSD][3559689984]clssnmvDHBValidateNCopy: node 1, fred0224, has a disk HB, but no network HB, DHB has rcfg 174483948, wrtcnt, 232, LATS 521702664, lastSeqNo 232, uniqueness 1279039649, timestamp 1279039959/521874274
In oraagent_oracle.log I found
[ clsdmc][1212365120]Fail to connect (ADDRESS=(PROTOCOL=ipc)(KEY=fred0225DBG_GPNPD)) with status 9
2010-07-13 12:54:07.234: [ora.gpnpd][1212365120] [check] Error = error 9 encountered when connecting to GPNPD
2010-07-13 12:54:07.238: [ora.gpnpd][1212365120] [check] Calling PID check for daemon
2010-07-13 12:54:07.238: [ora.gpnpd][1212365120] [check] Trying to check PID = 20584
2010-07-13 12:54:07.432: [ COMMCRS][1285794112]clsc_connect: (0x1304d850) no listener at (ADDRESS=(PROTOCOL=ipc)(KEY=fred0225DBG_GPNPD))
[ clsdmc][1222854976]Fail to connect (ADDRESS=(PROTOCOL=ipc)(KEY=fred0225DBG_MDNSD)) with status 9
2010-07-13 12:54:08.649: [ora.mdnsd][1222854976] [check] Error = error 9 encountered when connecting to MDNSD
2010-07-13 12:54:08.649: [ora.mdnsd][1222854976] [check] Calling PID check for daemon
2010-07-13 12:54:08.649: [ora.mdnsd][1222854976] [check] Trying to check PID = 20571
2010-07-13 12:54:08.841: [ COMMCRS][1201875264]clsc_connect: (0x12f3b1d0) no listener at (ADDRESS=(PROTOCOL=ipc)(KEY=fred0225DBG_MDNSD))
[ clsdmc][1159915840]Fail to connect (ADDRESS=(PROTOCOL=ipc)(KEY=fred0225DBG_GIPCD)) with status 9
2010-07-13 12:54:10.051: [ora.gipcd][1159915840] [check] Error = error 9 encountered when connecting to GIPCD
2010-07-13 12:54:10.051: [ora.gipcd][1159915840] [check] Calling PID check for daemon
2010-07-13 12:54:10.051: [ora.gipcd][1159915840] [check] Trying to check PID = 20566
2010-07-13 12:54:10.242: [ COMMCRS][1254324544]clsc_connect: (0x12f35630) no listener at (ADDRESS=(PROTOCOL=ipc)(KEY=fred0225DBG_GIPCD))
In oracssdagent_root.log I found
2010-07-13 12:52:28.698: [ CSSCLNT][1102481728]clssscConnect: gipc request failed with 29 (0x16)
2010-07-13 12:52:28.698: [ CSSCLNT][1102481728]clsssInitNative: connect failed, rc 29
2010-07-13 12:53:55.222: [ CSSCLNT][1102481728]clssnsqlnum: RPC failed rc 3
2010-07-13 12:53:55.222: [ USRTHRD][1102481728] clsnomon_cssini: failed 3 to fetch node number
2010-07-13 12:53:55.222: [ USRTHRD][1102481728] clsnomon_init: css init done, nodenum -1.
2010-07-13 12:53:55.222: [ CSSCLNT][1102481728]clsssRecvMsg: got a disconnect from the server while waiting for message type 43
2010-07-13 12:53:55.222: [ CSSCLNT][1102481728]clsssGetNLSData: Failure receiving a msg, rc 3
If you need more info, let me know.

Well, the error clearly indicates that a communication problem exists on the private interconnect.
Could this be a setting in ESX, which prevents some communication between the clients on the second network card? Any routing table in ESX not configured correctly?
Sebastian

Root.ah fails on 2nd node(rac2) with [ ORA-15018,ORA-15017,ORA-15003 ]

Hi All,
I m trying to setup 11gR2 Grid installation on two-node Rac . When it comes to running root.sh on second node (i.e. rac2) it fails with below error. Could please anyone help me out. This is my 3rd attempt and all fails with below errors on node 2.
rac2:
[root@rac2 grid_home]# ./root.sh
Running Oracle 11g root.sh script...
The following environment variables are set as:
    ORACLE_OWNER= grid
    ORACLE_HOME= /u01/grid_home
Enter the full pathname of the local bin directory: [/usr/local/bin]:
   Copying dbhome to /usr/local/bin ...
   Copying oraenv to /usr/local/bin ...
   Copying coraenv to /usr/local/bin ...
Creating /etc/oratab file...
Entries will be added to the /etc/oratab file as needed by
Database Configuration Assistant when a database is created
Finished running generic part of root.sh script.
Now product-specific root actions will be performed.
2013-07-10 18:53:15: Parsing the host name
2013-07-10 18:53:15: Checking for super user privileges
2013-07-10 18:53:15: User has super user privileges
Using configuration parameter file: /u01/grid_home/crs/install/crsconfig_params
Creating trace directory
LOCAL ADD MODE
Creating OCR keys for user 'root', privgrp 'root'..
Operation successful.
Adding daemon to inittab
CRS-4123: Oracle High Availability Services has been started.
ohasd is starting
CRS-2672: Attempting to start 'ora.gipcd' on 'rac2'
CRS-2672: Attempting to start 'ora.mdnsd' on 'rac2'
CRS-2676: Start of 'ora.gipcd' on 'rac2' succeeded
CRS-2676: Start of 'ora.mdnsd' on 'rac2' succeeded
CRS-2672: Attempting to start 'ora.gpnpd' on 'rac2'
CRS-2676: Start of 'ora.gpnpd' on 'rac2' succeeded
CRS-2672: Attempting to start 'ora.cssdmonitor' on 'rac2'
CRS-2676: Start of 'ora.cssdmonitor' on 'rac2' succeeded
CRS-2672: Attempting to start 'ora.cssd' on 'rac2'
CRS-2672: Attempting to start 'ora.diskmon' on 'rac2'
CRS-2676: Start of 'ora.diskmon' on 'rac2' succeeded
CRS-2676: Start of 'ora.cssd' on 'rac2' succeeded
CRS-2672: Attempting to start 'ora.ctssd' on 'rac2'
CRS-2676: Start of 'ora.ctssd' on 'rac2' succeeded
DiskGroup CRS creation failed with the following message:
ORA-15018: diskgroup cannot be created
ORA-15017: diskgroup "CRS" cannot be mounted
ORA-15003: diskgroup "CRS" already mounted in another lock name space
Configuration of ASM failed, see logs for details
Did not succssfully configure and start ASM
CRS-2500: Cannot stop resource 'ora.crsd' as it is not running
CRS-4000: Command Stop failed, or completed with errors.
Command return code of 1 (256) from command: /u01/grid_home/bin/crsctl stop resource ora.crsd -init
Stop of resource "ora.crsd -init" failed
Failed to stop CRSD
CRS-2673: Attempting to stop 'ora.asm' on 'rac2'
CRS-2677: Stop of 'ora.asm' on 'rac2' succeeded
CRS-2673: Attempting to stop 'ora.ctssd' on 'rac2'
CRS-2677: Stop of 'ora.ctssd' on 'rac2' succeeded
CRS-2673: Attempting to stop 'ora.cssdmonitor' on 'rac2'
CRS-2677: Stop of 'ora.cssdmonitor' on 'rac2' succeeded
CRS-2673: Attempting to stop 'ora.cssd' on 'rac2'
CRS-2677: Stop of 'ora.cssd' on 'rac2' succeeded
CRS-2673: Attempting to stop 'ora.gpnpd' on 'rac2'
CRS-2677: Stop of 'ora.gpnpd' on 'rac2' succeeded
CRS-2673: Attempting to stop 'ora.gipcd' on 'rac2'
CRS-2677: Stop of 'ora.gipcd' on 'rac2' succeeded
CRS-2673: Attempting to stop 'ora.mdnsd' on 'rac2'
CRS-2677: Stop of 'ora.mdnsd' on 'rac2' succeeded
Initial cluster configuration failed. See /u01/grid_home/cfgtoollogs/crsconfig/rootcrs_rac2.log for details
[root@rac2 grid_home]#
rac2 alertrac2.log
[root@rac2 rac2]# cat -n alertrac2.log
     1 Oracle Database 11g Clusterware Release 11.2.0.1.0 - Production Copyright 1996, 2009 Oracle. All rights reserved.
     2 2013-07-10 18:53:16.145
     3 [client(13088)]CRS-2106:The OLR location /u01/grid_home/cdata/rac2.olr is inaccessible. Details in /u01/grid_home/log/rac2/client/ocrconfig_13088.log.
     4 2013-07-10 18:53:16.228
     5 [client(13088)]CRS-2101:The OLR was formatted using version 3.
     6 2013-07-10 18:53:31.734
     7 [ohasd(13132)]CRS-2112:The OLR service started on node rac2.
     8 2013-07-10 18:53:31.893
     9 [ohasd(13132)]CRS-2772:Server 'rac2' has been assigned to pool 'Free'.
    10 2013-07-10 18:53:53.762
    11 [ohasd(13132)]CRS-2302:Cannot get GPnP profile. Error CLSGPNP_NO_DAEMON (GPNPD daemon is not running).
    12 2013-07-10 18:53:55.381
    13 [cssd(14409)]CRS-1713:CSSD daemon is started in exclusive mode
    14 2013-07-10 18:54:01.530
    15 [cssd(14409)]CRS-1709:Lease acquisition failed for node rac2 because no voting file has been configured; Details at (:CSSNM00031:) in /u01/grid_home/log/rac2/cssd/ocssd.log
    16 2013-07-10 18:54:19.113
    17 [cssd(14409)]CRS-1601:CSSD Reconfiguration complete. Active nodes are rac2 .
    18 2013-07-10 18:54:19.910
    19 [ctssd(14465)]CRS-2403:The Cluster Time Synchronization Service on host rac2 is in observer mode.
    20 2013-07-10 18:54:19.920
    21 [ctssd(14465)]CRS-2407:The new Cluster Time Synchronization Service reference node is host rac2.
    22 2013-07-10 18:54:20.903
    23 [ctssd(14465)]CRS-2401:The Cluster Time Synchronization Service started on host rac2.
    24 [client(14715)]CRS-10001:ACFS-9327: Verifying ADVM/ACFS devices.
    25 [client(14719)]CRS-10001:ACFS-9322: done.
    26 2013-07-10 18:54:47.104
    27 [ctssd(14465)]CRS-2405:The Cluster Time Synchronization Service on host rac2 is shutdown by user
    28 2013-07-10 18:54:55.837
    29 [cssd(14409)]CRS-1603:CSSD on node rac2 shutdown by user.
rac2 rootcrs logfile
[root@rac2 rac2]# cat /u01/grid_home/cfgtoollogs/crsconfig/rootcrs_rac2.log
2013-07-10 18:53:15: The configuration parameter file /u01/grid_home/crs/install/crsconfig_params is valid
2013-07-10 18:53:15: Checking for super user privileges
2013-07-10 18:53:15: User has super user privileges
2013-07-10 18:53:15: ### Printing the configuration values from files:
2013-07-10 18:53:15:    /u01/grid_home/crs/install/crsconfig_params
2013-07-10 18:53:15:    /u01/grid_home/crs/install/s_crsconfig_defs
2013-07-10 18:53:15: ASM_DISCOVERY_STRING=
2013-07-10 18:53:15: ASM_DISKS=ORCL:CRS1
2013-07-10 18:53:15: ASM_DISK_GROUP=CRS
2013-07-10 18:53:15: ASM_REDUNDANCY=EXTERNAL
2013-07-10 18:53:15: ASM_SPFILE=
2013-07-10 18:53:15: ASM_UPGRADE=false
2013-07-10 18:53:15: CLSCFG_MISSCOUNT=
2013-07-10 18:53:15: CLUSTER_GUID=
2013-07-10 18:53:15: CLUSTER_NAME=rac-scan
2013-07-10 18:53:15: CRS_NODEVIPS='rac1-vip/255.255.255.0/eth0,rac2-vip/255.255.255.0/eth0'
2013-07-10 18:53:15: CRS_STORAGE_OPTION=1
2013-07-10 18:53:15: CSS_LEASEDURATION=400
2013-07-10 18:53:15: DIRPREFIX=
2013-07-10 18:53:15: DISABLE_OPROCD=0
2013-07-10 18:53:15: EMBASEJAR_NAME=oemlt.jar
2013-07-10 18:53:15: EWTJAR_NAME=ewt3.jar
2013-07-10 18:53:15: EXTERNAL_ORACLE_BIN=/opt/oracle/bin
2013-07-10 18:53:15: GNS_ADDR_LIST=
2013-07-10 18:53:15: GNS_ALLOW_NET_LIST=
2013-07-10 18:53:15: GNS_CONF=false
2013-07-10 18:53:15: GNS_DENY_ITF_LIST=
2013-07-10 18:53:15: GNS_DENY_NET_LIST=
2013-07-10 18:53:15: GNS_DOMAIN_LIST=
2013-07-10 18:53:15: GPNPCONFIGDIR=/u01/grid_home
2013-07-10 18:53:15: GPNPGCONFIGDIR=/u01/grid_home
2013-07-10 18:53:15: GPNP_PA=
2013-07-10 18:53:15: HELPJAR_NAME=help4.jar
2013-07-10 18:53:15: HOST_NAME_LIST=rac1,rac2
2013-07-10 18:53:15: ID=/etc/init.d
2013-07-10 18:53:15: INIT=/sbin/init
2013-07-10 18:53:15: IT=/etc/inittab
2013-07-10 18:53:15: JEWTJAR_NAME=jewt4.jar
2013-07-10 18:53:15: JLIBDIR=/u01/grid_home/jlib
2013-07-10 18:53:15: JREDIR=/u01/grid_home/jdk/jre/
2013-07-10 18:53:15: LANGUAGE_ID=AMERICAN_AMERICA.AL32UTF8
2013-07-10 18:53:15: MSGFILE=/var/adm/messages
2013-07-10 18:53:15: NETCFGJAR_NAME=netcfg.jar
2013-07-10 18:53:15: NETWORKS="eth0"/192.168.0.0:public,"eth1"/192.168.1.0:cluster_interconnect
2013-07-10 18:53:15: NEW_HOST_NAME_LIST=
2013-07-10 18:53:15: NEW_NODEVIPS='rac1-vip/255.255.255.0/eth0,rac2-vip/255.255.255.0/eth0'
2013-07-10 18:53:15: NEW_NODE_NAME_LIST=
2013-07-10 18:53:15: NEW_PRIVATE_NAME_LIST=
2013-07-10 18:53:15: NODELIST=rac1,rac2
2013-07-10 18:53:15: NODE_NAME_LIST=rac1,rac2
2013-07-10 18:53:15: OCFS_CONFIG=
2013-07-10 18:53:15: OCRCONFIG=/etc/oracle/ocr.loc
2013-07-10 18:53:15: OCRCONFIGDIR=/etc/oracle
2013-07-10 18:53:15: OCRID=
2013-07-10 18:53:15: OCRLOC=ocr.loc
2013-07-10 18:53:15: OCR_LOCATIONS=NO_VAL
2013-07-10 18:53:15: OLASTGASPDIR=/etc/oracle/lastgasp
2013-07-10 18:53:15: OLRCONFIG=/etc/oracle/olr.loc
2013-07-10 18:53:15: OLRCONFIGDIR=/etc/oracle
2013-07-10 18:53:15: OLRLOC=olr.loc
2013-07-10 18:53:15: OPROCDCHECKDIR=/etc/oracle/oprocd/check
2013-07-10 18:53:15: OPROCDDIR=/etc/oracle/oprocd
2013-07-10 18:53:15: OPROCDFATALDIR=/etc/oracle/oprocd/fatal
2013-07-10 18:53:15: OPROCDSTOPDIR=/etc/oracle/oprocd/stop
2013-07-10 18:53:15: ORACLE_BASE=/u01/11.2.0
2013-07-10 18:53:15: ORACLE_HOME=/u01/grid_home
2013-07-10 18:53:15: ORACLE_OWNER=grid
2013-07-10 18:53:15: ORA_ASM_GROUP=asmadmin
2013-07-10 18:53:15: ORA_DBA_GROUP=oinstall
2013-07-10 18:53:15: PRIVATE_NAME_LIST=
2013-07-10 18:53:15: RCALLDIR=/etc/rc.d/rc0.d /etc/rc.d/rc1.d /etc/rc.d/rc2.d /etc/rc.d/rc3.d /etc/rc.d/rc4.d /etc/rc.d/rc5.d /etc/rc.d/rc6.d
2013-07-10 18:53:15: RCKDIR=/etc/rc.d/rc0.d /etc/rc.d/rc1.d /etc/rc.d/rc2.d /etc/rc.d/rc4.d /etc/rc.d/rc6.d
2013-07-10 18:53:15: RCSDIR=/etc/rc.d/rc3.d /etc/rc.d/rc5.d
2013-07-10 18:53:15: RC_KILL=K19
2013-07-10 18:53:15: RC_KILL_OLD=K96
2013-07-10 18:53:15: RC_START=S96
2013-07-10 18:53:15: SCAN_NAME=rac-scan.naveed.com
2013-07-10 18:53:15: SCAN_PORT=1521
2013-07-10 18:53:15: SCRBASE=/etc/oracle/scls_scr
2013-07-10 18:53:15: SHAREJAR_NAME=share.jar
2013-07-10 18:53:15: SILENT=false
2013-07-10 18:53:15: SO_EXT=so
2013-07-10 18:53:15: SRVCFGLOC=srvConfig.loc
2013-07-10 18:53:15: SRVCONFIG=/var/opt/oracle/srvConfig.loc
2013-07-10 18:53:15: SRVCONFIGDIR=/var/opt/oracle
2013-07-10 18:53:15: VNDR_CLUSTER=false
2013-07-10 18:53:15: VOTING_DISKS=NO_VAL
2013-07-10 18:53:15: ### Printing other configuration values ###
2013-07-10 18:53:15: CLSCFG_EXTRA_PARMS=
2013-07-10 18:53:15: CRSDelete=0
2013-07-10 18:53:15: CRSPatch=0
2013-07-10 18:53:15: DEBUG=
2013-07-10 18:53:15: DOWNGRADE=
2013-07-10 18:53:15: HAS_GROUP=oinstall
2013-07-10 18:53:15: HAS_USER=root
2013-07-10 18:53:15: HOST=rac2
2013-07-10 18:53:15: IS_SIHA=0
2013-07-10 18:53:15: OLR_DIRECTORY=/u01/grid_home/cdata
2013-07-10 18:53:15: OLR_LOCATION=/u01/grid_home/cdata/rac2.olr
2013-07-10 18:53:15: ORA_CRS_HOME=/u01/grid_home
2013-07-10 18:53:15: SUPERUSER=root
2013-07-10 18:53:15: UPGRADE=
2013-07-10 18:53:15: VF_DISCOVERY_STRING=
2013-07-10 18:53:15: addfile=/u01/grid_home/crs/install/crsconfig_addparams
2013-07-10 18:53:15: crscfg_trace=1
2013-07-10 18:53:15: crscfg_trace_file=/u01/grid_home/cfgtoollogs/crsconfig/rootcrs_rac2.log
2013-07-10 18:53:15: hosts=
2013-07-10 18:53:15: oldcrshome=
2013-07-10 18:53:15: oldcrsver=
2013-07-10 18:53:15: osdfile=/u01/grid_home/crs/install/s_crsconfig_defs
2013-07-10 18:53:15: parameters_valid=1
2013-07-10 18:53:15: paramfile=/u01/grid_home/crs/install/crsconfig_params
2013-07-10 18:53:15: platform_family=unix
2013-07-10 18:53:15: srvctl_trc_suff=0
2013-07-10 18:53:15: unlock_crshome=
2013-07-10 18:53:15: user_is_superuser=1
2013-07-10 18:53:15: ### Printing of configuration values complete ###
2013-07-10 18:53:15: Oracle CRS stack is not configured yet
2013-07-10 18:53:15: CRS is not yet configured. Hence, will proceed to configure CRS
2013-07-10 18:53:15: Cluster-wide one-time actions... Done!
2013-07-10 18:53:15: Oracle CRS home = /u01/grid_home
2013-07-10 18:53:15: Host name = rac2
2013-07-10 18:53:15: CRS user = grid
2013-07-10 18:53:15: Oracle CRS home = /u01/grid_home
2013-07-10 18:53:15: GPnP host = rac2
2013-07-10 18:53:15: Oracle GPnP home = /u01/grid_home/gpnp
2013-07-10 18:53:15: Oracle GPnP local home = /u01/grid_home/gpnp/rac2
2013-07-10 18:53:15: GPnP directories verified.
2013-07-10 18:53:15: Checking to see if Oracle CRS stack is already configured
2013-07-10 18:53:15: Oracle CRS stack is not configured yet
2013-07-10 18:53:15: ---Checking local gpnp setup...
2013-07-10 18:53:15: The setup file "/u01/grid_home/gpnp/rac2/profiles/peer/profile.xml" does not exist
2013-07-10 18:53:15: The setup file "/u01/grid_home/gpnp/rac2/wallets/peer/cwallet.sso" does not exist
2013-07-10 18:53:15: The setup file "/u01/grid_home/gpnp/rac2/wallets/prdr/cwallet.sso" does not exist
2013-07-10 18:53:15: chk gpnphome /u01/grid_home/gpnp/rac2: profile_ok 0 wallet_ok 0 r/o_wallet_ok 0
2013-07-10 18:53:15: chk gpnphome /u01/grid_home/gpnp/rac2: INVALID (bad profile/wallet)
2013-07-10 18:53:15: ---Checking cluster-wide gpnp setup...
2013-07-10 18:53:15: chk gpnphome /u01/grid_home/gpnp: profile_ok 1 wallet_ok 1 r/o_wallet_ok 1
2013-07-10 18:53:15: gpnptool: run /u01/grid_home/bin/gpnptool verify -p="/u01/grid_home/gpnp/profiles/peer/profile.xml" -w="file:/u01/grid_home/gpnp/wallets/peer" -wu=peer
2013-07-10 18:53:15: Running as user grid: /u01/grid_home/bin/gpnptool verify -p="/u01/grid_home/gpnp/profiles/peer/profile.xml" -w="file:/u01/grid_home/gpnp/wallets/peer" -wu=peer
2013-07-10 18:53:15: s_run_as_user2: Running /bin/su grid -c ' /u01/grid_home/bin/gpnptool verify -p="/u01/grid_home/gpnp/profiles/peer/profile.xml" -w="file:/u01/grid_home/gpnp/wallets/peer" -wu=peer '
2013-07-10 18:53:15: Removing file /tmp/file0qKE0c
2013-07-10 18:53:15: Successfully removed file: /tmp/file0qKE0c
2013-07-10 18:53:15: /bin/su successfully executed
2013-07-10 18:53:15: gpnptool: rc=0
2013-07-10 18:53:15: gpnptool output:
Profile signature is valid.
2013-07-10 18:53:15: Profile "/u01/grid_home/gpnp/profiles/peer/profile.xml" signature is VALID for wallet "file:/u01/grid_home/gpnp/wallets/peer"
2013-07-10 18:53:15: gpnptool: run /u01/grid_home/bin/gpnptool verify -p="/u01/grid_home/gpnp/profiles/peer/profile.xml" -w="file:/u01/grid_home/gpnp/wallets/prdr" -wu=peer
2013-07-10 18:53:15: Running as user grid: /u01/grid_home/bin/gpnptool verify -p="/u01/grid_home/gpnp/profiles/peer/profile.xml" -w="file:/u01/grid_home/gpnp/wallets/prdr" -wu=peer
2013-07-10 18:53:15: s_run_as_user2: Running /bin/su grid -c ' /u01/grid_home/bin/gpnptool verify -p="/u01/grid_home/gpnp/profiles/peer/profile.xml" -w="file:/u01/grid_home/gpnp/wallets/prdr" -wu=peer '
2013-07-10 18:53:16: Removing file /tmp/filebkOtBv
2013-07-10 18:53:16: Successfully removed file: /tmp/filebkOtBv
2013-07-10 18:53:16: /bin/su successfully executed
2013-07-10 18:53:16: gpnptool: rc=0
2013-07-10 18:53:16: gpnptool output:
Profile signature is valid.
2013-07-10 18:53:16: Profile "/u01/grid_home/gpnp/profiles/peer/profile.xml" signature is VALID for wallet "file:/u01/grid_home/gpnp/wallets/prdr"
2013-07-10 18:53:16: chk gpnphome /u01/grid_home/gpnp: OK
2013-07-10 18:53:16: GPnP Wallets ownership/permissions successfully set.
2013-07-10 18:53:16: gpnp setup checked: local valid? 0 cluster-wide valid? 1
2013-07-10 18:53:16: Taking cluster-wide setup as local
2013-07-10 18:53:16:   copy "/u01/grid_home/gpnp/profiles/peer/profile.xml" => "/u01/grid_home/gpnp/rac2/profiles/peer/profile.xml"
2013-07-10 18:53:16:   set ownership on "/u01/grid_home/gpnp/rac2/profiles/peer/profile.xml" => (grid,oinstall)
2013-07-10 18:53:16:   copy "/u01/grid_home/gpnp/wallets/peer/cwallet.sso" => "/u01/grid_home/gpnp/rac2/wallets/peer/cwallet.sso"
2013-07-10 18:53:16:   set ownership on "/u01/grid_home/gpnp/rac2/wallets/peer/cwallet.sso" => (grid,oinstall)
2013-07-10 18:53:16:   copy "/u01/grid_home/gpnp/wallets/prdr/cwallet.sso" => "/u01/grid_home/gpnp/rac2/wallets/prdr/cwallet.sso"
2013-07-10 18:53:16:   set ownership on "/u01/grid_home/gpnp/rac2/wallets/prdr/cwallet.sso" => (grid,oinstall)
2013-07-10 18:53:16:   copy "/u01/grid_home/gpnp/profiles/peer/profile_orig.xml" => "/u01/grid_home/gpnp/rac2/profiles/peer/profile_orig.xml"
2013-07-10 18:53:16:   set ownership on "/u01/grid_home/gpnp/rac2/profiles/peer/profile_orig.xml" => (grid,oinstall)
2013-07-10 18:53:16:   copy "/u01/grid_home/gpnp/wallets/root/ewallet.p12" => "/u01/grid_home/gpnp/rac2/wallets/root/ewallet.p12"
2013-07-10 18:53:16:   set ownership on "/u01/grid_home/gpnp/rac2/wallets/root/ewallet.p12" => (grid,oinstall)
2013-07-10 18:53:16:   copy "/u01/grid_home/gpnp/wallets/pa/cwallet.sso" => "/u01/grid_home/gpnp/rac2/wallets/pa/cwallet.sso"
2013-07-10 18:53:16:   set ownership on "/u01/grid_home/gpnp/rac2/wallets/pa/cwallet.sso" => (grid,oinstall)
2013-07-10 18:53:16:   copy "/u01/grid_home/gpnp/wallets/root/b64certificate.txt" => "/u01/grid_home/gpnp/rac2/wallets/root/b64certificate.txt"
2013-07-10 18:53:16:   set ownership on "/u01/grid_home/gpnp/rac2/wallets/root/b64certificate.txt" => (grid,oinstall)
2013-07-10 18:53:16:   copy "/u01/grid_home/gpnp/wallets/peer/cert.txt" => "/u01/grid_home/gpnp/rac2/wallets/peer/cert.txt"
2013-07-10 18:53:16:   set ownership on "/u01/grid_home/gpnp/rac2/wallets/peer/cert.txt" => (grid,oinstall)
2013-07-10 18:53:16:   copy "/u01/grid_home/gpnp/wallets/pa/cert.txt" => "/u01/grid_home/gpnp/rac2/wallets/pa/cert.txt"
2013-07-10 18:53:16:   set ownership on "/u01/grid_home/gpnp/rac2/wallets/pa/cert.txt" => (grid,oinstall)
2013-07-10 18:53:16: GPnP Wallets ownership/permissions successfully set.
2013-07-10 18:53:16: gpnp setup: GOTCLUSTERWIDE
2013-07-10 18:53:16: Validating for SI-CSS configuration
2013-07-10 18:53:16: Retrieving OCR main disk location
2013-07-10 18:53:16: Opening file OCRCONFIG
2013-07-10 18:53:16: Value () is set for key=ocrconfig_loc
2013-07-10 18:53:16: Unable to retrieve ocr disk info
2013-07-10 18:53:16: Checking to see if any 9i GSD is up
2013-07-10 18:53:16: libskgxnBase_lib = /etc/ORCLcluster/oracm/lib/libskgxn2.so
2013-07-10 18:53:16: libskgxn_lib = /opt/ORCLcluster/lib/libskgxn2.so
2013-07-10 18:53:16: SKGXN library file does not exists
2013-07-10 18:53:16: OLR location = /u01/grid_home/cdata/rac2.olr
2013-07-10 18:53:16: Oracle CRS Home = /u01/grid_home
2013-07-10 18:53:16: Validating /etc/oracle/olr.loc file for OLR location /u01/grid_home/cdata/rac2.olr
2013-07-10 18:53:16: /etc/oracle/olr.loc already exists. Backing up /etc/oracle/olr.loc to /etc/oracle/olr.loc.orig
2013-07-10 18:53:16: Oracle CRS home = /u01/grid_home
2013-07-10 18:53:16: Oracle cluster name = rac-scan
2013-07-10 18:53:16: OCR locations = +CRS
2013-07-10 18:53:16: Validating OCR
2013-07-10 18:53:16: Retrieving OCR location used by previous installations
2013-07-10 18:53:16: Opening file OCRCONFIG
2013-07-10 18:53:16: Value () is set for key=ocrconfig_loc
2013-07-10 18:53:16: Opening file OCRCONFIG
2013-07-10 18:53:16: Value () is set for key=ocrmirrorconfig_loc
2013-07-10 18:53:16: Opening file OCRCONFIG
2013-07-10 18:53:16: Value () is set for key=ocrconfig_loc3
2013-07-10 18:53:16: Opening file OCRCONFIG
2013-07-10 18:53:16: Value () is set for key=ocrconfig_loc4
2013-07-10 18:53:16: Opening file OCRCONFIG
2013-07-10 18:53:16: Value () is set for key=ocrconfig_loc5
2013-07-10 18:53:16: Checking if OCR sync file exists
2013-07-10 18:53:16: No need to sync OCR file
2013-07-10 18:53:16: OCR_LOCATION=+CRS
2013-07-10 18:53:16: OCR_MIRROR_LOCATION=
2013-07-10 18:53:16: OCR_MIRROR_LOC3=
2013-07-10 18:53:16: OCR_MIRROR_LOC4=
2013-07-10 18:53:16: OCR_MIRROR_LOC5=
2013-07-10 18:53:16: Current OCR location=
2013-07-10 18:53:16: Current OCR mirror location=
2013-07-10 18:53:16: Current OCR mirror loc3=
2013-07-10 18:53:16: Current OCR mirror loc4=
2013-07-10 18:53:16: Current OCR mirror loc5=
2013-07-10 18:53:16: Verifying current OCR settings with user entered values
2013-07-10 18:53:16: Setting OCR locations in /etc/oracle/ocr.loc
2013-07-10 18:53:16: Validating OCR locations in /etc/oracle/ocr.loc
2013-07-10 18:53:16: Checking for existence of /etc/oracle/ocr.loc
2013-07-10 18:53:16: Backing up /etc/oracle/ocr.loc to /etc/oracle/ocr.loc.orig
2013-07-10 18:53:16: Setting ocr location +CRS
2013-07-10 18:53:16: Creating or upgrading Oracle Local Registry (OLR)
2013-07-10 18:53:16: OLR successfully created or upgraded
2013-07-10 18:53:16: /u01/grid_home/bin/clscfg -localadd
2013-07-10 18:53:16: Keys created in the OLR successfully
2013-07-10 18:53:16: GPnP setup state: new-cluster-wide
2013-07-10 18:53:16: GPnP cluster configuration already performed
2013-07-10 18:53:16: Registering ohasd
2013-07-10 18:53:16: init file = /u01/grid_home/crs/init/init.ohasd
2013-07-10 18:53:16: Copying file /u01/grid_home/crs/init/init.ohasd to /etc/init.d directory
2013-07-10 18:53:16: Setting init.ohasd permission in /etc/init.d directory
2013-07-10 18:53:16: init file = /u01/grid_home/crs/init/ohasd
2013-07-10 18:53:16: Copying file /u01/grid_home/crs/init/ohasd to /etc/init.d directory
2013-07-10 18:53:16: Setting ohasd permission in /etc/init.d directory
2013-07-10 18:53:16: Removing "/etc/rc.d/rc3.d/S96ohasd"
2013-07-10 18:53:16: Removing file /etc/rc.d/rc3.d/S96ohasd
2013-07-10 18:53:16: Failure with return code 1 from command rm /etc/rc.d/rc3.d/S96ohasd
2013-07-10 18:53:16: Failed to remove file:
2013-07-10 18:53:16: Creating a link "/etc/rc.d/rc3.d/S96ohasd" pointing to /etc/init.d/ohasd
2013-07-10 18:53:16: Removing "/etc/rc.d/rc5.d/S96ohasd"
2013-07-10 18:53:16: Removing file /etc/rc.d/rc5.d/S96ohasd
2013-07-10 18:53:16: Failure with return code 1 from command rm /etc/rc.d/rc5.d/S96ohasd
2013-07-10 18:53:16: Failed to remove file:
2013-07-10 18:53:16: Creating a link "/etc/rc.d/rc5.d/S96ohasd" pointing to /etc/init.d/ohasd
2013-07-10 18:53:16: Removing "/etc/rc.d/rc0.d/K19ohasd"
2013-07-10 18:53:16: Removing file /etc/rc.d/rc0.d/K19ohasd
2013-07-10 18:53:16: Failure with return code 1 from command rm /etc/rc.d/rc0.d/K19ohasd
2013-07-10 18:53:16: Failed to remove file:
2013-07-10 18:53:16: Creating a link "/etc/rc.d/rc0.d/K19ohasd" pointing to /etc/init.d/ohasd
2013-07-10 18:53:16: Removing "/etc/rc.d/rc1.d/K19ohasd"
2013-07-10 18:53:16: Removing file /etc/rc.d/rc1.d/K19ohasd
2013-07-10 18:53:16: Failure with return code 1 from command rm /etc/rc.d/rc1.d/K19ohasd
2013-07-10 18:53:16: Failed to remove file:
2013-07-10 18:53:16: Creating a link "/etc/rc.d/rc1.d/K19ohasd" pointing to /etc/init.d/ohasd
2013-07-10 18:53:16: Removing "/etc/rc.d/rc2.d/K19ohasd"
2013-07-10 18:53:16: Removing file /etc/rc.d/rc2.d/K19ohasd
2013-07-10 18:53:16: Failure with return code 1 from command rm /etc/rc.d/rc2.d/K19ohasd
2013-07-10 18:53:16: Failed to remove file:
2013-07-10 18:53:16: Creating a link "/etc/rc.d/rc2.d/K19ohasd" pointing to /etc/init.d/ohasd
2013-07-10 18:53:16: Removing "/etc/rc.d/rc4.d/K19ohasd"
2013-07-10 18:53:16: Removing file /etc/rc.d/rc4.d/K19ohasd
2013-07-10 18:53:16: Failure with return code 1 from command rm /etc/rc.d/rc4.d/K19ohasd
2013-07-10 18:53:16: Failed to remove file:
2013-07-10 18:53:16: Creating a link "/etc/rc.d/rc4.d/K19ohasd" pointing to /etc/init.d/ohasd
2013-07-10 18:53:16: Removing "/etc/rc.d/rc6.d/K19ohasd"
2013-07-10 18:53:16: Removing file /etc/rc.d/rc6.d/K19ohasd
2013-07-10 18:53:16: Failure with return code 1 from command rm /etc/rc.d/rc6.d/K19ohasd
2013-07-10 18:53:16: Failed to remove file:
2013-07-10 18:53:16: Creating a link "/etc/rc.d/rc6.d/K19ohasd" pointing to /etc/init.d/ohasd
2013-07-10 18:53:16: The file ohasd has been successfully linked to the RC directories
2013-07-10 18:53:16: Starting ohasd
2013-07-10 18:53:16: itab entries=
2013-07-10 18:53:21: Created backup /etc/inittab.no_crs
2013-07-10 18:53:21: Appending to /etc/inittab.tmp:
2013-07-10 18:53:21: h1:35:respawn:/etc/init.d/init.ohasd run >/dev/null 2>&1 </dev/null
2013-07-10 18:53:21: Done updating /etc/inittab.tmp
2013-07-10 18:53:21: Saved /etc/inittab.crs
2013-07-10 18:53:21: Installed new /etc/inittab
2013-07-10 18:53:36: ohasd is starting
2013-07-10 18:53:36: Checking ohasd
2013-07-10 18:53:37: ohasd started successfully
2013-07-10 18:53:37: Creating CRS resources and dependencies
2013-07-10 18:53:37: Configuring HASD
2013-07-10 18:53:37: Registering type ora.daemon.type
2013-07-10 18:53:37: Registering type ora.mdns.type
2013-07-10 18:53:37: Registering type ora.gpnp.type
2013-07-10 18:53:38: Registering type ora.gipc.type
2013-07-10 18:53:38: Registering type ora.cssd.type
2013-07-10 18:53:38: Registering type ora.cssdmonitor.type
2013-07-10 18:53:39: Registering type ora.crs.type
2013-07-10 18:53:39: Registering type ora.evm.type
2013-07-10 18:53:39: Registering type ora.ctss.type
2013-07-10 18:53:40: Registering type ora.asm.type
2013-07-10 18:53:40: Registering type ora.drivers.acfs.type
2013-07-10 18:53:40: Registering type ora.diskmon.type
2013-07-10 18:53:51: ADVM/ACFS is configured
2013-07-10 18:53:51: Successfully created CRS resources for cluster daemon and ASM
2013-07-10 18:53:51: Checking if initial configuration has been performed
2013-07-10 18:53:51: Starting CSS in exclusive mode
2013-07-10 18:54:19: CRS-2672: Attempting to start 'ora.gipcd' on 'rac2'
2013-07-10 18:54:19: CRS-2672: Attempting to start 'ora.mdnsd' on 'rac2'
2013-07-10 18:54:19: CRS-2676: Start of 'ora.gipcd' on 'rac2' succeeded
2013-07-10 18:54:19: CRS-2676: Start of 'ora.mdnsd' on 'rac2' succeeded
2013-07-10 18:54:19: CRS-2672: Attempting to start 'ora.gpnpd' on 'rac2'
2013-07-10 18:54:19: CRS-2676: Start of 'ora.gpnpd' on 'rac2' succeeded
2013-07-10 18:54:19: CRS-2672: Attempting to start 'ora.cssdmonitor' on 'rac2'
2013-07-10 18:54:19: CRS-2676: Start of 'ora.cssdmonitor' on 'rac2' succeeded
2013-07-10 18:54:19: CRS-2672: Attempting to start 'ora.cssd' on 'rac2'
2013-07-10 18:54:19: CRS-2672: Attempting to start 'ora.diskmon' on 'rac2'
2013-07-10 18:54:19: CRS-2676: Start of 'ora.diskmon' on 'rac2' succeeded
2013-07-10 18:54:19: CRS-2676: Start of 'ora.cssd' on 'rac2' succeeded
2013-07-10 18:54:19: Querying for existing CSS voting disks
2013-07-10 18:54:19: Performing initial configuration for cluster
2013-07-10 18:54:21: Start of resource "ora.ctssd -init" Succeeded
2013-07-10 18:54:21: Configuring ASM via ASMCA
2013-07-10 18:54:21: Executing as grid: /u01/grid_home/bin/asmca -silent -diskGroupName CRS -diskList ORCL:CRS1 -redundancy EXTERNAL -configureLocalASM
2013-07-10 18:54:21: Running as user grid: /u01/grid_home/bin/asmca -silent -diskGroupName CRS -diskList ORCL:CRS1 -redundancy EXTERNAL -configureLocalASM
2013-07-10 18:54:21:   Invoking "/u01/grid_home/bin/asmca -silent -diskGroupName CRS -diskList ORCL:CRS1 -redundancy EXTERNAL -configureLocalASM" as user "grid"
2013-07-10 18:54:40: Configuration of ASM failed, see logs for details
2013-07-10 18:54:40: Did not succssfully configure and start ASM
2013-07-10 18:54:40: Exiting exclusive mode
2013-07-10 18:54:40: Command return code of 1 (256) from command: /u01/grid_home/bin/crsctl stop resource ora.crsd -init
2013-07-10 18:54:40: Stop of resource "ora.crsd -init" failed
2013-07-10 18:54:40: Failed to stop CRSD
2013-07-10 18:55:04: Initial cluster configuration failed. See /u01/grid_home/cfgtoollogs/crsconfig/rootcrs_rac2.log for details
Also below are some of the configs related to rac2 node
[root@rac2 rac2]# rpm -qa | grep oracleasm
oracleasmlib-2.0.4-1.el5
oracleasm-support-2.1.8-1.el5
oracleasm-2.6.18-274.el5xen-2.0.5-1.el5
oracleasm-2.6.18-274.el5-2.0.5-1.el5
oracleasm-2.6.18-274.el5debug-2.0.5-1.el5
oracleasm-2.6.18-274.el5-debuginfo-2.0.5-1.el5
[root@rac2 rac2]# /usr/sbin/oracleasm configure
ORACLEASM_ENABLED=true
ORACLEASM_UID=grid
ORACLEASM_GID=asmadmin
ORACLEASM_SCANBOOT=true
ORACLEASM_SCANORDER=""
ORACLEASM_SCANEXCLUDE=""
ORACLEASM_USE_LOGICAL_BLOCK_SIZE="false"
[root@rac2 rac2]# /usr/sbin/oracleasm status
Checking if ASM is loaded: yes
Checking if /dev/oracleasm is mounted: yes
[root@rac2 rac2]# /usr/sbin/oracleasm listdisks
CRS1
DATA1
FRA1
[root@rac2 rac2]# ls -l /dev/oracleasm/disks/
total 0
brw-rw---- 1 grid asmadmin 8, 17 Jul 10 18:35 CRS1
brw-rw---- 1 grid asmadmin 8, 33 Jul 10 18:36 DATA1
brw-rw---- 1 grid asmadmin 8, 49 Jul 10 18:36 FRA1
[root@rac2 rac2]# cat /etc/hosts
# Do not remove the following line, or various programs
# that require network functionality will fail.
127.0.0.1               localhost.localdomain localhost
::1             localhost6.localdomain6 localhost6
#Public IP's(eth0)
192.168.0.101    rac1.naveed.com    rac1
192.168.0.102    rac2.naveed.com    rac2
#Private IP's(eth1)
192.168.1.101    rac1-prv.naveed.com   rac1-prv
192.168.1.102    rac2-prv.naveed.com   rac2-prv
#VIPS
192.168.0.221    rac1-vip.naveed.com   rac1-vip
192.168.0.222    rac2-vip.naveed.com   rac2-vip
#DNS server IP
192.168.0.10    naveeddns.naveed.com   naveeddns
[root@rac2 rac2]#
Thanks in advance

Hi,
First of all thanks a lot for the response. You wont't beleive this is my 7th fresh installation and everytime in node 2 i m hit with this same error.
Also i tried below procedure instead of fresh installation
once i deconfig & rerun (./rootcrs.pl -verbose -deconfig -force) on node 2
Using configuration parameter file: ./crsconfig_params
PRCR-1119 : Failed to look up CRS resources of ora.cluster_vip_net1.type type
PRCR-1068 : Failed to query resources
Cannot communicate with crsd
PRCR-1070 : Failed to check if resource ora.gsd is registered
Cannot communicate with crsd
PRCR-1070 : Failed to check if resource ora.ons is registered
Cannot communicate with crsd
CRS-4535: Cannot communicate with Cluster Ready Services
CRS-4000: Command Stop failed, or completed with errors.
CRS-2791: Starting shutdown of Oracle High Availability Services-managed resources on 'rac2'
CRS-2673: Attempting to stop 'ora.drivers.acfs' on 'rac2'
CRS-2673: Attempting to stop 'ora.asm' on 'rac2'
CRS-2673: Attempting to stop 'ora.ctssd' on 'rac2'
CRS-2673: Attempting to stop 'ora.mdnsd' on 'rac2'
CRS-2677: Stop of 'ora.mdnsd' on 'rac2' succeeded
CRS-2677: Stop of 'ora.asm' on 'rac2' succeeded
CRS-2673: Attempting to stop 'ora.cluster_interconnect.haip' on 'rac2'
CRS-2677: Stop of 'ora.ctssd' on 'rac2' succeeded
CRS-2677: Stop of 'ora.cluster_interconnect.haip' on 'rac2' succeeded
CRS-2673: Attempting to stop 'ora.cssd' on 'rac2'
CRS-2677: Stop of 'ora.cssd' on 'rac2' succeeded
CRS-2673: Attempting to stop 'ora.gipcd' on 'rac2'
CRS-2677: Stop of 'ora.drivers.acfs' on 'rac2' succeeded
CRS-2677: Stop of 'ora.gipcd' on 'rac2' succeeded
CRS-2673: Attempting to stop 'ora.gpnpd' on 'rac2'
CRS-2677: Stop of 'ora.gpnpd' on 'rac2' succeeded
CRS-2793: Shutdown of Oracle High Availability Services-managed resources on 'rac2' has completed
CRS-4133: Oracle High Availability Services has been stopped.
Successfully deconfigured Oracle clusterware stack on this node
[root@rac2 grid_home]# ./root.sh
Performing root user operation for Oracle 11g
The following environment variables are set as:
    ORACLE_OWNER= grid
    ORACLE_HOME= /u01/grid_home
Enter the full pathname of the local bin directory: [/usr/local/bin]:
The contents of "dbhome" have not changed. No need to overwrite.
The contents of "oraenv" have not changed. No need to overwrite.
The contents of "coraenv" have not changed. No need to overwrite.
Entries will be added to the /etc/oratab file as needed by
Database Configuration Assistant when a database is created
Finished running generic part of root script.
Now product-specific root actions will be performed.
Using configuration parameter file: /u01/grid_home/crs/install/crsconfig_params
User ignored Prerequisites during installation
OLR initialization - successful
Adding Clusterware entries to inittab
CRS-2672: Attempting to start 'ora.mdnsd' on 'rac2'
CRS-2676: Start of 'ora.mdnsd' on 'rac2' succeeded
CRS-2672: Attempting to start 'ora.gpnpd' on 'rac2'
CRS-2676: Start of 'ora.gpnpd' on 'rac2' succeeded
CRS-2672: Attempting to start 'ora.cssdmonitor' on 'rac2'
CRS-2672: Attempting to start 'ora.gipcd' on 'rac2'
CRS-2676: Start of 'ora.cssdmonitor' on 'rac2' succeeded
CRS-2676: Start of 'ora.gipcd' on 'rac2' succeeded
CRS-2672: Attempting to start 'ora.cssd' on 'rac2'
CRS-2672: Attempting to start 'ora.diskmon' on 'rac2'
CRS-2676: Start of 'ora.diskmon' on 'rac2' succeeded
CRS-2676: Start of 'ora.cssd' on 'rac2' succeeded
ASM created and started successfully.
Disk Group CRS mounted successfully.
clscfg: -install mode specified
Successfully accumulated necessary OCR keys.
Creating OCR keys for user 'root', privgrp 'root'..
Operation successful.
Successful addition of voting disk 636af26485ef4f27bfec31523aaa0660.
Successfully replaced voting disk group with +CRS.
CRS-4266: Voting file(s) successfully replaced
## STATE    File Universal Id                File Name Disk group
1. ONLINE   636af26485ef4f27bfec31523aaa0660 (ORCL:CRS1) [CRS]
Located 1 voting disk(s).
Start of resource "ora.crsd" failed
CRS-2800: Cannot start resource 'ora.asm' as it is already in the INTERMEDIATE state on server 'rac2'
CRS-4000: Command Start failed, or completed with errors.
Failed to start Oracle Grid Infrastructure stack
Failed to start Cluster Ready Services at /u01/grid_home/crs/install/crsconfig_lib.pm line 1286.
/u01/grid_home/perl/bin/perl -I/u01/grid_home/perl/lib -I/u01/grid_home/crs/install /u01/grid_home/crs/install/rootcrs.pl execution failed

ASM install fails on one node

I have been trying to install 10gRAC on a two virtual node cluster. I installed clusterware and it was successful. Before I started ASM install:
[oracle@rac1 bin]$ ./crs_stat -t
Name Type Target State Host
ora.rac1.gsd application ONLINE ONLINE rac1
ora.rac1.ons application ONLINE ONLINE rac1
ora.rac1.vip application ONLINE ONLINE rac1
ora.rac2.gsd application ONLINE ONLINE rac2
ora.rac2.ons application ONLINE ONLINE rac2
ora.rac2.vip application ONLINE ONLINE rac2
[oracle@rac1 logs]$ /u01/crs/oracle/product/10.2.0/crs/bin/crsctl check crs
CSS appears healthy
CRS appears healthy
EVM appears healthy
[oracle@rac1 logs]$ ps -ef|grep d.bin
root 3795 1 0 13:57 ? 00:00:35 /u01/crs/oracle/product/10.2.0/crs/bin/crsd.bin reboot
oracle 4966 3793 0 13:59 ? 00:00:06 /u01/crs/oracle/product/10.2.0/crs/bin/evmd.bin
oracle 5082 5059 0 13:59 ? 00:01:06 /u01/crs/oracle/product/10.2.0/crs/bin/ocssd.bin
oracle 30520 4813 0 16:23 pts/3 00:00:00 grep d.bin
During thhe ASM install here is what I got:
WARNING: Error while copying directory /u01/app/oracle/product/10.2.0/db_1 with exclude file list 'null' to nodes 'rac2'. [PRKC-1073 : Failed to transfer directory "/u01/app/oracle/product/10.2.0/db_1" to any of the given nodes "rac2 ".
Error on node rac2:Read from remote host rac2: Connection reset by peer]
Refer to '/u01/app/oracle/oraInventory/logs/installActions2009-04-18_01-30-26PM.log' for details. You may fix the errors on the required remote nodes. Refer to the install guide for error recovery. Click 'Yes' if you want to proceed. Click 'No' to exit the install. Do you want to continue?
INFO: User Selected: Yes/OK
It appears to me as though the installer was not able to copy over the "/u01/app/oracle/product/10.2.0/db_1" directory to the rac2 node. I do not see any reason for that, I have setup ssh user equivalence for both oracle and root users, ssh and scp seem to work both ways. Permissions should not be an issue on one node and not the other as I replicated the permissions.
I continued the installation and ASM is working fine on rac1 node and not on the second node. I tried using the dbca to setup the ASM on the second node and it errors out with a "crs-0223 resource placement error". Here is what I did next:
[oracle@rac1 bin]$ ./srvctl status asm -n rac1
ASM instance +ASM1 is running on node rac1.
[oracle@rac1 bin]$ ./srvctl status asm -n rac2
ASM instance +ASM2 is not running on node rac2.
[oracle@rac1 bin]$ ./crs_sta
crs_start crs_start.bin crs_stat crs_stat.bin
[oracle@rac1 bin]$ ./crs_stat -t
Name Type Target State Host
ora....SM1.asm application ONLINE ONLINE rac1
ora....C1.lsnr application ONLINE ONLINE rac1
ora.rac1.gsd application ONLINE ONLINE rac1
ora.rac1.ons application ONLINE ONLINE rac1
ora.rac1.vip application ONLINE ONLINE rac1
ora....SM2.asm application ONLINE UNKNOWN rac2
ora....C2.lsnr application ONLINE UNKNOWN rac2
ora.rac2.gsd application ONLINE ONLINE rac2
ora.rac2.ons application ONLINE ONLINE rac2
ora.rac2.vip application ONLINE ONLINE rac2
[oracle@rac1 bin]$ ./crs_start ora.rac2.ASM2.asm
CRS-1028: Dependency analysis failed because of:
'Resource in UNKNOWN state: ora.rac2.ASM2.asm'
CRS-0223: Resource 'ora.rac2.ASM2.asm' has placement error.
I would like to get the ASM instance extended to the second node (rac2) and ofcourse, continue with the database instance creation. How can I accomplish this?
Thanks!

Hi orafun,
this message:
+WARNING: Error while copying directory /u01/app/oracle/product/10.2.0/db_1 with exclude file list 'null' to nodes 'rac2'. [PRKC-1073 : Failed to transfer directory "/u01/app/oracle/product/10.2.0/db_1" to any of the given nodes "rac2 ".+
+Error on node rac2:Read from remote host rac2: Connection reset by peer]+
Refer to '/u01/app/oracle/oraInventory/logs/installActions2009-04-18_01-30-26PM.log' for details. You may fix the errors on the required remote nodes. Refer to the install guide for error recovery. Click 'Yes' if you want to proceed. Click 'No' to exit the install. Do you want to continue?
INFO: User Selected: Yes/OK
Tells you that the Oracle Home could not be copied onto the remote node. The logs mentioned might tell you more, but this is the reason why ASM cannot be started on the other node - there is no software that could be used to start an ASM instance. Now you said:
"+It appears to me as though the installer was not able to copy over the "/u01/app/oracle/product/10.2.0/db_1" directory to the rac2 node. I do not see any reason for that, I have setup ssh user equivalence for both oracle and root users, ssh and scp seem to work both ways. Permissions should not be an issue on one node and not the other as I replicated the permissions+."
My question would be: What do you try to achieve? IF it is your only interest to "get it done and over with", then you can TAR up the Oracle Database home from which you want to run ASM and un-TAR on the remote node. Given that the paths are all correct, the registration already took place and hence, you can try starting the ASM instance on node2. IF you want to know the reason for the issue, further investigation and more information would be required.
Hope that helps. Thanks,
Markus

Runcluvfy.sh fails at precheck - nodes unreachable

Similar Messages

Maybe you are looking for