Oracle RAC crs无法启动的问题

这两个节点的RAC是做为DataGuard备库。
版本：Red Linux 5.6，Oracle 10.2.0.3.0
node1->$ crsctl check crs
CSS appears healthy
Cannot communicate with CRS
EVM appears healthy
node1->$ crsctl query css votedisk
0. 0 /dev/raw/raw1
located 1 votedisk(s).
node1->$ ocrcheck
Status of Oracle Cluster Registry is as follows :
Version : 2
Total space (kbytes) : 497744
Used space (kbytes) : 3820
Available space (kbytes) : 493924
ID : 1682116375
Device/File Name : /dev/raw/raw4
Device/File integrity check succeeded
Device/File not configured
Cluster registry integrity check succeeded
# *./oifcfg getif*
eth0 10.17.19.0 global cluster_interconnect
eth1 172.17.19.0 global public
# */etc/init.d/init.crs start*
node1->$ ps -ef|grep crs
root 5083 1 0 15:10 ? 00:00:00 /bin/su -l oracle -c sh -c 'ulimit -c unlimited; cd /app/oracle/product/10.2.0/crs_1/log/node1/evmd; exec /app/oracle/product/10.2.0/crs_1/bin/evmd '
oracle 17459 4769 0 16:09 pts/1 00:00:00 grep crs
oracle 26397 5083 0 15:51 ? 00:00:00 /app/oracle/product/10.2.0/crs_1/bin/evmd.bin
root 26619 26370 0 15:51 ? 00:00:00 /bin/su -l oracle -c /bin/sh -c 'cd /app/oracle/product/10.2.0/crs_1/log/node1/cssd/oclsomon; ulimit -c unlimited; /app/oracle/product/10.2.0/crs_1/bin/oclsomon || exit $?'
oracle 26626 26619 0 15:51 ? 00:00:00 /bin/sh -c cd /app/oracle/product/10.2.0/crs_1/log/node1/cssd/oclsomon; ulimit -c unlimited; /app/oracle/product/10.2.0/crs_1/bin/oclsomon || exit $?
oracle 26672 26626 0 15:51 ? 00:00:00 /app/oracle/product/10.2.0/crs_1/bin/oclsomon.bin
oracle 26691 26371 0 15:51 ? 00:00:00 /app/oracle/product/10.2.0/crs_1/bin/ocssd.bin
oracle 27094 26397 0 15:51 ? 00:00:00 /app/oracle/product/10.2.0/crs_1/bin/evmlogger.bin -o /app/oracle/product/10.2.0/crs_1/evm/log/evmlogger.info -l /app/oracle/product/10.2.0/crs_1/evm/log/evmlogger.log
alertnode1.log 文件部份内容：
2012-11-13 15:51:07.152
[cssd(26691)]CRS-1605:CSSD voting file is online: /dev/raw/raw1. Details in /app/oracle/product/10.2.0/crs_1/log/node1/cssd/ocssd.log.
2012-11-13 15:51:08.084
[cssd(26691)]CRS-1601:CSSD Reconfiguration complete. Active nodes are node1 node2 .
2012-11-13 15:51:08.320
[evmd(26397)]CRS-1401:EVMD started on node node1.
ocssd.log 文件内容：
[    CSSD]2012-11-13 15:51:05.037 >USER: Oracle Database 10g CSS Release 10.2.0.3.0 Production Copyright 1996, 2004 Oracle. All rights reserved.
[    CSSD]2012-11-13 15:51:05.037 >USER: CSS daemon log for node node1, number 1, in cluster crs
[    CSSD]2012-11-13 15:51:05.040 [2246605696] >TRACE: clssscmain: local-only set to false
[ clsdmt]Listening to (ADDRESS=(PROTOCOL=ipc)(KEY=node1DBG_CSSD))
[    CSSD]2012-11-13 15:51:05.065 [2246605696] >TRACE: clssnmReadNodeInfo: added node 1 (node1) to cluster
[    CSSD]2012-11-13 15:51:05.074 [2246605696] >TRACE: clssnmReadNodeInfo: added node 2 (node2) to cluster
[    CSSD]2012-11-13 15:51:05.077 [1120115008] >TRACE: clssnm_skgxnmon: skgxn init failed
[    CSSD]2012-11-13 15:51:05.077 [2246605696] >TRACE: clssnm_skgxnonline: Using vacuous skgxn monitor
[    CSSD]2012-11-13 15:51:05.079 [2246605696] >TRACE: clssnmNMInitialize: misscount set to (60), impending reconfig threshold set to (56000)
[    CSSD]2012-11-13 15:51:05.079 [2246605696] >TRACE: clssnmNMInitialize: diskShortTimeout set to (57000)ms
[    CSSD]2012-11-13 15:51:05.080 [2246605696] >TRACE: clssnmNMInitialize: diskLongTimeout set to (200000)ms
[    CSSD]2012-11-13 15:51:05.082 [2246605696] >TRACE: clssnmDiskStateChange: state from 1 to 2 disk (0//dev/raw/raw1)
[    CSSD]2012-11-13 15:51:05.082 [1120115008] >TRACE: clssnmvDPT: spawned for disk 0 (/dev/raw/raw1)
[    CSSD]2012-11-13 15:51:07.127 [1120115008] >TRACE: clssnmDiskStateChange: state from 2 to 4 disk (0//dev/raw/raw1)
[    CSSD]2012-11-13 15:51:07.153 [1130604864] >TRACE: clssnmvKillBlockThread: spawned for disk 0 (/dev/raw/raw1) initial sleep interval (1000)ms
[    CSSD]2012-11-13 15:51:07.161 [2246605696] >TRACE: clssnmFatalInit: fatal mode enabled
[    CSSD]2012-11-13 15:51:07.161 [1151584576] >TRACE: clssnmconnect: connecting to node 1, flags 0x0001, connector 1
[    CSSD]2012-11-13 15:51:07.161 [1120115008] >TRACE: clssnmReadDskHeartbeat: node(2) is down. rcfg(12) wrtcnt(78619) LATS(1830084) Disk lastSeqNo(78619)
[    CSSD]2012-11-13 15:51:07.162 [1151584576] >TRACE: clssnmClusterListener: Listening on (ADDRESS=(PROTOCOL=tcp)(HOST=node1-priv)(PORT=49895))
[    CSSD]2012-11-13 15:51:07.162 [1151584576] >TRACE: clssnmconnect: connecting to node 0, flags 0x0000, connector 1
[    CSSD]2012-11-13 15:51:07.162 [1151584576] >TRACE: clssnmClusterListener: Probing node 2, con (0x2aaaac10c320)
[    CSSD]2012-11-13 15:51:07.171 [1162074432] >TRACE: clssgmclientlsnr: listening on (ADDRESS=(PROTOCOL=ipc)(KEY=Oracle_CSS_LclLstnr_crs_1))
[    CSSD]2012-11-13 15:51:07.171 [1162074432] >TRACE: clssgmclientlsnr: listening on (ADDRESS=(PROTOCOL=ipc)(KEY=OCSSD_LL_node1_crs))
[    CSSD]2012-11-13 15:51:07.172 [1193544000] >TRACE: clssgmPeerListener: Listening on (ADDRESS=(PROTOCOL=tcp)(DEV=19)(HOST=10.17.19.20)(PORT=18701))
[    CSSD]2012-11-13 15:51:07.198 [1151584576] >TRACE: clssnmConnComplete: connected to node 2 (con 0x2aaaac163b50), state 3 birth 0, unique 1352712566/1352712566 prevConuni(0)
[    CSSD]2012-11-13 15:51:07.673 [1204033856] >TRACE: clssnmPollingThread: Connection complete
[    CSSD]2012-11-13 15:51:07.673 [1214523712] >TRACE: clssnmSendingThread: Connection complete
[    CSSD]2012-11-13 15:51:07.673 [1225013568] >TRACE: clssnmRcfgMgrThread: Connection complete
[    CSSD]2012-11-13 15:51:08.003 [1151584576] >TRACE: clssnmHandleSync: Acknowledging sync: src[2] srcName[node2] seq[45] sync[12]
[    CSSD]2012-11-13 15:51:08.003 [1151584576] >TRACE: clssnmHandleSync: diskTimeout set to (57000)ms
[    CSSD]2012-11-13 15:51:08.003 [1151584576] >TRACE: clssnmSendVoteInfo: node(2) syncSeqNo(12)
[    CSSD]2012-11-13 15:51:08.004 [1151584576] >TRACE: clssnmUpdateNodeState: node 0, state (0/0) unique (0/0) prevConuni(0) birth (0/0) (old/new)
[    CSSD]2012-11-13 15:51:08.004 [1151584576] >TRACE: clssnmDeactivateNode: node 0 () left cluster
[    CSSD]2012-11-13 15:51:08.004 [1151584576] >TRACE: clssnmUpdateNodeState: node 1, state (1/2) unique (1352793064/1352793064) prevConuni(0) birth (0/12) (old/new)
[    CSSD]2012-11-13 15:51:08.004 [1151584576] >TRACE: clssnmUpdateNodeState: node 2, state (4/3) unique (1352712566/1352712566) prevConuni(0) birth (0/1) (old/new)
[    CSSD]2012-11-13 15:51:08.004 [1151584576] >USER: clssnmHandleUpdate: SYNC(12) from node(2) completed
[    CSSD]2012-11-13 15:51:08.004 [1151584576] >USER: clssnmHandleUpdate: NODE 1 (node1) IS ACTIVE MEMBER OF CLUSTER
[    CSSD]2012-11-13 15:51:08.004 [1151584576] >USER: clssnmHandleUpdate: NODE 2 (node2) IS ACTIVE MEMBER OF CLUSTER
[    CSSD]2012-11-13 15:51:08.004 [1151584576] >TRACE: clssnmHandleUpdate: diskTimeout set to (200000)ms
[    CSSD]2012-11-13 15:51:08.081 [2246605696] >USER: NMEVENT_SUSPEND [00][00][00][00]
[    CSSD]2012-11-13 15:51:08.081 [1235503424] >TRACE: clssgmReconfigThread: started for reconfig (12)
[    CSSD]2012-11-13 15:51:08.081 [1235503424] >USER: NMEVENT_RECONFIG [00][00][00][06]
[    CSSD]2012-11-13 15:51:08.081 [1235503424] >TRACE: clssgmEstablishConnections: 2 nodes in cluster incarn 12
[    CSSD]2012-11-13 15:51:08.082 [1193544000] >TRACE: clssgmInitialRecv: (0xd9ae050) accepted a new connection from node 2 born at 1 active (2, 2), vers (10,3,1,2)
[    CSSD]2012-11-13 15:51:08.082 [1193544000] >TRACE: clssgmInitialRecv: conns done (2/2)
[    CSSD]2012-11-13 15:51:08.082 [1235503424] >TRACE: clssgmEstablishMasterNode: MASTER for 12 is node(2) birth(1)
[    CSSD]2012-11-13 15:51:08.082 [1235503424] >TRACE: clssgmChangeMasterNode: requeued 0 RPCs
[    CSSD]2012-11-13 15:51:08.083 [1193544000] >TRACE: clssgmHandleDBDone(): src/dest (2/65535) size(72) incarn 12
[    CSSD]CLSS-3000: reconfiguration successful, incarnation 12 with 2 nodes
[    CSSD]CLSS-3001: local node number 1, master node number 2
[    CSSD]2012-11-13 15:51:08.084 [1235503424] >TRACE: clssgmReconfigThread: completed for reconfig(12), with status(1)
[    CSSD]2012-11-13 15:51:08.268 [1162074432] >TRACE: clssgmClientConnectMsg: Connect from con(0xd9b4d50) proc(0xd9b9d50) pid() proto(10:2:1:1)
[    CSSD]2012-11-13 15:51:08.268 [1193544000] >TRACE: clssgmCommonAddMember: clsomon joined (1/0x1000000/#CSS_CLSSOMON)
[    CSSD]2012-11-13 15:51:08.269 [1162074432] >TRACE: clssgmClientConnectMsg: Connect from con(0xd9b7910) proc(0xd9ba0a0) pid() proto(10:2:1:1)
查看ocr，表决磁盘，存储，网络，裸设备权限，都没有发现问题，有时候执行/etc/init.d/init.crs start还会导致服务器重启，日志内容如下：
/var/log/message重启时的日志
Nov 13 15:51:03 node1 logger: Cluster Ready Services completed waiting on dependencies.
Nov 13 15:51:03 node1 logger: Cluster Ready Services completed waiting on dependencies.
Nov 13 16:10:54 node1 auditd[3667]: Audit daemon rotating log files
Nov 13 16:49:14 node1 auditd[3667]: Audit daemon rotating log files
Nov 13 16:50:37 node1 root: Cluster Ready Services completed waiting on dependencies.
Nov 13 16:52:07 node1 logger: Oracle CSS family monitor shutting down. 3
Nov 13 16:52:07 node1 root: Oracle CRSD 5797 set to stop
Nov 13 16:52:07 node1 root: Oracle CRSD 5797 shutdown completed
Nov 13 16:52:07 node1 root: Oracle EVMD set to stop
Nov 13 16:52:07 node1 root: Oracle CSSD being stopped
Nov 13 16:52:17 node1 root: Oracle CSSD being stopped
Nov 13 16:52:27 node1 root: Oracle EVMD set to stop
Nov 13 16:52:45 node1 root: Oracle CSSD being stopped
Nov 13 17:03:14 node1 root: Oracle CRSD 5797 set to stop
Nov 13 17:03:14 node1 root: Oracle CRSD 5797 shutdown completed
Nov 13 17:03:14 node1 root: Oracle EVMD set to stop
Nov 13 17:03:14 node1 root: Oracle CSSD being stopped
Nov 13 17:03:26 node1 root: Oracle Cluster Ready Services starting by user request.
Nov 13 17:03:35 node1 logger: Cluster Ready Services completed waiting on dependencies.
Nov 13 17:03:36 node1 logger: Oracle CSSD shell script failure. Duplicate CSSD.
Nov 13 17:03:36 node1 kernel: md: stopping all md devices.
Nov 13 17:21:49 node1 syslogd 1.4.1: restart.
Nov 13 17:21:49 node1 kernel: klogd 1.4.1, log source = /proc/kmsg started.
出现 Nov 13 17:03:36 node1 logger: Oracle CSSD shell script failure. Duplicate CSSD. 之后，服务器就重启了
在网上查了不少类似问题，其他网友无法启动CRS主要集中在几个方面：
1、/tmp权限不正确
2、删除/var/tmp/.oracle下的文件，再重启
3、oifcfg查看到网卡设置问题
但我遇到的问题，以上3项都是正常的，跟这个http://www.itpub.net/thread-1330782-1-1.html 问题类似。
请问这个问题是什么原因导致的？
帖子经 user1738965编辑过
帖子经 user1738965编辑过

关掉第1个节点，重启第2个节点，
crsd.log文件还是没有写入任何信息
alertnode2.log 部份日志
2012-11-14 12:57:53.568
[cssd(10296)]CRS-1605:CSSD voting file is online: /dev/raw/raw1. Details in /app/oracle/product/10.2.0/crs_1/log/node2/cssd/ocssd.log.
2012-11-14 13:01:13.616
[cssd(10296)]CRS-1601:CSSD Reconfiguration complete. Active nodes are node2 .
2012-11-14 13:01:13.776
[evmd(10080)]CRS-1401:EVMD started on node node2.
ocssd.log 部份日志
[    CSSD]2012-11-14 12:57:51.475 >USER: Oracle Database 10g CSS Release 10.2.0.3.0 Production Copyright 1996, 2004 Oracle. All rights reserved.
[ clsdmt]Listening to (ADDRESS=(PROTOCOL=ipc)(KEY=node2DBG_CSSD))
[    CSSD]2012-11-14 12:57:51.475 >USER: CSS daemon log for node node2, number 2, in cluster crs
[    CSSD]2012-11-14 12:57:51.482 [1618381696] >TRACE: clssscmain: local-only set to false
[    CSSD]2012-11-14 12:57:51.496 [1618381696] >TRACE: clssnmReadNodeInfo: added node 1 (node1) to cluster
[    CSSD]2012-11-14 12:57:51.500 [1618381696] >TRACE: clssnmReadNodeInfo: added node 2 (node2) to cluster
[    CSSD]2012-11-14 12:57:51.503 [1105389888] >TRACE: clssnm_skgxnmon: skgxn init failed
[    CSSD]2012-11-14 12:57:51.503 [1618381696] >TRACE: clssnm_skgxnonline: Using vacuous skgxn monitor
[    CSSD]2012-11-14 12:57:51.505 [1618381696] >TRACE: clssnmNMInitialize: misscount set to (60), impending reconfig threshold set to (56000)
[    CSSD]2012-11-14 12:57:51.505 [1618381696] >TRACE: clssnmNMInitialize: diskShortTimeout set to (57000)ms
[    CSSD]2012-11-14 12:57:51.506 [1618381696] >TRACE: clssnmNMInitialize: diskLongTimeout set to (200000)ms
[    CSSD]2012-11-14 12:57:51.508 [1618381696] >TRACE: clssnmDiskStateChange: state from 1 to 2 disk (0//dev/raw/raw1)
[    CSSD]2012-11-14 12:57:51.508 [1105389888] >TRACE: clssnmvDPT: spawned for disk 0 (/dev/raw/raw1)
[    CSSD]2012-11-14 12:57:53.552 [1105389888] >TRACE: clssnmDiskStateChange: state from 2 to 4 disk (0//dev/raw/raw1)
[    CSSD]2012-11-14 12:57:53.575 [1128057152] >TRACE: clssnmvKillBlockThread: spawned for disk 0 (/dev/raw/raw1) initial sleep interval (1000)ms
[    CSSD]2012-11-14 12:57:53.587 [1105389888] >TRACE: clssnmReadDskHeartbeat: node(1) is down. rcfg(15) wrtcnt(59321) LATS(3927324) Disk lastSeqNo(59321)
[    CSSD]2012-11-14 12:57:53.589 [1618381696] >TRACE: clssnmFatalInit: fatal mode enabled
[    CSSD]2012-11-14 12:57:53.589 [1149036864] >TRACE: clssnmconnect: connecting to node 2, flags 0x0001, connector 1
[    CSSD]2012-11-14 12:57:53.590 [1149036864] >TRACE: clssnmClusterListener: Listening on (ADDRESS=(PROTOCOL=tcp)(HOST=node2-priv)(PORT=49895))
[    CSSD]2012-11-14 12:57:53.590 [1149036864] >TRACE: clssnmconnect: connecting to node 0, flags 0x0000, connector 1
[    CSSD]2012-11-14 12:57:53.590 [1149036864] >TRACE: clssnmconnect: connecting to node 1, flags 0x0001, connector 0
[    CSSD]2012-11-14 12:57:53.595 [1149036864] >TRACE: clsc_send_msg: (0x108cf430) NS err (12571, 12560), transport (530, 111, 0)
[    CSSD]2012-11-14 12:57:53.600 [1159526720] >TRACE: clssgmclientlsnr: listening on (ADDRESS=(PROTOCOL=ipc)(KEY=Oracle_CSS_LclLstnr_crs_2))
[    CSSD]2012-11-14 12:57:53.600 [1159526720] >TRACE: clssgmclientlsnr: listening on (ADDRESS=(PROTOCOL=ipc)(KEY=OCSSD_LL_node2_crs))
[    CSSD]2012-11-14 12:57:53.601 [1190996288] >TRACE: clssgmPeerListener: Listening on (ADDRESS=(PROTOCOL=tcp)(DEV=19)(HOST=10.17.19.21)(PORT=52492))
[    CSSD]2012-11-14 12:57:53.601 [1201486144] >TRACE: clssnmPollingThread: Connection complete
[    CSSD]2012-11-14 12:57:53.601 [1211976000] >TRACE: clssnmSendingThread: Connection complete
[    CSSD]2012-11-14 12:57:53.601 [1222465856] >TRACE: clssnmRcfgMgrThread: Connection complete
[    CSSD]2012-11-14 12:58:00.616 [1222465856] >TRACE: clssnmRcfgMgrThread: Local Join
[    CSSD]2012-11-14 12:58:00.616 [1222465856] >TRACE: clssnmDoSyncUpdate: Initiating sync 1
[    CSSD]2012-11-14 12:58:00.616 [1222465856] >TRACE: clssnmDoSyncUpdate: diskTimeout set to (57000)ms
[    CSSD]2012-11-14 12:58:00.616 [1222465856] >TRACE: clssnmSetupAckWait: Ack message type (11)
[    CSSD]2012-11-14 12:58:00.616 [1222465856] >TRACE: clssnmSetupAckWait: node(2) is ALIVE
[    CSSD]2012-11-14 12:58:00.616 [1222465856] >TRACE: clssnmSendSync: syncSeqNo(1)
[    CSSD]2012-11-14 12:58:00.616 [1222465856] >TRACE: clssnmWaitForAcks: Ack message type(11), ackCount(1)
[    CSSD]2012-11-14 12:58:00.616 [1149036864] >TRACE: clssnmHandleSync: Acknowledging sync: src[2] srcName[node2] seq[1] sync[1]
[    CSSD]2012-11-14 12:58:00.616 [1149036864] >TRACE: clssnmHandleSync: diskTimeout set to (57000)ms
[    CSSD]2012-11-14 12:58:00.616 [1222465856] >TRACE: clssnmWaitForAcks: done, msg type(11)
[    CSSD]2012-11-14 12:58:00.616 [1222465856] >TRACE: clssnmSetupAckWait: Ack message type (13)
[    CSSD]2012-11-14 12:58:00.616 [1222465856] >TRACE: clssnmSetupAckWait: node(2) is ACTIVE
[    CSSD]2012-11-14 12:58:00.616 [1222465856] >TRACE: clssnmSendVote: syncSeqNo(1)
[    CSSD]2012-11-14 12:58:00.616 [1222465856] >TRACE: clssnmWaitForAcks: Ack message type(13), ackCount(1)
[    CSSD]2012-11-14 12:58:00.616 [1149036864] >TRACE: clssnmSendVoteInfo: node(2) syncSeqNo(1)
[    CSSD]2012-11-14 12:58:00.616 [1222465856] >TRACE: clssnmWaitForAcks: done, msg type(13)
[    CSSD]2012-11-14 12:58:00.616 [1222465856] >TRACE: clssnmCheckDskInfo: Checking disk info...
[    CSSD]2012-11-14 12:58:00.616 [1222465856] >TRACE: clssnmCheckDskInfo: diskTimeout set to (200000)ms
[    CSSD]2012-11-14 12:58:00.616 [1222465856] >TRACE: clssnmCheckDskInfo: node(1) timeout(7030) state_network(0) state_disk(3) misstime(3934354)
[    CSSD]2012-11-14 12:58:00.671 [1618381696] >USER: NMEVENT_SUSPEND [00][00][00][00]
[    CSSD]2012-11-14 12:58:01.616 [1222465856] >TRACE: clssnmCheckDskInfo: node(1) timeout(8030) state_network(0) state_disk(3) misstime(3934354)
[    CSSD]2012-11-14 12:58:02.618 [1222465856] >TRACE: clssnmCheckDskInfo: node(1) timeout(9030) state_network(0) state_disk(3) misstime(3935354)
[    CSSD]2012-11-14 12:58:03.619 [1222465856] >TRACE: clssnmCheckDskInfo: node(1) timeout(10030) state_network(0) state_disk(3) misstime(3936354)
[    CSSD]2012-11-14 12:58:04.620 [1222465856] >TRACE: clssnmCheckDskInfo: node(1) timeout(11030) state_network(0) state_disk(3) misstime(3937354)
[    CSSD]2012-11-14 12:58:05.620 [1222465856] >TRACE: clssnmCheckDskInfo: node(1) timeout(12030) state_network(0) state_disk(3) misstime(3938354)
[    CSSD]2012-11-14 12:58:06.621 [1222465856] >TRACE: clssnmCheckDskInfo: node(1) timeout(13030) state_network(0) state_disk(3) misstime(3939364)
[    CSSD]2012-11-14 12:58:07.622 [1222465856] >TRACE: clssnmCheckDskInfo: node(1) timeout(14030) state_network(0) state_disk(3) misstime(3940364)
中间这个clssnmCheckDskInfo日志有点多，超出回复字数限制，这里就去掉了一部份。
[    CSSD]2012-11-14 13:01:11.813 [1222465856] >TRACE: clssnmCheckDskInfo: node(1) timeout(198200) state_network(0) state_disk(3) misstime(4124704)
[    CSSD]2012-11-14 13:01:12.814 [1222465856] >TRACE: clssnmCheckDskInfo: node(1) timeout(199200) state_network(0) state_disk(3) misstime(4125704)
[    CSSD]2012-11-14 13:01:13.615 [1222465856] >TRACE: clssnmEvict: Start
[    CSSD]2012-11-14 13:01:13.615 [1222465856] >TRACE: clssnmWaitOnEvictions: Start
[    CSSD]2012-11-14 13:01:13.615 [1222465856] >TRACE: clssnmSetupAckWait: Ack message type (15)
[    CSSD]2012-11-14 13:01:13.615 [1222465856] >TRACE: clssnmSetupAckWait: node(2) is ACTIVE
[    CSSD]2012-11-14 13:01:13.615 [1222465856] >TRACE: clssnmSendUpdate: syncSeqNo(1)
[    CSSD]2012-11-14 13:01:13.615 [1222465856] >TRACE: clssnmWaitForAcks: Ack message type(15), ackCount(1)
[    CSSD]2012-11-14 13:01:13.615 [1149036864] >TRACE: clssnmUpdateNodeState: node 0, state (0/0) unique (0/0) prevConuni(0) birth (0/0) (old/new)
[    CSSD]2012-11-14 13:01:13.615 [1149036864] >TRACE: clssnmDeactivateNode: node 0 () left cluster
[    CSSD]2012-11-14 13:01:13.615 [1149036864] >TRACE: clssnmUpdateNodeState: node 1, state (0/0) unique (0/0) prevConuni(0) birth (0/0) (old/new)
[    CSSD]2012-11-14 13:01:13.615 [1149036864] >TRACE: clssnmDeactivateNode: node 1 (node1) left cluster
[    CSSD]2012-11-14 13:01:13.615 [1149036864] >TRACE: clssnmUpdateNodeState: node 2, state (2/2) unique (1352869071/1352869071) prevConuni(0) birth (1/1) (old/new)
[    CSSD]2012-11-14 13:01:13.615 [1149036864] >USER: clssnmHandleUpdate: SYNC(1) from node(2) completed
[    CSSD]2012-11-14 13:01:13.615 [1149036864] >USER: clssnmHandleUpdate: NODE 2 (node2) IS ACTIVE MEMBER OF CLUSTER
[    CSSD]2012-11-14 13:01:13.615 [1149036864] >TRACE: clssnmHandleUpdate: diskTimeout set to (200000)ms
[    CSSD]2012-11-14 13:01:13.615 [1222465856] >TRACE: clssnmWaitForAcks: done, msg type(15)
[    CSSD]2012-11-14 13:01:13.615 [1222465856] >TRACE: clssnmDoSyncUpdate: Sync Complete!
[    CSSD]2012-11-14 13:01:13.615 [1232955712] >TRACE: clssgmReconfigThread: started for reconfig (1)
[    CSSD]2012-11-14 13:01:13.615 [1232955712] >USER: NMEVENT_RECONFIG [00][00][00][04]
[    CSSD]2012-11-14 13:01:13.615 [1232955712] >TRACE: clssgmEstablishConnections: 1 nodes in cluster incarn 1
[    CSSD]2012-11-14 13:01:13.616 [1190996288] >TRACE: clssgmPeerListener: connects done (1/1)
[    CSSD]2012-11-14 13:01:13.616 [1232955712] >TRACE: clssgmEstablishMasterNode: MASTER for 1 is node(2) birth(1)
[    CSSD]2012-11-14 13:01:13.616 [1232955712] >TRACE: clssgmChangeMasterNode: requeued 0 RPCs
[    CSSD]2012-11-14 13:01:13.616 [1232955712] >TRACE: clssgmMasterCMSync: Synchronizing group/lock status
[    CSSD]2012-11-14 13:01:13.616 [1232955712] >TRACE: clssgmMasterSendDBDone: group/lock status synchronization complete
[    CSSD]CLSS-3000: reconfiguration successful, incarnation 1 with 1 nodes
[    CSSD]CLSS-3001: local node number 2, master node number 2
[    CSSD]2012-11-14 13:01:13.616 [1232955712] >TRACE: clssgmReconfigThread: completed for reconfig(1), with status(1)
[    CSSD]2012-11-14 13:01:13.732 [1159526720] >TRACE: clssgmClientConnectMsg: Connect from con(0x10a0b9a0) proc(0x10a10980) pid() proto(10:2:1:1)
[    CSSD]2012-11-14 13:01:13.732 [1159526720] >TRACE: clssgmClientConnectMsg: Connect from con(0x10a0e540) proc(0x10a10c50) pid() proto(10:2:1:1)
[    CSSD]2012-11-14 13:01:13.733 [1159526720] >TRACE: clssgmCommonAddMember: clsomon joined (2/0x1000000/#CSS_CLSSOMON)

Similar Messages

Load Balance Problem in Oracle RAC 10.2.0.4 (two nodes with CRS)

Hi all,
I have an Oracle RAC 10.2.0.4 with two nodes and I'm having problems with sessions load balance.
The problem is that all connections are going to the node which is enabled as 'master' in RAC, the node who has located .db resource in CRS.
If .db resource is located in node1 all connections are made in node1. In the same way, all connections are made in node2 when .db resource from CRS stack is located in node2.
The connections are made by a pool in Tomcat 6 using RAC service for made the connections.
Any idea?
Thanks in advance.
Here is an example of my net configuration:
listener.ora in node1:
LISTENER_RAC_NODE1 =
(DESCRIPTION_LIST =
(DESCRIPTION =
(ADDRESS = (PROTOCOL = TCP)(HOST = node1_vip)(PORT = 1525)(IP = FIRST))
(ADDRESS = (PROTOCOL = TCP)(HOST = node1)(PORT = 1525)(IP = FIRST))
tnsnames in node1:
LISTENERS_RAC =
(ADDRESS_LIST =
(ADDRESS = (PROTOCOL = TCP)(HOST = node1_vip)(PORT = 1525))
(ADDRESS = (PROTOCOL = TCP)(HOST = node2_vip)(PORT = 1525))
NODE1 =
(DESCRIPTION =
(ADDRESS = (PROTOCOL = TCP)(HOST = node1_vip)(PORT = 1525))
(ADDRESS = (PROTOCOL = TCP)(HOST = node1)(PORT = 1525))
(CONNECT_DATA =
(SERVER = DEDICATED)
(SERVICE_NAME = rac)
(INSTANCE_NAME = rac1)
RAC =
(DESCRIPTION =
(ADDRESS = (PROTOCOL = TCP)(HOST = node1_vip)(PORT = 1525))
(ADDRESS = (PROTOCOL = TCP)(HOST = node2_vip)(PORT = 1525))
(LOAD_BALANCE = yes)
(CONNECT_DATA =
(SERVER = DEDICATED)
(SERVICE_NAME = rac)
********************************************************

It is my fault.
It was the connection service. I had one one with preferred option and one without it. Now is working well.

Where to download Oracle RAC 11g2 CRS/clusterware?

Hello experts.
I'm building a home/test Oracle RAC environment on 11g2 and I can't seem to find the CRS/Clusterware for download?
I've been able to download the CVU and verify all of the server/OS/systems configuration work that was needed upfront.
I'm running on CentOS 5.5 (RHEL 5.5), on a Sun Sunfire v20z, that has FC disk array and FC HBA's connected over fiber (LC/LC connections).
I've downloaded the Oracle 11g2 software 1&2, and also the Grid software, and the CVU software.
CVU shows all configurations passed... can anyone point me in the right direction? I'm assuming Oracle allows people to download CRS/Clusterware for non-commercial (personal education) purposes without license?
Please advise.
Thank you.

Hi,
I'm building a home/test Oracle RAC environment on 11g2 and I can't seem to find the CRS/Clusterware for download?Here:
https://edelivery.oracle.com/
Or:
http://www.oracle.com/technetwork/database/enterprise-edition/downloads/index.html
CVU shows all configurations passed... can anyone point me in the right direction? I'm assuming Oracle allows people to download CRS/Clusterware for non-commercial (personal education) purposes without license?Oracle Clusterware and Oracle ASM binaries are installed as a single binary called the grid infrastructure. So, If you downloaded Grid Infrastructure you already have the CRS and ASM installation.
All software downloads are free, and most come with a Developer License that allows you to use full versions of the products at no charge while developing and prototyping your applications, or for strictly self-educational purposes.
Regards,
Levi Pereira

Crs,css and evm deamons in Oracle RAC

Can anyone please explain me the basic functionality and purpose of crs,css and evm deamons in Oracle RAC.
Read some where that ocssd deamon performs cluster configuration information in OCR disk......is this ocssd deamon alias is css ???

The Oracle Clusterware processes on UNIX-based systems are:
■ crsd—Performs high availability recovery and management operations such as
maintaining the OCR and managing application resources. This process runs as
the root user, or by a user in the admin group on Mac OS X-based systems. This
process restarts automatically upon failure.
■ evmd—Event manager daemon. This process also starts the racgevt process to
manage FAN server callouts.
■ ocssd—Manages cluster node membership and runs as the oracle user; failure
of this process results in cluster restart.
■ oprocd—Process monitor for the cluster. Note that this process only appears on
platforms that do not use vendor clusterware with Oracle Clusterware.

Linux Oracle RAC Issues with CRS

Using the Linux Oracle RAC install guide - I've finally installed my install on Linux for the Oracle RAC. Just one problem - now after 10 mins or so - one of my nodes loses communication which barks out a PRKH-1010 error after trying to run a status check on my nodes using the srvctl status nodeapps -n <servername> command.
any clues or ways to diagnose?

Hi!
The gsd relies on some files residing in /tmp/.oracle
Are they still there?
GSD is known to stop if they are absent.
cu
Andreas

Install Oracle RAC 10g (10.2.0.1) on HP-UX B.11.31 U ia64 failed

Hi All
I am installing Oracle RAC 10g 10.2.0.1 on HP-UX B.11.31 U ia64 but can not complete
hosts file
#Public IPs
10.144.1.111 spgdb01
10.144.1.112 spgdb02
#Private IPs
10.144.2.2 spgdb01p
10.144.2.3 spgdb02p
#Virtual IPs
10.144.1.113 spgdb01v
10.144.1.114 spgdb02v
I do installation with runInstaller without error. It copy and link is ok. When I run root.sh then It cannot complete as following
Checking to see if Oracle CRS stack is already configured
Checking to see if any 9i GSD is up
Setting the permissions on OCR backup directory
Setting up NS directories
Oracle Cluster Registry configuration upgraded successfully
WARNING: directory '/oracle/product/10.2.0' is not owned by root
WARNING: directory '/oracle/product' is not owned by root
WARNING: directory '/oracle' is not owned by root
Successfully accumulated necessary OCR keys.
Using ports: CSS=49895 CRS=49896 EVMC=49898 and EVMR=49897.
node <nodenumber>: <nodename> <private interconnect name> <hostname>
node 0: spgdb01 spgdb01p spgdb01
node 1: spgdb02 spgdb02p spgdb02
Creating OCR keys for user 'root', privgrp 'sys'..
Operation successful.
Now formatting voting device: /ora/crs/votedisk01
waitpid(-1, 0x7fffdf50, WUNTRACED) .................................................................................................... [sleeping]
Now formatting voting device: /oracle/oradata1/crs/votedisk02
Now formatting voting device: /oracle/oradata2/crs/votedisk03
Format of 3 voting devices complete.
Startup will be queued to init within 30 seconds.
====================
I have waited for 10 mins but still not complete
Additionally, log from runInstaller, I got
Preparing to launch Oracle Universal Installer from /tmp/OraInstall2011-04-28_12-13-31AM. Please wait ...-bash-4.2$ Oracle Universal Installer, Version 10.2.0.1.0 Production
Copyright (C) 1999, 2005, Oracle. All rights reserved.
Private Interconnect : null
Private Interconnect : null
Private Interconnect : null
Private Interconnect : null
So, please help me fix this issue
Thank you

I had this problem and resolved it by transporting the file to the installation server with the correct ftp datatype (binary).
On page 54 of the install guide (..Server\Oracle_Business_Intelligence\doc\doc\bi.1013\b31765.pdf) that comes with the installation files, there is an instruction to make sure that any ftp activity is done in binary.
This may not have occured with the license.xml file if you use a tool which offers the "feature" of automatic datatype recognition.
Hope this helps.

Oracle rac templates 11g R2 buildcluster.sh error

Hi All,
am facing below error, while creating oracle rac templates. kindly let us know how to resolve below error.
===error=========================
Oracle RAC 11gR2 OneCommand (v1.2) for Oracle VM - (c) 2010-2011 Oracle Corporation
   Cksum: [1170221909 255000 racovm.sh] at Sun Jan 5 04:15:14 EST 2014
   Kernel: 2.6.18-194.0.0.0.3.el5xen (i686) [1 processor(s)] 1700 MB
2014-01-05 04:15:14:[printparams:Time :racnode1] Completed successfully in 4 seconds (0h:00m:04s)
2014-01-05 04:15:14:[setsshora:Start:racnode1] SSH Setup for the Oracle user(s)...
INFO (node:racnode1): Running as oracle: /u01/racovm/ssh/setssh-Linux.sh -s -x -c NO -h nodelist -p ***   (setup on 2 node(s): racnode1 racnode2)
ERROR: Failed to create temporary file /tmp/setssh-cretmpQY3958 on localhost, can not proceed
Exiting...
ERROR (node:racnode1): Failed to configure passwordless SSH for the oracle user
2014-01-05 04:15:17:[setsshora:Time :racnode1] Completed with errors in 3 seconds (0h:00m:03s), status: 1
2014-01-05 04:15:17:[buildcluster:Time :racnode1] Completed with errors in 58 seconds (0h:00m:58s), status: 1
thanks,
Mike.

Try this. It worked for me.
Please keep in mind that you will need wait till each step finishes successfully before move to next one
For Step1 and 2, you can skip node(s) on which you didn't execute root.sh yet.
Step 1: As root, run "$GRID_HOME/crs/install/rootcrs.pl -verbose -deconfig -force" on all nodes, except the last one.
Step 2: As root, run "$GRID_HOME/crs/install/rootcrs.pl -verbose -deconfig -force -lastnode" on last node. This command will zero out OCR and VD disk also.
Step 3: As root, run $GRID_HOME/root.sh on all node one by one

10G RAC: CRS 설치 실패 후 정리 방법

제품 : ORACLE SERVER
작성날짜 : 2004-11-30
10G RAC: CRS 설치 실패 후 정리 방법
=====================================
PURPOSE
이 문서는, DBA와 기술 지원 엔지니어가 10g RAC의 CRS (Cluster Ready Services)
실패시, 일부 설치된 CRS를 제거하는데 필요한 정보를 제공하는 것을 목적으로 한다.
Explanation
설치에 실패한 CRS는 노드 리부팅과 같은 문제를 야기 시킬 수 있다.
실패한 CRS 설치본을 정리하기 위해서는 다음과 같은 절차를 따른다:
1. $ORA_CRS_HOME/install 디렉토리에서 rootdelete.sh 스크립트를 실행한 후 rootdeinstall.sh 스크립트를
실행시킨다. 만약 이 스크립트를 실행시키는데 문제가 있거나, 모든 콤포넌트가 성공적으로 제거되었는지
여부를 확인하려면 step 2로 간다:
2. 모든 노드로 부터 노드 애플리케이션을 중단시킨다:
srvctl stop nodeapps -n <node name>
3. 노드 부팅시 CRS가 구동되는 것을 예방한다. 이를 위해 root 계정에서 다음과 같은 작업을
수행한다 :
Sun:
rm /etc/init.d/init.cssd
rm /etc/init.d/init.crs
rm /etc/init.d/init.crsd
rm /etc/init.d/init.evmd
rm /etc/rc3.d/K96init.crs
rm /etc/rc3.d/S96init.crs
rm -Rf /var/opt/oracle/scls_scr
rm -Rf /var/opt/oracle/oprocd
rm /etc/inittab.crs
cp /etc/inittab.orig /etc/inittab
Linux:
rm -f /etc/init.d/init.cssd
rm -f /etc/init.d/init.crs
rm -f /etc/init.d/init.crsd
rm -f /etc/init.d/init.evmd
rm -f /etc/rc2.d/K96init.crs
rm -f /etc/rc2.d/S96init.crs
rm -f /etc/rc3.d/K96init.crs
rm -f /etc/rc3.d/S96init.crs
rm -f /etc/rc5.d/K96init.crs
rm -f /etc/rc5.d/S96init.crs
rm -Rf /etc/oracle/scls_scr
rm -f /etc/inittab.crs
cp /etc/inittab.orig /etc/inittab
HP-UX:
rm /sbin/init.d/init.cssd
rm /sbin/init.d/init.crs
rm /sbin/init.d/init.crsd
rm /sbin/init.d/init.evmd
rm /sbin/rc3.d/K960init.crs
rm /sbin/rc3.d/S960init.crs
rm -Rf /var/opt/oracle/scls_scr
rm -Rf /var/opt/oracle/oprocd
rm /etc/inittab.crs
cp /etc/inittab.orig /etc/inittab
HP Tru64:
rm /sbin/init.d/init.cssd
rm /sbin/init.d/init.crs
rm /sbin/init.d/init.crsd
rm /sbin/init.d/init.evmd
rm /sbin/rc3.d/K96init.crs
rm /sbin/rc3.d/S96init.crs
rm -Rf /var/opt/oracle/scls_scr
rm -Rf /var/opt/oracle/oprocd
rm /etc/inittab.crs
cp /etc/inittab.orig /etc/inittab
IBM AIX:
rm /etc/init.cssd
rm /etc/init.crs
rm /etc/init.crsd
rm /etc/init.evmd
rm /etc/rc.d/rc2.d/K96init.crs
rm /etc/rc.d/rc2.d/S96init.crs
rm -Rf /etc/oracle/scls_scr
rm -Rf /etc/oracle/oprocd
rm /etc/inittab.crs
cp /etc/inittab.orig /etc/inittab
4. 만약 프로세스가 살아 있다면 EVM, CRS 및 CRS 프로세스를 kill 시키거나
노드를 리부팅 한다:
ps -ef | grep crs
kill <crs pid>
ps -ef | grep evm
kill <evm pid>
ps -ef | grep css
kill <css pid>
5. CRS 설치 디렉토리를 제거한다:
rm -Rf <CRS Install Location>/*
6. Oracle Universal Installer에서 CRS home을 De-install 한다.
7. dd 명령으로 OCR 및 Voting File을 제거한다. 예 :
dd if=/dev/zero of=/dev/rdsk/V1064_vote_01_20m.dbf bs=8192 count=2560
dd if=/dev/zero of=/dev/rdsk/ocrV1064_100m.ora bs=8192 count=12800
만약 RDBMS 설치를 제거한다면, 사용중이던 ASM 디스크도 정리한다.
8. 만약 CRS를 재 설치하고자 하면, RAC 설치 매뉴얼에 기술된 순서대로 설치를 다시 진행한다.
Example
Reference Documents
<Note:239998.1> 10g RAC: How to Clean Up After a Failed CRS Install

Oracle RAC with ASM install failure

Hi Guy's
I've just rebuilt an Oracle RAC system, i've built this system many times and have the build documented. I usually follow the build doc to the tee and everything works fine. I have now been asked to rebuild the system to go into the production evironment. This is a two server standard RAC setup using ASM. We have a HP MSA500 shared storage setup. I've run the cluster verification tool and all the checks have come out fine. Whilst installing the clusterware i get a failed error message whilst it try's to confiure the Oracle Clusterware Configuration Assistant.
The error message i get is:
Command = C:\Windows\system 32\cmd\C Call E:\Oracle\Product\10.2.0\crs\install\crssetup.config.bat
PROT-1:Failed to intialize OCR Config
STEP 1 Checking status of CRS cluster
STEP 2 Creating directories (E:\Oracle\Product\10.2.0\Crs
STEP 3 Configuring OCR repository
ocr upgrade failed with (-1)
I've done this build many times and never seen this error. From what i've read on the internet it looks like it could be something up with my shared storage but i dont know what.
Any help would be greatly appreciated
Thanks
Lee

try to apply ''dd' command against OCR & Voting disks to ensure that their headers are formatted 100% and contains no data from old installation:
dd if=/dev/zero of=YOUR_OCR_DISK bs=1024 count=10000
dd if=/dev/zero of=YOUR_VOTE_DISK bs=1024 count=10000

Error in ONS logs while implmenting FCF on oracle RAC from java program

I have java prog on client machine that uses properties from a property file.While making the connection to the ONS port on the oracle RAC server to implement FCF the program is throwing error as below:
java.sql.SQLException: Io exception: The Network Adapter could not establish the connection
and when i checked the ons logs for that node the logs are as follows:
Connection 5,199.xxx.xxxxxx,8200 header RCV failed (Connect
ion reset by peer) coFlags=1002a
These logs are generated only when java program tries to connect else the daemon started without any errors.
But sometime it connets and gives the desired output.
Please advice and do let me know in case you need more information.
Java program on the client machine is as follows..
* Oracle Support Services
import java.sql.Connection;
import java.sql.ResultSet;
import java.sql.SQLException;
import java.sql.Statement;
import java.util.Enumeration;
import java.util.Properties;
import java.util.ResourceBundle;
import oracle.jdbc.pool.OracleConnectionCacheManager;
import oracle.jdbc.pool.OracleDataSource;
public class FCFConnectionCacheExample
private OracleDataSource ods = null;
private OracleConnectionCacheManager occm = null;
private Properties cacheProperties = null;
public FCFConnectionCacheExample() throws SQLException
// create a cache manager
occm = OracleConnectionCacheManager.getConnectionCacheManagerInstance();
Properties props = loadProperties("fcfcache");
cacheProperties = new java.util.Properties();
cacheProperties.setProperty("InitialLimit", (String)props.get("InitialLimit"));
cacheProperties.setProperty("MinLimit", (String)props.get("MinLimit"));
cacheProperties.setProperty("MaxLimit", (String)props.get("MaxLimit"));
ods = new OracleDataSource();
ods.setUser((String)props.get("username"));
ods.setPassword((String)props.get("password"));
ods.setConnectionCachingEnabled(true);
ods.setFastConnectionFailoverEnabled(true);
ods.setConnectionCacheName("MyCache");
ods.setONSConfiguration((String)props.get("onsconfig"));
ods.setURL((String)props.get("url"));
occm.createCache("MyCache", ods, cacheProperties);
private Properties loadProperties (String file)
Properties prop = new Properties();
ResourceBundle bundle = ResourceBundle.getBundle(file);
Enumeration enumlist = bundle.getKeys();
String key = null;
while (enumlist.hasMoreElements())
key = (String) enumlist.nextElement();
prop.put(key, bundle.getObject(key));
return prop;
public void run() throws Exception
Connection conn = null;
Statement stmt = null;
ResultSet rset = null;
String sQuery =
"select sys_context('userenv', 'instance_name'), " +
"sys_context('userenv', 'server_host'), " +
"sys_context('userenv', 'service_name') " +
"from dual";
try
conn = null;
conn = ods.getConnection();
stmt = conn.createStatement();
rset = stmt.executeQuery(sQuery);
rset.next();
System.out.println("-----------");
System.out.println("Instance -> " + rset.getString(1));
System.out.println("Host -> " + rset.getString(2));
System.out.println("Service -> " + rset.getString(3));
System.out.println("NumberOfAvailableConnections: " +
occm.getNumberOfAvailableConnections("MyCache"));
System.out.println("NumberOfActiveConnections: " +
occm.getNumberOfActiveConnections("MyCache"));
System.out.println("-----------");
catch (SQLException sqle)
while (sqle != null)
System.out.println("SQL State: " + sqle.getSQLState());
System.out.println("Vendor Specific code: " +
sqle.getErrorCode());
Throwable te = sqle.getCause();
while (te != null) {
System.out.print("Throwable: " + te);
te = te.getCause();
sqle.printStackTrace();
sqle = sqle.getNextException();
finally
try
rset.close();
stmt.close();
conn.close();
catch (SQLException sqle2)
System.out.println("Error during close");
public static void main(String[] args)
System.out.println(">> PROGRAM using JDBC thin driver no oracle client required");
System.out.println(">> ojdbc14.jar and ons.jar must be in the CLASSPATH");
System.out.println(">> Press CNTRL C to exit running program\n");
try
FCFConnectionCacheExample test = new FCFConnectionCacheExample();
while (true)
test.run();
Thread.currentThread().sleep(10000);
catch (InterruptedException e)
System.out.println("PROGRAM Ended by user");
catch (Exception ex)
System.out.println("Error Occurred in MAIN");
ex.printStackTrace();
Some of the info i have deleted intensionally as this is confidential
Property file is as follows
# properties required for test
username=test
password=test
InitialLimit=10
MinLimit=10
MaxLimit=20
onsconfig=nodes=RAC-node1:port,RAC-node2:port
url=jdbc:oracle:thin:@(DESCRIPTION= \
(LOAD_BALANCE=yes) \
(ADDRESS=(PROTOCOL=TCP)(HOST=RAC-node1)(PORT=1521)) \
(ADDRESS=(PROTOCOL=TCP)(HOST=RAC-node1)(PORT=1521)) \
(CONNECT_DATA=(service_name=RAC_SERVICE)))

Hi;
Please check below note:
Link Errors While Installing CRS & RAC Database software [ID 438747.1]
Codeword File $TIMEBOMB_CWD,/opt/aCC/newconfig/aCC.cwd Missing Or Empty [ID 552893.1]
Regard
Helios

Oracle RAC 10g on Solaris x86 using vmware

Guys,
I am practising 10g RAC installation on my laptop, wherein I have vmware workstation and
solaris-x86 version 10 installed on it.
I am trying to install Oracle 10g clusterware.
I have followed the steps for clusterware installation.
I am facing an errror on one of the node while running root.sh.
On the first node, the root.sh runs fine.
The following is the log.
bash-3.00# ./root.sh
WARNING: directory '/u01/app/oracle/product' is not owned by root
WARNING: directory '/u01/app/oracle' is not owned by root
WARNING: directory '/u01/app' is not owned by root
WARNING: directory '/u01' is not owned by root
Checking to see if Oracle CRS stack is already configured
Setting the permissions on OCR backup directory
Setting up NS directories
Oracle Cluster Registry configuration upgraded successfully
WARNING: directory '/u01/app/oracle/product' is not owned by root
WARNING: directory '/u01/app/oracle' is not owned by root
WARNING: directory '/u01/app' is not owned by root
WARNING: directory '/u01' is not owned by root
Successfully accumulated necessary OCR keys.
Using ports: CSS=49895 CRS=49896 EVMC=49898 and EVMR=49897.
node <nodenumber>: <nodename> <private interconnect name> <hostname>
node 1: xsan001 xsan001-priv xsan001
node 2: xsan002 xsan002-priv xsan002
Creating OCR keys for user 'root', privgrp 'root'..
Operation successful.
Now formatting voting device: /dev/rdsk/c0d0s4
Format of 1 voting devices complete.
Startup will be queued to init within 30 seconds.
Adding daemons to inittab
Expecting the CRS daemons to be up within 600 seconds.
CSS is active on these nodes.
xsan001
CSS is inactive on these nodes.
xsan002
Local node checking complete.
Run root.sh on remaining nodes to start CRS daemons.
=============================
On the second node, the root.sh gives error.
bash-3.00# ./root.sh
WARNING: directory '/u01/app/oracle/product' is not owned by root
WARNING: directory '/u01/app/oracle' is not owned by root
WARNING: directory '/u01/app' is not owned by root
WARNING: directory '/u01' is not owned by root
Checking to see if Oracle CRS stack is already configured
Setting the permissions on OCR backup directory
Setting up NS directories
Oracle Cluster Registry configuration upgraded successfully
WARNING: directory '/u01/app/oracle/product' is not owned by root
WARNING: directory '/u01/app/oracle' is not owned by root
WARNING: directory '/u01/app' is not owned by root
WARNING: directory '/u01' is not owned by root
Successfully accumulated necessary OCR keys.
Using ports: CSS=49895 CRS=49896 EVMC=49898 and EVMR=49897.
node <nodenumber>: <nodename> <private interconnect name> <hostname>
node 1: xsan001 xsan001-priv xsan001
node 2: xsan002 xsan002-priv xsan002
Creating OCR keys for user 'root', privgrp 'root'..
Operation successful.
Now formatting voting device: /dev/rdsk/c0d0s4
Format of 1 voting devices complete.
Startup will be queued to init within 30 seconds.
Adding daemons to inittab
Expecting the CRS daemons to be up within 600 seconds.
Failure at final check of Oracle CRS stack.
10
Can you provide some sort of clue what could be done to resolve the above error?

Refer to http://www.oracleracsig.org/ Under documents section select OS=Solaris. You will find a document on Oracle RAC on Solaris10 using Vmware.

Does /sapmnt need in cluster file system(SAP ECC 6.0 with oracle RAC)

We are going to be installing SAP with Oracle 10.2.0.4 RAC on Linux SuSE 10 and OCFS2. The Oracle RAC documentation states:
You must store the following components in the cluster file system when you use RAC
in the SAP environment:
- Oracle Clusterware (CRS) Home
- Oracle RDBMS Home
- SAP Home (also /sapmnt)
- Voting Disks
- OCR
- Database
What I want to ask is if I really need put SAP Home(also /sapmnt) on cluster file system? I will build a two nodes oracel 10g RAC and I also have another two nodes to install SAP CI and DI. My orginial think is sapmnt is a NFS share, and mount to all four nodes(RAC node and CI/DI), and all oracle stuff was on OCFS2(only two rac nodes are OCFS), anybody can tell me if SAP Home(also /sapmnt) can be NFS mount not OCFS2, thanks.
Best regards,
Peter

Hi Peter,
I don't think you need to keep /sapmnt in ocfs2 . Reason any file system need to be in cluster is,in RAC environment, data stored in the cache of one Oracle instance to be accessed by any other instance by transferring it across the private network and preserves data integrity and cache coherency by transmitting locking and other synchronization information across cluster nodes.
AS this applies to redo files, datafiles and control files only , you should be fine with nfs mount of /sapmnt sharing across and not having ocfs2.
-SV

Failover not happening the Oracle RAC 10g

Hi All,
I am new to RAC.
I have installed Oracle RAC 10g on Redhat Linux 4.0. Till yesterday failover was happening that is when i stopped one instance on node01 the vip of node01 was transferred to node02.This was shown using ifconfig -a but now that is now happening.Don't know as what has happened.Can you please help me out
Below information is given:
[oracle@node01 ~]$ crs_stat -t
Name Type Target State Host
ora.hitesh.db application ONLINE ONLINE node02
ora....h1.inst application ONLINE ONLINE node01
ora....h2.inst application OFFLINE OFFLINE
ora....SM1.asm application ONLINE ONLINE node01
ora....01.lsnr application ONLINE ONLINE node01
ora.node01.gsd application ONLINE ONLINE node01
ora.node01.ons application ONLINE ONLINE node01
ora.node01.vip application ONLINE ONLINE node01
ora....SM2.asm application ONLINE ONLINE node02
ora....02.lsnr application ONLINE ONLINE node02
ora.node02.gsd application ONLINE ONLINE node02
ora.node02.ons application ONLINE ONLINE node02
ora.node02.vip application ONLINE ONLINE node02
Listner status on node01 is given:
[oracle@node01 ~]$ lsnrctl status
LSNRCTL for Linux: Version 10.2.0.1.0 - Production on 06-APR-2013 12:59:29
Copyright (c) 1991, 2005, Oracle. All rights reserved.
Connecting to (ADDRESS=(PROTOCOL=tcp)(HOST=)(PORT=1521))
STATUS of the LISTENER
Alias LISTENER_NODE01
Version TNSLSNR for Linux: Version 10.2.0.1.0 - Production
Start Date 06-APR-2013 11:59:03
Uptime 0 days 1 hr. 0 min. 25 sec
Trace Level off
Security ON: Local OS Authentication
SNMP OFF
Listener Parameter File /home/oracle/oracle/product/10.2.0/db_1/network/admin/listener.ora
Listener Log File /home/oracle/oracle/product/10.2.0/db_1/network/log/listener_node01.log
Listening Endpoints Summary...
(DESCRIPTION=(ADDRESS=(PROTOCOL=tcp)(HOST=192.168.1.131)(PORT=1521)))
(DESCRIPTION=(ADDRESS=(PROTOCOL=tcp)(HOST=127.0.0.1)(PORT=1521)))
Services Summary...
Service "+ASM" has 1 instance(s).
Instance "+ASM1", status BLOCKED, has 1 handler(s) for this service...
Service "+ASM_XPT" has 1 instance(s).
Instance "+ASM1", status BLOCKED, has 1 handler(s) for this service...
Service "PLSExtProc" has 1 instance(s).
Instance "PLSExtProc", status UNKNOWN, has 1 handler(s) for this service...
Service "hitesh" has 2 instance(s).
Instance "hitesh1", status READY, has 2 handler(s) for this service...
Instance "hitesh2", status READY, has 1 handler(s) for this service...
Service "hiteshXDB" has 2 instance(s).
Instance "hitesh1", status READY, has 1 handler(s) for this service...
Instance "hitesh2", status READY, has 1 handler(s) for this service...
Service "hitesh_XPT" has 2 instance(s).
Instance "hitesh1", status READY, has 2 handler(s) for this service...
Instance "hitesh2", status READY, has 1 handler(s) for this service...
The command completed successfully
[root@node01 oracle]# crsctl check crs
CSS appears healthy
CRS appears healthy
EVM appears healthy
[root@node01 oracle]# ps -ef | grep lmon
oracle 5741 1 0 12:07 ? 00:00:03 ora_lmon_hitesh1
root 22582 20805 0 13:01 pts/2 00:00:00 grep lmon
oracle 23643 1 0 11:58 ? 00:00:01 asm_lmon_+ASM1
Please let me know what information else is required
Edited by: user12924280 on Apr 6, 2013 12:36 AM

Since you didn't say "thank you", I assumed my time was of no value to you.
However, I shall try again.
There is no relationship between instance failure and VIP failover. How can there be? What if you are running ten instances on each node, and one fails? Would you want the VIP to relocate? And I've already told you how to test it: kill the node. Just reboot it.

Rconfig fails for migrate single ASM instance to Oracle RAC 11.2.0.3

We completed the following steps thus far to migrate single instance with ASM 11.2.0.3 on Oracle Linux to RAC:
1. Install RAC CRS 11.2.0.3 in GRID_HOME
2. Install RAC database 11.2.0.3 software in new ORACLE_HOME
3. Added ASM diskgroups to two node ASM instances
4. run rconfig from target RAC ORACLE_HOME
It fails on:
[RACConvertStep.execute:193] STEP Result=Oracle home is not a cluster database home. Make sure that oracle binary is linked with RAC option and the Oracle home entry in Oracle inventory is updated with cluster node list.
From rconfig:
$ rconfig convert.xml
<?xml version="1.0" ?>
<RConfig version="1.1" >
<ConvertToRAC>
<Convert>
<Response>
<Result code="1" >
Operation Failed
</Result>
<ErrorDetails>
Oracle home is not a cluster database home. Make sure that oracle binary is linked with RAC option and the Oracle home entry in Oracle inventory is updated with cluster node list.
</ErrorDetails>
</Response>
</Convert>
</ConvertToRAC></RConfig>
5. the orainventory inventory xml file has:
6. We have this:
Source database
ORACLE_BASE=/oracle
ORACLE_HOME=/oracle/db/112
3- target database configuration
ORACLE_BASE=/oracle/db
ORACLE_HOME=/oracle/db/11.2.0.3
4- RAC GRID_HOME
ORACLE_BASE=/oracle
GRID_HOME=/oracle/11.2.0/grid
Has anyone seen this issue before with migrating with rconfig and if so, how did you resolve it?

good to know. thanks for updating the thread with what was wrong, this will help the community if someone faces this issue.

Oracle RAC 10g on VMware

I am installing Oracle RAC on 2 virtual machines using RHEL 3-U7, basically I have been able to install RAC and the software for the database. However when I run dbca to configure ASM, I get
Error When starting ASM instance on node node1
PRKS-1009: Failed to start ASM instance +ASM2 on node1
last line repeats
CRS-1028: Dependency analysis failed because of:
CRS-0223: Resource ora.node1.ASM2.asm has placement errors.
I ran ifconfig and appears to be a VIP on both servers. Is there a reason/solution or a workaround for this issue ?. Thank you in advance for your help

Hello testing33,
I am trying to install Oracle 10g on VMware(RHEL3 -guest).
I am using cook book instructions provided on the web site.
I am having rpoblem on stage 4, while installing Oracle 10g CRS CD at the very end.
What instructions did you use to install Oracle 10g?
Thanks for help,
Natalie

Oracle RAC crs无法启动的问题

Similar Messages

Maybe you are looking for