Oracle10g RAC 구성에서 CRS 오류로 Oracle이 재시작 되는 원인

oracle10g Release 10.2.0.1.0, SunOS(sparc, Sun-Fire-V890) 2대로 RAC 구성,
Oracle CRS를 사용하고 있습니다.
시스템이 정상적으로 운영되고 있어, alert 로그 점검을 소홀히 하고 있었는데,
오늘 아래와 같이 alert 로그를 점검 결과 최근 3개월 동안 약 3회정도 오라클이 CRS에 의하여 Shutdown 되었다가 Startup 되었습니다.
CRS 에러가 왜 발생되었는지, 어찌 하면 해결할 수 있는지 알고 싶습니다.
고수님의 조언 부탁드립니다.
/oracle/app/oracle/admin/POPORA/bdump/alert_POPORA2.log
Wed Jan 17 06:49:17 2007
Shutting down instance (abort)
Starting ORACLE instance (normal)
Fri Mar 2 08:35:50 2007
Shutting down instance (abort)
Starting ORACLE instance (normal)
Thu Apr 12 03:15:16 2007
Shutting down instance (abort)
Starting ORACLE instance (normal)
/oracle/app/oracle/product/10g/crs/log/popsvr2/cssd/ocssd.log
2007-01-17 06:49:12.834: [ OCRSRV][23]th_select_handler: Failed to retrieve procctx from ht. constr = [27591024] retval lht [-27] Signal CV.
2007-01-17 06:49:15.577: [ CRSAPP][66882] CheckResource error for ora.POPORA.POPORA2.inst error code = 138
2007-01-17 06:49:15.601: [ CRSRES][66882] In stateChanged, ora.POPORA.POPORA2.inst target is ONLINE
2007-01-17 06:49:15.601: [ CRSRES][66882] ora.POPORA.POPORA2.inst on popsvr2 went OFFLINE unexpectedly
2007-01-17 06:49:15.602: [ CRSRES][66882] StopResource: setting CLI values
2007-01-17 06:49:15.609: [ CRSRES][66882] Attempting to stop `ora.POPORA.POPORA2.inst` on member `popsvr2`
2007-01-17 06:49:16.021: [ OCRSRV][23]th_select_handler: Failed to retrieve procctx from ht. constr = [27036080] retval lht [-27] Signal CV.
2007-01-17 06:49:19.202: [ CRSRES][66882] Stop of `ora.POPORA.POPORA2.inst` on member `popsvr2` succeeded.
2007-01-17 06:49:19.202: [ CRSRES][66882] ora.POPORA.POPORA2.inst RESTART_COUNT=0 RESTART_ATTEMPTS=5
2007-01-17 06:49:19.202: [ CRSRES][66882] Restarting ora.POPORA.POPORA2.inst on popsvr2
2007-01-17 06:49:19.209: [ CRSRES][66882] startRunnable: setting CLI values
2007-01-17 06:49:19.210: [ CRSRES][66882] Attempting to start `ora.POPORA.POPORA2.inst` on member `popsvr2`
2007-01-17 06:49:19.431: [ OCRUTL][37]u_freem: mem passed is null
2007-01-17 06:49:19.882: [ OCRSRV][23]th_select_handler: Failed to retrieve procctx from ht. constr = [27589600] retval lht [-27] Signal CV.
2007-01-17 06:49:19.931: [ OCRSRV][23]th_select_handler: Failed to retrieve procctx from ht. constr = [26869536] retval lht [-27] Signal CV.
2007-01-17 06:49:19.966: [ OCRSRV][23]th_select_handler: Failed to retrieve procctx from ht. constr = [26844992] retval lht [-27] Signal CV.
2007-01-17 06:49:21.179: [ OCRSRV][23]th_select_handler: Failed to retrieve procctx from ht. constr = [27080608] retval lht [-27] Signal CV.
2007-01-17 06:49:21.247: [ OCRSRV][23]th_select_handler: Failed to retrieve procctx from ht. constr = [27604336] retval lht [-27] Signal CV.
2007-01-17 06:49:25.027: [ OCRSRV][23]th_select_handler: Failed to retrieve procctx from ht. constr = [26856384] retval lht [-27] Signal CV.
2007-01-17 06:49:25.477: [ OCRSRV][23]th_select_handler: Failed to retrieve procctx from ht. constr = [26830032] retval lht [-27] Signal CV.
2007-01-17 06:49:25.546: [ OCRUTL][36]u_freem: mem passed is null
2007-01-17 06:49:33.042: [ OCRSRV][23]th_select_handler: Failed to retrieve procctx from ht. constr = [28019888] retval lht [-27] Signal CV.
2007-01-17 06:49:37.179: [ OCRSRV][23]th_select_handler: Failed to retrieve procctx from ht. constr = [23224304] retval lht [-27] Signal CV.
2007-01-17 06:49:37.477: [ CRSRES][66882] Start of `ora.POPORA.POPORA2.inst` on member `popsvr2` succeeded.
2007-01-17 06:49:37.478: [ CRSRES][66882] Successfully restarted ora.POPORA.POPORA2.inst on popsvr2, RESTART_COUNT=1
2007-01-17 06:49:37.504: [ CRSRES][66882] ora.POPORA.POPORA2.inst Updated LAST_RESTART time in ocr
2007-01-17 06:49:42.472: [ OCRUTL][33]u_freem: mem passed is null
2007-01-17 06:50:21.720: [ OCRSRV][23]th_select_handler: Failed to retrieve procctx from ht. constr = [26800336] retval lht [-27] Signal CV.
2007-03-02 08:35:44.002: [ CRSAPP][63658] CheckResource error for ora.POPORA.POPORA2.inst error code = 138
2007-03-02 08:35:44.021: [ CRSRES][63658] In stateChanged, ora.POPORA.POPORA2.inst target is ONLINE
2007-03-02 08:35:44.027: [ CRSRES][63658] ora.POPORA.POPORA2.inst on popsvr2 went OFFLINE unexpectedly
2007-03-02 08:35:44.028: [ CRSRES][63658] StopResource: setting CLI values
2007-03-02 08:35:44.035: [ CRSRES][63658] Attempting to stop `ora.POPORA.POPORA2.inst` on member `popsvr2`
2007-03-02 08:35:44.479: [ OCRSRV][19]th_select_handler: Failed to retrieve procctx from ht. constr = [28011088] retval lht [-27] Signal CV.
2007-03-02 08:35:53.136: [ CRSRES][63658] Stop of `ora.POPORA.POPORA2.inst` on member `popsvr2` succeeded.
2007-03-02 08:35:53.137: [ CRSRES][63658] ora.POPORA.POPORA2.inst RESTART_COUNT=0 RESTART_ATTEMPTS=5
2007-03-02 08:35:53.137: [ CRSRES][63658] Restarting ora.POPORA.POPORA2.inst on popsvr2
2007-03-02 08:35:53.144: [ CRSRES][63658] startRunnable: setting CLI values
2007-03-02 08:35:53.145: [ CRSRES][63658] Attempting to start `ora.POPORA.POPORA2.inst` on member `popsvr2`
2007-03-02 08:35:53.797: [ OCRSRV][19]th_select_handler: Failed to retrieve procctx from ht. constr = [20704688] retval lht [-27] Signal CV.
2007-03-02 08:35:53.852: [ OCRSRV][19]th_select_handler: Failed to retrieve procctx from ht. constr = [20704688] retval lht [-27] Signal CV.
2007-03-02 08:35:53.889: [ OCRSRV][19]th_select_handler: Failed to retrieve procctx from ht. constr = [20704688] retval lht [-27] Signal CV.
2007-03-02 08:35:58.120: [ OCRSRV][19]th_select_handler: Failed to retrieve procctx from ht. constr = [27603248] retval lht [-27] Signal CV.
2007-03-02 08:36:03.561: [ OCRSRV][19]th_select_handler: Failed to retrieve procctx from ht. constr = [27092528] retval lht [-27] Signal CV.
2007-03-02 08:36:03.666: [ OCRSRV][19]th_select_handler: Failed to retrieve procctx from ht. constr = [27029472] retval lht [-27] Signal CV.
2007-03-02 08:36:10.412: [ CRSRES][63658] Start of `ora.POPORA.POPORA2.inst` on member `popsvr2` succeeded.
2007-03-02 08:36:10.413: [ CRSRES][63658] Successfully restarted ora.POPORA.POPORA2.inst on popsvr2, RESTART_COUNT=1
2007-03-02 08:36:10.429: [ CRSRES][63658] ora.POPORA.POPORA2.inst Updated LAST_RESTART time in ocr
2007-03-02 08:36:15.363: [ OCRSRV][19]th_select_handler: Failed to retrieve procctx from ht. constr = [27101328] retval lht [-27] Signal CV.
2007-04-12 03:15:10.436: [ CRSAPP][184389] CheckResource error for ora.POPORA.POPORA2.inst error code = 138
2007-04-12 03:15:10.452: [ CRSRES][184389] In stateChanged, ora.POPORA.POPORA2.inst target is ONLINE
2007-04-12 03:15:10.455: [ CRSRES][184389] ora.POPORA.POPORA2.inst on popsvr2 went OFFLINE unexpectedly
2007-04-12 03:15:10.455: [ CRSRES][184389] StopResource: setting CLI values
2007-04-12 03:15:10.462: [ CRSRES][184389] Attempting to stop `ora.POPORA.POPORA2.inst` on member `popsvr2`
2007-04-12 03:15:10.932: [ OCRSRV][19]th_select_handler: Failed to retrieve procctx from ht. constr = [27571696] retval lht [-27] Signal CV.
2007-04-12 03:15:19.646: [ CRSRES][184389] Stop of `ora.POPORA.POPORA2.inst` on member `popsvr2` succeeded.
2007-04-12 03:15:19.647: [ CRSRES][184389] ora.POPORA.POPORA2.inst RESTART_COUNT=1 RESTART_ATTEMPTS=5
2007-04-12 03:15:19.647: [ CRSRES][184389] ora.POPORA.POPORA2.inst Uptime exceeds uptime_threshold, resetting RC
2007-04-12 03:15:19.647: [ CRSRES][184389] Restarting ora.POPORA.POPORA2.inst on popsvr2
2007-04-12 03:15:19.663: [ CRSRES][184389] startRunnable: setting CLI values
2007-04-12 03:15:19.664: [ CRSRES][184389] Attempting to start `ora.POPORA.POPORA2.inst` on member `popsvr2`
2007-04-12 03:15:20.330: [ OCRSRV][19]th_select_handler: Failed to retrieve procctx from ht. constr = [27106752] retval lht [-27] Signal CV.
2007-04-12 03:15:20.383: [ OCRSRV][19]th_select_handler: Failed to retrieve procctx from ht. constr = [27106752] retval lht [-27] Signal CV.
2007-04-12 03:15:20.414: [ OCRSRV][19]th_select_handler: Failed to retrieve procctx from ht. constr = [27106752] retval lht [-27] Signal CV.
2007-04-12 03:15:21.610: [ OCRSRV][19]th_select_handler: Failed to retrieve procctx from ht. constr = [27106752] retval lht [-27] Signal CV.
2007-04-12 03:15:22.431: [ OCRSRV][19]th_select_handler: Failed to retrieve procctx from ht. constr = [27945248] retval lht [-27] Signal CV.
2007-04-12 03:15:26.226: [ OCRSRV][19]th_select_handler: Failed to retrieve procctx from ht. constr = [26854336] retval lht [-27] Signal CV.
2007-04-12 03:15:26.702: [ OCRSRV][19]th_select_handler: Failed to retrieve procctx from ht. constr = [27089648] retval lht [-27] Signal CV.
2007-04-12 03:15:31.927: [ OCRSRV][19]th_select_handler: Failed to retrieve procctx from ht. constr = [27089472] retval lht [-27] Signal CV.
2007-04-12 03:15:32.049: [ OCRSRV][19]th_select_handler: Failed to retrieve procctx from ht. constr = [26866656] retval lht [-27] Signal CV.
2007-04-12 03:15:32.145: [ OCRSRV][19]th_select_handler: Failed to retrieve procctx from ht. constr = [27425536] retval lht [-27] Signal CV.
2007-04-12 03:15:32.339: [ OCRSRV][19]th_select_handler: Failed to retrieve procctx from ht. constr = [27030912] retval lht [-27] Signal CV.
2007-04-12 03:15:33.450: [ OCRSRV][19]th_select_handler: Failed to retrieve procctx from ht. constr = [26824976] retval lht [-27] Signal CV.
2007-04-12 03:15:34.500: [ OCRSRV][19]th_select_handler: Failed to retrieve procctx from ht. constr = [26825120] retval lht [-27] Signal CV.
2007-04-12 03:15:40.034: [ CRSRES][184389] Start of `ora.POPORA.POPORA2.inst` on member `popsvr2` succeeded.
2007-04-12 03:15:40.035: [ CRSRES][184389] Successfully restarted ora.POPORA.POPORA2.inst on popsvr2, RESTART_COUNT=1
2007-04-12 03:15:40.055: [ CRSRES][184389] ora.POPORA.POPORA2.inst Updated LAST_RESTART time in ocr
2007-04-12 03:16:33.965: [ OCRSRV][19]th_select_handler: Failed to retrieve procctx from ht. constr = [26840832] retval lht [-27] Signal CV.
2
........................

inst error code = 138로 보면 Bug 4556989 인 것 같네요..
10.2.0.2 이상으로 patch 하시면 됩니다.

Similar Messages

  • Oracle10g RAC with Oracle clusterware

    Hi,
    We are installing Oracle10g RAC with Oracle clusterware with ASM on Solaris 10. Does anybody have a installation/configuration manual for this combination.
    Thanks,
    Murtuja

    Sun would not recommend using Oracle 10g RAC without Sun Cluster. Setup for that configuration would be found in the Sun Cluster Data Service for Oracle 10g RAC and the Oracle installation manuals.
    The installation of Oracle Clusterware alone would be documented in Oracle's manuals. If you have any problems with that, you are better off asking Oracle.
    Regards,
    Tim
    ---

  • Why we need CRS in Oracle10g RAC

    When i am having third party cluster solution like Veritas(SFRAC for Oracle), why its not possible to configure Oracle10g RAC with Veritas without CRS ?????
    Its possible in Oracle9i but why not in Oracle10g ????

    because ASM is there !!!
    in 10g you do not have to use raw devices to store data files : you just use the ASM instance which comes free and provides the performance of the raw devices with the management capabilities of any filesystem
    you only need to configure raw devices for the OCR and Voting Disk (but this is really simple !!! compared to the management problem of storing datafiles on raw (no autoextend , mapping and so on)

  • Oracle RAC crs无法启动的问题

    这两个节点的RAC是做为DataGuard备库。
    版本:Red Linux 5.6,Oracle 10.2.0.3.0
    node1->$ crsctl check crs
    CSS appears healthy
    Cannot communicate with CRS
    EVM appears healthy
    node1->$ crsctl query css votedisk
    0. 0 /dev/raw/raw1
    located 1 votedisk(s).
    node1->$ ocrcheck
    Status of Oracle Cluster Registry is as follows :
    Version : 2
    Total space (kbytes) : 497744
    Used space (kbytes) : 3820
    Available space (kbytes) : 493924
    ID : 1682116375
    Device/File Name : /dev/raw/raw4
    Device/File integrity check succeeded
    Device/File not configured
    Cluster registry integrity check succeeded
    # *./oifcfg getif*
    eth0 10.17.19.0 global cluster_interconnect
    eth1 172.17.19.0 global public
    # */etc/init.d/init.crs start*
    node1->$ ps -ef|grep crs
    root 5083 1 0 15:10 ? 00:00:00 /bin/su -l oracle -c sh -c 'ulimit -c unlimited; cd /app/oracle/product/10.2.0/crs_1/log/node1/evmd; exec /app/oracle/product/10.2.0/crs_1/bin/evmd '
    oracle 17459 4769 0 16:09 pts/1 00:00:00 grep crs
    oracle 26397 5083 0 15:51 ? 00:00:00 /app/oracle/product/10.2.0/crs_1/bin/evmd.bin
    root 26619 26370 0 15:51 ? 00:00:00 /bin/su -l oracle -c /bin/sh -c 'cd /app/oracle/product/10.2.0/crs_1/log/node1/cssd/oclsomon; ulimit -c unlimited; /app/oracle/product/10.2.0/crs_1/bin/oclsomon || exit $?'
    oracle 26626 26619 0 15:51 ? 00:00:00 /bin/sh -c cd /app/oracle/product/10.2.0/crs_1/log/node1/cssd/oclsomon; ulimit -c unlimited; /app/oracle/product/10.2.0/crs_1/bin/oclsomon || exit $?
    oracle 26672 26626 0 15:51 ? 00:00:00 /app/oracle/product/10.2.0/crs_1/bin/oclsomon.bin
    oracle 26691 26371 0 15:51 ? 00:00:00 /app/oracle/product/10.2.0/crs_1/bin/ocssd.bin
    oracle 27094 26397 0 15:51 ? 00:00:00 /app/oracle/product/10.2.0/crs_1/bin/evmlogger.bin -o /app/oracle/product/10.2.0/crs_1/evm/log/evmlogger.info -l /app/oracle/product/10.2.0/crs_1/evm/log/evmlogger.log
    alertnode1.log 文件部份内容:
    2012-11-13 15:51:07.152
    [cssd(26691)]CRS-1605:CSSD voting file is online: /dev/raw/raw1. Details in /app/oracle/product/10.2.0/crs_1/log/node1/cssd/ocssd.log.
    2012-11-13 15:51:08.084
    [cssd(26691)]CRS-1601:CSSD Reconfiguration complete. Active nodes are node1 node2 .
    2012-11-13 15:51:08.320
    [evmd(26397)]CRS-1401:EVMD started on node node1.
    ocssd.log 文件内容:
    [    CSSD]2012-11-13 15:51:05.037 >USER: Oracle Database 10g CSS Release 10.2.0.3.0 Production Copyright 1996, 2004 Oracle. All rights reserved.
    [    CSSD]2012-11-13 15:51:05.037 >USER: CSS daemon log for node node1, number 1, in cluster crs
    [    CSSD]2012-11-13 15:51:05.040 [2246605696] >TRACE: clssscmain: local-only set to false
    [  clsdmt]Listening to (ADDRESS=(PROTOCOL=ipc)(KEY=node1DBG_CSSD))
    [    CSSD]2012-11-13 15:51:05.065 [2246605696] >TRACE: clssnmReadNodeInfo: added node 1 (node1) to cluster
    [    CSSD]2012-11-13 15:51:05.074 [2246605696] >TRACE: clssnmReadNodeInfo: added node 2 (node2) to cluster
    [    CSSD]2012-11-13 15:51:05.077 [1120115008] >TRACE: clssnm_skgxnmon: skgxn init failed
    [    CSSD]2012-11-13 15:51:05.077 [2246605696] >TRACE: clssnm_skgxnonline: Using vacuous skgxn monitor
    [    CSSD]2012-11-13 15:51:05.079 [2246605696] >TRACE: clssnmNMInitialize: misscount set to (60), impending reconfig threshold set to (56000)
    [    CSSD]2012-11-13 15:51:05.079 [2246605696] >TRACE: clssnmNMInitialize: diskShortTimeout set to (57000)ms
    [    CSSD]2012-11-13 15:51:05.080 [2246605696] >TRACE: clssnmNMInitialize: diskLongTimeout set to (200000)ms
    [    CSSD]2012-11-13 15:51:05.082 [2246605696] >TRACE: clssnmDiskStateChange: state from 1 to 2 disk (0//dev/raw/raw1)
    [    CSSD]2012-11-13 15:51:05.082 [1120115008] >TRACE: clssnmvDPT: spawned for disk 0 (/dev/raw/raw1)
    [    CSSD]2012-11-13 15:51:07.127 [1120115008] >TRACE: clssnmDiskStateChange: state from 2 to 4 disk (0//dev/raw/raw1)
    [    CSSD]2012-11-13 15:51:07.153 [1130604864] >TRACE: clssnmvKillBlockThread: spawned for disk 0 (/dev/raw/raw1) initial sleep interval (1000)ms
    [    CSSD]2012-11-13 15:51:07.161 [2246605696] >TRACE: clssnmFatalInit: fatal mode enabled
    [    CSSD]2012-11-13 15:51:07.161 [1151584576] >TRACE: clssnmconnect: connecting to node 1, flags 0x0001, connector 1
    [    CSSD]2012-11-13 15:51:07.161 [1120115008] >TRACE: clssnmReadDskHeartbeat: node(2) is down. rcfg(12) wrtcnt(78619) LATS(1830084) Disk lastSeqNo(78619)
    [    CSSD]2012-11-13 15:51:07.162 [1151584576] >TRACE: clssnmClusterListener: Listening on (ADDRESS=(PROTOCOL=tcp)(HOST=node1-priv)(PORT=49895))
    [    CSSD]2012-11-13 15:51:07.162 [1151584576] >TRACE: clssnmconnect: connecting to node 0, flags 0x0000, connector 1
    [    CSSD]2012-11-13 15:51:07.162 [1151584576] >TRACE: clssnmClusterListener: Probing node 2, con (0x2aaaac10c320)
    [    CSSD]2012-11-13 15:51:07.171 [1162074432] >TRACE: clssgmclientlsnr: listening on (ADDRESS=(PROTOCOL=ipc)(KEY=Oracle_CSS_LclLstnr_crs_1))
    [    CSSD]2012-11-13 15:51:07.171 [1162074432] >TRACE: clssgmclientlsnr: listening on (ADDRESS=(PROTOCOL=ipc)(KEY=OCSSD_LL_node1_crs))
    [    CSSD]2012-11-13 15:51:07.172 [1193544000] >TRACE: clssgmPeerListener: Listening on (ADDRESS=(PROTOCOL=tcp)(DEV=19)(HOST=10.17.19.20)(PORT=18701))
    [    CSSD]2012-11-13 15:51:07.198 [1151584576] >TRACE: clssnmConnComplete: connected to node 2 (con 0x2aaaac163b50), state 3 birth 0, unique 1352712566/1352712566 prevConuni(0)
    [    CSSD]2012-11-13 15:51:07.673 [1204033856] >TRACE: clssnmPollingThread: Connection complete
    [    CSSD]2012-11-13 15:51:07.673 [1214523712] >TRACE: clssnmSendingThread: Connection complete
    [    CSSD]2012-11-13 15:51:07.673 [1225013568] >TRACE: clssnmRcfgMgrThread: Connection complete
    [    CSSD]2012-11-13 15:51:08.003 [1151584576] >TRACE: clssnmHandleSync: Acknowledging sync: src[2] srcName[node2] seq[45] sync[12]
    [    CSSD]2012-11-13 15:51:08.003 [1151584576] >TRACE: clssnmHandleSync: diskTimeout set to (57000)ms
    [    CSSD]2012-11-13 15:51:08.003 [1151584576] >TRACE: clssnmSendVoteInfo: node(2) syncSeqNo(12)
    [    CSSD]2012-11-13 15:51:08.004 [1151584576] >TRACE: clssnmUpdateNodeState: node 0, state (0/0) unique (0/0) prevConuni(0) birth (0/0) (old/new)
    [    CSSD]2012-11-13 15:51:08.004 [1151584576] >TRACE: clssnmDeactivateNode: node 0 () left cluster
    [    CSSD]2012-11-13 15:51:08.004 [1151584576] >TRACE: clssnmUpdateNodeState: node 1, state (1/2) unique (1352793064/1352793064) prevConuni(0) birth (0/12) (old/new)
    [    CSSD]2012-11-13 15:51:08.004 [1151584576] >TRACE: clssnmUpdateNodeState: node 2, state (4/3) unique (1352712566/1352712566) prevConuni(0) birth (0/1) (old/new)
    [    CSSD]2012-11-13 15:51:08.004 [1151584576] >USER: clssnmHandleUpdate: SYNC(12) from node(2) completed
    [    CSSD]2012-11-13 15:51:08.004 [1151584576] >USER: clssnmHandleUpdate: NODE 1 (node1) IS ACTIVE MEMBER OF CLUSTER
    [    CSSD]2012-11-13 15:51:08.004 [1151584576] >USER: clssnmHandleUpdate: NODE 2 (node2) IS ACTIVE MEMBER OF CLUSTER
    [    CSSD]2012-11-13 15:51:08.004 [1151584576] >TRACE: clssnmHandleUpdate: diskTimeout set to (200000)ms
    [    CSSD]2012-11-13 15:51:08.081 [2246605696] >USER: NMEVENT_SUSPEND [00][00][00][00]
    [    CSSD]2012-11-13 15:51:08.081 [1235503424] >TRACE: clssgmReconfigThread: started for reconfig (12)
    [    CSSD]2012-11-13 15:51:08.081 [1235503424] >USER: NMEVENT_RECONFIG [00][00][00][06]
    [    CSSD]2012-11-13 15:51:08.081 [1235503424] >TRACE: clssgmEstablishConnections: 2 nodes in cluster incarn 12
    [    CSSD]2012-11-13 15:51:08.082 [1193544000] >TRACE: clssgmInitialRecv: (0xd9ae050) accepted a new connection from node 2 born at 1 active (2, 2), vers (10,3,1,2)
    [    CSSD]2012-11-13 15:51:08.082 [1193544000] >TRACE: clssgmInitialRecv: conns done (2/2)
    [    CSSD]2012-11-13 15:51:08.082 [1235503424] >TRACE: clssgmEstablishMasterNode: MASTER for 12 is node(2) birth(1)
    [    CSSD]2012-11-13 15:51:08.082 [1235503424] >TRACE: clssgmChangeMasterNode: requeued 0 RPCs
    [    CSSD]2012-11-13 15:51:08.083 [1193544000] >TRACE: clssgmHandleDBDone(): src/dest (2/65535) size(72) incarn 12
    [    CSSD]CLSS-3000: reconfiguration successful, incarnation 12 with 2 nodes
    [    CSSD]CLSS-3001: local node number 1, master node number 2
    [    CSSD]2012-11-13 15:51:08.084 [1235503424] >TRACE: clssgmReconfigThread: completed for reconfig(12), with status(1)
    [    CSSD]2012-11-13 15:51:08.268 [1162074432] >TRACE: clssgmClientConnectMsg: Connect from con(0xd9b4d50) proc(0xd9b9d50) pid() proto(10:2:1:1)
    [    CSSD]2012-11-13 15:51:08.268 [1193544000] >TRACE: clssgmCommonAddMember: clsomon joined (1/0x1000000/#CSS_CLSSOMON)
    [    CSSD]2012-11-13 15:51:08.269 [1162074432] >TRACE: clssgmClientConnectMsg: Connect from con(0xd9b7910) proc(0xd9ba0a0) pid() proto(10:2:1:1)
    查看ocr,表决磁盘,存储,网络,裸设备权限,都没有发现问题,有时候执行/etc/init.d/init.crs start还会导致服务器重启,日志内容如下:
    /var/log/message重启时的日志
    Nov 13 15:51:03 node1 logger: Cluster Ready Services completed waiting on dependencies.
    Nov 13 15:51:03 node1 logger: Cluster Ready Services completed waiting on dependencies.
    Nov 13 16:10:54 node1 auditd[3667]: Audit daemon rotating log files
    Nov 13 16:49:14 node1 auditd[3667]: Audit daemon rotating log files
    Nov 13 16:50:37 node1 root: Cluster Ready Services completed waiting on dependencies.
    Nov 13 16:52:07 node1 logger: Oracle CSS family monitor shutting down. 3
    Nov 13 16:52:07 node1 root: Oracle CRSD 5797 set to stop
    Nov 13 16:52:07 node1 root: Oracle CRSD 5797 shutdown completed
    Nov 13 16:52:07 node1 root: Oracle EVMD set to stop
    Nov 13 16:52:07 node1 root: Oracle CSSD being stopped
    Nov 13 16:52:17 node1 root: Oracle CSSD being stopped
    Nov 13 16:52:27 node1 root: Oracle EVMD set to stop
    Nov 13 16:52:45 node1 root: Oracle CSSD being stopped
    Nov 13 17:03:14 node1 root: Oracle CRSD 5797 set to stop
    Nov 13 17:03:14 node1 root: Oracle CRSD 5797 shutdown completed
    Nov 13 17:03:14 node1 root: Oracle EVMD set to stop
    Nov 13 17:03:14 node1 root: Oracle CSSD being stopped
    Nov 13 17:03:26 node1 root: Oracle Cluster Ready Services starting by user request.
    Nov 13 17:03:35 node1 logger: Cluster Ready Services completed waiting on dependencies.
    Nov 13 17:03:36 node1 logger: Oracle CSSD shell script failure. Duplicate CSSD.
    Nov 13 17:03:36 node1 kernel: md: stopping all md devices.
    Nov 13 17:21:49 node1 syslogd 1.4.1: restart.
    Nov 13 17:21:49 node1 kernel: klogd 1.4.1, log source = /proc/kmsg started.
    出现 Nov 13 17:03:36 node1 logger: Oracle CSSD shell script failure. Duplicate CSSD. 之后,服务器就重启了
    在网上查了不少类似问题,其他网友无法启动CRS主要集中在几个方面:
    1、/tmp权限不正确
    2、删除/var/tmp/.oracle下的文件,再重启
    3、oifcfg查看到网卡设置问题
    但我遇到的问题,以上3项都是正常的,跟这个http://www.itpub.net/thread-1330782-1-1.html 问题类似。
    请问这个问题是什么原因导致的?
    帖子经 user1738965编辑过
    帖子经 user1738965编辑过

    关掉第1个节点,重启第2个节点,
    crsd.log文件还是没有写入任何信息
    alertnode2.log 部份日志
    2012-11-14 12:57:53.568
    [cssd(10296)]CRS-1605:CSSD voting file is online: /dev/raw/raw1. Details in /app/oracle/product/10.2.0/crs_1/log/node2/cssd/ocssd.log.
    2012-11-14 13:01:13.616
    [cssd(10296)]CRS-1601:CSSD Reconfiguration complete. Active nodes are node2 .
    2012-11-14 13:01:13.776
    [evmd(10080)]CRS-1401:EVMD started on node node2.
    ocssd.log 部份日志
    [    CSSD]2012-11-14 12:57:51.475 >USER: Oracle Database 10g CSS Release 10.2.0.3.0 Production Copyright 1996, 2004 Oracle. All rights reserved.
    [  clsdmt]Listening to (ADDRESS=(PROTOCOL=ipc)(KEY=node2DBG_CSSD))
    [    CSSD]2012-11-14 12:57:51.475 >USER: CSS daemon log for node node2, number 2, in cluster crs
    [    CSSD]2012-11-14 12:57:51.482 [1618381696] >TRACE: clssscmain: local-only set to false
    [    CSSD]2012-11-14 12:57:51.496 [1618381696] >TRACE: clssnmReadNodeInfo: added node 1 (node1) to cluster
    [    CSSD]2012-11-14 12:57:51.500 [1618381696] >TRACE: clssnmReadNodeInfo: added node 2 (node2) to cluster
    [    CSSD]2012-11-14 12:57:51.503 [1105389888] >TRACE: clssnm_skgxnmon: skgxn init failed
    [    CSSD]2012-11-14 12:57:51.503 [1618381696] >TRACE: clssnm_skgxnonline: Using vacuous skgxn monitor
    [    CSSD]2012-11-14 12:57:51.505 [1618381696] >TRACE: clssnmNMInitialize: misscount set to (60), impending reconfig threshold set to (56000)
    [    CSSD]2012-11-14 12:57:51.505 [1618381696] >TRACE: clssnmNMInitialize: diskShortTimeout set to (57000)ms
    [    CSSD]2012-11-14 12:57:51.506 [1618381696] >TRACE: clssnmNMInitialize: diskLongTimeout set to (200000)ms
    [    CSSD]2012-11-14 12:57:51.508 [1618381696] >TRACE: clssnmDiskStateChange: state from 1 to 2 disk (0//dev/raw/raw1)
    [    CSSD]2012-11-14 12:57:51.508 [1105389888] >TRACE: clssnmvDPT: spawned for disk 0 (/dev/raw/raw1)
    [    CSSD]2012-11-14 12:57:53.552 [1105389888] >TRACE: clssnmDiskStateChange: state from 2 to 4 disk (0//dev/raw/raw1)
    [    CSSD]2012-11-14 12:57:53.575 [1128057152] >TRACE: clssnmvKillBlockThread: spawned for disk 0 (/dev/raw/raw1) initial sleep interval (1000)ms
    [    CSSD]2012-11-14 12:57:53.587 [1105389888] >TRACE: clssnmReadDskHeartbeat: node(1) is down. rcfg(15) wrtcnt(59321) LATS(3927324) Disk lastSeqNo(59321)
    [    CSSD]2012-11-14 12:57:53.589 [1618381696] >TRACE: clssnmFatalInit: fatal mode enabled
    [    CSSD]2012-11-14 12:57:53.589 [1149036864] >TRACE: clssnmconnect: connecting to node 2, flags 0x0001, connector 1
    [    CSSD]2012-11-14 12:57:53.590 [1149036864] >TRACE: clssnmClusterListener: Listening on (ADDRESS=(PROTOCOL=tcp)(HOST=node2-priv)(PORT=49895))
    [    CSSD]2012-11-14 12:57:53.590 [1149036864] >TRACE: clssnmconnect: connecting to node 0, flags 0x0000, connector 1
    [    CSSD]2012-11-14 12:57:53.590 [1149036864] >TRACE: clssnmconnect: connecting to node 1, flags 0x0001, connector 0
    [    CSSD]2012-11-14 12:57:53.595 [1149036864] >TRACE: clsc_send_msg: (0x108cf430) NS err (12571, 12560), transport (530, 111, 0)
    [    CSSD]2012-11-14 12:57:53.600 [1159526720] >TRACE: clssgmclientlsnr: listening on (ADDRESS=(PROTOCOL=ipc)(KEY=Oracle_CSS_LclLstnr_crs_2))
    [    CSSD]2012-11-14 12:57:53.600 [1159526720] >TRACE: clssgmclientlsnr: listening on (ADDRESS=(PROTOCOL=ipc)(KEY=OCSSD_LL_node2_crs))
    [    CSSD]2012-11-14 12:57:53.601 [1190996288] >TRACE: clssgmPeerListener: Listening on (ADDRESS=(PROTOCOL=tcp)(DEV=19)(HOST=10.17.19.21)(PORT=52492))
    [    CSSD]2012-11-14 12:57:53.601 [1201486144] >TRACE: clssnmPollingThread: Connection complete
    [    CSSD]2012-11-14 12:57:53.601 [1211976000] >TRACE: clssnmSendingThread: Connection complete
    [    CSSD]2012-11-14 12:57:53.601 [1222465856] >TRACE: clssnmRcfgMgrThread: Connection complete
    [    CSSD]2012-11-14 12:58:00.616 [1222465856] >TRACE: clssnmRcfgMgrThread: Local Join
    [    CSSD]2012-11-14 12:58:00.616 [1222465856] >TRACE: clssnmDoSyncUpdate: Initiating sync 1
    [    CSSD]2012-11-14 12:58:00.616 [1222465856] >TRACE: clssnmDoSyncUpdate: diskTimeout set to (57000)ms
    [    CSSD]2012-11-14 12:58:00.616 [1222465856] >TRACE: clssnmSetupAckWait: Ack message type (11)
    [    CSSD]2012-11-14 12:58:00.616 [1222465856] >TRACE: clssnmSetupAckWait: node(2) is ALIVE
    [    CSSD]2012-11-14 12:58:00.616 [1222465856] >TRACE: clssnmSendSync: syncSeqNo(1)
    [    CSSD]2012-11-14 12:58:00.616 [1222465856] >TRACE: clssnmWaitForAcks: Ack message type(11), ackCount(1)
    [    CSSD]2012-11-14 12:58:00.616 [1149036864] >TRACE: clssnmHandleSync: Acknowledging sync: src[2] srcName[node2] seq[1] sync[1]
    [    CSSD]2012-11-14 12:58:00.616 [1149036864] >TRACE: clssnmHandleSync: diskTimeout set to (57000)ms
    [    CSSD]2012-11-14 12:58:00.616 [1222465856] >TRACE: clssnmWaitForAcks: done, msg type(11)
    [    CSSD]2012-11-14 12:58:00.616 [1222465856] >TRACE: clssnmSetupAckWait: Ack message type (13)
    [    CSSD]2012-11-14 12:58:00.616 [1222465856] >TRACE: clssnmSetupAckWait: node(2) is ACTIVE
    [    CSSD]2012-11-14 12:58:00.616 [1222465856] >TRACE: clssnmSendVote: syncSeqNo(1)
    [    CSSD]2012-11-14 12:58:00.616 [1222465856] >TRACE: clssnmWaitForAcks: Ack message type(13), ackCount(1)
    [    CSSD]2012-11-14 12:58:00.616 [1149036864] >TRACE: clssnmSendVoteInfo: node(2) syncSeqNo(1)
    [    CSSD]2012-11-14 12:58:00.616 [1222465856] >TRACE: clssnmWaitForAcks: done, msg type(13)
    [    CSSD]2012-11-14 12:58:00.616 [1222465856] >TRACE: clssnmCheckDskInfo: Checking disk info...
    [    CSSD]2012-11-14 12:58:00.616 [1222465856] >TRACE: clssnmCheckDskInfo: diskTimeout set to (200000)ms
    [    CSSD]2012-11-14 12:58:00.616 [1222465856] >TRACE: clssnmCheckDskInfo: node(1) timeout(7030) state_network(0) state_disk(3) misstime(3934354)
    [    CSSD]2012-11-14 12:58:00.671 [1618381696] >USER: NMEVENT_SUSPEND [00][00][00][00]
    [    CSSD]2012-11-14 12:58:01.616 [1222465856] >TRACE: clssnmCheckDskInfo: node(1) timeout(8030) state_network(0) state_disk(3) misstime(3934354)
    [    CSSD]2012-11-14 12:58:02.618 [1222465856] >TRACE: clssnmCheckDskInfo: node(1) timeout(9030) state_network(0) state_disk(3) misstime(3935354)
    [    CSSD]2012-11-14 12:58:03.619 [1222465856] >TRACE: clssnmCheckDskInfo: node(1) timeout(10030) state_network(0) state_disk(3) misstime(3936354)
    [    CSSD]2012-11-14 12:58:04.620 [1222465856] >TRACE: clssnmCheckDskInfo: node(1) timeout(11030) state_network(0) state_disk(3) misstime(3937354)
    [    CSSD]2012-11-14 12:58:05.620 [1222465856] >TRACE: clssnmCheckDskInfo: node(1) timeout(12030) state_network(0) state_disk(3) misstime(3938354)
    [    CSSD]2012-11-14 12:58:06.621 [1222465856] >TRACE: clssnmCheckDskInfo: node(1) timeout(13030) state_network(0) state_disk(3) misstime(3939364)
    [    CSSD]2012-11-14 12:58:07.622 [1222465856] >TRACE: clssnmCheckDskInfo: node(1) timeout(14030) state_network(0) state_disk(3) misstime(3940364)
    中间这个clssnmCheckDskInfo日志有点多,超出回复字数限制,这里就去掉了一部份。
    [    CSSD]2012-11-14 13:01:11.813 [1222465856] >TRACE: clssnmCheckDskInfo: node(1) timeout(198200) state_network(0) state_disk(3) misstime(4124704)
    [    CSSD]2012-11-14 13:01:12.814 [1222465856] >TRACE: clssnmCheckDskInfo: node(1) timeout(199200) state_network(0) state_disk(3) misstime(4125704)
    [    CSSD]2012-11-14 13:01:13.615 [1222465856] >TRACE: clssnmEvict: Start
    [    CSSD]2012-11-14 13:01:13.615 [1222465856] >TRACE: clssnmWaitOnEvictions: Start
    [    CSSD]2012-11-14 13:01:13.615 [1222465856] >TRACE: clssnmSetupAckWait: Ack message type (15)
    [    CSSD]2012-11-14 13:01:13.615 [1222465856] >TRACE: clssnmSetupAckWait: node(2) is ACTIVE
    [    CSSD]2012-11-14 13:01:13.615 [1222465856] >TRACE: clssnmSendUpdate: syncSeqNo(1)
    [    CSSD]2012-11-14 13:01:13.615 [1222465856] >TRACE: clssnmWaitForAcks: Ack message type(15), ackCount(1)
    [    CSSD]2012-11-14 13:01:13.615 [1149036864] >TRACE: clssnmUpdateNodeState: node 0, state (0/0) unique (0/0) prevConuni(0) birth (0/0) (old/new)
    [    CSSD]2012-11-14 13:01:13.615 [1149036864] >TRACE: clssnmDeactivateNode: node 0 () left cluster
    [    CSSD]2012-11-14 13:01:13.615 [1149036864] >TRACE: clssnmUpdateNodeState: node 1, state (0/0) unique (0/0) prevConuni(0) birth (0/0) (old/new)
    [    CSSD]2012-11-14 13:01:13.615 [1149036864] >TRACE: clssnmDeactivateNode: node 1 (node1) left cluster
    [    CSSD]2012-11-14 13:01:13.615 [1149036864] >TRACE: clssnmUpdateNodeState: node 2, state (2/2) unique (1352869071/1352869071) prevConuni(0) birth (1/1) (old/new)
    [    CSSD]2012-11-14 13:01:13.615 [1149036864] >USER: clssnmHandleUpdate: SYNC(1) from node(2) completed
    [    CSSD]2012-11-14 13:01:13.615 [1149036864] >USER: clssnmHandleUpdate: NODE 2 (node2) IS ACTIVE MEMBER OF CLUSTER
    [    CSSD]2012-11-14 13:01:13.615 [1149036864] >TRACE: clssnmHandleUpdate: diskTimeout set to (200000)ms
    [    CSSD]2012-11-14 13:01:13.615 [1222465856] >TRACE: clssnmWaitForAcks: done, msg type(15)
    [    CSSD]2012-11-14 13:01:13.615 [1222465856] >TRACE: clssnmDoSyncUpdate: Sync Complete!
    [    CSSD]2012-11-14 13:01:13.615 [1232955712] >TRACE: clssgmReconfigThread: started for reconfig (1)
    [    CSSD]2012-11-14 13:01:13.615 [1232955712] >USER: NMEVENT_RECONFIG [00][00][00][04]
    [    CSSD]2012-11-14 13:01:13.615 [1232955712] >TRACE: clssgmEstablishConnections: 1 nodes in cluster incarn 1
    [    CSSD]2012-11-14 13:01:13.616 [1190996288] >TRACE: clssgmPeerListener: connects done (1/1)
    [    CSSD]2012-11-14 13:01:13.616 [1232955712] >TRACE: clssgmEstablishMasterNode: MASTER for 1 is node(2) birth(1)
    [    CSSD]2012-11-14 13:01:13.616 [1232955712] >TRACE: clssgmChangeMasterNode: requeued 0 RPCs
    [    CSSD]2012-11-14 13:01:13.616 [1232955712] >TRACE: clssgmMasterCMSync: Synchronizing group/lock status
    [    CSSD]2012-11-14 13:01:13.616 [1232955712] >TRACE: clssgmMasterSendDBDone: group/lock status synchronization complete
    [    CSSD]CLSS-3000: reconfiguration successful, incarnation 1 with 1 nodes
    [    CSSD]CLSS-3001: local node number 2, master node number 2
    [    CSSD]2012-11-14 13:01:13.616 [1232955712] >TRACE: clssgmReconfigThread: completed for reconfig(1), with status(1)
    [    CSSD]2012-11-14 13:01:13.732 [1159526720] >TRACE: clssgmClientConnectMsg: Connect from con(0x10a0b9a0) proc(0x10a10980) pid() proto(10:2:1:1)
    [    CSSD]2012-11-14 13:01:13.732 [1159526720] >TRACE: clssgmClientConnectMsg: Connect from con(0x10a0e540) proc(0x10a10c50) pid() proto(10:2:1:1)
    [    CSSD]2012-11-14 13:01:13.733 [1159526720] >TRACE: clssgmCommonAddMember: clsomon joined (2/0x1000000/#CSS_CLSSOMON)

  • 10G RAC: CRS 설치 실패 후 정리 방법

    제품 : ORACLE SERVER
    작성날짜 : 2004-11-30
    10G RAC: CRS 설치 실패 후 정리 방법
    =====================================
    PURPOSE
    이 문서는, DBA와 기술 지원 엔지니어가 10g RAC의 CRS (Cluster Ready Services)
    실패시, 일부 설치된 CRS를 제거하는데 필요한 정보를 제공하는 것을 목적으로 한다.
    Explanation
    설치에 실패한 CRS는 노드 리부팅과 같은 문제를 야기 시킬 수 있다.
    실패한 CRS 설치본을 정리하기 위해서는 다음과 같은 절차를 따른다:
    1. $ORA_CRS_HOME/install 디렉토리에서 rootdelete.sh 스크립트를 실행한 후 rootdeinstall.sh 스크립트를
    실행시킨다. 만약 이 스크립트를 실행시키는데 문제가 있거나, 모든 콤포넌트가 성공적으로 제거되었는지
    여부를 확인하려면 step 2로 간다:
    2. 모든 노드로 부터 노드 애플리케이션을 중단시킨다:
    srvctl stop nodeapps -n <node name>
    3. 노드 부팅시 CRS가 구동되는 것을 예방한다. 이를 위해 root 계정에서 다음과 같은 작업을
    수행한다 :
    Sun:
    rm /etc/init.d/init.cssd
    rm /etc/init.d/init.crs
    rm /etc/init.d/init.crsd
    rm /etc/init.d/init.evmd
    rm /etc/rc3.d/K96init.crs
    rm /etc/rc3.d/S96init.crs
    rm -Rf /var/opt/oracle/scls_scr
    rm -Rf /var/opt/oracle/oprocd
    rm /etc/inittab.crs
    cp /etc/inittab.orig /etc/inittab
    Linux:
    rm -f /etc/init.d/init.cssd
    rm -f /etc/init.d/init.crs
    rm -f /etc/init.d/init.crsd
    rm -f /etc/init.d/init.evmd
    rm -f /etc/rc2.d/K96init.crs
    rm -f /etc/rc2.d/S96init.crs
    rm -f /etc/rc3.d/K96init.crs
    rm -f /etc/rc3.d/S96init.crs
    rm -f /etc/rc5.d/K96init.crs
    rm -f /etc/rc5.d/S96init.crs
    rm -Rf /etc/oracle/scls_scr
    rm -f /etc/inittab.crs
    cp /etc/inittab.orig /etc/inittab
    HP-UX:
    rm /sbin/init.d/init.cssd
    rm /sbin/init.d/init.crs
    rm /sbin/init.d/init.crsd
    rm /sbin/init.d/init.evmd
    rm /sbin/rc3.d/K960init.crs
    rm /sbin/rc3.d/S960init.crs
    rm -Rf /var/opt/oracle/scls_scr
    rm -Rf /var/opt/oracle/oprocd
    rm /etc/inittab.crs
    cp /etc/inittab.orig /etc/inittab
    HP Tru64:
    rm /sbin/init.d/init.cssd
    rm /sbin/init.d/init.crs
    rm /sbin/init.d/init.crsd
    rm /sbin/init.d/init.evmd
    rm /sbin/rc3.d/K96init.crs
    rm /sbin/rc3.d/S96init.crs
    rm -Rf /var/opt/oracle/scls_scr
    rm -Rf /var/opt/oracle/oprocd
    rm /etc/inittab.crs
    cp /etc/inittab.orig /etc/inittab
    IBM AIX:
    rm /etc/init.cssd
    rm /etc/init.crs
    rm /etc/init.crsd
    rm /etc/init.evmd
    rm /etc/rc.d/rc2.d/K96init.crs
    rm /etc/rc.d/rc2.d/S96init.crs
    rm -Rf /etc/oracle/scls_scr
    rm -Rf /etc/oracle/oprocd
    rm /etc/inittab.crs
    cp /etc/inittab.orig /etc/inittab
    4. 만약 프로세스가 살아 있다면 EVM, CRS 및 CRS 프로세스를 kill 시키거나
    노드를 리부팅 한다:
    ps -ef | grep crs
    kill <crs pid>
    ps -ef | grep evm
    kill <evm pid>
    ps -ef | grep css
    kill <css pid>
    5. CRS 설치 디렉토리를 제거한다:
    rm -Rf <CRS Install Location>/*
    6. Oracle Universal Installer에서 CRS home을 De-install 한다.
    7. dd 명령으로 OCR 및 Voting File을 제거한다. 예 :
    dd if=/dev/zero of=/dev/rdsk/V1064_vote_01_20m.dbf bs=8192 count=2560
    dd if=/dev/zero of=/dev/rdsk/ocrV1064_100m.ora bs=8192 count=12800
    만약 RDBMS 설치를 제거한다면, 사용중이던 ASM 디스크도 정리한다.
    8. 만약 CRS를 재 설치하고자 하면, RAC 설치 매뉴얼에 기술된 순서대로 설치를 다시 진행한다.
    Example
    Reference Documents
    <Note:239998.1> 10g RAC: How to Clean Up After a Failed CRS Install

  • Oracle10g RAC Cluster Interconnect issues

    Hello Everybody,
    Just a brief overview as to what i am currently doing. I have installed Oracle10g RAC database on a cluster of two Windows 2000 AS nodes.These two nodes are accessing an external SCSI hard disk.I have used Oracle cluster file system.
    Currently i am facing some performance issues when it comes to balancing workload on both the nodes.(Single instance database load is faster than a parallel load using two database instances).
    I feel the performance issues could be due to IPC using public Ethernet IP instead of private interconnect.
    (During a parallel load large amount of packets of data are sent over the Public IP and not Private interconnect).
    How can i be sure that the Private interconnect is used for transferring cluster traffic and not the Public IP? (Oracle mentions that for a Oracle10g RAC database, private IP should be used for heart beat as well as transferring cluster traffic).
    Thanks in advance,
    Regards,
    Salil

    You find the answers here:
    RAC: Frequently Asked Questions
    Doc ID: NOTE:220970.1
    At least crossover interconnect is completely unsupported.
    Werner

  • Solaris cluster, Oracle10g RAC

    I just want to understand, which one is more favorable, most popular combination used by sun customers for Oracle10g RAC.
    1) Solaris cluster + VERITAS Storage Foundation
    2) Solaris cluster + QFS

    Please refer to http://docs.sun.com/app/docs/doc/820-2574/fmnyo?a=view for the supported options for Oracle RAC data storage. This is important because if you stray outside these you will not be on a jointly Sun/Oracle supported configuration.
    Therefore, if you want to put Oracle RAC (tablespace) data files on a cluster file system, you must use shared QFS as that is the only supported option open to you. Furthermore, you can only run sQFS on top of SVM/Oban or h/w RAID - we do not support running it on VxVM/CVM.
    Regards,
    Tim
    ---

  • Oracle10g RAC vs Oracle9i RAC with WebLogic

    Hello, I am planning on installing Oracle RAC on Redhat's Linux AS 3.0, while BEA's WebLogic will the application server.
    What would you choose as the back end, Oracle9i RAC or Oracle10g RAC and why ?
    Also, what release/patch-level (sp ??) of Linux A.S should be used ?
    EMC's storage disk arrays sould be used. Is anyone aware of any I/O issues between Linux AS and EMC disks ?
    Thank you for your thoughts.
    Regards,
    Tom

    Sun would not recommend using Oracle 10g RAC without Sun Cluster. Setup for that configuration would be found in the Sun Cluster Data Service for Oracle 10g RAC and the Oracle installation manuals.
    The installation of Oracle Clusterware alone would be documented in Oracle's manuals. If you have any problems with that, you are better off asking Oracle.
    Regards,
    Tim
    ---

  • Oracle10g rac installation steps in windows2008R2

    Hello All ,
    Grettings....
    Is it possible to have oracle10g RAC in windows2008R2?
    can i have the steps to install Oracle10g RAC in windows2008R2...
    Thanks,
    Edited by: user4487322 on Mar 30, 2011 2:53 AM

    Hi,
    a.) RTFM: otn.oracle.com/documentation
    b.) RAC Assurance Support Team: RAC Starter Kit and Best Practices (Windows) (Doc ID 811271.1)
    https://support.oracle.com/oip/faces/secure/km/DocumentDisplay.jspx?id=811271.1
    Regards
    Sebastian

  • Benchmark of Oracle10g RAC

    Hi,
    Does anyone know where I can find the benchmark of Oracle10g RAC (2 nodes or more ) on Sun Solaris?
    Thanks

    You may find this posting helpful for using free benchmark tool for oracle.
    http://www.oraclepoint.com/topic.php?filename=166&extra=page%3D1 (register and then log in because it's only available for members.)
    There are step-by-step installation guide and sample report of TPC-C.
    Hope it helps.

  • Recover Oracle10g RAC Database

    Hi,
    Anyone know whether the below backup and restoration works for oracle10g RAC database. I can;t test it out as i don;t have an development environment. Please correct me if i am wrong.
    1.) stop all oracle services running on the server
    2.) use dd command to backup the raw partition to tape + backup oracle homes on all nodes
    3.) Encountered Media failure, change the hardisks
    4.) create the raw partitions
    5.) restore the raw partitions backup from tape + restore oracle binaries from tape
    6.) startup all oracle services
    7.) Done.
    Thank You, anyone advice and comments are greatly appreciated..

    Many thanks for your responses . I finally got it working . It does not require additionalconfigurational change apart from updating the ocr with oifcfg commands(metalink-Note 283684.1 ) i.e. deleting the old entry and setting he new one .
    Where I got it wrong was I did not shutdown the nodeapps (which was not included in the metalink note ) before updating the ocr with oifcfg commands
    Thanks ..
    Edited by: Tai Shebby on Jan 22, 2009 8:12 AM

  • Oracle10g RAC

    Hi,
    Can anyone help me,
    Is there any problem for Oracle10g RAC running on Sun Solaris 10 using the latest SUN Containers feature?
    regards,
    Dilip.

    Have a look at the Certify & Availability in Metalink for your Platform. Also the RAC Readme and documentation for Sun Solaris 10 will indicate if there are any known problems.
    If in doubt and you are having any specific problems, I would suggest you open a TAR with Oracle Support.

  • Upgrade oracle 9i RAC to Oracle 11g RAC

    Hi all,
    db version:oracle 9.2.0.5 RAC
    OS:Linux
    My project team and client is planning to upgrade oracle 9i RAC to Oracle 11g RAC.
    please suggest any MOS notes any links.
    thanks,
    Visu.
    Edited by: visu996253 on Apr 8, 2013 5:30 AM

    db version:oracle 9.2.0.5 RAC
    OS:Linux
    My project team and client is planning to upgrade oracle 9i RAC to Oracle 11g RAC.
    please suggest any MOS notes any links.Master Note For Oracle Database Upgrades and Migrations [ID 1152016.1]
    http://www.oracle.com/technetwork/products/upgrade/index-088044.html
    785351.1 - Oracle 11g Upgrade Companion
    http://download.oracle.com/docs/cd/E11882_01/server.112/e17222.pdf - Oracle® Database Upgrade Guide 11g Release 2 (11.2) E10819-02
    Edited by: KR10822864 on Apr 8, 2013 5:52 AM
    added more info.

  • Step By Step Implementation of RAC in Oracle EBS R12 (version 12.0.4)

    Hi,
    Can anyne suggest me any Document in metalink or any other useful document on the
    *"Step By Step Implementation of RAC in Oracle EBS R12 (version 12.0.4)"*.
    My Database Version is 10g Enterprise Edition Release 10.2.0.4.0 and my Platform  is HP-UX  ia64.

    How To Find Out The Example of The LOCAL_LISTENER and REMOTE_LISTENER Defined In The init.ora When configuring the 11i or R12 on RAC ? [ID 744508.1]
    1072636.1 - Oracle E-Business Suite Release 12 High Availability Documentation Roadmap
    Please check Hussein's discussion before
    Rac and R12
    R12 with RAC
    Rac and R12
    Moving from NON-RAC to RAC
    R12 and RAC
    Re: R12.1-RAC
    EBS R12 on Linux with RAC

  • How can i do the RAC with Oracle 9i ?

    How can i do the RAC with Oracle 9i ?
    The Oracle 9i has a RAC(Real Application Cluster)module , please who can tell me how can i let it working .
    Which hardware the RAC need's
    Thank All ,
    [email protected]

    That is right you need atleast 2 boxes. The two servers will use the same hard disks.
    The concept is an extension of OPS(Oracle Parallel Server).
    Basically, you install Oracle Server Software on both the boxes, One database will be shared by both the servers. You can access your database through the Server1 or Server2. If Server1 fails then the Server2 will take over all the connections. You can add or remove any servers to and from the cluster any time you want with out impacting your production.
    They share load, reliable, scalable....

Maybe you are looking for