Deallock detection in rac

Hi All,
We have a 2 node rac database. client used to run a load everyday on PROD database.
Everyday they are getting a deadlock issue because of which they have rerun a job.
The load takes more time as expected.
When i checked the alert log i found the below error.
Global Enqueue Services Deadlock detected. More info in file
/u02/admin/EDWXPRD/bdump/edwxprd1_lmd0_10662.trc.
Tue Nov 2 07:02:54 2010
Global Enqueue Services Deadlock detected. More info in file
/u02/admin/EDWXPRD/bdump/regprd1_lmd0_10662.trc.
Trace files logs
DRM(2870) ignoring dissolve of 558909
* kjdrchkdrm: found an RM request in the request queue
Dissolve pkey 558910
DRM(2870) ignoring dissolve of 558910
*** 2010-11-02 07:00:36.256
stale cvak fr 1:0xade80f58([0x286b8][0x0],[AF])[h=KJUSERNL,n=KJUSEREX,b=KJUSERNL,ls=KJUSERSTAT_NOVALUE]:0x2 < 0x0
stale cvak fr 1:0xade80f58([0x286b8][0x0],[AF])[h=KJUSERNL,n=KJUSEREX,b=KJUSEREX,ls=KJUSERSTAT_NOVALUE]:0x3 < 0x0
*** 2010-11-02 07:01:25.070
user session for deadlock lock 0xbd8a20e0
pid=52 serial=49701 audsid=726380511 user: 58/ICOM_VW
O/S info: user: svc-ch-bo-sso, term: SOBSREP00, ospid: 8012:1884, machine: CORP\SHBAREP00
program: wireportserver.exe
application name: wireportserver.exe, hash value=1663395875
Current SQL Statement:
SELECT
V_TARGET_LST_D.LST_NM,
V_RUST_TARGET_D.ACTL_RNK,
DECODE(V_RUST_TARGET_D.TARGET_IND,:"SYS_B_00",:"SYS_B_01",:"SYS_B_02",:"SYS_B_03"),
count(V_RUST_D.SHIRE_RUST_ID),
V_TARGET_SLSFRC_DH.SLS_ORG_NM,
V_TARGET_SLSFRC_DH.PAR_SLSFRC_NM,
V_TARGET_SLSFRC_DH.SLSFRC_NM,
V_RUST_D.SHIRE_RUST_ID,
V_RUST_D.RUST_NM,
application name: SQL*Plus, hash value=3669949024
Current SQL Statement:
ALTER TABLE RUST_TARGET_D ENABLE CONSTRAINT R_303
user session for deadlock lock 0xbd7dd210
pid=50 serial=62712 audsid=726380510 user: 57/ICOM_DM
O/S info: user: svc-etl-icbi, term: , ospid: 19433, machine: shbaetl00.corp.shire.com
program: [email protected] (TNS V1-V3)
application name: SQL*Plus, hash value=3669949024
Current SQL Statement:
ALTER TABLE RUST_TARGET_D ENABLE CONSTRAINT R_303
user session for deadlock lock 0xbd8a13a8
pid=52 serial=49701 audsid=726380511 user: 58/ICOM_VW
O/S info: user: svc-ch-bo-sso, term: SHBAREP00, ospid: 8012:1884, machine: CORP\SHBAREP00
program: wireportserver.exe
application name: wireportserver.exe, hash value=1663395875
Current SQL Statement:
SELECT
V_TARGET_LST_D.LST_NM,
V_RUST_TARGET_D.ACTL_RNK,
DECODE(V_RUST_TARGET_D.TARGET_IND,:"SYS_B_00",:"SYS_B_01",:"SYS_B_02",:"SYS_B_03"),
count(V_RUST_D.SHIRE_RUST_ID),
V_TARGET_SLSFRC_DH.SLS_ORG_NM,
V_TARGET_SLSFRC_DH.PAR_SLSFRC_NM,
V_TARGET_SLSFRC_DH.SLSFRC_NM,
V_RUST_D.SHIRE_RUST_ID,
V_RUST_D.RUST_NM,
V_RUST_CON_D.FST_NM,
V_RUST_CON_D.LAST_NM,
V_RUST_TARGET_D.EXT_RNK
FROM
V_RUST_D,
The error which they are getting from application side is
ORA-06502: PL/SQL: numeric or value error: character to number conversion error
ORA-06512: at line 30
ORA-04020: deadlock detected while trying to lock object BCOM_DM.DATE_D
They complained that they get the deallock issue when they try to enable constraint as you can see in trace log.

Hi,
GLOBAL ENQUEUE SERVICES DEADLOCK DETECTED [ID 973178.1]
http://www.dba-oracle.com/t_ora_04020_deadlock_detected_while_trying_to_lock_object_string.htm
Thanks

Similar Messages

Race detection in child processes

Hi,
the program given below creates a child process with two threads accessing a shared variable. Unfortunately, Thread Analyzer does not detect the race (neither version 12.0 nor 12.3). The race is only detected if it occurs in the parent process. Option "-F on/all" to instrument child processes does not work in conjunction with option "-r all" (the collect tool emits the error message "
Race-detection data cannot be collected with any of -F -j -J -x"). Is this an inherent limitation of Thread Analyzer or is there a way to detect races and deadlocks in child processes?
Thank you very much in advance for your help.
Regards
#include <unistd.h>
#include <stdlib.h>
#include <assert.h>
#include <pthread.h>
int x;
void* fun(void*)
    x++;
    return NULL;
int main ()
    x = 0;
    int pid = fork();
    assert(pid >= 0);
    if (pid == 0)
        /* Child process */
        pthread_t thread;
        assert(pthread_create(&thread, NULL, fun, NULL) == 0);
        x++;
        assert(pthread_join(thread, NULL) == 0);
        _exit(0);
    else
        /* Parent process */
        sleep (1);
    return 0;

Just an update in case someone has seen this issue. The problem I am seeing is that some reports will spawn these child processes (that's how the appear in toad). At the DB level it is just another database process which shows up from v$session. I don't know how the application is calling to use as many processes as possible but this is what I am trying to limit on. If I run the same query in SQLPlus, only one process SID is created.

10g grid agent installation and configuration on oracle RAC

Hi All,
I have one two node RAC setup on HP-UX itanium platform and I want to install oracle 10g grid agent on both the RAC nodes to monitor the RAC instances and databases and OMS repository server is ready and running. Only I have installed agents on both the nodes, but tns entry and hosts files on OMS server and both the nodes modified.
But, nodes are not getting automatically discovered by OMS server. And what is the command to configure the grid agent on RAC instances.
I have to install grid agent on both the nodes individually or it can be installed on single node and can be integrated with other node.
Please help me out in this regard.......
Thanks in Advance,
Sukanta Paul.
Edited by: sukanta paul on Aug 18, 2009 10:16 PM

Similar problem and installation does not detect the rac nodes and installation is done locally which does not help the situation..By the way we all consult documentation and the whole point of having a forum is so we have a potential fix or work around and not link from here to documentation. Just to mention :)

Static variable in openmp

Hello all,
I'd like to receive some hints about how to parallelize the code below that Thread Analyser has detected races for static variables:
#pragma omp parallel for private(i)
for (i=0; i<n; i++){
x = calc(a);
int calc(int a){
static int x,y,z=0;
x = 2 * a / 2.2345; // Thread Analyser detected write race here
y = x * 3.4567; // Thread Analyser detected write race here
z += x * y; // Thrd Analyser detected write and read races here
return z;
Best Regards,
Glauber

You can either declare the static variables as "threadprivate", or put the accesses to these variables in critical sections. Like
int calc(int a){
static int x,y,z=0;
#pragma omp threadprivate(x,y,z)
x = 2 * a / 2.2345;
y = x * 3.4567;
z += x * y;
return z;
or
int calc(int a){
static int x,y,z=0;
int t;
#pragma omp critical
x = 2 * a / 2.2345;
y = x * 3.4567;
z += x * y;
t = z;
return t;
Notice the use of 't' in the above code, as you cannot put a 'return' in a critical section.
While the above techniques may get rid of the data races, they may not fix the problem you are facing. Making a code thread safe is more than merely getting rid of the data races. If you can show in more details how the static variables are used, we may be able to give more specific helps.
-- Yuan

The er_print crashes while analyzing datarace

Dear all,
I have installed the latest Sun Studio12 and use its DRDT tools to detect the race condition.
After the execution of "collect -r on a.out", I can see that the experiement data has been successfully collected. But when I execute "er_print <tha.name>", er_print crashes.
The following is the related crash info:
# ls -lrht
total 12773670
drwxr-xr-x 2 rtp99 dba 512 Apr 10 13:56 MgmtFctAPI
drwxr-x--- 4 rtp99 dba 512 May 9 14:24 RtpEvtHdl01
-rw------- 1 rtp99 dba 900M Jun 8 14:09 core.pcs.19549.1181282948
-rw------- 1 zhf mobile 37M Jun 8 15:09 core.pcs.27127.1181286589
-rw------- 1 zhf mobile 547M Jun 8 15:18 core.pcs.27825.1181287097
-rw------- 1 rtp99 dba 911M Jun 8 16:34 core.pcs.4208.1181291656
drwxr-xr-x 33 rtp99 dba 1.0K Jun 8 16:45 pcs
-rw------- 1 rtp99 dba 371M Jun 8 16:49 core.pcs.6244.1181292550
-rw------- 1 rtp99 dba 239M Jun 8 16:57 core.pcs.7122.1181293055
-rw------- 1 rtp99 dba 235M Jun 8 17:11 core.pcs.8759.1181293881
-rw------- 1 zhf mobile 307M Jun 8 18:49 core.er_print.14083.1181299747
-rw------- 1 zhf mobile 307M Jun 8 18:52 core.er_print.14339.1181299906
-rw------- 1 zhf mobile 306M Jun 8 18:58 core.er_print.14882.1181300314
-rw------- 1 zhf mobile 237M Jun 8 19:00 core.er_print.14927.1181300421
-rw------- 1 zhf mobile 530M Jun 8 19:10 core.er_print.15816.1181300989
-rw------- 1 yyj mobile 237M Jun 11 09:40 core.er_print.24839.1181526011
-rw------- 1 yyj mobile 237M Jun 11 09:45 core.er_print.24927.1181526327
-rw------- 1 yyj mobile 306M Jun 11 10:33 core.er_print.26727.1181529189
-rw------- 1 yyj mobile 290M Jun 11 10:41 core.er_print.27057.1181529676
-rw------- 1 yyj mobile 237M Jun 11 11:41 core.er_print.2877.1181533293
# dbx /usr/bin/er_print core.er_print.27057.1181529676
For information about new features see `help changes'
To remove this message, put `dbxenv suppress_startup_message 7.6' in your .dbxrc
Reading er_print
core file header read successfully
Reading ld.so.1
Reading liber_dbe.so
Reading libCstd.so.1
Reading libCrun.so.1
Reading libc.so.1
Reading libdl.so.1
Reading libnsl.so.1
Reading libCstd_isa.so.1
Reading libc_psr.so.1
program terminated by signal SEGV (no mapping at the fault address)
0xfeed512c: freeunlocked+0x004c: ld [%i5 - 8], %o3
(dbx) threads
dbx: thread related commands not available
(dbx) where
=>[1] freeunlocked(0x61697473, 0x0, 0x931a8, 0xfef6fad4, 0xfef68298, 0x61697473), at 0xfeed512c
[2] free(0x61697473, 0xfeed50a8, 0x931e8, 0x1ff18, 0xfef68298, 0xfeed50a8), at 0xfeed50cc
[3] dbeGetRaceData(0x1, 0x6a42ea8, 0x6a30340, 0xff30e0ce, 0x1, 0x1), at 0xff276218
(dbx)
And I find that It alwasy crash in this race:
Race #2, Vaddr: 0xfcb24
Access 1: Read, __rwstd::__rb_tree<std::basic_string<char,std::char_traits<char>,std::allocator<char> >,std::pair<const std::basic_string<char,std::char_traits<char>,std::allocator<char> >,pcsia::CIaBGFData*>,__rwstd::__select1st<std::pair<const std::basic_string<char,std::char_traits<char>,std::allocator<char> >,pcsia::CIaBGFData*>,std::basic_string<char,std::char_traits<char>,std::allocator<char> > >,std::less<std::basic_string<char,std::char_traits<char>,std::allocator<char> > >,std::allocator<std::pair<const std::basic_string<char,std::char_traits<char>,std::allocator<char> >,pcsia::CIaBGFData*> > >::iterator::operator*()const + 0x00000098,
line 321 in "tree"
Access 2: Write, __rwstd::__rb_tree<std::basic_string<char,std::char_traits<char>,std::allocator<char> >,std::pair<const std::basic_string<char,std::char_traits<char>,std::allocator<char> >,pcsia::CIaBGFData*>,__rwstd::__select1st<std::pair<const std::basic_string<char,std::char_traits<char>,std::allocator<char> >,pcsia::CIaBGFData*>,std::basic_string<char,std::char_traits<char>,std::allocator<char> > >,std::less<std::basic_string<char,std::char_traits<char>,std::allocator<char> > >,std::allocator<std::pair<const std::basic_string<char,std::char_traits<char>,std::allocator<char> >,pcsia::CIaBGFData*> > >::iterator::operator=(const __rwstd::__rb_tree<std::basic_string<char,std::char_traits<char>,std::allocator<char> >,std::pair<const std::basic_string<char,std::char_traits<char>,std::allocator<char> >,pcsia::CIaBGFData*>,__rwstd::__select1st<std::pair<const std::basic_string<char,std::char_traits<char>,std::allocator<char> >,pcsia::CIaBGFData*>,std::basic_string<char,std::char_traits<char>,std::allocator<char> > >,std::less<std::basic_string<char,std::char_traits<char>,std::allocator<char> > >,std::allocator<std::pair<const std::basic_string<char,std::char_traits<char>,std::allocator<char> >,pcsia::CIaBGFData*> > >::iterator&) + 0x000000D4
Total Traces: 7
Segmentation fault (core dumped)
I asked a developer in SUN and he told me that "This could be a problem cause by long function name of C++ class member function"
Could developer of er_print have a look at this and give me some hints?
many thanks!
Cheers
Shen

Hi, Shen
This bug has been fixed in the first patch for Sun Studio 12. We will update the forum when the patch is released.
Thanks a lot!
-Xi

Lock_lint for c++

I want to use lock_lint for deallock detection in my c++ code. Before starting please let me know if it can be used for c++ at all or is it only possible to use with C code? I am using posix thread APIs.
Regards
Mandeep

Unfortunately, lock_lint is based on Sun C compiler and thus unable to handle C++ code at all.
regards,
__Fedor.

Allocator race detected, Adobe Flash Pro CS5 on Mac OS X 10.5.8 using network accounts

Hello,
I manage a number of academic computer labs, three of which are Mac OS X 10.5.8 units with Adobe Flash CS5 installed. Users log in using AD credentials and the client computers are managed with settings in OD Workgroup Manager (MCX records). The AD plugin is set to not force local home directories and users' home directories are located on network storage. ~/Library/Caches/* has been redirected locally, but no other folders are redirected.
Users have been reporting an issue where Flash sometimes appears to freeze during loading. I've been told this has been happening for about two semesters, but I was not aware of the issue until today. It sounds like students have been powering off the machines and logging back in, and hoping Flash will work on that attempt. When a 'freeze' is encountered the Flash splash screen appears and the status below the "Adobe Flash Professional CS5" text states it is "Copying First Run Files..."
Checking the console, I can see that Flash appears to be attempting to access files, but is encountering issues. Dozens of times, I see errors like the following:
11/3/11 3:17:41 PM /Applications/Adobe Flash CS5/Adobe Flash CS5.app/Contents/MacOS/Adobe Flash CS5[2801] Allocator race detected: transaction is not verified for -112/2 - Library/Application Support/Adobe/Flash CS5/en_US/Configuration/ProcScripts/svg/LineBrushArt10.svg
11/3/11 3:17:45 PM /System/Library/CoreServices/coreservicesd[62] removing dropped transaction for -112,Library/Application Support/Adobe/Flash CS5/en_US/Configuration/ProcScripts/svg/LineBrushArt10.svg
11/3/11 3:17:45 PM /Applications/Adobe Flash CS5/Adobe Flash CS5.app/Contents/MacOS/Adobe Flash CS5[2801] Allocator race detected: transaction is not verified for -112/2 - Library/Application Support/Adobe/Flash CS5/en_US/Configuration/ProcScripts/svg/LineBrushArt11.svg
11/3/11 3:17:45 PM /Applications/Adobe Flash CS5/Adobe Flash CS5.app/Contents/MacOS/Adobe Flash CS5[2801] Allocator race detected: transaction is not verified for -112/2 - Library/Application Support/Adobe/Flash CS5/en_US/Configuration/ProcScripts/svg/LineBrushArt11.svg
11/3/11 3:17:49 PM /System/Library/CoreServices/coreservicesd[62] removing dropped transaction for -112,Library/Application Support/Adobe/Flash CS5/en_US/Configuration/ProcScripts/svg/LineBrushArt11.svg
11/3/11 3:17:49 PM /Applications/Adobe Flash CS5/Adobe Flash CS5.app/Contents/MacOS/Adobe Flash CS5[2801] Allocator race detected: transaction is not verified for -112/2 - Library/Application Support/Adobe/Flash CS5/en_US/Configuration/ProcScripts/svg/LineBrushArt12.svg
11/3/11 3:17:49 PM /Applications/Adobe Flash CS5/Adobe Flash CS5.app/Contents/MacOS/Adobe Flash CS5[2801] Allocator race detected: transaction is not verified for -112/2 - Library/Application Support/Adobe/Flash CS5/en_US/Configuration/ProcScripts/svg/LineBrushArt12.svg
11/3/11 3:17:53 PM /System/Library/CoreServices/coreservicesd[62] removing dropped transaction for -112,Library/Application Support/Adobe/Flash CS5/en_US/Configuration/ProcScripts/svg/LineBrushArt12.svg
11/3/11 3:17:53 PM /Applications/Adobe Flash CS5/Adobe Flash CS5.app/Contents/MacOS/Adobe Flash CS5[2801] Allocator race detected: transaction is not verified for -112/2 - Library/Application Support/Adobe/Flash CS5/en_US/Configuration/ProcScripts/svg/LineBrushArt13.svg
11/3/11 3:17:57 PM /System/Library/CoreServices/coreservicesd[62] removing dropped transaction for -112,Library/Application Support/Adobe/Flash CS5/en_US/Configuration/ProcScripts/svg/LineBrushArt13.svg
11/3/11 3:17:57 PM /Applications/Adobe Flash CS5/Adobe Flash CS5.app/Contents/MacOS/Adobe Flash CS5[2801] Allocator race detected: transaction is not verified for -112/2 - Library/Application Support/Adobe/Flash CS5/en_US/Configuration/ProcScripts/svg/LineBrushArt14.svg
11/3/11 3:17:57 PM /Applications/Adobe Flash CS5/Adobe Flash CS5.app/Contents/MacOS/Adobe Flash CS5[2801] Allocator race detected: transaction is not verified for -112/2 - Library/Application Support/Adobe/Flash CS5/en_US/Configuration/ProcScripts/svg/LineBrushArt14.svg
Messages like these, but listing different files, have been going on since I launched Flash at 11/3/11 3:06 PM.
If you have encountered this problem or have any information for me, please let me know.
Thank you,
Branden Ohlinger

Based on some searching and testing, it seems that if you redirect ~/Library/Application Support/Adobe/Flash CS5 to a location on the local hard drive, rather than the network account storage, the problem ceases.
I hope this helps anyone else who may run into this problem.
Take care,
Branden

OUI does not detect RAC installation while installing 10.2.0.3

Hello all,
I have a 2 node rac on 10.2.0.2 on Solaris 10.
The other day when i tried to upgrade them to 10.2.0.3, the installer was not detecting
installation as a RAC(the cluster nodes were not shown), but as single installation. Even 'opatch lsinventory' didnt show the rac details which it was supposed to show in a RAC. So i have left my CRS with 10.2.0.3 and RDBMS as 10.2.0.2.. Seems like inventory got screwed, but not sure.
I am not sure how this installation was done.
A friend of mine said that there could be another means of installing RAC other than letting the OUI itself to copy the files to the node2 while installation and that is why it doesn't show the RAC details.
Is it possible to have Oracle Single instances on 2 machines to act as a rac by doing some changes?
Could someone let me know the different methods of setting up RAC (other than the default method of letting the OUI to do the part.) ?
Will there be some problem if i proceed installing 10.2.0.3 even if the OUI does not identify it as a rac installation?
TIA,
JJ

Hi,
myuolicdbd01 [SHCL1N01]$ ./opatch lsinventory -oh $ORA_CRS_HOME
Invoking OPatch 10.2.0.2.0
Oracle interim Patch Installer version 10.2.0.2.0
Copyright (c) 2005, Oracle Corporation. All rights reserved..
Oracle Home       : /u01/app/oracle/product/crs
Central Inventory : /u01/app/oracle/oraInventory
   from           : /u01/app/oracle/product/crs/oraInst.loc
OPatch version    : 10.2.0.2.0
OUI version       : 10.2.0.3.0
OUI location      : /u01/app/oracle/product/crs/oui
Log file location : /u01/app/oracle/product/crs/cfgtoollogs/opatch/opatch-2007_Dec_04_15-33-47-JST_Tue.log
Lsinventory Output file location : /u01/app/oracle/product/crs/cfgtoollogs/opatch/lsinv/lsinventory-2007_Dec_04_15-33-47-JST_Tue.txt
Installed Top-level Products (2):
Oracle Clusterware                                                   10.2.0.1.0
Oracle Database 10g Release 2 Patch Set 2                            10.2.0.3.0
There are 2 products installed in this Oracle Home.
There are no Interim patches installed in this Oracle Home.
Rac system comprising of multiple nodes
Local node = myuolicdbd01
Remote node = myuolicdbd02
OPatch succeeded.
myuolicdbd01 [SHCL1N01]$The below is for the RDBMS home and it does not show the node list.
myuolicdbd01 [SHCL1N01]$ ./opatch lsinventory -oh $ORACLE_HOME
Invoking OPatch 10.2.0.2.0
Oracle interim Patch Installer version 10.2.0.2.0
Copyright (c) 2005, Oracle Corporation. All rights reserved..
Oracle Home       : /u01/app/oracle/product/10.2.0/db_1
Central Inventory : /u01/app/oracle/oraInventory
   from           : /u01/app/oracle/product/10.2.0/db_1/oraInst.loc
OPatch version    : 10.2.0.2.0
OUI version       : 10.2.0.2.0
OUI location      : /u01/app/oracle/product/10.2.0/db_1/oui
Log file location : /u01/app/oracle/product/10.2.0/db_1/cfgtoollogs/opatch/opatch-2007_Dec_04_15-34-22-JST_Tue.log
Lsinventory Output file location : /u01/app/oracle/product/10.2.0/db_1/cfgtoollogs/opatch/lsinv/lsinventory-2007_Dec_04_15-34-22-JST_Tue.txt
Installed Top-level Products (2):
Oracle Database 10g                                                  10.2.0.1.0
Oracle Database 10g Release 2 Patch Set 1                            10.2.0.2.0
There are 2 products installed in this Oracle Home.
Interim patches (2) :
Patch 5490848      : applied on Thu Nov 16 01:26:37 JST 2006
   Created on 5 Oct 2006, 10:16:33 hrs US/Eastern
   Bugs fixed:
     5407466, 5092134, 4925103, 5099995, 5079037, 5242650, 5079038, 4604970
     4763768, 4523125, 5225799, 5490937, 5490848, 4669305
Patch 5117016      : applied on Wed Jul 19 19:48:22 JST 2006
   Created on 30 Mar 2006, 12:52:17 hrs US/Pacific
   Bugs fixed:
     5117016
OPatch succeeded.
myuolicdbd01 [SHCL1N01]$Details of the Cluster Verification Utility
==============================
myjpsuolicdbd01 [SHCL1N01]$ cluvfy stage -post crsinst -n myuolicdbd01,myuolicdbd02
Performing post-checks for cluster services setup
Checking node reachability...
Node reachability check passed from node "myuolicdbd01".
Checking user equivalence...
User equivalence check passed for user "oraprd".
Checking Cluster manager integrity...
Checking CSS daemon...
Daemon status check passed for "CSS daemon".
Cluster manager integrity check passed.
Checking cluster integrity...
Cluster integrity check passed
Checking OCR integrity...
Checking the absence of a non-clustered configuration...
All nodes free of non-clustered, local-only configurations.
Uniqueness check for OCR device passed.
Checking the version of OCR...
OCR of correct Version "2" exists.
Checking data integrity of OCR...
Data integrity check for OCR passed.
OCR integrity check passed.
Checking CRS integrity...
Checking daemon liveness...
Liveness check passed for "CRS daemon".
Checking daemon liveness...
Liveness check passed for "CSS daemon".
Checking daemon liveness...
Liveness check passed for "EVM daemon".
Checking CRS health...
CRS health check passed.
CRS integrity check passed.
Checking node application existence...
Checking existence of VIP node application (required)
Check passed.
Checking existence of ONS node application (optional)
Check passed.
Checking existence of GSD node application (optional)
Check passed.
Post-check for cluster services setup was successful.TIA,
J J

Programmatically detect if working in RAC mode?

How can I programmatically detect if database is running in RAC mode. I found solution with dbms_utility.is_cluster_database but my boss wants to know is there any other solution. So is there :)

Hi,
If it is a RAC database, you should have several instances in v$thread :SQL> select count(distinct(instance)) from gv$thread;
COUNT(DISTINCT(INSTANCE))
                        1Or check the value of the cluster_database parameter :SQL> select name, value
2 from v$parameter
3 where name ='cluster_database';
NAME                           VALUE
cluster_database               FALSEOr count the number of instances in gv$instance:SQL> select count(*) from gv$instance;
COUNT(*)
         1

System.log - Allocator race detected

I'm trying to sort out some problems on an Xserve box. I'm new to OS X server administration so I'm trying to figure out what messages are important. The following message comes up a lot and I'm wondering what it means. I haven't been able to find any references on Google.
Apr 30 16:23:34 coreone /usr/sbin/AppleFileServer[173]: Allocator race detected: transaction is not verified for -102/2 - FILESERVER/files/working.psd
Thanks,
Levi

Based on some searching and testing, it seems that if you redirect ~/Library/Application Support/Adobe/Flash CS5 to a location on the local hard drive, rather than the network account storage, the problem ceases.
I hope this helps anyone else who may run into this problem.
Take care,
Branden

Data race detection tool

Is there an easy way to inquire whether any error has been detected?
It would be nice if in a makefile this would be possible:
rdt -check race.er
echo $?
regards,
Dieter

Yes, that's would I just programmed.
So I am answering my own stupid question with this little script called check_races
#!/bin/ksh
rc=$(echo races | er_print $1 2> /dev/null | grep 'Total Races: 0' > /dev/null 2>&1 )
return $rc
and then in the Makefile:
check_races races.er || rdt races.er
Thanks
Dieter

Allocator race detected ...

Hi,
A user hit the following error yesterday and I am trying to work out what is wrong.
Jul 1 17:45:49 mac-01 com.apple.launchd[132] (0x10fed0.Locum[5687]): Exited: Terminated
Jul 1 17:48:04 mac-01 /System/Library/CoreServices/Finder.app/Contents/MacOS/Finder[153]: Allocator race detected: transaction is not verified for -102/2 - Compressed/k0025337-h.264.mov
The user was trying to copy a number of files from her Desktop to a remote server. All the files made it but 2 files would not copy correctly. The files made it to the remote server but instead of being 1.7MB they were 1.8MB (ok they are different files systems so the size may be reported differently) and when you attempted to play the files - the files were h264 movies - they were reported as corrupted.
The only other odd think about the files is that when these particular files arrive on the remote server their permissions are different from all the files that made it to their destination ok.
EG:
-rw-rw-r-- 1 asomerve samba 1354503 2010-07-01 16:56 k0025343-H.264 watermark.mov
-rw-rw-r-- 1 asomerve samba 2984461 2010-07-01 17:01 k0025333-H.264.mov
-rw-rw-r-- 1 asomerve samba 3044265 2010-07-01 17:06 k0025333-H.264 watermark.mov
-rwx---rw- 1 asomerve samba 2680045 2010-07-01 17:30 k0025335-h.264.mov
-rw-rw-rw- 1 asomerve samba 2797236 2010-07-01 17:31 k0025335-h.264watermarked.mov
-rwx---rw- 1 asomerve samba 1721173 2010-07-01 17:31 k0025337-h.264.mov
-rw-rw-rw- 1 asomerve samba 1651866 2010-07-01 17:31 k0025337-h.264watermarked.mov
The files k0025337-h.264.mov and k0025335-h.264.mov are the culprits. The other files all made it over without any problems.
Does anyone have any ideas on what the problem might be?
Thanx in advance.
Dp.

Hiya mm...
No-one's followed up on this, and you haven't reposted,.. did your problem get resolved or just go away?
I'm asking, because we had an AFP hicup this morning (that went away all by his lonesome) but digging, reveals the same "Allocator race detected" messages you mention. So,.. any news?
shawn

[代发帖子]10.2.0.4.0版本rac 报IPC Send timeout detected.Sender 错误问题

以下为代网友发帖，内容为
环境：db：10.2.0.4.0 system：aix 6.1
请各位大师帮忙确定是什么原因导致的
node1
Wed Aug 22 09:59:17 2012
IPC Send timeout detected.Sender: ospid 25821502
Receiver: inst 2 binc 1412232506 ospid 27066946
Wed Aug 22 09:59:17 2012
IPC Send timeout detected.Sender: ospid 24838548
Receiver: inst 2 binc 1412232506 ospid 27066946
Wed Aug 22 09:59:18 2012
IPC Send timeout detected.Sender: ospid 18743888
Receiver: inst 2 binc 1412232506 ospid 27066946
Wed Aug 22 09:59:18 2012
IPC Send timeout detected.Sender: ospid 8586514
Receiver: inst 2 binc 1412232506 ospid 27066946
Wed Aug 22 09:59:19 2012
IPC Send timeout detected.Sender: ospid 21365006
Receiver: inst 2 binc 1412232506 ospid 27066946
Wed Aug 22 09:59:20 2012
IPC Send timeout detected.Sender: ospid 18088896
Receiver: inst 2 binc 1412232506 ospid 27066946
Wed Aug 22 09:59:20 2012
IPC Send timeout detected.Sender: ospid 15467098
Receiver: inst 2 binc 1412232506 ospid 27066946
Wed Aug 22 09:59:20 2012
Wed Aug 22 09:59:44 2012
IPC Send timeout detected.Sender: ospid 9635304
Receiver: inst 2 binc 1412232544 ospid 25362610
Wed Aug 22 09:59:44 2012
IPC Send timeout detected.Sender: ospid 1704160
Receiver: inst 2 binc 1412232544 ospid 25362610
Wed Aug 22 09:59:44 2012
IPC Send timeout detected.Sender: ospid 2687916
Receiver: inst 2 binc 1412232544 ospid 25362610
Wed Aug 22 09:59:44 2012
IPC Send timeout detected.Sender: ospid 1836306
Receiver: inst 2 binc 1412232506 ospid 27066946
Wed Aug 22 09:59:45 2012
IPC Send timeout detected.Sender: ospid 11273456
Wed Aug 22 10:00:03 2012
IPC Send timeout detected.Sender: ospid 4982212
Receiver: inst 2 binc 1412232544 ospid 25362610
Wed Aug 22 10:00:03 2012
IPC Send timeout to 1.1 inc 28 for msg type 32 from opid 629
Wed Aug 22 10:00:03 2012
IPC Send timeout detected.Sender: ospid 15270298
Receiver: inst 2 binc 1412232506 ospid 27066946
Wed Aug 22 10:00:03 2012
IPC Send timeout to 1.5 inc 28 for msg type 36 from opid 374
Wed Aug 22 10:00:03 2012
IPC Send timeout to 1.5 inc 28 for msg type 32 from opid 619
Wed Aug 22 10:00:03 2012
IPC Send timeout detected.Sender: ospid 16712906
Receiver: inst 2 binc 1412232506 ospid 27066946
Wed Aug 22 10:00:03 2012
IPC Send timeout to 1.1 inc 28 for msg type 32 from opid 626
Wed Aug 22 10:00:03 2012
IPC Send timeout to 1.5 inc 28 for msg type 32 from opid 376
Wed Aug 22 10:00:03 2012
IPC Send timeout to 1.1 inc 28 for msg type 32 from opid 260
Wed Aug 22 10:00:03 2012
IPC Send timeout detected.Sender: ospid 12125594
Receiver: inst 2 binc 1412232506 ospid 27066946
Wed Aug 22 10:00:04 2012
IPC Send timeout detected.Sender: ospid 22086174
Receiver: inst 2 binc 1412232506 ospid 27066946
Wed Aug 22 10:00:04 2012
IPC Send timeout detected.Sender: ospid 8979076
Receiver: inst 2 binc 1412232544 ospid 25362610
Wed Aug 22 10:00:04 2012
IPC Send timeout detected.Sender: ospid 23004052
Receiver: inst 2 binc 1412232506 ospid 27066946
Wed Aug 22 10:00:04 2012
IPC Send timeout to 1.5 inc 28 for msg type 32 from opid 640
Wed Aug 22 10:00:04 2012
IPC Send timeout to 1.5 inc 28 for msg type 32 from opid 114
Wed Aug 22 10:00:04 2012
IPC Send timeout to 1.5 inc 28 for msg type 36 from opid 642
Wed Aug 22 10:00:04 2012
IPC Send timeout to 1.1 inc 28 for msg type 32 from opid 644
Wed Aug 22 10:00:04 2012
IPC Send timeout to 1.5 inc 28 for msg type 32 from opid 228
Wed Aug 22 10:00:04 2012
IPC Send timeout detected.Sender: ospid 13436000
Receiver: inst 2 binc 1412232506 ospid 27066946
Wed Aug 22 10:00:05 2012
IPC Send timeout detected.Sender: ospid 4850956
Receiver: inst 2 binc 1412232544 ospid 25362610
Wed Aug 22 10:00:05 2012
IPC Send timeout detected.Sender: ospid 21627618
Receiver: inst 2 binc 1412232544 ospid 25362610
Wed Aug 22 10:00:05 2012
IPC Send timeout to 1.1 inc 28 for msg type 36 from opid 637
Wed Aug 22 10:00:05 2012
IPC Send timeout to 1.5 inc 28 for msg type 32 from opid 648
Wed Aug 22 10:00:05 2012
IPC Send timeout to 1.1 inc 28 for msg type 36 from opid 628
Wed Aug 22 10:00:05 2012
IPC Send timeout to 1.1 inc 28 for msg type 32 from opid 647
Wed Aug 22 10:00:05 2012
IPC Send timeout detected.Sender: ospid 23724892
Receiver: inst 2 binc 1412232506 ospid 27066946
Wed Aug 22 10:00:05 2012
IPC Send timeout detected.Sender: ospid 14353416
Receiver: inst 2 binc 1412232506 ospid 27066946
Wed Aug 22 10:00:05 2012
IPC Send timeout detected.Sender: ospid 15270628
Receiver: inst 2 binc 1412232506 ospid 27066946
Wed Aug 22 10:00:06 2012
IPC Send timeout to 1.1 inc 28 for msg type 32 from opid 634
Wed Aug 22 10:00:06 2012
IPC Send timeout to 1.1 inc 28 for msg type 32 from opid 235
Wed Aug 22 10:00:06 2012
IPC Send timeout detected.Sender: ospid 10749150
Receiver: inst 2 binc 1412232506 ospid 27066946
Wed Aug 22 10:00:06 2012
IPC Send timeout to 1.1 inc 28 for msg type 32 from opid 72
Wed Aug 22 10:00:06 2012
IPC Send timeout to 1.1 inc 28 for msg type 32 from opid 654
Wed Aug 22 10:00:06 2012
IPC Send timeout detected.Sender: ospid 16123646
Receiver: inst 2 binc 1412232544 ospid 25362610
Wed Aug 22 10:00:06 2012
IPC Send timeout detected.Sender: ospid 19005890
Receiver: inst 2 binc 1412232506 ospid 27066946
Wed Aug 22 10:00:06 2012
IPC Send timeout to 1.1 inc 28 for msg type 32 from opid 655
Wed Aug 22 10:00:06 2012
IPC Send timeout detected.Sender: ospid 1311244
Receiver: inst 2 binc 1412232506 ospid 27066946
Wed Aug 22 10:00:06 2012
IPC Send timeout to 1.5 inc 28 for msg type 36 from opid 658
Wed Aug 22 10:00:07 2012
IPC Send timeout detected.Sender: ospid 14091918
Receiver: inst 2 binc 1412232506 ospid 27066946
Wed Aug 22 10:00:07 2012
IPC Send timeout to 1.5 inc 28 for msg type 32 from opid 424
Wed Aug 22 10:00:07 2012
IPC Send timeout to 1.5 inc 28 for msg type 32 from opid 631
Wed Aug 22 10:00:07 2012
IPC Send timeout detected.Sender: ospid 14876874
Receiver: inst 2 binc 1412232506 ospid 27066946
Wed Aug 22 10:00:07 2012
IPC Send timeout to 1.1 inc 28 for msg type 32 from opid 661
Wed Aug 22 10:00:07 2012
IPC Send timeout detected.Sender: ospid 18613208
Receiver: inst 2 binc 1412232506 ospid 27066946
Wed Aug 22 10:00:07 2012
IPC Send timeout to 1.5 inc 28 for msg type 32 from opid 664
Wed Aug 22 10:00:07 2012
IPC Send timeout to 1.1 inc 28 for msg type 32 from opid 603
Wed Aug 22 10:00:07 2012
IPC Send timeout detected.Sender: ospid 2818720
Receiver: inst 2 binc 1412232544 ospid 25362610
Wed Aug 22 10:03:54 2012
IPC Send timeout to 1.1 inc 28 for msg type 36 from opid 1219
Wed Aug 22 10:04:05 2012
IPC Send timeout to 1.5 inc 28 for msg type 32 from opid 2256
Wed Aug 22 10:04:08 2012
IPC Send timeout to 1.1 inc 28 for msg type 36 from opid 1908
Wed Aug 22 10:04:10 2012
IPC Send timeout to 1.1 inc 28 for msg type 32 from opid 1476
Wed Aug 22 10:04:12 2012
IPC Send timeout to 1.5 inc 28 for msg type 32 from opid 1653
Wed Aug 22 10:04:12 2012
IPC Send timeout to 1.5 inc 28 for msg type 32 from opid 1266
Wed Aug 22 10:04:13 2012
IPC Send timeout to 1.1 inc 28 for msg type 32 from opid 1345
Wed Aug 22 10:04:13 2012
IPC Send timeout to 1.1 inc 28 for msg type 32 from opid 1202
Wed Aug 22 10:04:13 2012
IPC Send timeout to 1.1 inc 28 for msg type 32 from opid 1203
Wed Aug 22 10:04:13 2012
IPC Send timeout to 1.1 inc 28 for msg type 32 from opid 1239
Wed Aug 22 10:04:13 2012
IPC Send timeout to 1.1 inc 28 for msg type 32 from opid 1237
Wed Aug 22 10:04:13 2012
IPC Send timeout to 1.5 inc 28 for msg type 32 from opid 1207
Wed Aug 22 10:04:13 2012
IPC Send timeout to 1.5 inc 28 for msg type 32 from opid 1197
Wed Aug 22 10:04:13 2012
IPC Send timeout to 1.5 inc 28 for msg type 32 from opid 1313
Wed Aug 22 10:04:13 2012
IPC Send timeout to 1.5 inc 28 for msg type 32 from opid 1274
Wed Aug 22 10:04:13 2012
IPC Send timeout to 1.1 inc 28 for msg type 32 from opid 1140
Wed Aug 22 10:04:13 2012
IPC Send timeout to 1.1 inc Wed Aug 22 10:25:55 2012Node2:
Wed Aug 22 09:47:57 2012
Error 2056 trapped in 2PC on transaction 20.16.21521114. Cleaning up.
Error stack returned to user:
ORA-02056: 2PC: k2lpre: bad two-phase command number 0 from coord
Wed Aug 22 09:48:28 2012
Error 2056 trapped in 2PC on transaction 78.44.4484418. Cleaning up.
Error stack returned to user:
ORA-02056: 2PC: k2lpre: bad two-phase command number 0 from coord
Error 2056 trapped in 2PC on transaction 17.42.11081684. Cleaning up.
Error stack returned to user:
ORA-02056: 2PC: k2lpre: bad two-phase command number 0 from coord
Wed Aug 22 09:48:42 2012
Error 2056 trapped in 2PC on transaction 75.15.10618335. Cleaning up.
Error stack returned to user:
ORA-02056: 2PC: k2lpre: bad two-phase command number 0 from coord
Wed Aug 22 09:48:58 2012
Error 2056 trapped in 2PC on transaction 62.28.20411900. Cleaning up.
Error stack returned to user:
ORA-02056: 2PC: k2lpre: bad two-phase command number 0 from coord
Wed Aug 22 09:49:04 2012
Error 2056 trapped in 2PC on transaction 81.34.7287924. Cleaning up.
Error stack returned to user:
ORA-02056: 2PC: k2lpre: bad two-phase command number 0 from coord
Wed Aug 22 09:50:01 2012
Thread 2 advanced to log sequence 53611 (LGWR switch)
Current log# 5 seq# 53611 mem# 0: /dev/rredo05_rzdb
Wed Aug 22 09:50:34 2012
Error 2056 trapped in 2PC on transaction 71.15.14657575. Cleaning up.
Error stack returned to user:
ORA-02056: 2PC: k2lpre: bad two-phase command number 0 from coord
Wed Aug 22 09:50:46 2012
Error 2056 trapped in 2PC on transaction 84.12.11181366. Cleaning up.
Error stack returned to user:
ORA-02056: 2PC: k2lpre: bad two-phase command number 0 from coord
Wed Aug 22 09:50:48 2012
Errors in file /oracle/opt/product/10.2.0/db/admin/crmrz/udump/crmrz2_ora_23856588.trc:
ORA-00600: internal error code, arguments: [ksqcmi:res2], [], [], [], [], [], [], []
Wed Aug 22 09:50:49 2012
Trace dumping is performing id=[cdmp_20120822095049]
Wed Aug 22 09:59:16 2012
IPC Send timeout detected. Receiver ospid 27066946
Receiver is waiting for a latch dumping latch state for receiver -17356
Wed Aug 22 09:59:16 2012
Errors in file /oracle/opt/product/10.2.0/db/admin/crmrz/bdump/crmrz2_lms4_25362610.trc:
Wed Aug 22 09:59:38 2012
Errors in file /oracle/opt/product/10.2.0/db/admin/crmrz/bdump/crmrz2_lms0_27066946.trc:
Wed Aug 22 09:59:38 2012
Trace dumping is performing id=[cdmp_20120822095938]
Wed Aug 22 10:02:09 2012
kkjcre1p: unable to spawn jobq slave process
Wed Aug 22 10:02:09 2012
Errors in file /oracle/opt/product/10.2.0/db/admin/crmrz/bdump/crmrz2_cjq0_25100728.trc:
Wed Aug 22 10:02:32 2012
Errors in file /oracle/opt/product/10.2.0/db/admin/crmrz/bdump/crmrz2_lmon_25493740.trc:
ORA-29740: evicted by member 0, group incarnation 30
Wed Aug 22 10:02:32 2012
LMON: terminating instance due to error 29740
Wed Aug 22 10:02:32 2012
Errors in file /oracle/opt/product/10.2.0/db/admin/crmrz/bdump/crmrz2_lms2_24904066.trc:
ORA-29740: evicted by member , group incarnation
Wed Aug 22 10:02:32 2012
Errors in file /oracle/opt/product/10.2.0/db/admin/crmrz/bdump/crmrz2_lms6_25626088.trc:
ORA-29740: evicted by member , group incarnation
Wed Aug 22 10:02:32 2012
Errors in file /oracle/opt/product/10.2.0/db/admin/crmrz/bdump/crmrz2_lms3_25559280.trc:
ORA-29740: evicted by member , group incarnation
Wed Aug 22 10:02:32 2012
Errors in file /oracle/opt/product/10.2.0/db/admin/crmrz/bdump/crmrz2_lms5_22873874.trc:
ORA-29740: evicted by member , group incarnation
Wed Aug 22 10:02:32 2012
Errors in file /oracle/opt/product/10.2.0/db/admin/crmrz/bdump/crmrz2_lms1_24905100.trc:
ORA-29740: evicted by member , group incarnation
Wed Aug 22 10:02:32 2012
Errors in file /oracle/opt/product/10.2.0/db/admin/crmrz/bdump/crmrz2_arc1_26543176.trc:
ORA-29740: evicted by member , group incarnation
Wed Aug 22 10:02:32 2012
Errors in file /oracle/opt/product/10.2.0/db/admin/crmrz/bdump/crmrz2_arc1_26543176.trc:
ORA-29740: evicted by member , group incarnation
Wed Aug 22 10:02:32 2012
Errors in file /oracle/opt/product/10.2.0/db/admin/crmrz/bdump/crmrz2_arc1_26543176.trc:
ORA-29740: evicted by member , group incarnation
Wed Aug 22 10:02:32 2012
Master background archival failure: 29740
Wed Aug 22 10:02:32 2012
Errors in file /oracle/opt/product/10.2.0/db/admin/crmrz/bdump/crmrz2_lgwr_26477376.trc:
ORA-29740: evicted by member , group incarnation
Wed Aug 22 10:02:32 2012
Errors in file /oracle/opt/product/10.2.0/db/admin/crmrz/bdump/crmrz2_lck0_22677374.trc:
ORA-29740: evicted by member , group incarnation
Wed Aug 22 10:02:32 2012
Errors in file /oracle/opt/product/10.2.0/db/admin/crmrz/bdump/crmrz2_lmd0_27132530.trc:
ORA-29740: evicted by member , group incarnation
Wed Aug 22 10:02:32 2012
Errors in file /oracle/opt/product/10.2.0/db/admin/crmrz/bdump/crmrz2_pmon_25494950.trc:
ORA-29740: evicted by member , group incarnation
Wed Aug 22 10:02:33 2012
Errors in file /oracle/opt/product/10.2.0/db/admin/crmrz/bdump/crmrz2_mman_21235492.trc:
ORA-29740: evicted by member , group incarnation
Wed Aug 22 10:02:33 2012
Errors in file /oracle/opt/product/10.2.0/db/admin/crmrz/bdump/crmrz2_arc0_25036272.trc:
ORA-29740: evicted by member , group incarnation
Wed Aug 22 10:02:33 2012
Errors in file /oracle/opt/product/10.2.0/db/admin/crmrz/bdump/crmrz2_arc0_25036272.trc:
ORA-29740: evicted by member , group incarnation
Wed Aug 22 10:02:33 2012
Errors in file /oracle/opt/product/10.2.0/db/admin/crmrz/bdump/crmrz2_arc0_25036272.trc:
ORA-29740: evicted by member , group incarnation
Wed Aug 22 10:02:33 2012
Master background archival failure: 29740
Wed Aug 22 10:02:33 2012
Errors in file /oracle/opt/product/10.2.0/db/admin/crmrz/bdump/crmrz2_smon_22743004.trc:
ORA-29740: evicted by member , group incarnation
Wed Aug 22 10:02:34 2012
Shutting down instance (abort)
License high water mark = 1722
Wed Aug 22 10:02:38 2012
Instance terminated by LMON, pid = 25493740
Wed Aug 22 10:02:39 2012
Instance terminated by USER, pid = 22873880
Wed Aug 22 10:03:09 2012
Starting ORACLE instance (normal)
sskgpgetexecname failed to get name
LICENSE_MAX_SESSION = 0
LICENSE_SESSIONS_WARNING = 0
Interface type 1 en10 192.100.100.0 configured from OCR for use as a cluster interconnect
Interface type 1 en9 134.224.106.0 configured from OCR for use as a public interface
Picked latch-free SCN scheme 3
Autotune of undo retention is turned on.
LICENSE_MAX_USERS = 0
SYS auditing is disabled
ksdpec: called for event 13740 prior to event group initialization
Starting up ORACLE RDBMS Version: 10.2.0.4.0.
System parameters with non-default values:
processes                = 3000
sessions                 = 3305
resource_limit           = TRUE
event                    =
sga_max_size             = 51539607552
__shared_pool_size       = 4227858432
__large_pool_size        = 16777216
__java_pool_size         = 33554432
__streams_pool_size      = 0
spfile                   = /dev/rspfile_rzdb
sga_target               = 51539607552
control_files            = /dev/rcntrl01_rzdb, /dev/rcntrl02_rzdb, /dev/rcntrl03_rzdb
db_block_size            = 8192
__db_cache_size          = 46707769344
db_keep_cache_size       = 536870912
compatible               = 10.2.0.3.0
log_archive_dest_1       = location=/arch02
db_files                 = 1000
db_file_multiblock_read_count= 16
cluster_database         = TRUE
cluster_database_instances= 2
thread                   = 2
instance_number          = 2
undo_management          = AUTO
undo_tablespace          = UNDOTBS2
undo_retention           = 900
recyclebin               = OFF
remote_login_passwordfile= EXCLUSIVE
db_domain                =
dispatchers              = (PROTOCOL=TCP) (SERVICE=crmrzXDB)
local_listener           = (ADDRESS = (PROTOCOL =TCP)(HOST = 134.224.106.126)(PORT = 1521))
remote_listener          = LISTENERS_CRMRZ
utl_file_dir             = /arch01
job_queue_processes      = 20
cursor_sharing           = SIMILAR
background_dump_dest     = /oracle/opt/product/10.2.0/db/admin/crmrz/bdump
user_dump_dest           = /oracle/opt/product/10.2.0/db/admin/crmrz/udump
core_dump_dest           = /oracle/opt/product/10.2.0/db/admin/crmrz/cdump
audit_file_dest          = /oracle/opt/product/10.2.0/db/admin/crmrz/adump
db_name                  = crmrz
open_cursors             = 1500
pga_aggregate_target     = 10726932480
Cluster communication is configured to use the following interface(s) for this instance
192.100.100.2
Wed Aug 22 10:03:11 2012
cluster interconnect IPC version:Oracle UDP/IP (generic)
IPC Vendor 1 proto 2
PMON started with pid=2, OS id=24379686
DIAG started with pid=3, OS id=22743022
PSP0 started with pid=4, OS id=24773700
LMON started with pid=5, OS id=25429322
LMD0 started with pid=6, OS id=25494130
LMS0 started with pid=7, OS id=24904068
LMS1 started with pid=8, OS id=9437246
LMS2 started with pid=9, OS id=13304862
LMS3 started with pid=10, OS id=24969498
LMS4 started with pid=11, OS id=26936330
LMS5 started with pid=12, OS id=26935868
LMS6 started with pid=13, OS id=25167108
MMAN started with pid=14, OS id=27132604
DBW0 started with pid=15, OS id=26739208
DBW1 started with pid=16, OS id=23790482
DBW2 started with pid=17, OS id=25626104
DBW3 started with pid=18, OS id=25494956
LGWR started with pid=19, OS id=26673758
CKPT started with pid=20, OS id=26542676
SMON started with pid=21, OS id=20383626
RECO started with pid=22, OS id=26477394
CJQ0 started with pid=23, OS id=26411882
MMON started with pid=24, OS id=26346430
Wed Aug 22 10:03:14 2012
starting up 1 dispatcher(s) for network address '(ADDRESS=(PARTIAL=YES)(PROTOCOL=TCP))'...
MMNL started with pid=25, OS id=25624582
Wed Aug 22 10:03:14 2012
starting up 1 shared server(s) ...
Wed Aug 22 10:03:19 2012
lmon registered with NM - instance id 2 (internal mem no 1)
Wed Aug 22 10:03:26 2012
Reconfiguration started (old inc 0, new inc 33)
List of nodes:
0 1
Global Resource Directory frozen
* allocate domain 0, invalid = TRUE
Communication channels reestablished
* domain 0 valid = 0 according to instance 0
Wed Aug 22 10:03:26 2012
Master broadcasted resource hash value bitmaps
Non-local Process blocks cleaned out
Wed Aug 22 10:03:26 2012
LMS 4: 0 GCS shadows cancelled, 0 closed
Wed Aug 22 10:03:26 2012
LMS 6: 0 GCS shadows cancelled, 0 closed
Wed Aug 22 10:03:26 2012
LMS 2: 0 GCS shadows cancelled, 0 closed
Wed Aug 22 10:03:26 2012
LMS 1: 0 GCS shadows cancelled, 0 closed
Wed Aug 22 10:03:26 2012
LMS 5: 0 GCS shadows cancelled, 0 closed
Wed Aug 22 10:03:26 2012
LMS 3: 0 GCS shadows cancelled, 0 closed
Wed Aug 22 10:03:26 2012
LMS 0: 0 GCS shadows cancelled, 0 closed
Set master node info
Submitted all remote-enqueue requests
Dwn-cvts replayed, VALBLKs dubious
All grantable enqueues granted
Wed Aug 22 10:03:28 2012
LMS 5: 0 GCS shadows traversed, 0 replayed
Wed Aug 22 10:03:28 2012
LMS 6: 0 GCS shadows traversed, 0 replayed
Wed Aug 22 10:03:28 2012
LMS 4: 0 GCS shadows traversed, 0 replayed
Wed Aug 22 10:03:28 2012
LMS 2: 0 GCS shadows traversed, 0 replayed
Wed Aug 22 10:03:28 2012
LMS 1: 0 GCS shadows traversed, 0 replayed
Wed Aug 22 10:03:28 2012
LMS 3: 0 GCS shadows traversed, 0 replayed
Wed Aug 22 10:03:28 2012
LMS 0: 0 GCS shadows traversed, 0 replayed
Wed Aug 22 10:03:28 2012
Submitted all GCS remote-cache requests
Fix write in gcs resources
Reconfiguration complete
LCK0 started with pid=28, OS id=26215276
Wed Aug 22 10:03:33 2012
ALTER DATABASE   MOUNT
Wed Aug 22 10:03:37 2012
Setting recovery target incarnation to 1
Wed Aug 22 10:03:37 2012
Successful mount of redo thread 2, with mount id 3477695295
Wed Aug 22 10:03:37 2012
Database mounted in Shared Mode (CLUSTER_DATABASE=TRUE)
Completed: ALTER DATABASE   MOUNT
Wed Aug 22 10:03:38 2012
ALTER DATABASE OPEN
Picked broadcast on commit scheme to generate SCNs
Wed Aug 22 10:03:59 2012
Trace dumping is performing id=[cdmp_20120822100344]
IPC Send timeout detected. Receiver ospid 24904068
Wed Aug 22 10:04:03 2012
Errors in file /oracle/opt/product/10.2.0/db/admin/crmrz/bdump/crmrz2_lms0_24904068.trc:
Wed Aug 22 10:04:59 2012
Trace dumping is performing id=[cdmp_20120822100458]
Wed Aug 22 10:05:19 2012
LGWR: STARTING ARCH PROCESSES
ARC0 started with pid=32, OS id=26739814
Wed Aug 22 10:05:19 2012
ARC0: Archival started
ARC1: Archival started
LGWR: STARTING ARCH PROCESSES COMPLETE
ARC1 started with pid=31, OS id=24445226
Wed Aug 22 10:05:20 2012
IPC Send timeout detected. Receiver ospid 24904068
Wed Aug 22 10:05:20 2012
Errors in file /oracle/opt/product/10.2.0/db/admin/crmrz/bdump/crmrz2_lms0_24904068.trc:
Wed Aug 22 10:05:21 2012
Thread 2 opened at log sequence 53612
Current log# 6 seq# 53612 mem# 0: /dev/rredo06_rzdb
Wed Aug 22 10:05:21 2012
ARC1: Becoming the 'no FAL' ARCH
ARC1: Becoming the 'no SRL' ARCH
Wed Aug 22 10:05:21 2012
ARC0: Becoming the heartbeat ARCH
Wed Aug 22 10:05:21 2012
Successful open of redo thread 2
Wed Aug 22 10:05:21 2012
MTTR advisory is disabled because FAST_START_MTTR_TARGET is not set
Wed Aug 22 10:05:21 2012
SMON: enabling cache recovery
Wed Aug 22 10:06:31 2012
WARNING: inbound connection timed out (ORA-3136)
Wed Aug 22 10:06:32 2012
Successfully onlined Undo Tablespace 4.
Wed Aug 22 10:06:32 2012
SMON: enabling tx recovery
Wed Aug 22 10:06:32 2012
Database Characterset is ZHS16GBK
Opening with internal Resource Manager plan
where NUMA PG = 1, CPUs = 28
replication_dependency_tracking turned off (no async multimaster replication found)
Starting background process QMNC
QMNC started with pid=42, OS id=25887794
Wed Aug 22 10:06:43 2012
Completed: ALTER DATABASE OPEN
Wed Aug 22 10:07:00 2012
WARNING: inbound connection timed out (ORA-3136)
Wed Aug 22 10:07:13 2012
WARNING: inbound connection timed out (ORA-3136)
Wed Aug 22 10:07:25 2012
WARNING: inbound connection timed out (ORA-3136)
Wed Aug 22 10:07:31 2012
Trace dumping is performing id=[cdmp_20120822100731]
Wed Aug 22 10:09:20 2012
Trace dumping is performing id=[cdmp_20120822100732]
Wed Aug 22 10:25:54 2012

2012-08-22 10:01:23.571: [ CRSEVT][11527]32CAAMonitorHandler :: 0:Could not join /oracle/opt/product/10.2.0/crs/bin/racgwrap(check)
category: 1234, operation: scls_process_join, loc: childcrash, OS error: 0, other: Abnormal termination of the child
2012-08-22 10:01:31.473: [ CRSEVT][11527]32CAAMonitorHandler :: 0:Action Script /oracle/opt/product/10.2.0/crs/bin/racgwrap(check) timed out for ora.crmrz01.vip! (timeout=60)
2012-08-22 10:01:31.474: [ CRSAPP][11527]32CheckResource error for ora.crmrz01.vip error code = -2
2012-08-22 10:03:04.553: [ CRSEVT][11530]32CAAMonitorHandler :: 0:Could not join /oracle/opt/product/10.2.0/crs/bin/racgwrap(check)
category: 1234, operation: scls_process_join, loc: childcrash, OS error: 0, other: Abnormal termination of the child
2012-08-22 10:03:04.553: [ CRSEVT][11530]32CAAMonitorHandler :: 0:Action Script /oracle/opt/product/10.2.0/crs/bin/racgwrap(check) timed out for ora.crmrz01.vip! (timeout=60)
2012-08-22 10:03:04.554: [ CRSAPP][11530]32CheckResource error for ora.crmrz01.vip error code = -2
2012-08-22 10:03:39.055: [ COMMCRS][3866]Authorization failed, network error
2012-08-22 10:03:39.138: [ OCRSRV][3866]th_select_answer: Failure in answer. clsc ret [3]
2012-08-22 10:03:39.630: [ COMMCRS][3866]Authorization failed, network error
2012-08-22 10:03:39.814: [ OCRSRV][3866]th_select_answer: Failure in answer. clsc ret [3]
2012-08-22 10:04:13.506: [ COMMCRS][3866]Authorization failed, network error
2012-08-22 10:04:13.522: [ OCRSRV][3866]th_select_answer: Failure in answer. clsc ret [3]
2012-08-22 10:04:13.610: [ COMMCRS][3866]Authorization failed, network error
2012-08-22 10:04:13.618: [ OCRSRV][3866]th_select_answer: Failure in answer. clsc ret [3]
2012-08-22 10:04:44.509: [ COMMCRS][3866]Authorization failed, network error
2012-08-22 10:04:44.525: [ OCRSRV][3866]th_select_answer: Failure in answer. clsc ret [3]
2012-08-22 10:04:44.898: [ COMMCRS][3866]Authorization failed, network error
2012-08-22 10:04:44.915: [ OCRSRV][3866]th_select_answer: Failure in answer. clsc ret [3]
2012-08-22 10:04:58.959: [ COMMCRS][3866]Authorization failed, network error
2012-08-22 10:04:58.963: [ OCRSRV][3866]th_select_answer: Failure in answer. clsc ret [3]
2012-08-22 10:04:59.047: [ COMMCRS][3866]Authorization failed, network 2012-08-22 10:12:25.310: [ CRSRES][11569]32startRunnable: setting CLI values
crs 日志network error 基本上可以确定当时的网络是存在问题

MULTIPLE USERS 10G RAC ORACLE_HOME INSTALL WITH ASM/CRS

Hi,
We need to install multiple 10g RAC databases on a two node Sun servers. Below is our configuration:
1) Sun Solaris (ver 10) with Sun Cluster 3.2
2) One ASM/CRS install (by 1 OS account)
3) Four ORACLE_HOME 10g database install (by 4 different OS user accounts)
We would like to use one ASM instance for all four databases with appropriate privileges.
OS User:           OS Group
========      =========
oraasm           dbaasm - (ASM and CRS install owner)
ora1           dbaora1 - first db owner
ora2           dbaora2 - second db owner
ora3           dbaora3 - third db owner
ora4           dbaora4 - fourth db owner
I understand that certain privileges need to be shared between ASM/CRS and DB owners. Please let me know the steps to be followed to complete this install.
Thanks in advance.

Hi
Please read that: Documentation http://download.oracle.com/docs/html/B10766_08/intro.htm
- You can install and operate multiple Oracle homes and different versions of Oracle cluster database software on the same computer as described in the following points:
-You can install multiple Oracle Database 10g RAC homes on the same node. The multiple homes feature enables you to install one or more releases on the same machine in multiple Oracle home directories. However, each node can have only one CRS home.
-In addition, you cannot install Oracle Database 10g RAC into an existing single-instance Oracle home. If you have an Oracle home for Oracle Database 10g, then use a different Oracle home, and one that is available across the entire cluster for your new installation. Similarly, if you have an Oracle home for an earlier Oracle cluster database software release, then you must also use a different home for the new installation.
If the OUI detects an earlier version of a database, then the OUI asks you about your upgrade preferences. You have the option to upgrade one of the previous-version databases with DBUA or to create a new database using DBCA. The information collected during this dialog is passed to DBUA or DBCA after the software is installed.
- You can use the OUI to complete some of the de-install and re-install steps for Oracle Database 10g Real Application Clusters if needed.
Note:
Do not move Oracle binaries from one Oracle home to another because this causes dynamic link failures.
. If you are using ASM with Oracle database instances from multiple database homes on the same node, then Oracle recommends that you run the ASM instance from an Oracle home that is distinct from the database homes. In addition, the ASM home should be installed on every cluster node. This prevents the accidental removal of ASM instances that are in use by databases from other homes during the de-installation of a database's Oracle home.

APEX Application behaviour in a RAC setup

Hi
Caveat first: I'm pretty new to Oracle RAC and just looking into it as an option. We have an APEX application currently running in Oracle 11gR2 single node currently and are considering HA for this.
My question is: What would be the expected behaviour seen by a User of an APEX application, in the event of a node failure, when running with an OHS / RAC configuration? Will they get "transparent fail-over" and see nothing or will they see an error?
I appreciate I could post in the APEX forum, but feel that is probably more of a development forum and possibly someone here has had to look at things at this level.
I have read what I think may be the definitive reference for this:
http://www.oracle.com/technetwork/developer-tools/apex/learnmore/apex-rac-wp-133532.pdf
but while it covers most of what I want I don't believe I have found an answer to my question
This states:
"The Transparent Application Failover (TAF) feature of Oracle Net Services is a runtime failover for high-availability environments. It enables client applications to automatically reconnect to the database if the connection fails and, optionally, resume a SELECT statement that was in progress. The reconnection happens automatically from within the Oracle Call Interface (OCI) library. For applications that do insert, update or delete transactions, the application must trap the error when the failure occurs, rollback the transaction, and then resubmit. If the application is not written to be TAF aware, the session will get disconnected."
However (as I understand it) APEX runs in the database and would fail with the database, it isn't a typical "client application" connecting to Oracle via a TAF aware connection pool - it is essentially a large pl/sql package and TAF only covers SELECT statements not packages.
May be I'm over-reading this and it's simpler than that: APEX/Mod_plsql might just handle it?
- APEX User/HTTP session state is stored in database APEX: Understanding session state which is available on other nodes
- Mod_plsql in OHS can detect the error returning and reissues the request to good server and APEX on that instance can retrieve Users/HTTP state and process the request (APEX/RAC doc states mod_plsql can see an error from database and cleanup connection up and form a new connection, but not that it will retry the request for the client into other APEX/DB node).
I'm really just after a (transparent/non-transparent) statement based on experience, but an outline of how the components behave would be useful.
Thanks in advance
Dave

Hi
Any chance of getting that link outside of Metalink? - I'm trying to get our customer support id, but no luck at present.
I'm aware that APEX can run with RAC (as per the link I posted) - I'm really after next level info around behaviour in that environment.
Thanks
Dave

Deallock detection in rac

Similar Messages

Maybe you are looking for