RAC instance crash...

1)Hi...I have a 3 node RAC A,B,C...suppose if node A got evicted automatically the survivng nodes will recover the node A in 10.2.0....my question is as a DBA how can we know that the node A got evicted and what files or views do we have to check that the instance got crashed...can the DBA perform any instance recover..if yes,how can we do it...
2) In a stand alone database if the instance got crashed how would we recover the instance..what views or file do we have to see..can anyone explain the process with few steps.
Thanks

798852 wrote:
1)Hi...I have a 3 node RAC A,B,C...suppose if node A got evicted automatically the survivng nodes will recover the node A in 10.2.0....my question is as a DBA how can we know that the node A got evicted and what files or views do we have to check that the instance got crashed...can the DBA perform any instance recover..if yes,how can we do it...
As the answer is given by Pierre, just wanted to share this MOS note, 461662.1 .
2) In a stand alone database if the instance got crashed how would we recover the instance..what views or file do we have to see..can anyone explain the process with few steps.
iEither in a RAC or in a standalone database, DBA does not do anything except issuing startup command. Instance recovery is an automatic phenomenon. Being a DBA ,what you may check is the Alert.log file which would show the progress and also the V$instance_recovery view.
HTH
Aman....

Similar Messages

Rac Instance Crashes

Dear all,
My version is 11.2.0.2.5 one of my rac instance crashes with message ORA-00240: control file enqueue held for more than 120 seconds. Received an instance abort message from instance 1.
here are the contents of alert log file
IPC Send timeout detected. Receiver ospid 27423 [[email protected] (LMON)]
2013-03-22 22:30:05.644000 -07:00
Errors in file /u01/app/oracle/diag/rdbms/lfgoimdb/LFGoimdb2/trace/LFGoimdb2_lmon_27423.trc:
2013-03-22 22:31:08.734000 -07:00
Errors in file /u01/app/oracle/diag/rdbms/lfgoimdb/LFGoimdb2/trace/LFGoimdb2_arc2_27691.trc (incident=15905):
ORA-00240: control file enqueue held for more than 120 seconds
Incident details in: /u01/app/oracle/diag/rdbms/lfgoimdb/LFGoimdb2/incident/incdir_15905/LFGoimdb2_arc2_27691_i15905.trc
2013-03-22 22:31:13.409000 -07:00
Received an instance abort message from instance 1
Please check instance 1 alert and LMON trace files for detail.
LMS0 (ospid: 27427): terminating the instance due to error 481
System state dump requested by (instance=2, osid=27427 (LMS0)), summary=[abnormal instance termination].
System State dumped to trace file /u01/app/oracle/diag/rdbms/lfgoimdb/LFGoimdb2/trace/LFGoimdb2_diag_27413.trc
2013-03-22 22:31:18.376000 -07:00
Dumping diagnostic data in directory=[cdmp_20130322223113], requested by (instance=2, osid=27427 (LMS0)), summary=[abnormal instance termination].
ORA-1092 : opitsk aborting process
Instance terminated by LMS0, pid = 27427

Thanks for reply,
My redo logs size is default 50mb.There is currently no load on the system since we are not using this environment for time being.The log switches are averaged to be 8 per day.I think Increasing the size of redo will further cause the problems since the archiver may again hold lock for more time.
Since there is no dedicated connection between the nodes and storage ,So increasing the hardware and network configuration is only solution to this? Or I am still missing something...
As far as configuration is considered i cannot add more resources to this environment.How can I solve this issue?

Instance crash on RAC will impact goldengate extract process?

Dear All,
DB Version: 11.2.0.3
GG version: 11.2.0.1
Could you please let me know, how gg works when a instance crashes in a RAC cluster. Will extract proess get abend or extract will continue to capture
Thank you very much in advance
Regards
Mvk

If GG is configured properly for the RAC, the GG manager should start extract on the living node and there should not be any data loss.
Please refer to Oracle documentation on how to configure GG in RAC.

Difference between Instance Crash and Database Crash in Standalone and RAC?

I am new to Oracle Environment. I would appreiciate if i get clarified on the "_+Difference between Instance Crash and Database Crash in Standalone and
RAC_".+
Oracle Version: 11.1.0.7.0
O.S Version: HP-UX B.11.31 U

Sivaprasad S wrote:
I am new to Oracle Environment. I would appreiciate if i get clarified on the "_+Difference between Instance Crash and Database Crash in Standalone and
RAC_".+That depends on your definitions of database crash versus instance crash.
Literally, database crash means that the database itself is damaged in some way. A disk is no more. Data files are corrupted. Etc. The database itself thus causes the crash - and a database crash will also cause all database instances (RAC or single) for that database to crash too.
An instance crash will therefore mean that the crash occurs in the instance due to an Oracle bug or similar. In other words a s/w error and not a database error. The database is still "fine", but the instance not. Instance crashes. However, any other instance on that same (RAC) database will still be fine as the database itself is unaffected by that s/w error in the crashed instance.
If you mean something different, please enlighten us. Not all crashes are equal. {noformat};-){noformat}

Rconfig: converting a single instance to RAC instance

Hi,
I am trying to use the "rconfig" utility to convert a single instance to a RAC instance in an existing RAC cluster.
I have modified the .xml file, and am trying to run the conversion from the 1st node in the 2 node cluster (where the single instance resides).
The only error message i seem to be getting is below:
<Response>
<Result code="1" >
Operation Failed
</Result>
<ErrorDetails>
ORCL_DATA_ORCLCLN The specified diskgroup is not mounted.
</ErrorDetails>
</Response>
</Convert>
</ConvertToRAC></RConfig>
Now I dont really understand why I would be getting that message as the instance is up and running and ASM disk group is mounted on node1 at the time i run the rconfig command, though its not clear to me if I also need to somehow mount the ASM disk group on the second node prior to running the rconfig command??
node1:
bash-3.00$ asmcmd -p
ASMCMD [+] > lsdg
State Type Rebal Unbal Sector Block AU Total_MB Free_MB Req_mir_free_MB Usable_file_MB Offline_disks Name
MOUNTED EXTERN N N 512 4096 1048576 10181 7442 0 7442 0 ORCL_DATA_ORCLCLN/
node2:
ASMCMD [+] > lsdg
State Type Rebal Unbal Sector Block AU Total_MB Free_MB Req_mir_free_MB Usable_file_MB Offline_disks Name
I have attached the output of the alert log during the rconfig conversion of the target database, but it all looks pretty standard to me (keep in mind i am an oracle novice!).
alert.log
Completed: ALTER DATABASE OPEN
Thu Jul 23 13:51:55 2009
Shutting down instance (abort)
License high water mark = 2
Instance terminated by USER, pid = 15030
Thu Jul 23 13:51:57 2009
Starting ORACLE instance (normal)
LICENSE_MAX_SESSION = 0
LICENSE_SESSIONS_WARNING = 0
Interface type 1 e1000g1 10.128.113.0 configured from OCR for use as a cluster interconnect
Interface type 1 e1000g0 10.128.113.0 configured from OCR for use as a public interface
Picked latch-free SCN scheme 2
Using LOG_ARCHIVE_DEST_1 parameter default value as /u01/app/oracle/product/10.2.0/db_1/dbs/arch
Autotune of undo retention is turned on.
IMODE=BR
ILAT =18
LICENSE_MAX_USERS = 0
SYS auditing is disabled
ksdpec: called for event 13740 prior to event group initialization
Starting up ORACLE RDBMS Version: 10.2.0.2.0.
System parameters with non-default values:
processes = 150
__shared_pool_size = 121634816
__large_pool_size = 4194304
__java_pool_size = 4194304
__streams_pool_size = 0
sga_target = 440401920
control_files = +ORCL_DATA_ORCLCLN/control01.ctl
db_block_size = 8192
__db_cache_size = 306184192
compatible = 10.2.0.2.0
log_archive_format = %t_%s_%r.dbf
db_file_multiblock_read_count= 16
cluster_database = FALSE
cluster_database_instances= 1
db_recovery_file_dest_size= 2147483648
norecovery_through_resetlogs= TRUE
undo_management = AUTO
undo_tablespace = UNDOTBS1
remote_login_passwordfile= EXCLUSIVE
db_domain = netapp.com
job_queue_processes = 10
background_dump_dest = /u01/app/oracle/admin/orcldb/bdump/ORCLCLN
user_dump_dest = /u01/app/oracle/admin/orcldb/udump/ORCLCLN
core_dump_dest = /u01/app/oracle/admin/orcldb/cdump/ORCLCLN
db_name = ORCLCLN
open_cursors = 300
pga_aggregate_target = 145752064
Cluster communication is configured to use the following interface(s) for this instance
10.128.113.200
Thu Jul 23 13:51:59 2009
cluster interconnect IPC version:Oracle UDP/IP (generic)
IPC Vendor 1 proto 2
PMON started with pid=2, OS id=15085
DIAG started with pid=3, OS id=15091
PSP0 started with pid=4, OS id=15094
LMON started with pid=5, OS id=15097
LMD0 started with pid=6, OS id=15102
MMAN started with pid=7, OS id=15112
DBW0 started with pid=8, OS id=15114
LGWR started with pid=9, OS id=15116
CKPT started with pid=10, OS id=15125
SMON started with pid=11, OS id=15128
RECO started with pid=12, OS id=15130
CJQ0 started with pid=13, OS id=15134
MMON started with pid=14, OS id=15143
MMNL started with pid=15, OS id=15146
Thu Jul 23 13:52:03 2009
lmon registered with NM - instance id 1 (internal mem no 0)
Thu Jul 23 13:52:04 2009
Reconfiguration started (old inc 0, new inc 2)
List of nodes:
0
Global Resource Directory frozen
* allocate domain 0, invalid = TRUE
Communication channels reestablished
Master broadcasted resource hash value bitmaps
Non-local Process blocks cleaned out
Resources and enqueues cleaned out
Resources remastered 0
Set master node info
Submitted all remote-enqueue requests
Dwn-cvts replayed, VALBLKs dubious
All grantable enqueues granted
Post SMON to start 1st pass IR
Submitted all GCS remote-cache requests
Post SMON to start 1st pass IR
Reconfiguration complete
Thu Jul 23 13:52:04 2009
ALTER DATABASE MOUNT
Thu Jul 23 13:52:04 2009
Starting background process ASMB
ASMB started with pid=17, OS id=15157
Starting background process RBAL
RBAL started with pid=18, OS id=15169
Thu Jul 23 13:52:09 2009
SUCCESS: diskgroup ORCL_DATA_ORCLCLN was mounted
Thu Jul 23 13:52:13 2009
Setting recovery target incarnation to 2
Thu Jul 23 13:52:13 2009
Successful mount of redo thread 1, with mount id 4437636
Thu Jul 23 13:52:13 2009
Database mounted in Exclusive Mode
Completed: ALTER DATABASE MOUNT
Thu Jul 23 13:52:14 2009
ALTER DATABASE OPEN
Thu Jul 23 13:52:14 2009
Beginning crash recovery of 1 threads
Thu Jul 23 13:52:14 2009
Started redo scan
Thu Jul 23 13:52:14 2009
Completed redo scan
105 redo blocks read, 32 data blocks need recovery
Thu Jul 23 13:52:14 2009
Started redo application at
Thread 1: logseq 2, block 929
Thu Jul 23 13:52:15 2009
Recovery of Online Redo Log: Thread 1 Group 2 Seq 2 Reading mem 0
Mem# 0 errs 0: +ORCL_DATA_ORCLCLN/redo_2_1.log
Mem# 1 errs 0: +ORCL_DATA_ORCLCLN/redo_2_0.log
Thu Jul 23 13:52:15 2009
Completed redo application
Thu Jul 23 13:52:15 2009
Completed crash recovery at
Thread 1: logseq 2, block 1034, scn 613579
32 data blocks read, 25 data blocks written, 105 redo blocks read
Thu Jul 23 13:52:15 2009
Thread 1 advanced to log sequence 3
Thread 1 opened at log sequence 3
Current log# 1 seq# 3 mem# 0: +ORCL_DATA_ORCLCLN/redo_1_1.log
Current log# 1 seq# 3 mem# 1: +ORCL_DATA_ORCLCLN/redo_1_0.log
Successful open of redo thread 1
Thu Jul 23 13:52:15 2009
MTTR advisory is disabled because FAST_START_MTTR_TARGET is not set
Thu Jul 23 13:52:15 2009
SMON: enabling cache recovery
Thu Jul 23 13:52:17 2009
Successfully onlined Undo Tablespace 1.
Thu Jul 23 13:52:17 2009
SMON: enabling tx recovery
Thu Jul 23 13:52:17 2009
Database Characterset is WE8ISO8859P1
replication_dependency_tracking turned off (no async multimaster replication found)
Starting background process QMNC
QMNC started with pid=21, OS id=15328
Thu Jul 23 13:52:23 2009
Completed: ALTER DATABASE OPEN
Any help would be greatly appreciated!!!!

Ok,
So I managed to get the disk group mounted on the second node, and re-ran the rconfig process.
I got a little further, but encountered another error which is displayed below:
-bash-3.00$ rconfig racconv.xml
<?xml version="1.0" ?>
<RConfig>
<ConvertToRAC>
<Convert>
<Response>
<Result code="1" >
Operation Failed
</Result>
<ErrorDetails>
/u01/app/oracle/product/10.2.0/db_1/dbs Data File is not shared across all nodes in the cluster
</ErrorDetails>
</Response>
</Convert>
</ConvertToRAC></RConfig>
I am not using a shared oracle home, each node in the cluster has its own oracle installation residing on local disk. Is a shared oracle home a pre-requisite for usin rconfig?
I have provided the .xml file I am using below:
-bash-3.00$ cat racconv.xml
<?xml version="1.0" encoding="UTF-8"?>
<n:RConfig xmlns:n="http://www.oracle.com/rconfig"
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"
xsi:schemaLocation="http://www.oracle.com/rconfig">
<n:ConvertToRAC>

<n:Convert verify="YES">

<n:SourceDBHome>/u01/app/oracle/product/10.2.0/db_1</n:SourceDBHome>

<n:TargetDBHome>/u01/app/oracle/product/10.2.0/db_1</n:TargetDBHome>

<n:SourceDBInfo SID="ORCLCLN">
<n:Credentials>
<n:User>oracle</n:User>
<n:Password>password</n:Password>
<n:Role>sysdba</n:Role>
</n:Credentials>
</n:SourceDBInfo>

<n:ASMInfo SID="+ASM1">
<n:Credentials>
<n:User>oracle</n:User>
<n:Password>password</n:Password>
<n:Role>sysdba</n:Role>
</n:Credentials>
</n:ASMInfo>

<n:NodeList>
<n:Node name="sol002"/>
<n:Node name="sol003"/>
</n:NodeList>

<n:InstancePrefix>ORCLCLN</n:InstancePrefix>

<n:Listener port=""/>

<n:SharedStorage type="ASM">

<n:TargetDatabaseArea></n:TargetDatabaseArea>

<n:TargetFlashRecoveryArea></n:TargetFlashRecoveryArea>
</n:SharedStorage>
</n:Convert>
</n:ConvertToRAC>
</n:RConfig>

Why my instance crashed down

HI,everybody,My instance crashed down last day.
Is there anybody can tell me why ?
oracle version：10.2.0.1.0 (Only Patch 4612267）
OS version：redhat as 4.5
Sorry, I am in china,so there is some chinese character in the alter file,but I think everybody can guess the meanings depending on the location.
alter file：Thu Apr 8 17:02:38 2010
Hex dump of (file 3, block 85680) in trace file /u01/app/oracle/admin/WENDING/bdump/wending_dbw0_22490.trc
Corrupt block relative dba: 0x00c14eb0 (file 3, block 85680)
Bad header found during preparing block for write
Data in bad block:
type: 6 format: 2 rdba: 0x00c14eb0
last change scn: 0x0001.e55fdf1f seq: 0x1 flg: 0x02
spare1: 0x0 spare2: 0x0 spare3: 0x2000
consistency value in tail: 0xdf1f0601
check value in block header: 0xcf0
block checksum disabled
Thu Apr 8 17:02:49 2010
Errors in file /u01/app/oracle/admin/WENDING/bdump/wending_dbw0_22490.trc:
ORA-00600: 内部错误代码, 参数: [kcbzpbuf_1], [4], [1], [], [], [], [], []
Thu Apr 8 17:02:51 2010
Errors in file /u01/app/oracle/admin/WENDING/bdump/wending_dbw0_22490.trc:
ORA-00600: 内部错误代码, 参数: [kcbzpbuf_1], [4], [1], [], [], [], [], []
Thu Apr 8 17:02:51 2010
DBW0: terminating instance due to error 471
Termination issued to instance processes. Waiting for the processes to exit
Thu Apr 8 17:03:02 2010
Instance termination failed to kill one or more processes
Instance terminated by DBW0, pid = 22490
trace file：
BH (0x85fa400c) file#: 3 rdba: 0x00c14eb0 (3/85680) class: 1 ba: 0x85000000
set: 3 blksize: 8192 bsi: 0 set-flg: 0 pwbcnt: 0
dbwrid: 0 obj: 147678 objn: 147678 tsn: 2 afn: 3
hash: [63ff817c,9d1f6d1c] lru-req: [9d465708,9d465708]
lru-flags: on_auxiliary_list object_write_list
ckptq: [26fdb504,30ff0114] fileq: [26fdb50c,6bfbf4cc] objq: [93e35c90,93e35c90]
st: XCURRENT md: NULL tch: 2
flags: buffer_dirty being_written gotten_in_current_mode being_evicted
block_written_once redo_since_read
LRBA: [0x516a.1a2bd.0] HSCN: [0x1.e55fdf1f] HSUB: [1]
Dump of buffer cache at level 10 for tsn=2, rdba=12668592
BH (0x85fa400c) file#: 3 rdba: 0x00c14eb0 (3/85680) class: 1 ba: 0x85000000
set: 3 blksize: 8192 bsi: 0 set-flg: 0 pwbcnt: 0
dbwrid: 0 obj: 147678 objn: 147678 tsn: 2 afn: 3
hash: [63ff817c,9d1f6d1c] lru-req: [9d465708,9d465708]
lru-flags: on_auxiliary_list object_write_list
ckptq: [26fdb504,30ff0114] fileq: [26fdb50c,6bfbf4cc] objq: [93e35c90,93e35c90]
st: XCURRENT md: NULL tch: 2
flags: buffer_dirty being_written gotten_in_current_mode being_evicted
block_written_once redo_since_read
LRBA: [0x516a.1a2bd.0] HSCN: [0x1.e55fdf1f] HSUB: [1]
buffer tsn: 2 rdba: 0x00c14eb0 (3/85680)
scn: 0x0001.e55fdf1f seq: 0x01 flg: 0x02 tail: 0xdf1f0601
frmt: 0x02 chkval: 0x0cf0 type: 0x06=trans data
Hex dump of corrupt header 4 = CORRUPT
Edited by: seven qi on 2010-4-9 上午1:30

thank you for so quick response.
I don't understand what happened when the error appeared.
The trace file is so difficult to read, I don't know what it want to show.
Would you like to tell me what happened with simple language?
Edited by: seven qi on 2010-4-9 上午1:42

Communication between RAC instances

Hi,
I see different answers for this Communication between RAC instances in google.
1. UDP
2.Distributed Lock Manager (DLM)
which one of is correct?

both of them are correct on different layers. The IP protocol used over the interconnect is udp and one of the protocols/mechanism implemented above that is DLM.

Communication between RAC instances in google.

Hi,
I see different answers for this Communication between RAC instances in google.
1. UDP
2.Distributed Lock Manager (DLM)
which one of is correct?

Pl do not post duplicate threads - Communication between RAC instances
Srini

Oracle RAC instance related

We have Oracle Databases 11g Enterprise Edition Release 11.2.0.1.0 - 64bit Production running on Linux x86 64-bit
It is a two instance RAC running on servers, let's say - node1 and node2 . We are using ASM
Node 1 has an ASM instance ASM1 and Node 2 has an ASM instance ASM2.
There are 3 11g rdbms databases running on these nodes.
Instances db11,db21,db31 are running on node 1 and corresponding RAC instances db12,db22,db32 are running on node 2
The listeners are configured exactly the same on both nodes.
On Node 2, when I do
[oracle@node2 admin]$ lsnrctl status
LSNRCTL for Linux: Version 11.2.0.1.0 - Production on 19-NOV-2010 14:34:34
Copyright (c) 1991, 2009, Oracle. All rights reserved.
Connecting to (DESCRIPTION=(ADDRESS=(PROTOCOL=IPC)(KEY=LISTENER)))
STATUS of the LISTENER
Alias                     LISTENER
Version                   TNSLSNR for Linux: Version 11.2.0.1.0 - Production
Start Date                15-NOV-2010 13:33:49
Uptime                    4 days 1 hr. 0 min. 44 sec
Trace Level               off
Security                  ON: Local OS Authentication
SNMP                      OFF
Listener Parameter File   /opt/oracle/product/11.2.0/grid/network/admin/listener.ora
Listener Log File         /opt/app/oracle/diag/tnslsnr/node2/listener/alert/log.xml
Listening Endpoints Summary...
(DESCRIPTION=(ADDRESS=(PROTOCOL=ipc)(KEY=LISTENER)))
(DESCRIPTION=(ADDRESS=(PROTOCOL=tcp)(HOST=10.10.7.42)(PORT=1521)))
(DESCRIPTION=(ADDRESS=(PROTOCOL=tcp)(HOST=10.10.7.52)(PORT=1521)))
Services Summary...
Service "+ASM" has 1 instance(s).
Instance "+ASM2", status READY, has 1 handler(s) for this service...
Service "db1" has 2 instance(s).
Instance "db11", status READY, has 1 handler(s) for this service...
Instance "db12", status READY, has 2 handler(s) for this service...
Service "db2" has 2 instance(s).
Instance "db21", status READY, has 1 handler(s) for this service...
Instance "db22", status READY, has 2 handler(s) for this service...
Service "db3" has 2 instance(s).
Instance "db31", status READY, has 1 handler(s) for this service...
Instance "db32", status READY, has 2 handler(s) for this service...
The command completed successfullyThe above looks good which is what should be the case.
Now, if I try doing the same on node 1 (and this is where I am concerned)
[oracle@node1 admin]$ lsnrctl status
LSNRCTL for Linux: Version 11.2.0.1.0 - Production on 19-NOV-2010 14:41:45
Copyright (c) 1991, 2009, Oracle. All rights reserved.
Connecting to (DESCRIPTION=(ADDRESS=(PROTOCOL=IPC)(KEY=LISTENER)))
STATUS of the LISTENER
Alias                     LISTENER
Version                   TNSLSNR for Linux: Version 11.2.0.1.0 - Production
Start Date                19-NOV-2010 03:20:44
Uptime                    0 days 11 hr. 21 min. 1 sec
Trace Level               off
Security                  ON: Local OS Authentication
SNMP                      OFF
Listener Parameter File   /opt/oracle/product/11.2.0/grid/network/admin/listener.ora
Listener Log File         /opt/app/oracle/diag/tnslsnr/node1/listener/alert/log.xml
Listening Endpoints Summary...
(DESCRIPTION=(ADDRESS=(PROTOCOL=ipc)(KEY=LISTENER)))
(DESCRIPTION=(ADDRESS=(PROTOCOL=tcp)(HOST=10.10.7.41)(PORT=1521)))
(DESCRIPTION=(ADDRESS=(PROTOCOL=tcp)(HOST=10.10.7.51)(PORT=1521)))
Services Summary...
Service "+ASM" has 1 instance(s).
Instance "+ASM1", status READY, has 1 handler(s) for this service...
Service "db1" has 1 instance(s).
Instance "db11", status READY, has 2 handler(s) for this service...
Service "db2" has 1 instance(s).
Instance "db21", status READY, has 2 handler(s) for this service...
Service "db3" has 1 instance(s).
Instance "db31", status READY, has 2 handler(s) for this service...
The command completed successfullyThe node 1 does not seem to report the fact that each of these 3 databases have 2 instances and also does not list its other instances besides the ones running on it. Any ideas or suggestions as to where to look?

One problem is here
The listeners are configured exactly the same on both nodes.This is incorrect, as the listeners need to have different names.
They need to have different names as listener_node1 is the remote_listener for node 2 and vice versa.
The correct set up is:
The listener name is node dependent.
The listener definition
listener_<node>=(host=)(protocol=)(port=)
is included in tnsnames.ora
the remote_listener parameter is set to the listener of the other node.
Using hardcoded IPs in listener.ora and tnsnames.ora is a bad idea.
Not sure why you don't ask this question in the RAC forum.
Sybrand Bakker
Senior Oracle DBA

RAC instance name - Can i choose my OWN?

Can i choose RAC instance name rather than defined by DBCA?
for example, i want to name my instance as <DBNAME>_1

yes you can
Cheers

How many 11gR2 RAC instance servers would be needed?

On the shared SAN drive - there will be as many as 6 11gR2 databases. For 2 node RAC instance servers - how many instance servers would work fine - are two separate servers needed for each database?

user7107816 wrote:
On the shared SAN drive - there will be as many as 6 11gR2 databases. For 2 node RAC instance servers - how many instance servers would work fine - are two separate servers needed for each database???? That depends. Does each database require high availability? Your question seems to indicate a lack of understanding of what RAC provides and how it achieves that goal... High Availability.
RAC can also be used to provide additional horsepower - on the fly. I have added a new node - start to finish in < 2hrs. (new hardware+install/configure OS+addnode to RAC and have it online and operational.) No other database solution offers this flexability.

How to determine which RAC-instance the appl. is logged onto?

Dear all,
I need to have my application server determine which RAC-
instance is currently active (logged onto). I have a
tnsnames.ora file with a primary-, and secondary RAC-
instance configured, and Failover/Failback between the
instances work fine. However, I would be interested in
determining which instance I am curently using.
Does the Oracle Net Protocol have support for letting me
"read" this out, or...?
Thanks.
Regards, Eldor R.

Thank you for the prompt reply.
Is there, in the Oracle Net Protocol, available
function(s) for reading out this information
directly without "parsing" the trace file?
I would like to read out this information from my
application run-time.
Thanks.

OC4J_SOA INSTANCE CRASH INADVERTENTLY WHILE A ESB SYSTEM IS PROCESSING

We are using OAS 10.1.3.3.0....
We have a ESB system which reads changes in a table (DB Adapter) and enqueue that changes in an AQ queue with an ADT (AQ Adapter)...
Sometimes that ESB process thousands of records very well but sometimes when is processing the oc4j_soa instances
crash and its log show...
07/11/08 17:03:57 at java.lang.Thread.run(Thread.java:595)
# An unexpected error has been detected by HotSpot Virtual Machine:
# SIGSEGV (0xb) at pc=0xb7a0010c, pid=28819, tid=831085488
# Java VM: Java HotSpot(TM) Server VM (1.5.0_06-b05 mixed mode)
# Problematic frame:
# V [libjvm.so+0x3d910c]
# An error report file with more information is saved as hs_err_pid28819.log
# If you would like to submit a bug report, please visit:
# http://java.sun.com/webapps/bugreport/crash.jsp
When I see the AppServer Control the instance is automatically restarted....
I need your help......!
Message was edited by:
user604794

Thanks for your answer....
About it.. let me talk to you the following... I have experienced this crash in Windows XP in a development enviroment, and I think that in an enviroment more robust like Linux this problem woulbe be solved... but no... the same error with less frecuency but the same error...
About Metalink... they told me....
"...You should be able to log a bug with SUN. The JVM is a SUN product. From the error is looks like an issue with one
of the JVM libraries. Once you submit the hs_err_pid28819.log file, SUN should b
e able to search and find a match in there library regarding the issue...."
But In SUN when I try to register the crash I read the following...
" ...If the crash occurred while running a product developed by a company other than Sun, please report this issue to the vendor of that product. ..."
So I don´t know what to do... Anybody HELP....

RAC instance, trying to recover UNDO datafile, RMAN gives RMAN-06054

Hello all,
This has been a troublesome instance..a quick bit of background. This was created awhile back by someone else, I inherited this 3 mode RAC clusterof instance1.
I'm exporting out of one database (10G) into this instance1 (11G). When I was about to start the import..I found this instance wouldn't start. Turned out no backup had been going on of this empty instance. I backed up the archive logs to tape to free up the FRA..and things fired up.
I began the import, and found a bunch of errors...basically tellling me that I couldn't access one of the undo tablespaces...datafile problems.
I went to look and saw:
SQL> select a.file_name, a.file_id, b.status, a.tablespace_name
     2 from dba_data_files a, v$datafile b
     3 where a.file_id = b.file#
     4 order by a.file_name;
     FILE_NAME FILE_ID STATUS TABLESPACE_NAME
     +DATADG/instance1/datafile/sysaux.270.696702269              2 ONLINE                SYSAUX
     +DATADG/instance1/datafile/system.263.696702253              1 SYSTEM                SYSTEM
     +DATADG/instance1/datafile/undotbs1.257.696702279            3 ONLINE                UNDOTBS1
     +DATADG/instance1/datafile/undotbs2.266.696702305            4 ONLINE                UNDOTBS2
     +DATADG/instance1/datafile/undotbs3.269.696702313            5 RECOVER               UNDOTBS3
     +DATADG/instance1/datafile/users.268.696702321               6 ONLINE                USERS
     +DATADG/instance1/l_data_01_01                              11 ONLINE                L_DATA_01
     +DATADG/instance1/s_data_01_01                               7 ONLINE                S_DATA_01
     +DATADG/instance1/s_data_01_02                               8 ONLINE                S_DATA_01
     +INDEXDG/instance1/l_index_01_01                            12 ONLINE                L_INDEX_01
     +INDEXDG/instance1/s_index_01_01                             9 ONLINE                S_INDEX_01
     FILE_NAME FILE_ID STATUS TABLESPACE_NAME
     +INDEXDG/instance1/s_index_01_02                            10 ONLINE                S_INDEX_01
There is is, file #5.
So, I went into RMAN to try to restore/recover:
RMAN> restore datafile 5;
Starting restore at 06-APR-10
allocated channel: ORA_SBT_TAPE_1
channel ORA_SBT_TAPE_1: SID=222 instance=instance1 device type=SBT_TAPE
channel ORA_SBT_TAPE_1: NMO v4.5.0.0
allocated channel: ORA_DISK_1
channel ORA_DISK_1: SID=222 instance=instance1 device type=DISK
creating datafile file number=5 name=+DATADG/instance1/datafile/undotbs3.269.696702313
restore not done; all files read only, offline, or already restored
Finished restore at 06-APR-10
RMAN> recover datafile 5;
Starting recover at 06-APR-10
using channel ORA_SBT_TAPE_1
using channel ORA_DISK_1
starting media recovery
RMAN-06560: WARNING: backup set with key 343546 will be read 2 times
available space of 8315779 kb needed to avoid reading the backup set multiple times
unable to find archived log
archived log thread=1 sequence=1
RMAN-00571: ===========================================================
RMAN-00569: =============== ERROR MESSAGE STACK FOLLOWS ===============
RMAN-00571: ===========================================================
RMAN-03002: failure of recover command at 04/06/2010 14:33:07
RMAN-06054: media recovery requesting unknown archived log for thread 1 with sequence 1 and starting SCN of 16016
This is all on ASM, and am a bit of a newb with that. I bascially have no data I'm worried about losing, I just need to get everything 'on the air' so I can import successfully, and let users on this instance. I've set up the backups in GRID now....so, it will be backed up on the future, but what is the quickest, most efficient way to get this UNDO tablespace datafile recovered?
Thank you,
cayenne

Hemant K Chitale wrote:
SET UNTIL SEQUENCE 27wouldn't work if the Recovery requires Sequence 1 and it is missing.
Hemant K ChitaleOops...meant to have start and set until both to "1"
However, I see what you mean. It seems I cannot find the file on tape.
Since the RAC instance hasn't yet had any data put into it, I'm thinking it might be best to just blow it away, and recreate everything.
Trouble is, I'm a bit new at RAC and ASM. I was thinking the best route might be to use DBCA to remove the database...? Would this not take care of removing all the datafiles from all the ASM instances on the RAC..as well as all the other directories, etc on all 3x nodes?
I've already used the dbca to create templates of this instance, so recreation shouldn't be too difficult (although it will be my first RAC creation)...
Thank you in advance for the advice so far,
cayenne

Getting error while starting the RAC instance through SRVCTL utility

Hi DBA's,
We are getting the below error while starting the Orcale 10g RAC instance on one of the node thru SRVCTL:
oracle:/home/oracle]>srvctl start instance -d SQLTSTM2 -i SQLTSTM21
PRKP-1001 : Error starting instance SQLTSTM21 on node r1racd
CRS-1028: Dependency analysis failed because of:
CRS-0223: Resource 'ora.SQLTSTM2.SQLTSTM21.inst' has placement error.
:oracle:/home/oracle]>
srvctl status database -d SQLTSTM2
PRKO-2015 : Error in checking condition of instance on node: r1racd
Instance SQLTSTM22 is running on node r2racd
oracle:/opt/oracle/product/10.2.0/db]>
NOTE: but i am able to start the instance thru SQLPLUS.
After starting the instance thru SQLPLUS i checked the CRS_STAT -t . It shows the status offline... Please find the details below:
Name Type Target State Host
ora....21.inst application OFFLINE OFFLINE
ora....22.inst application ONLINE ONLINE r2racd
ora....VLM2.db application ONLINE ONLINE r2racd
ora....21.inst application ONLINE ONLINE r1racd
ora....22.inst application ONLINE ONLINE r2racd
ora....STM2.db application ONLINE ONLINE r2racd
ora....SM1.asm application ONLINE ONLINE r1racd
ora....CD.lsnr application ONLINE ONLINE r1racd
ora.r1racd.gsd application ONLINE ONLINE r1racd
ora.r1racd.ons application ONLINE ONLINE r1racd
ora.r1racd.vip application ONLINE ONLINE r1racd
ora....SM2.asm application ONLINE ONLINE r2racd
ora....CD.lsnr application ONLINE ONLINE r2racd
ora.r2racd.gsd application ONLINE ONLINE r2racd
ora.r2racd.ons application ONLINE ONLINE r2racd
ora.r2racd.vip application ONLINE ONLINE r2racd
Thanks in advance...
Anil

Use the 'srvctl config database -d ' command to check if the instance has been configured in the Oracle Cluster Registry; make sure that the GSDs are running on each node in the cluster.
Also refere to Metalink note 295328.1

RAC instance crash...

Similar Messages

Maybe you are looking for