Oracle RAC - network problem between nodes
Hi all,
I have the following configuration:
Two guests operation systems CentOS 4.8 (installed).
The host is Windows XP (where is installed the VMWare Server 2).
Oracle 10.2.0.1 and Oracle Clusterware (Not installed Yet, trying)
I've created two VMs (RAC1 and RAC2) on VMware Server2 for Oracle 10g RAC.
Each VM has 2 network adapters:
eth0: public
eth1: private
But, Into RAC1, when I try a ping to RAC2, I got the error destination host unreachable.
The subnet and the net mask is OK, I've set the /etc/hosts.
From my Host PC, I cannot to perform ping to RAC1 and RAC2 too.
I've tryed to set the ethernet to hostonly and bridged, but I got the same problem.
Is there any configuration we need to do?
thank you very much!!!!
From my Host PC, I cannot to perform ping to RAC1 and RAC2 too.Make sure you have both nodes in the network and must be under same subnet mask.
Is it mandatory the ethernet adapter of host is connect to a network?If its not in the network how you are going to access?
Similar Messages
-
Problem when I extend an oracle rac 10g on new node
Hi everyone
I need to extend an oracle RAC but i have problems when I add a new node. My actual enviroment is:
1) Oracle Grid Infraestructure 11gR2 - 11.2.0.3 (Upgraded from Clusterware 10gR2 + ASM 10gR2)
2) Oracle Rac Database - 10.2.0.5
(all on one only node)
The first problem was when I executed the script "root.sh" on the new node because this script called the old Clusterware home (/oracle/product/10.2.0/crshome). I edited the file and I changed this path for /oracle/gridbase/product/11.2.0/gridhome (current home for GI). Finally, I execute the script.
Now, I tried to extend the rac through of DBCA, but when, I choose the new node and I clic on "next" button then appears the following error:
"The nodes "[rstatbdbpm02]" are not part of the cluster. Make sure clusterware is active on these nodes before proceeding"
However, when I execute the "crsctl" command to view the status of cluster the result is correct:
[oracle@rstatbdbpm01] /home/oracle > crsctl status res -t
NAME TARGET STATE SERVER STATE_DETAILS
Local Resources
ora.DATA.dg
ONLINE ONLINE rstatbdbpm01
ONLINE ONLINE rstatbdbpm02
ora.LISTENER.lsnr
ONLINE ONLINE rstatbdbpm01
ONLINE ONLINE rstatbdbpm02
ora.asm
ONLINE ONLINE rstatbdbpm01 Started
ONLINE ONLINE rstatbdbpm02 Started
ora.gsd
OFFLINE OFFLINE rstatbdbpm01
OFFLINE OFFLINE rstatbdbpm02
ora.net1.network
ONLINE ONLINE rstatbdbpm01
ONLINE ONLINE rstatbdbpm02
ora.ons
ONLINE ONLINE rstatbdbpm01
ONLINE ONLINE rstatbdbpm02
ora.registry.acfs
ONLINE ONLINE rstatbdbpm01
ONLINE ONLINE rstatbdbpm02
Cluster Resources
ora.BDBPM.BDBPM1.inst
1 ONLINE ONLINE rstatbdbpm01
ora.BDBPM.BPMVEH.BDBPM1.srv
1 ONLINE ONLINE rstatbdbpm01
ora.BDBPM.BPMVEH.cs
1 ONLINE ONLINE rstatbdbpm01
ora.BDBPM.db
1 ONLINE ONLINE rstatbdbpm01
ora.LISTENER_SCAN1.lsnr
1 ONLINE ONLINE rstatbdbpm02
ora.LISTENER_SCAN2.lsnr
1 ONLINE ONLINE rstatbdbpm02
ora.LISTENER_SCAN3.lsnr
1 ONLINE ONLINE rstatbdbpm01
ora.cvu
1 ONLINE ONLINE rstatbdbpm01
ora.oc4j
1 ONLINE ONLINE rstatbdbpm01
ora.rstatbdbpm01.vip
1 ONLINE ONLINE rstatbdbpm01
ora.rstatbdbpm02.vip
1 ONLINE ONLINE rstatbdbpm02
ora.scan1.vip
1 ONLINE ONLINE rstatbdbpm02
ora.scan2.vip
1 ONLINE ONLINE rstatbdbpm02
ora.scan3.vip
1 ONLINE ONLINE rstatbdbpm01
[oracle@rstatbdbpm01] /home/oracle >
Please, Any idea with that problem?
Thanks,
LuisHi,
Please check dbca trace logs for further checks, it will give an idea what command is being run to check status of cluster.
Generally first checks should be on inventory for rdbms home, grid home and making sure no ORACLE related parameter is set in environment.
Regards,
Sharma -
Oracle RAC 10.2G reboots node every 45 minutes
Hello:
- We have installed Oracle RAC 10.2G for Solaris X86 ( 64 bit ).
- On one node, there are no issues. But the other node ( I think )
is being rebooted by CRS every 45 minutes or so.
- Is this issue caused by some misconfiguration I did during the install ?
- Or is there a patch available to fix this ?
- Has anyone else encountered this problem ?
Thanks
jlemHello:
- I re-installed Oracle RAC. The nodes were only rebooted once so far.
So, the second install may be ok. If not, I have provided answers to the first email reply.
- Any help given is most welcome. In meantime, I will continue searching the oracle forums
for solutions.
- My environment is:
- both nodes are running under vmware ESX server version 3.0.1
- the shared storage for OCR and Voting Disk is a raw shared device under vmware
- both nodes are using Solaris X86 5.10 update 5
- Oracle version is: 10.2.0.3 ( patched from version 10.2.0.1 )
- My public network configuration is:
node 1:
e1000g0: flags=1000843<UP,BROADCAST,RUNNING,MULTICAST,IPv4> mtu 1500 index 2
inet 10.20.1.74 netmask ffff0000 broadcast 10.20.255.255
ether 0:c:29:3a:45:a9
e1000g0:1: flags=1000843<UP,BROADCAST,RUNNING,MULTICAST,IPv4> mtu 1500 index 2
inet 10.20.1.77 netmask ffff0000 broadcast 10.20.255.255
node 2:
e1000g0: flags=1000843<UP,BROADCAST,RUNNING,MULTICAST,IPv4> mtu 1500 index 2
inet 10.20.1.75 netmask ffff0000 broadcast 10.20.255.255
ether 0:c:29:2b:db:90
e1000g0:1: flags=1000843<UP,BROADCAST,RUNNING,MULTICAST,IPv4> mtu 1500 index 2
inet 10.20.1.78 netmask ffff0000 broadcast 10.20.255.255
- My private network configuration is:
node 1:
e1000g1: flags=1000843<UP,BROADCAST,RUNNING,MULTICAST,IPv4> mtu 1500 index 3
inet 192.168.0.1 netmask ffffff00 broadcast 192.168.0.255
ether 0:c:29:3a:45:b3
node 2:
e1000g1: flags=1000843<UP,BROADCAST,RUNNING,MULTICAST,IPv4> mtu 1500 index 3
inet 192.168.0.2 netmask ffffff00 broadcast 192.168.0.255
ether 0:c:29:2b:db:9a
- My storage solution is:
- 3 virtual shared SCSI hard disks ( each 500 MB in size )
- My log files are:
- /var/adm/messages
- doesn't report much only the following:
Nov 12 10:57:05 saucer nfs4cbd[328]: [ID 867284 daemon.notice] nfsv4 cannot determine local hostname binding for transport
tcp6 - delegations will not be available on this transport
Nov 12 10:57:21 saucer savecore: [ID 570001 auth.error] reboot after panic: forced crash dump initiated at user requestNov 12 10:57:21 saucer savecore: [ID 748169 auth.error] saving system crash dump in /var/crash/saucer/*.2Nov 12 10:57:41 saucer root: [ID 702911 user.error] Oracle Cluster Ready Services disabled by administrator.Nov 12 10:57:54 saucer rootnex: [ID 349649 kern.info] xsvc0 at rootNov 12 10:57:54 saucer genunix: [ID 936769 kern.info] xsvc0 is /xsvc
- ocssd.log file for node1 indicates that node2 was evicted for impeding a reconfig. Details are:
[ CSSD]2008-11-12 10:55:43.700 [15] >TRACE: clssnmPollingThread: node saucer (2) is impending reconfig
[ CSSD]2008-11-12 10:55:43.700 [15] >WARNING: clssnmPollingThread: node saucer (2) at 90% heartbeat fatal, eviction in 0
.973 seconds
[ CSSD]2008-11-12 10:55:44.679 [15] >TRACE: clssnmPollingThread: node saucer (2) is impending reconfig
[ CSSD]2008-11-12 10:55:44.679 [15] >TRACE: clssnmPollingThread: Eviction started for node saucer (2), flags 0x000d, s
tate 3, wt4c 0
[ CSSD]2008-11-12 10:55:44.690 [17] >TRACE: clssnmDoSyncUpdate: Initiating sync 3
[ CSSD]2008-11-12 10:55:44.690 [17] >TRACE: clssnmDoSyncUpdate: diskTimeout set to (27000)ms
[ CSSD]2008-11-12 10:55:44.691 [17] >TRACE: clssnmSetupAckWait: Ack message type (11)
[ CSSD]2008-11-12 10:55:44.691 [17] >TRACE: clssnmSetupAckWait: node(1) is ALIVE
[ CSSD]2008-11-12 10:55:44.691 [17] >TRACE: clssnmSetupAckWait: node(2) is ALIVE
[ CSSD]2008-11-12 10:55:44.691 [17] >TRACE: clssnmSendSync: syncSeqNo(3)
- node2 ocssd.log does not indicate the problem. See below for details:
[ CSSD]2008-11-12 10:52:34.731 [11] >TRACE: clssgmClientConnectMsg: Connect from con(da8410) proc(dab900) pid() proto(
10:2:1:1)
[ CSSD]2008-11-12 10:53:37.305 [11] >TRACE: clssgmClientConnectMsg: Connect from con(da8410) proc(dab900) pid() proto(
10:2:1:1)
[ CSSD]2008-11-12 10:54:40.515 [11] >TRACE: clssgmClientConnectMsg: Connect from con(da8410) proc(dab900) pid() proto(
10:2:1:1)
[ CSSD]2008-11-12 11:18:09.997 >USER: Oracle Database 10g CSS Release 10.2.0.3.0 Production Copyright 1996, 2004 Orac
le. All rights reserved.
[ CSSD]2008-11-12 11:18:09.997 >USER: CSS daemon log for node saucer, number 2, in cluster crs
[ clsdmt]Listening to (ADDRESS=(PROTOCOL=ipc)(KEY=saucerDBG_CSSD))
[ CSSD]2008-11-12 11:18:10.016 [1] >TRACE: clssscmain: local-only set to false
[ CSSD]2008-11-12 11:18:10.031 [1] >TRACE: clssnmReadNodeInfo: added node 1 (flying) to cluster
[ CSSD]2008-11-12 11:18:10.042 [1] >TRACE: clssnmReadNodeInfo: added node 2 (saucer) to cluster
[ CSSD]2008-11-12 11:18:10.057 [5] >TRACE: clssnm_skgxnmon: skgxn init failed
[ CSSD]2008-11-12 11:18:10.057 [1] >TRACE: clssnm_skgxnonline: Using vacuous skgxn monitor
- ORACLE VERIFY: cluvfy was run on node2 resulting with the following:
bash-3.00$ ./cluvfy comp ocr -n all -verbose
Verifying OCR integrity
Checking OCR integrity...
Checking the absence of a non-clustered configuration...
All nodes free of non-clustered, local-only configurations.
Uniqueness check for OCR device passed.
Checking the version of OCR...
OCR of correct Version "2" exists.
Checking data integrity of OCR...
Data integrity check for OCR passed.
OCR integrity check passed.
Verification of OCR integrity was successful.
bash-3.00$
Thanks
jlem -
Oracle Rac services on second node
Hi all
I have got oracle rac database installed on a 2 node cluster. I was not able to get the below 2 services online (state) on the second node. Any idea?
ora.racdb.racdb2.inst
ora.racdb.racdb_taf.racdb2.srv
Name Type Target State Host
ora....b2.inst application ONLINE OFFLINE
ora....db2.srv application ONLINE OFFLINE
I checked the +ASM2 instance on the second node, the asm_diskgroup parameter is not set. Is that the cause of the problem I am encountering?
Thanks all
OS: Oracle Enterprise Linux 5 x86
Oracle: Oracle 10g x86You should be able to find some more information in the instance's alert log file. But not having asm_diskgroups set is a very likely cause. Your diskgroups won't get mounted and that leads to the instance not being able to start (because it won't even find the spfile if it is stored on ASM). You could try to mount them manually like this:
export ORACLE_SID=+ASM2
sqlplus sys/ as sysdba
alter diskgroup MYDISKGROUP mount;and then start the instance with 'srvctl start instance -d myracdb -i myinstance2'
Or, of course you could edit the asm instance's pfile (or spfile) to include the asm_diskgroup parameter and reboot the server (or just asm)
Bjoern -
Can't install ORACLE RAC on Solaris (specified nodes are not clusterable)
Hi all,
Could you please help with the Oracle CRS issue?
During the installation Oracle CRS the OUI indicate that the specified nodes are not clusterable.
The window appears and displays:
"The specified nodes are not clusterable.
The following error was returned by the operating system:"
I am using 10gr2_cluster_sol.cpio.gz file.
My Solaris 10 configuration:
server - sun3
bash-3.00# cat /etc/hosts
# Internet host table
127.0.0.1 localhost
10.160.19.49 sun3 loghost
10.160.19.50 sun4 loghost
10.11.12.13 sun3prv
10.11.12.14 sun4prv
10.160.19.64 sun3pub
10.160.19.65 sun4pub
bash-3.00# ifconfig -a
lo0: flags=2001000849<UP,LOOPBACK,RUNNING,MULTICAST,IPv4,VIRTUAL> mtu 8232 index 1
inet 127.0.0.1 netmask ff000000
bge0: flags=1000843<UP,BROADCAST,RUNNING,MULTICAST,IPv4> mtu 1500 index 2
inet 10.160.19.49 netmask fffffe00 broadcast 10.160.19.255
ether 0:14:4f:0:64:82
bge1: flags=1000843<UP,BROADCAST,RUNNING,MULTICAST,IPv4> mtu 1500 index 3
inet 10.11.12.13 netmask fffffe00 broadcast 10.11.13.255
ether 0:14:4f:0:64:83
bash-3.00# cat /etc/netmasks
10.160.18.0 255.255.254.0
10.160.19.0 255.255.254.0
10.11.12.0 255.255.254.0
bash-3.00# cat /etc/hostname.bge0
sun3
bash-3.00# cat /etc/hostname.bge1
sun3prv
server - sun4
bash-3.00# cat /etc/hosts
# Internet host table
127.0.0.1 localhost
10.160.19.50 sun4 loghost
10.160.19.49 sun3 loghost
10.11.12.14 sun4prv
10.11.12.13 sun3prv
10.160.19.63 sun4pub
10.160.19.62 sun3pub
bash-3.00# ifconfig -a
lo0: flags=2001000849<UP,LOOPBACK,RUNNING,MULTICAST,IPv4,VIRTUAL> mtu 8232 index 1
inet 127.0.0.1 netmask ff000000
bge0: flags=1000843<UP,BROADCAST,RUNNING,MULTICAST,IPv4> mtu 1500 index 2
inet 10.160.19.50 netmask fffffe00 broadcast 10.160.19.255
ether 0:14:4f:0:41:c8
bge1: flags=1000843<UP,BROADCAST,RUNNING,MULTICAST,IPv4> mtu 1500 index 3
inet 10.11.12.14 netmask fffffe00 broadcast 10.11.13.255
ether 0:14:4f:0:41:c9
bash-3.00# cat /etc/netmasks
10.160.18.0 255.255.254.0
10.11.12.0 255.255.254.0
10.160.19.0 255.255.254.0
bash-3.00# cat /etc/hostname.bge1
sun4prv
bash-3.00# cat /etc/hostname.bge0
sun40) This error occur when I run .runInstaller
All prerequisites check passed. The error window appears after clicking Next button in Specify Cluster Configuration window.
1) I have changed /etc/hosts file as you have mentioned
SUN3
bash-3.00# cat /etc/hosts
# Internet host table
::1 localhost
127.0.0.1 localhost
10.160.19.49 sun3
10.160.19.50 sun4
10.11.12.13 sun3-vip
10.11.12.14 sun4-vip
10.160.19.64 sun3pub
10.160.19.65 sun4pub
SUN4
bash-3.00# cat /etc/hosts
# Internet host table
::1 localhost
127.0.0.1 localhost
10.160.19.50 sun4
10.160.19.49 sun3
10.11.12.13 sun3-vip
10.11.12.14 sun4-vip
10.160.19.64 sun3pub
10.160.19.65 sun4pub
Also I have configured bge0:1 interface
bash-3.00# ifconfig -a
lo0: flags=2001000849<UP,LOOPBACK,RUNNING,MULTICAST,IPv4,VIRTUAL> mtu 8232 index 1
inet 127.0.0.1 netmask ff000000
bge0: flags=1000843<UP,BROADCAST,RUNNING,MULTICAST,IPv4> mtu 1500 index 2
inet 10.160.19.49 netmask fffffe00 broadcast 10.160.19.255
ether 0:14:4f:0:64:82
bge0:1: flags=1000843<UP,BROADCAST,RUNNING,MULTICAST,IPv4> mtu 1500 index 2
inet 10.160.19.64 netmask ffffff00 broadcast 10.160.19.255
bge1: flags=1000843<UP,BROADCAST,RUNNING,MULTICAST,IPv4> mtu 1500 index 3
inet 10.11.12.13 netmask fffffe00 broadcast 10.11.13.255
ether 0:14:4f:0:64:83
2) I have removed loghost from /etc/hosts file
3) Currently I do not have shared storage. I am going to use Storage Foundation to create a shared storage
Also I was trying to test the machines using runcluvfy.sh command
The output is the following:
-bash-3.00$ ./runcluvfy.sh stage -pre crsinst -n sun3,sun4
Performing pre-checks for cluster services setup
Checking node reachability...
Node reachability check passed from node "sun3".
Checking user equivalence...
User equivalence check failed for user "oracle".
Check failed on nodes:
sun4,sun3
ERROR:
User equivalence unavailable on all the nodes.
Verification cannot proceed.
Pre-check for cluster services setup was unsuccessful on all the nodes. -
Latency problems between nodes
I'm looking for some help debugging coherence in a weblogic environment where we are having problems with (what seems to be) the latency of calls between coherence nodes both on the same box and on different boxes.
The application is a standard 3 tier java app which uses hibernate as its persistence provider and coherence as the L2 cache provider.
The target architecture is the application clustered in weblogic (in my test environment it is over a pair of boxes, in production this will scale out horizontally). The managed server JVM's will join the coherence cluster as nodes but set to not store data locally (we would like to reserve all the heap for the app) and then have (multiple?) coherence nodes on each physical server (we need to play with heap sizes and gc to best understand how we leverage the physical memory on the box) to support our caches.
We have seperate caches for each entity that is cached but to start of with they are all partitioned (as we work through testing we'll obviously play with the configurations for the different groups of entities as appropriate).
The coherence cluster is using well known addresses to find nodes but we haven't got into overriding any of the other values in the tangosol-coherence-override.xml file.
The simplest test case for demonstrating this behaviour is a single user logging in executing a particular search logging out, logging back in executing the search etc. etc.
If I have only one JVM running for the application (the managed server which has the application on) and for the sake of these tests set it to store local data so essentially the cache is running in process (like a replicated cache essentially). The we get a flat line of performance of 5 seconds per search. This is fine as a starting point but I obviously want to scale it out.
If I now start a coherence server node (launched from with weblogic) on the same physical machine (it happily joins the cluster) and run the same test I get a flatline of performance of 11 seconds. This really surprised me as I'm only sharing data between 2 processes on the same box yet we seem to have introduced a latency of 6 seconds.
Finally if I now start the full cluster, so I have 2 managed servers in a cluster (not storing data locally for this test) and each physical machine also has a coherence node on it the test flatlines at around 18 seconds. So we've introduced some more significant latency where the data maybe potentially on another box (but as I understand it coherence guarentees its only ever 1 hop away).
We're running on solaris 10 and the boxes aren't stressed for CPU or memory during the tests. Equally if I monitor the JVM's health they seem fine. I've also bumped up the coherence logging to 9 but don't get anything additional that looks worrying.
Is there anything obvious I am missing and if not how can I start to understand the problem more.
Thanks in advance,
Paul Fitz.Thanks for the response. Just to clarify 5 seconds is too long but for now its just my baseline (it should be less than that but that’s within the control of the dev team, in fact someone is checking in a fix to a mistake in the algorithm at the moment which brings that down to something more sensible). This is still with caches defined as partitioned but with only one JVM running that is hosting both the application and is acting as a coherence node storing data. It’s not the target architecture but to baseline what’s the quickest this will go based on the assumption that optimum throughput will come if the cache is in the same JVM as the app.
Moving towards the target architecture I then run the same test but with an additional coherence node started on the same physical box as my application JVM (so when I talk about hops I mean when hibernate looks for data in its L2 cache its either going to be in the node running in the same JVM as my app or in the node running as a separate JVM but on the same box). The searches now consistently come back at 11 seconds so as nothing else has changed the extra 6 seconds seems to come from the data coming now from 1 of 2 JVM’s (coherence nodes) running on the same machine as opposed to always being in the same JVM.
The final tests (in reality we started with this the target architecture and worked backwards to see where we were loosing time) is then bringing up nodes on the second box in the cluster which adds an extra 6 or 7 seconds to searches. In some respects I could understand this more and can debug traffic going out on the network between the cluster members and see if its being sent around the houses so to speak by the network and try some of the things mentioned above. What makes less sense is the second scenario where all we’ve got is inter process communication going on in the same box. I'll look at the actual size of the data being passed between the 2 processes because your post has made me question the cost of the serialization process thats happening (nothing more than standard java at the moment).
I’ve also got a couple of dev windows boxes on a different network so I’m inclined to quickly stand them up with the code and see if I can replicate the same increase in response times between the different configs. This should give me a feeling for if its something to do with my NFR environments and from that if its networks, interaction with a different o/s etc. -
Oracle RAC 10g , is it possible OS different version
Dear all,
we have two node Oracle RAC 10gr2 on Red hat 4. Now the plan is to add a new node to the exising Oracle RAC . The new node is Redhat 5.4.
is it possible to add this node to the existing Oracle RAC.
The only the difference is OS version, existing setup is Redhat 4 and new node having Redhat 5.
Thanks & regards,
Sher khanHi,
Oracle documentation says the following:
Oracle Clusterware and Oracle RAC do not support heterogeneous platforms in the same cluster. For example, you cannot have one node in the cluster running Oracle Linux and another node in the same cluster running Solaris UNIX. All nodes must run the same operating system; that is, they must be binary compatible. Oracle RAC does not support machines having different chip architectures in the same cluster. However, you can have machines of different speeds and sizes in the same cluster.
And from Metalink article
Comparison Between Features : RAC, Dataguard, Streams, Advanced Replication and Basic Replication [ID 370850.1]
we can see some additional clarification:
Real Application Clusters (RAC) -
Same OS on all nodes including Patchset release
Same version on all nodes including Oracle Patchset release
Good luck!
http://dba-star.blogspot.com/ -
SAP - Oracle RAC - Linux - VMware
Hi,
we would like to install SAP on a virtual environment (VMware) on Linux RedHat platform.
For the database, Oracle RAC is certified with OCFS2 (1.6) only with Oracle Linux and not RedHat Linux 5.
Can we intstall Oracle RAC database on 2 nodes with Oracle Linux and OCFS2 and other SAP components on Redhat Linux 5 ?
Are there any issues with SAP matrix compatibility ?
Thank you!
DanHi Audun,
Thank you for the links.
I can see the ASM support is new (2.2.2011)
On Note 527843 from 25.01.2011 - Oracle RAC support in the SAP environment
Using raw devices and ASM (Automatic Storage Management) is not supported in the SAP environment. In the case of ASM, there is an exception regarding the Oracle Cluster Registry (OCR) and the voting disks
and
on Note 1550133 - Oracle Automatic Storage Management (ASM) from 2 Feb 2011 they support ASM but with restrictions:
ACFS is required for RAC installations with ASM
To use ASM now, you have to migrate your database to ASM manually
Some important functions of SAP BR*Tools like backup, restore, recovery, tablespace and datafile management are not supported at the moment
Support for tablespace and datafile management is planned for Q2/2011
Also ASM instance will introduce another layer in the system with its problems, failures and resources consumption. I see on the Oracle Metalink alot of problems related to ASM. From the marketin point of view is ok, let's oracle do everything but I think if you will keep it simple it will be better.
I didn't figure out if you can directly install Oracle 11.2.0.2, they say "Allows direct SAP system installations with Oracle 11.2"
We have a vBlock hardware with flash disks as FAST Cache so I believe with 1TB of cache IO will not be a problem.
Thank you!
Dan -
Hello, I have an Oracle RAC 10gR2 with 2 nodes on Suse Linux Enterprise Server 10. When I enter on the enterprise manager database control (no grid control). I see the follow message:
"Availability calculations for the cluster database target are disabled. Please enable the DBMS JOB EMD_MAINTENANCE.EXECUTE_EM_DBMS_JOB_PROCS for Database Control."
What is it? How can I clean it?
Thanks in advance.Check out MOS note :
What is EMD_MAINTENANCE.EXECUTE_EM_DBMS_JOB_PROCS dbms_job and how to Remove / Re-create it [ID 444033.1]
Regards
Rajesh -
Oracle RAC installation failover
Hi,
I have an Oracle RAC installation with 2 nodes with the data stored on a shared OCFS partition. I had a client test the connection using jdbc string for RAC failover. I tried shutting down one of the nodes on the RAC installation and the client could not connect to the oracle cluster database for the next 5 to 10mins.
I understand that the client would failover to the next available listener (On the next retry connection) if the node it is currently listening to has failed. Is there any configuration i should make to increase the failover efficiency?
Thanks for any advice.Hi,
Server side failover is arranged by setting the remote_listener parameter.
Client side failover is set by using T(ransparent) A(pplication) F(ailover) (9i and higher)
or F(ast)C(onnection)F(ailover). Both are documented in the Net administrators manual for the version you didn't care to mention.
As far as I know, both TAF and FCF are not supported by the JDBC thin driver.
Sybrand Bakker
Senior Oracle DBA -
Multipool with weblogic 8.1 sp3 and Oracle RAC
Hi,
I have an Oracle RAC define with 2 node.
For each node I defined a simple ConnectionPool using the Oracle thin driver 10 g.
Then I set a multipool that contains those conections pool.
This is a good solution: It works... :-)
Now I would like to use a XA Driver. The documentation say that it is not supported...
I would like to figure out why?
Does it mean that an EJB that connect to this multipool cannot participate in a XA transaction?
Will it work if I use the param KeepXAConnTillTXComplete="true" on the connectionpool? which means that I will use the same conection throughout the transaction.
Thank you
Yann.Yann Albou wrote:
Hi,
I have an Oracle RAC define with 2 node.
For each node I defined a simple ConnectionPool using the Oracle thin driver 10 g.
Then I set a multipool that contains those conections pool.
This is a good solution: It works... :-)
Now I would like to use a XA Driver. The documentation say that it is not supported...
I would like to figure out why?
Does it mean that an EJB that connect to this multipool cannot participate in a XA transaction?
Will it work if I use the param KeepXAConnTillTXComplete="true" on the connectionpool? which means that I will use the same conection throughout the transaction.
Thank you
Yann.Hi. The trouble with XA and multipools is that sometimes XA must recover after failures,
and if a multipool is involved, the transaction coordinator cannot know whether the
connection it gets from the multipool for recovery is talking to the same DBMS that was
involved in the transaction that has to be recovered.
Do check the 81sp3 documents on this issue.
Joe -
Hi, Can I install Oracle Rac on one pc(nodes, storage,oracle software)? it means, I want to make some tests in order to learn about Oracle RAC. My computer has Centos 5.1 and Oracle Database 10g. I searched on Internet, but tutorial are too advanced. If you know about some sites for beginners, please, write the urls.
Thanksyou can check with the following link for install on Linux
[http://www.oracle.com/technology/pub/articles/smiley_rac10g_install.html|http://www.oracle.com/technology/pub/articles/smiley_rac10g_install.html]
[http://www.puschitz.com/InstallingOracle10gRAC.shtml|http://www.puschitz.com/InstallingOracle10gRAC.shtml] -
Hi Guys,
I'm new in RAC. I have a Oracle RAC database with 2 nodes. I want to know if it is good/appropriate to backup the 2 nodes instead of 1. Since I believe they both use the same database.
Thanks,
Benjo>
Currently we are backing up two nodes of the Oracle RAC individually. What is on my mind is to backup only 1 node and that is all.
>
It seems to me that you should first get a clear understanding of what to backup . There is only one database to backup in a RAC, regardless of the number of nodes. If you are happy to connect with RMAN to one node and take a backup there, everything is fine. There is no need to connect to the other node and do the same there.
That is, if you have placed the archivelogs on the shared storage. This is a best practice. If they are on node-local devices, there is actually something node specific to backup. Else there is not.
Kind regards
Uwe
http://uhesse.wordpress.com -
TMQFORWARD dies when network problems occur
Hi
I have two Tux 6.5 (patchlev 317) domains A, B interconnected through a WAN. The
two domains communicate with a disk-based queue which resides on domain A and
TMQFORWARD server forwards the messages to a service of domain B. These messages
just log a few things in a database of domain B.
Sometimes during the day there are network problems on the WAN which give errors
like the following in the ULOG of domain A:
100350.kat1126!GWTDOMAIN.4428: LIBGWT_CAT:1249: WARN: Connect to Kentriko address
2F-2F-31-30-2E-31-2E-39-2E-32-30-30-3A-33-31-30-30-30 failed, Network error(0x0)
100350.kat1126!GWTDOMAIN.4428: LIBGWT_CAT:1304: WARN: No more remote domain address
for remote domain Kentriko
Everything is fine until this point. However, at certain occassions, if after
these network problems I try to tmshutdown domain A, tmshutdown stucks while trying
to tmshutdown TMQFORWARD server. The following is the relevent point in the ULOG
of domain A (server QUEGRP/10 is TMQFORWARD):
142051.kat1126!BBL.4321: LIBTUX_CAT:541: WARN: Server QUEGRP/10 terminated
142051.kat1126!BBL.4321: LIBTUX_CAT:550: WARN: Cleaning up restartable server
QUEGRP/10
142051.kat1126!TMSYSEVT.4426: LIBTUX_CAT:1477: ERROR: .SysServerDied: TMQFORWARD,
group QUEGRP, id 10 server died
142051.kat1126!TMSYSEVT.4425: LIBTUX_CAT:1475: ERROR: .SysServerCleaning: TMQFORWARD,
group QUEGRP, id 10 server cleaning
142051.kat1126!cleanupsrv.8592: 09202001: TUXEDO Version 6.5 SCO_SV 3.2 2 i386.
142051.kat1126!cleanupsrv.8592: server QUEGRP/10: CMDTUX_CAT:551: INFO: server
removed
142252.kat1126!BBL.4321: LIBTUX_CAT:216: WARN: Process 4433 died; removing from
BB
I then have to kill the tmshutdown process and use ipcclean in order to clear
the system!
This problem does not happen in all cases I have network problems between the
two domains. Sometimes I have seen TMQFORWARD dying and then restarting during
the day possibly because of these network problems. When this was the case, domain
A was shutdown perfectly well.
Can anyone explain all these? Why would TMQFORWARD die when network problems occur?
Has anyone met such strange situations with TMQFORWARD?
Many thanks
Lazaros MitsisHi
I have two Tux 6.5 (patchlev 317) domains A, B interconnected through a WAN. The
two domains communicate with a disk-based queue which resides on domain A and
TMQFORWARD server forwards the messages to a service of domain B. These messages
just log a few things in a database of domain B.
Sometimes during the day there are network problems on the WAN which give errors
like the following in the ULOG of domain A:
100350.kat1126!GWTDOMAIN.4428: LIBGWT_CAT:1249: WARN: Connect to Kentriko address
2F-2F-31-30-2E-31-2E-39-2E-32-30-30-3A-33-31-30-30-30 failed, Network error(0x0)
100350.kat1126!GWTDOMAIN.4428: LIBGWT_CAT:1304: WARN: No more remote domain address
for remote domain Kentriko
Everything is fine until this point. However, at certain occassions, if after
these network problems I try to tmshutdown domain A, tmshutdown stucks while trying
to tmshutdown TMQFORWARD server. The following is the relevent point in the ULOG
of domain A (server QUEGRP/10 is TMQFORWARD):
142051.kat1126!BBL.4321: LIBTUX_CAT:541: WARN: Server QUEGRP/10 terminated
142051.kat1126!BBL.4321: LIBTUX_CAT:550: WARN: Cleaning up restartable server
QUEGRP/10
142051.kat1126!TMSYSEVT.4426: LIBTUX_CAT:1477: ERROR: .SysServerDied: TMQFORWARD,
group QUEGRP, id 10 server died
142051.kat1126!TMSYSEVT.4425: LIBTUX_CAT:1475: ERROR: .SysServerCleaning: TMQFORWARD,
group QUEGRP, id 10 server cleaning
142051.kat1126!cleanupsrv.8592: 09202001: TUXEDO Version 6.5 SCO_SV 3.2 2 i386.
142051.kat1126!cleanupsrv.8592: server QUEGRP/10: CMDTUX_CAT:551: INFO: server
removed
142252.kat1126!BBL.4321: LIBTUX_CAT:216: WARN: Process 4433 died; removing from
BB
I then have to kill the tmshutdown process and use ipcclean in order to clear
the system!
This problem does not happen in all cases I have network problems between the
two domains. Sometimes I have seen TMQFORWARD dying and then restarting during
the day possibly because of these network problems. When this was the case, domain
A was shutdown perfectly well.
Can anyone explain all these? Why would TMQFORWARD die when network problems occur?
Has anyone met such strange situations with TMQFORWARD?
Many thanks
Lazaros Mitsis -
Two lan cards for oracle RAC.
Hi,
For the oracle RAC installation on two nodes should i required 2 lan cards on each nodes.
ThanksHi,
The document has the hardware requirements do produce a RAC installation
10gR2 http://download.oracle.com/docs/cd/B19306_01/rac.102/b28759/toc.htm
11gR1 http://download.oracle.com/docs/cd/B28359_01/rac.111/b28252/toc.htm
Regards,
Rodrigo Mufalani
http://mufalani.blogspot.com
Maybe you are looking for
-
Hello everyone, I just had the WORST CUSTOMER EXPERIENCE with a Verizon Wireless Premium Retailer (Go Wireless in Chino Hills) and a Verizon Account Manager. Through a ridiculous saga of events, I finally had to send an email to a customer service su
-
How to get radio behavior for multiple embedded movies on a single page?
Hi, I've got a single XHTML page with multiple embedded movie objects. The movies all play fine. I would like to convert the page to achieve a radio-style behavior for the movies so only one is running at any time. That is, if I click on a movie that
-
How do I add content to my iphone using itunes?
I have watched the tutorial, and I know that once my iphone is plugged in, it shows up in itunes, I should be able to click on that, and then choose "add to" and simply drag and drop items onto my iphone. However, the "add to" button just will not s
-
Hello, I use the following code to call web service from abap, but in the method "http_client->receive" i see "http_communication_failure = 1" . this is the code: DATA: SMS_TEXT TYPE STRING, SMS_TEXT_UTF TYPE STRING, SEND_STRIN
-
IPhone 5 Siri-how do I get Siri to call me by another name?
I ask her to call me "new name". She replies, "From now on, I'll call you 'new name'. OK?" I click yes. She replies "Sorry, old name, I'm afraid I can't do that." I have tried several names with the same result. Thanks!