Oracle RAC / Logical Data Guard causing network problems on VMware
We have VMWare 5.0 cluster across the 12 blades (6 per chassis) running a mixture of Red Hat and Windows 2008 R2 vms. The Red Hat boxes are two times two node Oracle RAC (primary and secondary), also apache web servers and jboss application servers. The Windows servers are for AV/DC/Management/Monitoring.
The problem is that intermittent network connectivity to random Windows and Red Hat boxes occur when the Oracle RAC builds up archive logs and then ships / applies them to the secondary nodes, between ESX nodes either on different blades in the same chassis or across the chassis and even when all RAC nodes are on the same ESX host.
We are using NFS, Oracle 11g and Red Hat 6.2.
Sorry if this info is a bit vague, im not an Oracle expert! :-)
thanks,
Dave
Hi,
1.) The calculation for Standby RedoLogs is:
(Max Number of Logfiles per thread (Instance) +1) * Max Number of Threads (Instances))
So if you have 4 Redo Log Groups on your primary (which is 2 Redo Log Groups per Instance), then it ends up:
(2 +1) * 2 = 6
So actually you will only need 6 standby redo logs, not 8. But 2 more don't harm.
Your primary will need exactly the same number (6 or in your case 8). Which will be 3 per thread/instance or in your case 4.
2.) The SID List in the listener.ora is a listing of SIDs the Listener is listening on. It is not the listener name.
Hence it is not "lsnrctl guard_dgmgrl start" but only "lsnrctl LISTENER start", whereas the LISTENER is the default and "lsnrctl start" would be sufficient.
However since this is grid infrastructure with the listener running out of ASM home, be sure to have set your environment to GI Home not to DB_HOME for the listener.ora entries, but to DB_HOME for the tnsnames.ora entries necessary for data guard.
And since listener is running under clusterware you should use "srvctl stop listener" and start.
Last but not least the SID entries for dataguard have to use DGMGRL not dgmgrl.
3.) Here is the whitepaper you are looking for:
www.oracle.com/goto/maa
Also for client failover best practices.
(Here the direct link to the RAC whitepaper):
http://www.oracle.com/technetwork/database/features/availability/maa-wp-10g-racprimarysingleinstance-131970.pdf
However since this is 10g you should combine this with the 11g RAC standy paper (e.g. SCAN Listener setup).
Sebastian
Similar Messages
-
Oracle11g R2 Active Data guard using ASM Problem?
I have configured oracle11g r2 RAC on 2 notes using ASM Grid ( OS unix).
RAC is up and running.
Now I am configuring Active data Guard.
Under grid user instance +ASM and listener is running.
Under oracle user static listener is running.
All disk is mounted.
Oracle RAC and Data Guard directory and structure I have keeped same.
Now my problem is below:
$ ./rman target sys/HPinvent123nbl@dcpdb AUXILIARY sys/HPinvent123nbl@drpdb
Recovery Manager: Release 11.2.0.1.0 - Production on Wed Jan 16 16:28:32 2013
Copyright (c) 1982, 2009, Oracle and/or its affiliates. All rights reserved.
connected to target database: DCPDB (DBID=316773134)
connected to auxiliary database: DRPDB (not mounted)
RMAN> duplicate target database for standby from active database;
Starting Duplicate Db at 16-JAN-13
using target database control file instead of recovery catalog
allocated channel: ORA_AUX_DISK_1
channel ORA_AUX_DISK_1: SID=5644 device type=DISK
contents of Memory Script:
backup as copy reuse
targetfile '/u02/app/oracle/product/11.2.0/dbhome_1/dbs/orapwdcpdb1' auxiliary format
'/u02/app/oracle/product/11.2.0/dbhome_1/dbs/orapwdrpdb' ;
executing Memory Script
Starting backup at 16-JAN-13
allocated channel: ORA_DISK_1
channel ORA_DISK_1: SID=1897 instance=dcpdb1 device type=DISK
Finished backup at 16-JAN-13
RMAN-00571: ===========================================================
RMAN-00569: =============== ERROR MESSAGE STACK FOLLOWS ===============
RMAN-00571: ===========================================================
RMAN-03002: failure of Duplicate Db command at 01/16/2013 16:28:48
RMAN-06136: ORACLE error from auxiliary database: ORA-00200: control file could not be created
ORA-00202: control file: '+data'
ORA-17502: ksfdcre:4 Failed to create file +data
ORA-15001: diskgroup "DATA" does not exist or is not mounted
ORA-15055: unable to connect to ASM instance
ORA-01031: insufficient privileges
RMAN>
Please help.\
Thanks
Solaimanroot@drpdb1 []# id oracle
uid=108(oracle) gid=700(oinstall) groups=701(dba)
root@drpdb1 []# id grid
uid=109(grid) gid=700(oinstall) groups=701(dba),702(asmdba)
Edited by: 876149 on Jan 16, 2013 3:19 AM -
Any Benefits of Extended RAC Over Data Guard?
Hi,
My company is in the process of setting a second data center, a bit far from the current one (about 20 KM).
This Data Center will be used as a DR site, as well as to accommodate additional servers since the current Data Center is already stretched.
We're currently running about 5 RAC clusters, two nodes each on Oracle 11g and on AIX plaforms. It's not yet decided what type of technology will be employed with the databases - whether RAC with Data Guard (DG) or Extended RAC.The network link will be fairly good, a dark fiber link.
Does anyone have a suggestion as to which of the above technology would be preferable? With Extended RAC I think we are able to continue operating from the second site without needing to 'failover' as such, while with DG, we probably will need to put in place an elaborate failover procedure, even though we can use the Fast-Start-Automatic failover feature of 10g Rel2 and above.
Any thoughts/suggestions/clarifications?
Dula
Edited by: dula on Aug 28, 2009 2:47 PMHi Dula,
Even some time back we were also consdering the same to use remote DG for disaster recovery solution (active-passive) to create extended RAC on a remote site and then make use of those remote servers for the purpose of disaster reocovery and also make those servers connect to the PROD application servers thus making a (active-active) clustering solution.
Management wanted to look into the feasibility of using active-active nodes on remote sites which they thought will help them make use of remote servers to connect to the LIVE applicatio. With the active-passive mode where there was just standby servers in recovery mode on remote site they were of the opinion that why to waste power for remote servers and why not use them in LIVE site.
We thought of using the dense optic fiber for the DB replication to remote RAC nodes and also for our 30+ application servers to connect from site 1 to site 2( site 2 will be remote site with extented rac nodes hosted).
However there were many reservations that came which making decission. First was that we could not find any reliable source of group who had successfully implemented this active-active remote extended rac nodes in production. The cost of dense optic fiber was another consideration. How the interconnects will perform in remote sites was another grey area for us. Also since the application servers uses TAF to connect to DB servers it was not known that how the sessions from the same application servers(web based applications) to local and remote nodes will have performance impact. With so many grey areas we dropped the idea of using extended rac nodes and went ahead with DG solution.
Amar -
Implemeting 11gR2 RAC with Data Guard
Hi ,
Could any one provide the steps on how to setup 11gR2 two node RAC With Dataguard . Could the 11R2 Active database duplication can be used in setting up the standby ?
I just need the order of steps to be followed to set up the environment.
Thanks,
shashi.Hi Fiedi ,
Thanks for the reply .
I know how to build the oracle dataguard . But , I'm looking for the order of steps that I need to follow to build 11gR2 RAC with data guard.
1] Set up the Grid Infrsatructure for the 2 node RAC .
2] Create the database .
3] Modify the init.ora prameter to chage the above created database as primary .
4] Set up the grid infrastructure for the 2 node RAC on the DR site.
5] Create the standby database using 11gR2 active database dupication.
Is the above order correct ? If not , let me know the correct order of steps that needs to be followed to setup 11gR2 RAC with dataguard. -
Oracle 11g Active Data Guard and SAP R3
Hi All,
I have a query regarding Oracle 11g Active Data Guard and SAP R3.
Does the Oracle 11g R1/R2 Active Data guard feature supported with SAP R3?
I appreciate your help to provide any link or document for the same.
Thanks,
VihangI have a query regarding Oracle 11g Active Data Guard and SAP R3.
Does the Oracle 11g R1/R2 Active Data guard feature supported with SAP R3?
I appreciate your help to provide any link or document for the same.
Oracle database 11g functionality certified by SAP, check below link
http://www.oracle.com/us/solutions/sap/oradb11g-article-upd-1-323074.pdf
http://www.oracle.com/technetwork/middleware/ias/downloads/osb-11gr1certmatrix.xls -
Urgent : ORA-01426: numeric overflow on oracle 11g Active Data Guard
Hi
I have configured Active Data Guard on oracle 11g, for reporting purpose we will select mutliple querry on target side(10 users). we are getting 'numeric overflow erro'r on alert log file When we issuing multiple query on target side. PLeae let me know is this error will cause performance degrad. if it will degrade performance mean please tell me how to resolve this problem. Why the numeric overflow is comming . and it is not comming in the primary database, it is comming in standby database only. please any one help it is very urgent
is there any parameter To overcome this problme
Please please it is very important to me and very urgent .
Thanks
nafees
Edited by: Nafees on Jan 1, 2009 3:44 AM
Edited by: Nafees on Jan 1, 2009 3:54 AMThere is no one drowning.
Your house is not on fire.
The volcano has not exploded.
Please apologize for abusing this forum by claiming your issue is more urgent than other people's requests.
Then, and only then, should anyone help you. I know I certainly won't until I read your sincere apology and promise not to be abusive in the future. -
Oracle 11g Active Data Guard是怎么收费的?
原来在Oracle 10g中,DG是免费使用的,想请问一下,在Oracle 11g中,性能得到加强的ADG是怎么收费的?谢谢
FYI
Use customary EE Option License Practices
Active Data Guard license required if using either Real-time Query or RMAN block-change tracking on a standby
Example with primary and 5 separate standby databases
S1 = physical - real-time query
S2 = physical - real-time query + RMAN block change tracking
S3 = physical - RMAN block change tracking
S4 = physical - neither real-time query nor RMAN block change tracking
S5 = logical
Active Data Guard must be licensed for primary and for S1, S2 and S3.
Active Data Guard is not relevant to logical standby -
Oracle RAC 11g R1 Release Connection Failover Problem
Hi All,
In our Architecture we are using Oracle RAC 11g R1. Below is the JDBC URL :
JDBCURL = jdbc:oracle:thin:@(DESCRIPTION =(ADDRESS = (PROTOCOL = TCP)(HOST = Host1-vip)(PORT = 1521))(ADDRESS = (PROTOCOL = TCP)(HOST = Host2-vi
p)(PORT = 1521))(LOAD_BALANCE = ON)(FAILOVER=ON)(CONNECT_DATA =(SERVER = DEDICATED)(SERVICE_NAME = <Service_name>)))
We are using two node RAC. The problem is whenever we are rebooting a Node and rejoin the cluster, Application Servers are not able to recognize that.
Suppose we have node1 and node2, I will take down node1 (freeze the cluster) and then reboot node1 and bring it back up( and join the cluster). At this point, My application servers are not able to recognize that some new DBserver(node1) had joined the cluster until I restart my application servers.
Please Provide me a solution for this. Thanks alot to everyone in advance.
Edited by: 877010 on Aug 4, 2011 2:00 PM
Edited by: 877010 on Aug 8, 2011 10:19 AMPlease try using this
JDBCURL = jdbc:oracle:thin:@(DESCRIPTION =(ADDRESS = (PROTOCOL = TCP)(HOST = Host1-vip)(PORT = 1521))(ADDRESS = (PROTOCOL = TCP)(HOST = Host2-vi
p)(PORT = 1521))(LOAD_BALANCE = YES)(FAILOVER=YES)(CONNECT_DATA =(SERVER = DEDICATED)(SERVICE_NAME = <Service_name>))) -
Data guard role transition problem
Hi,
I am trying to do a switchover using the data guard broker cli and get the following error:
DGMGRL> switchover to "TGDRDB01"
Performing switchover NOW. Please wait...
Error: ORA-16775: Target standby in switchover operation has missing redo logs.
Failed.
Can not proceed to switchover. Primary is still "TGDB01".
The drc log file shows a bit more info:
DG 2005-12-01-17:17:43 2000000 3 574466141 DMON: chief lock convert for switchover
DG 2005-12-01-17:17:43 0 2 0 Executing SQL [ALTER SYSTEM ARCHIVE LOG CURRENT]
DG 2005-12-01-17:17:54 0 2 0 SQL [ALTER SYSTEM ARCHIVE LOG CURRENT] Executed successfully
DG 2005-12-01-17:18:17 0 2 0 ORA-16775 Error: the target standby database has 338 redo log(s) missing. Cannot proceed with the switchover operation.
DG 2005-12-01-17:18:17 2000000 3 574466141 Operation CTL_SWITCH cancelled during phase 0, error = ORA-16775
What i don't understand is the bit about 338 redo logs missing becuase the output from "archive log list" on both databases is as follows:
Primary
SQL> archive log list
Database log mode Archive Mode
Automatic archival Enabled
Archive destination /home/oracle/admin/TGPROD/archive
Oldest online log sequence 3091
Next log sequence to archive 3093
Current log sequence 3093
Standby
SQL> archive log list
Database log mode Archive Mode
Automatic archival Enabled
Archive destination /home/oracle/admin/TGPROD/archive
Oldest online log sequence 3091
Next log sequence to archive 0
Current log sequence 3093
Any help will be gratefully recieved as I'm stumped! Found a note for the original oracle error on here that said try switching logfiles a couple of times then trying again - which suffices to say didn't work. I don't understand where it gets 338 missing redo logs from!?!?
Oh and oracle version is 10.1.0.3....
thanks in advance.
regards,
MarkYes, it'll be waiting for these archive till he gets them.
If you see the files at system level, then cancel the MRP and apply them manually. If those files are missiong, then stop MRP and propagate hot backups from your primary db to the standby db, then reactivate the MRP. This will resync the databases.
Regards,
Yoann. -
Could anybody share me Oracle modules Logical Data Model?
Who have Logical Data Model charts on oracle financials modules like AR and FA? I need it. Thank you!
yes,
i have 12 years experience of Computer Expert Accounting System. I have gained good knowledge on System Design
[email protected] -
Recomendation: RAC + ASM + Data Guard
Hi all:
I need to implement a Data Guard in my 2 nodes RAC Database.
How I make the ASM instance in the remote host?? I'm not found any documentation...
Please, any sugestion.
Thanks a lot.Use the same installation procedure as you used for RAC.
Check these example instructions for Linux.
:p -
Oracle 11g Active Data Guard help ?
Hi Friends,
I successfully setup an Active data guard environment(11g). But, I dont know when the PROD database is highly utilize , its read only tasks like reporting and backup are doing in STANDBY. How can I know which db (prod or stand by) is used for these readonly operations ?
Regards
VishIt is not so simple to direct reports to the Physical Standby as you seem to assume.
You need to do some work for the setup.
See here for a description:
http://uhesse.com/downloads/real-time-query-presentation/
Kind regards
Uwe Hesse
Don't believe it, test it!"
http://uhesse.com -
I'm posting this note because I just solved my network issue/crash problems. I started experiencing lots of disconnects and page not found on my office computer over the past few days. Only thing I could think of that was different lately was that I installed the Blackberry Desktop Software. Then a huge lightbulb went on over my head. I remembered when it was installing, it said it was installing "Roxio" software and at that moment shivers went down my spine from past experiences.
Well, I have lots of good "bad" experiences with Roxio CD software which had caused these same problems on two of my home computers. It caused blue screens when used in conjunction with my wireless Netgear card and also standard network card. The same symptoms including very slow internet access, general machine sluggishness, etc. I was trying to chase this down for months til I finally removed the Roxio crapware and ALL problems went away. No more slow machine, no more page not found and no more blue screens on two of my computers. Now, on a completely different Toshiba work laptop and different network adaptor, same problems.
So, here we are again with Roxio as part of Blackberry Desktop software. I like my new Blackberrry 8830, but I have to pass this along to the Blackberry folks because I KNOW I'M RIGHT about this. I only installed the software so that I could type my contacts into Outlook and then download to my Blackberry. So, after realizing the common symptoms and after having completed my download (which worked great by the way), I removed the Blackberry Desktop Software and I'm back to speedy everything, no more pages not found, no more putty disconnects, and no more blue screens.
I hope someone gets good use of this post because this drove me crazy the first time around.
-- JeffWe had the same problem.
We did extensive troubleshooting, and found the problem was effecting our new HP Windows XP .Laptops.
We did not have any trouble with the Older Dell Latitude we have.
We found the BSOD still happened after removing the Black Berry Desktop Manager Software.
We traced it to a memroy / resource leak in the USB subsystem.
The workaround we found is to go into the Device manager, and remove all the drivers from the USB section.
** Disclaimer : This works for us, but may make your computer unusable. Backup your data to external storage.
Back up your computer
Start -> control Panel -> system ->Hardware tab -> Device manager.
Expand "Universal Serial Bus Controller" at bottom of list.
Right click on each Item, and select "uninstall"
Don't reboot after each one.
* Remove ALL the entries under Universal Serial Bus Controllers *
Repeat until all the entries under "Universe Serial Bus Controller" are removed
(there may be one that sticks, thats ok. so long as it has been uninstalled.
Reboot.
The OS will rebuild the USB stack and drivers to default.
This has worked for us so far.
We are not sure if this wont break something else. -
Oracle VM Server 3.1.1 Network problem
I installed Oracle VM Server 3.1.1 on HP ProLiant BL460c G1 Blade Server.
Installation had no problem, but network doesn't work at all.
If I write "ipconfig -a" I have this response:
bond0 Link encap:Ethernet HWaddr 00:1E:0B:8D:5E:5E
inet addr:192.168.99.160 Bcast:192.168.99.255 Mask:255.255.255.0
UP BROADCAST RUNNING MASTER MULTICAST MTU:1500 Metric:1
RX packets:0 errors:0 dropped:0 overruns:0 frame:0
TX packets:0 errors:0 dropped:0 overruns:0 carrier:0
collisions:0 txqueuelen:0
RX bytes:0 (0.0 b) TX bytes:0 (0.0 b)
eth0 Link encap:Ethernet HWaddr 00:1E:0B:8D:5E:5E
UP BROADCAST SLAVE MULTICAST MTU:1500 Metric:1
RX packets:0 errors:0 dropped:0 overruns:0 frame:0
TX packets:0 errors:0 dropped:0 overruns:0 carrier:0
collisions:0 txqueuelen:1000
RX bytes:0 (0.0 b) TX bytes:0 (0.0 b)
Interrupt:16 Memory:f6000000-f6012000
eth1 Link encap:Ethernet HWaddr 00:1E:0B:8D:5E:54
BROADCAST MULTICAST MTU:1500 Metric:1
RX packets:0 errors:0 dropped:0 overruns:0 frame:0
TX packets:0 errors:0 dropped:0 overruns:0 carrier:0
collisions:0 txqueuelen:1000
RX bytes:0 (0.0 b) TX bytes:0 (0.0 b)
Interrupt:16 Memory:fa000000-fa012800
lo Link encap:Local Loopback
inet addr:127.0.0.1 Mask:255.0.0.0
UP LOOPBACK RUNNING MTU:16436 Metric:1
RX packets:46 errors:0 dropped:0 overruns:0 frame:0
TX packets:46 errors:0 dropped:0 overruns:0 carrier:0
collisions:0 txqueuelen:0
RX bytes:6820 (6.6 KiB) TX bytes:6820 (6.6 KiB)
So - I think - my system recognize network cards.
I can ping loopback and server ifself, but I cannot ping nothing more.
Can you help me to solve problem?
Thank youThis is output:
Ethernet Channel Bonding Driver: v3.7.1 (April 27, 2011)
Bonding Mode: fault-tolerance (active-backup)
Primary Slave: eth0 (primary_reselect always)
Currently Active Slave: none
MII Status: down
MII Polling Interval (ms): 250
Up Delay (ms): 500
Down Delay (ms): 500
Slave Interface: eth0
MII Status: down
Speed: 100 Mbps
Duplex: full
Link Failure Count: 0
Permanent HW addr: 00:1e:0b:8d:5e:5e
Slave queue ID: 0
Thank you -
Oracle RAC with operating system virtual network cards
Hi,
I have two servers with four network cards each, two for public and two for private network. Each public card has own IP-address and the both have the virtual IP-address (from OS, not from Oracle Clusterware). Each private card has own IP-address and the both have the virtual IP-address. I have full redundancy for network cards. How should define Oracle Clusterware VIP-address for each machine?
Can somebody write me sample /etc/hosts file for this configuration?
Thanx,
JacekOracle RAC and crossover cable for private network
Maybe you are looking for
-
How do I transfer content in Dashboard Stickies from one Mac to another?
I've got a new MacBook Air and because the hard drive on it is smaller than the hard drive on the MacBook Pro I had before, I've opted not to use Migration Assistant. But I have a good bit of content in Dashboard Stickies. Does anyone know how I can
-
Data Manager not available in the BPC 7.0 Action Pane
Hi, I have just installed BPC 7.0 and everything is working fine, except I don't have access to the Data Manager. According to the BPC guide, this is how to start Data Manager: 1. Click the Business Planning and Consolidation icon on your desktop. 2.
-
17inch macbook pro keeps restarting!
Hi, I have just got my work macbook pro(late 2009) back from the Apple genius' after taking it in to repair the screen. It now won't shut down properly. If I go to the Apple menu in the top left corner and select shut down, it shuts down but restarts
-
How do I get iTunes movies to play on the Nook Tablet?
How do you down load iTunes movies to the Nook Tablet?
-
Allow or Block websites disabled
I posted this a little while back to no avail. Within Preferences/Trust Manager/Manage Internet Access the Specifiy Web Sites to Allow or Block is greyed out. I have checked all of my admin settings in Windows settings and everything is set to admi