Backup & Restore Fail-over Cluster
i ask for the best practice of backup and restore SQL fail-over cluster with Active-Active solution.
Hi Sir ,
Here is an article regarding baking up and recovering the cluster configuration :
http://blogs.msdn.com/b/clustering/archive/2008/01/20/7176982.aspx
Best Regards,
Elton Ji
Please remember to mark the replies as answers if they help and unmark them if they provide no help. If you have feedback for TechNet Subscriber Support, contact [email protected] .
Similar Messages
-
Which role do I need DFS or File server on fail over cluster server 2012 R2?
what I want to achieve is that I want to share all my user data files in a central location and to be highly available all the time whether it's a general share or folder redirection data. BUT I'm a bit confused; I have fail over cluster set-up
on server 2012, now I would like to add DFS as a role but than we have another role called File server and virtually it does the same thing as DFS? Means it creates a namespace share that can be access even one of the nodes goes down. Now I am thinking is
that DFS does the replication between two physical location but fail over cluster works slightly differently and with file server it pretty much does the same thing except for replicating data from one drive to another. Now what do you suggest I do or
did I get the concept wrong like a noob?DFS and Failover Clustering for file shares provides a similar end result for file access, but they are significantly different implementations.
Clustering provides high availability to files by presenting shared access to set a files served from a cluster. With 2012 R2 Microsoft added the ability to create a Scale-out File Server that even allows all nodes of the cluster to server access to
the files for a higher level of performance and other great things. Bottom line with Failover Clusters for files is that there is a single copy of the file presented from the cluster.
DFS on the other hand provides high availability to files by presenting multiple copies of the file by making a copy in two or more locations and presenting a naming space that allows access to the file through any of the network paths. DFS works very
well for files that are primarily read-only. When you get into a situation where there is a lot of updating of the shared files, DFS is not a very good solution. There are ways to implement DFS for read/write files, but it generally requires a
good knowledge of how the files are used and how you want to manage them.
The key to answering your question comes in your first sentence "I want to share all my user data files in a central location and to be highly available all the time". My initial reaction to this is that central location means Failover Cluster
- there is only a single copy of the file. However, "all the time" can be compromised by network failures to the central site. Remote sites would not have access if they can't access the central site. DFS provides the ability to
have copies remotely, but then if you allow updating at multiple sites, you have to manage the merging of the changes, among other things.
. : | : . : | : . tim -
What hardware is required to setup Fail over cluster using windows 2003 enterprise edition.
I want to setup fail over cluster...i have already installed HP 350 G6 server in my environment. now i want to know which hardware i may require to setup failover cluster for statefull application. and secondly, does my existing server can be utilized .
AN Update:
The Oracle Universal Installer shows the following in the screen before the error appears:
Starting Oracle Universal Installer...
No pre-requisite checks found in oraparam.ini, no system pre-requisite checks w
ill be executed.
Preparing to launch Oracle Universal Installer from D:\DOCUME~1\ADMINI~1\LOCALS
~1\Temp\OraInstall2011-03-02_04-25-26PM. Please wait ... Oracle Universal Instal
ler, Version 10.1.0.6.0 Production
Copyright (C) 1999, 2007, Oracle. All rights reserved.
...............................................................Val: 0
Val: 0
Val: 0
Val: 2
Val: 0
Val: 0
Val: 0
Val: 2
Val: 0
Val: 0
Val: 0
Val: 0
Val: 0
Val: 0
Val: 2
Val: 0
Val: 0
Val: 0
Val: 0
Val: 2
Val: 0
Val: 0
path: D:\DOCUME~1\ADMINI~1\LOCALS~1\Temp\OraInstall2011-03-02_04-25-26PM\jre\bin
;.;D:\WINDOWS\system32;D:\WINDOWS;D:\StageR12\startCD\Disk1\rapidwiz\unzip\NT;D:
\MVS\VC\bin;D:\cygwin\bin;D:\WINDOWS\system32;D:\WINDOWS;D:\WINDOWS\System32\Wbe
m
toload is D:\DOCUME~1\ADMINI~1\LOCALS~1\Temp\OraInstall2011-03-02_04-25-26PM\Win
dowsGPortQueries.dll
100% Done.
Copying files in progress (Wed Mar 02 16:25:59 IST 2011)
.................................................Val: 0
. 79% Done.
Copy successful
Setup in progress (Wed Mar 02 16:26:05 IST 2011)
.....Oracle JAAS [Wed Mar 02 16:26:28 IST 2011]: exception: 9
opmnctl: opmn started
Please help me.
Thanks and regards,
Adm -
Is my installation of SQL Server Fail Over cluster correct?
I made a 2 node SQL Server 2012 fail over cluster but having some problems during installation so I wanted to know if the steps below I performed are correct.
Hardware
Node1 192.168.1.10
Node2 192.168.1.11
Added following entries in DNS
cluster.domain.local 192.168.1.12 (for Windows Cluster)
msdtc.domain.local 192.168.1.13 (for MSDTC)
sql.domain.local 192.168.1.14 (for SQL Server Cluster)
Cluster Storage
Disk1 (for Quorum)
Disk2 (for MSDTC
Disk3 (for SQL Server)
Now comes the installation. I am performing all these steps as DOMAIN ADMIN.
1. First I installed clustering role on both nodes
2. Then I ran fail over validation wizard on Node1 adding both nodes which went fine (there were some warnings)
3. Then I made a Windows Cluster on Node1 using these two nodes. I gave the name and IP to this cluster which I wrote above i.e. cluster.domain.local 192.168.1.12
4. Cluster was created and boths nodes are UP.
Now I want to ask a question here. Is it best practice to perform the above operation using DOMAIN ADMIN? Or if I use a standard domain user account with local admin rights, will it work? If not then exactly what rights are required to perform this operation.
5. Then I installed "Application Server" role on both Node1 and Node2 and also added "Distributed Transaction" feature
6. Then I right clicked on Windows Cluster I created and added a new role/feature which is "DTC"
7. I gave it the same name which I wrote above i.e. msdtc.domain.local 192.168.1.13
8. MSDTC was created but when it tried to UP its service, it threw an error. Upon investigation it turns out the Windows Cluster cluster.domain.local doesn't have proper rights to created some objects in AD. I didn't know what rights to give so I gave it full
permission and after that when I created MSDTC again, the service went up fine.
So I want to know what rights does cluster.domain.com require to make MSDTC?
Am I doing good so far?Hello,
>>Then I made a Windows Cluster on Node1 using these two nodes. I gave the name and IP to this cluster which I wrote above i.e. cluster.domain.local 192.168.1.10
Hello I suppose this IP was physical node IP windows cluster IP was 192.168.1.12 I suppose yo must have given this IP as windows cluster IP.10 and 11 are physical nodes in Cluster but 12 is Cluster IP .Correct me if I am wrong.
Did you do failover and failback to check whether cluster is configured correctly or not ,If not please do it .
>>Then I ran fail over validation wizard on Node1 adding both nodes which went fine (there were some warnings)
Please remove warnings also ,it might cause issue.Not sure its correct every time but make sure cluster validation should be free of error and warning.
>>Now I want to ask a question here. Is it best practice to perform the above operation using DOMAIN ADMIN?
You can do it with domain admin account as this is required to create Cluster NAme object(CNO) in domain and local account might not have that right so I would say its ok.
>>I gave it the same name which I wrote above i.e. msdtc.domain.local
192.168.1.11
again this IP is node 2 IP how can you give it to MSDTC.Use below link for reference
http://blogs.msdn.com/b/cindygross/archive/2009/02/22/how-to-configure-dtc-for-sql-server-in-a-windows-2008-cluster.aspx
Please mark this reply as the answer or vote as helpful, as appropriate, to make it useful for other readers -
Hi All,
We have a windows fail over cluster having one windows machine on local network as one of its node.
I want to add a virtual cloud machine available on microsoft azure as another node to this existing cluster.
Please suggest how to do this?
Thanking all in advance,
RaghvendraBefore you even start working on the SQL side, you will need to create a Windows Server 2008 R2 cluster with no shared storage. You can actually test that in-house. Create a VM running 2008 R2 and cluster it with your physical (from your description,
I am assuming physical) 2008 R2 machine. Create it with a file share witness for quorum. Then configure your environment to see that it works as expected.
Once you know how to configure the cluster between physical and VM with a file share witness, build it to Azure. The location of the FSW gets to be an interesting choice. To have a FSW in Azure means that you will need another VM in Azure to
host the file share, meaning you have two quorum votes in Azure and one in-house. Or, you could create a file share witness on an in-house system, giving you two quorum votes in-house and one in Azure.
In the FSW in Azure scenario, if you have a loss of the in-house server, automatic failover occurs because two quorum votes exist in Azure. With FSW in-house, depending on the loss you have in-house, you might have to force quorum to get the Azure
single-node cluster to run. Loss of access to Azure reverses those scenarios. Neither one is optimal, but it does provide some level of recoverability.
. : | : . : | : . tim -
Weblogic 11g Fail over Cluster
Hi,
I'm Using WebLogic Server 11g (10.3.6). I have installed ATG and Commerce reference store in same machine with weblogic.(Endeca Has Separate server). In addition to i have oracle DB server and apache server.
I did following things.
*I have configure one physical machine (WM1) with Web Logic Domain. Other physical machine (WM2) i installed weblogic.
*I configured ATG and Commerce reference store in WM1.(Using cim.sh)
*and Configured Endeca app for WM1.
*I am using weblogic for production environment. I Created 3 Managed servers according to cim.sh production.publishing and staging.
I want to Create fail-over Cluster with WM1 and WM2.
Now is it possible to create fail over cluster?
Please give me instructions ,suggestions or guide to configure cluster for this environments.
Thanks
Nish.Try to see the cluster log information, you should see an event that describes the error that causes the cluster reource to fail.
Regards, Samir Farhat Infrastructure and Virtualization Consultant || Virtualization, Cloud, Azure ? Follow and Ask here https://buildwindows.wordpress.com -
Backup Restore Failed After iOS5 Upgrade
I did the iOS5 install on my iphone 4 yesterday and it went off perfectly. However, this evening, I did the same on my wife's iphone 4. Prior to doing the download, I did a Backup and a Restore Purchases. When the process came to the Restore point, I received the error message that the restore from backup had failed. Each time we plugged the phone back into itunes, it prompted us to complete the restore process but with the same message. Consequently, my wife's phone lost all contacts, photos, and apps. I manually re-entered her contacts, plugged back into itunes, and performed another backup. We then attempted to restore from backup once again with the failure message following. I'm almost sure that we cannot recover her photos, but if anyone on the forum has a suggestion, we'd appreciate it. Also, what is the safest and most surefire way to backup photos and contacts on the iphone 4 ? JEFF
My husband updated my phone on the 15th... same identical thing happened. I've lost all photos, text messages, and contacts since the las backup. It did download 5.0, but I lost everything. I spent 2.5 hours on the phone with an apple advisor and they needed to research it further and would call me back......hasn't happened yet. If they find a way, I will post it. What error message did you receive... mine was (-50).
-
Backup Restoration fails on MSSQL2005(SP2) on windows 2003 IA64 in ECC6.0
Hi ,
Iam trying to resore backup of my MSSQL2005 server from a Ultrium 3 tape drive but it's getting failed. Iam able to take the backup sucessfully and Iam able to see the content on the tape0 from SQLStudio Manager. When I try to restore the problem coming, OS is 2003 IA64 and SAP is ECC6.0.
Following error occurs on a pop up.
An exception occured while executing a Transact-SQL statement or batch.
Additional information.
Timeout expired. The time out period elapsed prior to completion of the operation or the server is not responding.
So we have increased the time out with the command SET LOCK_TIMEOUT 3600; but still no use, we are getting the same error.
I have raised a OSS Mesage but the gentlemen expect the error in SQL 2005 Studio Manager, He say's it's not supporting 64 bit OS. So he suggested to install a 32 bit OS and access the 64 bit database. So I have tried the same , I am able to take backup and view the content on tape. When I try to restore it's giving SQL 3201 error. But this error doc. from Microsoft is not giving any exact reason and it's not maching my case.
Thanks &Regards,
Hari.Dear Hari,
I also faced same problem and till i did not find solution.
To resolve this issue we did the restore through command line option.
If i will get the solution through management studio i will update it here.
Regards,
Nikunj Thaker. -
Hi Everyone,
I am facing a failed issue when restoring the WCS Database. Below is the error i get, does any one out there facing it before?
[root@egwgwcs WCS7.0.220.0]# ./Restore
Please enter the full path of the backup file name: /opt/WCS7.0.220.0/Backup_File/WCS_Aug2012.nmsbackup
Untaring the backup file...
Failed to untar backup file. Exception: invalid stored block lengths
Restore database failed.
Is there any solution to solve this error?
Thanks
Tay Li TiongHi,
It seems the backup file is corrupted!!!
Sent from Cisco Technical Support iPad App -
6680 update - backup/restore fails
Hi,
after updating firmware from 2.x to 4.x with Nokia software, I cannot restore my old backup correctly.
After restore with ContentCopy, the screen does not show the menu entries and the systems settings cannot be called.
Any help appreciated!
Thanks, OliverHey alexamai,
I see that you have an issue with your ability to update and restore your iPhone, and are receiving an error code(3) when attempting to update. Here is an article for you that addresses this issue and that error code, specifically:
Resolve iOS update and restore errors - Apple Support
http://support.apple.com/en-us/TS3694
Check for hardware issues
Related errors: 1, 3, 10, 11, 12, 13, 14, 16, 20, 21, 23, 26, 27, 28, 29, 34, 35, 36, 37, 40, 1000, 1002, 1004, 1011, 1012, 1014, 1667, or 1669.
These errors mean that your device or computer may have a hardware issue that's preventing the update or restore from completing.
Check that your security software and settings aren't preventing your device from communicating with the Apple update server.
Then try to restore your iOS device two more times while connected with a cable, computer, and network you know are good.
Confirm that your security software and settings are allowing communication between your device and update servers.
If you still see the error message when you update or restore, contact Apple support.
Thanks for coming to the Apple Support Communities!
Regards,
Braden -
hi,
please need help. after a firmware update, my phone e61i does not restore the backup from "backup.arc" file. restoring takes about 1sec and phone wants to be restarted, but after that no data has been restored. what should i do, i've lost everything.My husband updated my phone on the 15th... same identical thing happened. I've lost all photos, text messages, and contacts since the las backup. It did download 5.0, but I lost everything. I spent 2.5 hours on the phone with an apple advisor and they needed to research it further and would call me back......hasn't happened yet. If they find a way, I will post it. What error message did you receive... mine was (-50).
-
OCR and voting disks on ASM, problems in case of fail-over instances
Hi everybody
in case at your site you :
- have an 11.2 fail-over cluster using Grid Infrastructure (CRS, OCR, voting disks),
where you have yourself created additional CRS resources to handle single-node db instances,
their listener, their disks and so on (which are started only on one node at a time,
can fail from that node and restart to another);
- have put OCR and voting disks into an ASM diskgroup (as strongly suggested by Oracle);
then you might have problems (as we had) because you might:
- reach max number of diskgroups handled by an ASM instance (63 only, above which you get ORA-15068);
- experiment delays (especially in case of multipath), find fake CRS resources, etc.
whenever you dismount disks from one node and mount to another;
So (if both conditions are true) you might be interested in this story,
then please keep reading on for the boring details.
One step backward (I'll try to keep it simple).
Oracle Grid Infrastructure is mainly used by RAC db instances,
which means that any db you create usually has one instance started on each node,
and all instances access read / write the same disks from each node.
So, ASM instance on each node will mount diskgroups in Shared Mode,
because the same diskgroups are mounted also by other ASM instances on the other nodes.
ASM instances have a spfile parameter CLUSTER_DATABASE=true (and this parameter implies
that every diskgroup is mounted in Shared Mode, among other things).
In this context, it is quite obvious that Oracle strongly recommends to put OCR and voting disks
inside ASM: this (usually called CRS_DATA) will become diskgroup number 1
and ASM instances will mount it before CRS starts.
Then, additional diskgroup will be added by users, for DATA, REDO, FRA etc of each RAC db,
and will be mounted later when a RAC db instance starts on the specific node.
In case of fail-over cluster, where instances are not RAC type and there is
only one instance running (on one of the nodes) at any time for each db, it is different.
All diskgroups of db instances don't need to be mounted in Shared Mode,
because they are used by one instance only at a time
(on the contrary, they should be mounted in Exclusive Mode).
Yet, if you follow Oracle advice and put OCR and voting inside ASM, then:
- at installation OUI will start ASM instance on each node with CLUSTER_DATABASE=true;
- the first diskgroup, which contains OCR and votings, will be mounted Shared Mode;
- all other diskgroups, used by each db instance, will be mounted Shared Mode, too,
even if you'll take care that they'll be mounted by one ASM instance at a time.
At our site, for our three-nodes cluster, this fact has two consequences.
One conseguence is that we hit ORA-15068 limit (max 63 diskgroups) earlier than expected:
- none ot the instances on this cluster are Production (only Test, Dev, etc);
- we planned to have usually 10 instances on each node, each of them with 3 diskgroups (DATA, REDO, FRA),
so 30 diskgroups each node, for a total of 90 diskgroups (30 instances) on the cluster;
- in case one node failed, surviving two should get resources of the failing node,
in the worst case: one node with 60 diskgroups (20 instances), the other one with 30 diskgroups (10 instances)
- in case two nodes failed, the only node survived should not be able to mount additional diskgroups
(because of limit of max 63 diskgroup mounted by an ASM instance), so all other would remain unmounted
and their db instances stopped (they are not Production instances);
But it didn't worked, since ASM has parameter CLUSTER_DATABASE=true, so you cannot mount 90 diskgroups,
you can mount 62 globally (once a diskgroup is mounted on one node, it is given a number between 2 and 63,
and other diskgroups mounted on other nodes cannot reuse that number).
So as a matter of fact we can mount only 21 diskgroups (about 7 instances) on each node.
The second conseguence is that, every time our CRS handmade scripts dismount diskgroups
from one node and mount it to another, there are delays in the range of seconds (especially with multipath).
Also we found inside CRS log that, whenever we mounted diskgroups (on one node only), then
behind the scenes were created on the fly additional fake resources
of type ora*.dg, maybe to accomodate the fact that on other nodes those diskgroups were left unmounted
(once again, instances are single-node here, and not RAC type).
That's all.
Did anyone go into similar problems?
We opened a SR to Oracle asking about what options do we have here, and we are disappointed by their answer.
Regards
OscarHi Klaas-Jan
- best practises require that also online redolog files are in a separate diskgroup, in case of ASM logical corruption (we are a little bit paranoid): in case DATA dg gets corrupted, you can restore Full backup plus Archived RedoLog plus Online Redolog (otherwise you will stop at the latest Archived).
So we have 3 diskgroups for each db instance: DATA, REDO, FRA.
- in case of fail-over cluster (active-passive), Oracle provide some templates of CRS scripts (in $CRS_HOME/crs/crs/public) that you edit and change at your will, also you might create additionale scripts in case of additional resources you might need (Oracle Agents, backups agent, file systems, monitoring tools, etc)
About our problem, the only solution is to move OCR and voting disks from ASM and change pfile af all ASM instance (parameter CLUSTER_DATABASE from true to false ).
Oracle aswers were a litlle bit odd:
- first they told us to use Grid Standalone (without CRS, OCR, voting at all), but we told them that we needed a Fail-over solution
- then they told us to use RAC Single Node, which actually has some better features, in csae of planned fail-over it might be able to migreate
client sessions without causing a reconnect (for SELECTs only, not in case of a running transaction), but we already have a few fail-over cluster, we cannot change them all
So we plan to move OCR and voting disks into block devices (we think that the other solution, which needs a Shared File System, will take longer).
Thanks Marko for pointing us to OCFS2 pros / cons.
We asked Oracle a confirmation that it supported, they said yes but it is discouraged (and also, doesn't work with OUI nor ASMCA).
Anyway that's the simplest approach, this is a non-Prod cluster, we'll start here and if everthing is fine, after a while we'll do it also on Prod ones.
- Note 605828.1, paragraph 5, Configuring non-raw multipath devices for Oracle Clusterware 11g (11.1.0, 11.2.0) on RHEL5/OL5
- Note 428681.1: OCR / Vote disk Maintenance Operations: (ADD/REMOVE/REPLACE/MOVE)
-"Grid Infrastructure Install on Linux", paragraph 3.1.6, Table 3-2
Oscar -
Multiple types of database and fail over clustering
Hi,
I have a few questions here.
1) Can I have 2 types of databases (eg: OLTP and OLAP)run at the same time on a same machine?
2) Can I implement a cross fail over cluster in this situation? Meaning I have 2 machines with OLAP and OLTP database instances installed in them (replica of each other), 1st machine running OLTP and 2nd running OLAP. In the situation where one of machines fail, the passive instance on the other machine takes over (back to situation on question 1).
Thanks
Regards
Lai LingDear All,
My problem is solved by disabling antivirus.
thanks for the support
Sunil
SUNIL PATEL SYSTEM ADMINISTRATOR -
Two wistnesses in a SQL Server fail over group
Is it possible to have two witnesses in a SQL Server Always on Availability Group Fail Over Cluster? Our goal is to have redundant witnesses in an Azure availability set.
Thanks,
MikeAlwaysOn uses Windows Failover Clustering for quorum. See, eg Understanding Quorum Configurations in a Failover Cluster
You can do this, but with Dynamic Quorum it's probably not helpful. If you loose your witness vote, the cluster will adjust the quorum requirements.
David
David http://blogs.msdn.com/b/dbrowne/ -
ACE 4710 - 'reverse proxy' infront of serverfarm - fail-over/sorry server design issue
Hi All,
I'm working on a specific config and have an issue in the backup farm/fail-over/sorry server area.
The customer wants the following:
They have an existing serverfarm with X web servers, they want a single server to act as a reverse-proxy in front of the farm.
So that all traffic goes trough that server, that server then forwards the request to the original serverfarm.
The problem in my design is in the fail-over, if i configure the reverse-proxy server in a new serverfarm and use the original (web servers) farm as backup it has fail-over, but if the reverse-proxy AND the original serverfarm fail, there is no nice way to get the users on a sorry server.
I could give the original serverfarms rservers a 'backup standby' server but that won't give the desired effect either.
For maintance they first take 50% of the servers offline and switch to the other 50% after that, so then users would see a sorry page even if there where operational servers in the farm left.
The 4710's are running routed mode, and the farms use Sticky Cookie, and also some http URL & Cookie matching is done.
Anyone have an idea how to build this?Hi,
It need additional testing but as per my understanding if you put the back up in this order then the last backup server will be choosen first.
In your case it will be like " RSERVER1 >> backup sorry server >> backup web content
As per the below example:
I put test 2 as first backup server and test1 as second backup server but if you look at the first part it took rserver test1 as first backup.
serverfarm host 1313-GIN-GWAP-SDC-80
rserver RSERVER1
backup-rserver test1
inservice
rserver test1
inservice standby
rserver test2
inservice standby
regards,
Ajay Kumar
Maybe you are looking for
-
How to cancel reregistering windows .dll files?
How do I undo the following recommended "fix" posted on this Apple website... I was having problems with the ipod shuffle (receiving error 1418), but since resolved that issue by downloading the ipod reset utility.... However, an earlier attempt to c
-
Can i use this script in illustrator?
can i use this script in illustrator? Newsgroup_User
-
Hi, I am reading 'Advanced RAC troubleshooting' written by Riyaj Shamsudeen and have some questions about the wait event 'gc current/cr grant 2-way'. It says: CR – disk read* Select c1 from t1 where n1 =:b1;* +1 User process in instance 1 requests ma
-
Cant create contact out of dialled number - asha31...
For nokia asha 311, i am not able to create a contact out of a dailled number. e.g. I type a new number in dialler, after that i want to create a contact out of this or update a current account, cant do in asha 311
-
HT204266 If you don't like an app can you get a refund?
Refund for crappy apps.