10g RAC database: instances fail after a few days...

Im afraid I'm new to Oracle, so assume total ignorance when replying!
We have two servers, db1 and db2 running Oracle under Windows 2K3 web edition [SP1].
Both have a shared Direct Attached Storage disk, over a Scsi fibre connection.
The database is set up using instructions found online for RAC.
Either instance will stop, and the other will take over, after a few days. The problem is, we aren't using the database yet. [There isn't a problem with it, the development people have checked that they can use the database.]
The two errors [found using the EM console] which seem to be relevant are:
'Metrics "Database Time Spent Waiting (%)" is at 100 for event class "other"'
and 'Failed to connect to database instance ... TNS no listener'.
[Which is obvioulsly the failure]
I have scoured the web, and the rather dense reference books for Oracle I have access to, and to no avail.
Another contributing factor is the 'WMI Performance Adaptor' stops and starts on both machines every thirty seconds.
Any response/guidlines/advise would be greatly appreciated.
Apologies if I have started this thread in the wrong place!
James
Spelling errors - Message was edited by: jmorse

Tada, it failed at 4am last night.
Below is the bottom half of the log file.
2005-09-05 15:27:57.796: Attempting to start `ora.gandlake-db2.ASM2.asm` on member `gandlake-db2`
2005-09-05 15:28:04.703: Start of `ora.gandlake-db2.ASM2.asm` on member `gandlake-db2` succeeded.
2005-09-05 15:28:04.921: Attempting to start `ora.x200.x2002.inst` on member `gandlake-db2`
2005-09-05 15:28:33.359: Start of `ora.x200.x2002.inst` on member `gandlake-db2` succeeded.
2005-09-05 15:28:40.578: Start of `ora.x200.x2001.inst` on member `gandlake-db1` succeeded.
2005-09-05 15:28:40.609: CRS Daemon Started.
2005-09-05 15:28:41.140: Attempting to start >`ora.gandlake-db1.LISTENER_GANDLAKE-DB1.lsnr` on member `gandlake-db1`
2005-09-05 15:30:06.078: Start of `ora.gandlake-db1.LISTENER_GANDLAKE-DB1.lsnr` on member `gandlake-db1` succeeded.
`ora.gandlake-db1.vip` on `gandlake-db1` went OFFLINE unexpectedly
2005-09-06 19:48:35.703: Attempting to stop `ora.gandlake-db1.vip` on member `gandlake-db1`
2005-09-06 19:48:36.250: Stop of `ora.gandlake-db1.vip` on member `gandlake-db1` succeeded.
Restarting `ora.gandlake-db1.vip` on `gandlake-db1`
2005-09-06 19:48:36.312: Attempting to start `ora.gandlake-db1.vip` on member `gandlake-db1`
2005-09-06 19:48:38.843: Start of `ora.gandlake-db1.vip` on member `gandlake-db1` succeeded.
Successfully restarted `ora.gandlake-db1.vip` on `gandlake-db1`
`ora.gandlake-db1.ons` on `gandlake-db1` went OFFLINE unexpectedly
2005-09-06 22:59:36.875: Attempting to stop `ora.gandlake-db1.ons` on member `gandlake-db1`
2005-09-06 22:59:38.218: Stop of `ora.gandlake-db1.ons` on member `gandlake-db1` succeeded.
Restarting `ora.gandlake-db1.ons` on `gandlake-db1`
2005-09-06 22:59:38.296: Attempting to start `ora.gandlake-db1.ons` on member `gandlake-db1`
2005-09-06 22:59:40.109: Start of `ora.gandlake-db1.ons` on member `gandlake-db1` succeeded.
Successfully restarted `ora.gandlake-db1.ons` on `gandlake-db1`
`ora.gandlake-db1.vip` on `gandlake-db1` went OFFLINE unexpectedly
2005-09-08 04:47:35.078: Attempting to stop `ora.gandlake-db1.vip` on member `gandlake-db1`
2005-09-08 04:47:35.734: Stop of `ora.gandlake-db1.vip` on member `gandlake-db1` succeeded.
`ora.gandlake-db1.vip` ran out of restarts on `gandlake-db1`
`ora.gandlake-db1.vip` failed on `gandlake-db1`, relocating.
2005-09-08 04:47:35.875: Attempting to stop `ora.gandlake-db1.LISTENER_GANDLAKE-DB1.lsnr` on member `gandlake-db1`
2005-09-08 04:47:36.718: Stop of `ora.gandlake-db1.LISTENER_GANDLAKE-DB1.lsnr` on member `gandlake-db1` succeeded.
2005-09-08 04:47:36.765: Attempting to stop `ora.gandlake-db1.ASM1.asm` on member `gandlake-db1`
2005-09-08 04:47:37.390: [RUNNABLELISTENER:4648] state change aborted (locked): ora.x200.x2001.inst
2005-09-08 04:47:44.578: Stop of `ora.gandlake-db1.ASM1.asm` on member `gandlake-db1` succeeded.
2005-09-08 04:47:44.625: Attempting to stop `ora.x200.x2001.inst` on member `gandlake-db1`
2005-09-08 04:47:46.531: Stop of `ora.x200.x2001.inst` on member `gandlake-db1` succeeded.
2005-09-08 04:47:46.625: Attempting to start `ora.gandlake-db1.vip` on member `gandlake-db2`
2005-09-08 04:47:50.203: Start of `ora.gandlake-db1.vip` on member `gandlake-db2` succeeded.x200 is the name of the database, there are two instances in total, db1 and db2. The company name is 'gandlake', and the machines are called 'gandlake-db1' and 'gandlake-db2'.
Thanks again.

Similar Messages

  • SQL Azure sync service-- absurdly slow and fails after a few days

    Hello. We have been trying to use Azure Data Sync to replicate an on-premise MSSQL database to an SQL Azure database for read-only access by a customer. This was working for a while, but stopped syncing after a couple months(12hr auto-sync schedule) with
    no errors in the log. I had to re-create the sync group, but now it takes even longer than originally to try to sync, and never actually completes, as it gets interrupted by bi-weekly server restarts... It used to take a few hours to sync the new data in our
    database(which is appended to daily)-- but this time it fails after 4+ days... It was unacceptably slow initially(IMO), but now it's clearly unusable.  The original initialization of the data when I first set it up was less than 2 days of syncing. 
    It seems there is a throttle on the Azure sync service. Is this true?  Would it be best to clear the SQL Azure database now and re-sync? Is there a way to pre-load the SQL Azure database with MSSQL on-premise data via a SQL backup file or something?
    Please advise. Thank you.

    when you re-created the sync group, does the member databases/hub database have pre-existing data?
    when synching a sync group for the first time, make sure databases don't contain the same set of data, otherwise, you will run into conflicts which will completely slow down your sync...
    I deleted the initial sync group because it wasn't syncing(auto or on-demand), nor creating a log entry with an error indicating why.
    So, I simply deleted the sync group and re-created it with the same exact databases and settings. I did not delete all data in the SQL Azure database-- I was under the assumption that the sync service, with the tracking tables were smart enough to not get
    confused with pre-existing data, but apparently that's not how this works?
    I obviously can't delete the data in the source database(MSSQL on-premise), but I could delete the tables in the SQL Azure database if that's supposed to fix the problem-- then we'll just have to wait multiple days for it to be completely re-initialized,
    hopefully without error... Is there a way to seed the data in some way to prevent this extremely log first sync?
    Thank you for your help.

  • NIC ROUTER requirement for 10g RAC database

    I need to know what hardware to order in setting up an 10g RAC database without a single-point-of-failure. My question centers on the networks. If each server has one NIC for the public interface and one NIC for the private interconnect, aren't the routers these NICs are attached to a single point of failure? If each server has two NICs bonded together for the public interface and two NICs bonded together for the private interconnect, does each NIC attach to a different router?

    sayantan chakraborty wrote:
    is RDMA and infiband ar same??Infiniband is a switched fabric layer for high speed communication. RDMA is a protocol for "+remote direct memory access+".
    Infiniband supports the Internet Protocol suite over Infiniband, or IPoIB. It also supports RDMA over Infiniband.
    so do we need only infiband SDR / QDR switch to set infiband OR what else??You need an Infiniband switch - Cisco and Voltaire are two companies that supply this type of hardware. The latter is used by Oracle in its Exadata database machines. Unsure about Cisco's commitment to Infiniband switches as they have discontinued their Infiniband switches with integrated fibre channel gateways. A sore point with us after we bought into this technology about 2 years ago, in part on their very own recommendations - and now own very expensive, somewhat buggy and totally unsupported hardware. Seems like the new Cisco California servers are GigE and not Infiniband, which perhaps explain this (flawed) decision of theirs...
    For each cluster node, you need an Infiniband PCI card (typically has 2 ports). If you are going to use bonding, you will need a pair of Infiniband cables per server. Also remember to get spares - both PCI cards and cables.
    If you want redundancy at the switch, you need to get yourself 2 Infiniband switches.
    But if you're planning to invest this into a RAC cluster, then surely you should also invest money in RAC licensing and support - and with that gain access to very useful documentation like the RAC Starter Kit and so on.

  • How to start Oracle 10g RAC database and clusterware?

    I have steps to stop the 10g RAC Database and clusterware but not sure about starting it.
    I have heard executing
    $crsctl stop crs --as root
    on each node
    will start the database,asm,nodeapps .Is that true?
    or we have to do that step by step like we do in stopping the clusterware and database below
    1.Stop the agent:
    cd to $AGENT_HOME/corpng04.amhc.amhealthways.net/bin, then run: ./emctl stop agent
    2.Stop the full database
    $ oracle_home/bin/srvctl stop database -d db_name
    3.Stop the ASM Instances on node1,node2
    $ oracle_home/bin/srvctl stop asm -n node -- I guess you can't give multiple nodes in one command with comma,you need to give this multiple times with diff node name
    4.Stop the NodeApps :vip,listener,oms and gsd
    $ oracle_home/bin/srvctl stop nodeapps -n node -- I guess you can't give multiple nodes in one command with comma,you need to give this multiple times with diff node name
    5.Stop the CRS cluster processes :those bloody 3 evmd,ocssd,crsd
    $su - root
    $CRS_home/bin/crsctl stop crs

    Paul R @ NL wrote:
    before is shutting down crs i tend to stop the instances and services via srvctl then stop crs via crsctl
    just the way i do it. not saying it's the right way but it is the one i am comfortable with.Good -) If we stop CRS, but forgot shutdown oracle instances ... we'll see shutdown abort in alert log file(that mean instances are shutdowned abort).
    We should shutdown instance before stop CRS anyway.

  • Http get requests fail after a few weeks

    All,
    I have a get request to a servlet that works for a few weeks, then it will suddenly stop.
    I change the code once, works,then it will fail after a few weeks.
    I change the code again, works, then it will fail after a few weeks.
    Servlet works like: send one request, wait, then send a second.
    Here are the last 2 code iterations:
    try {
            // Construct data
            String data = URLEncoder.encode("key1", "UTF-8") + "=" + URLEncoder.encode("value1", "UTF-8");
            data += "&" + URLEncoder.encode("key2", "UTF-8") + "=" + URLEncoder.encode("value2", "UTF-8");
              //String data = "";
            // Send data
            //URL url = new URL("http://localhost:8080/stocks?action=1&date=20080310");
            URL url = new URL("http://localhost:8080/stocks/monitor?action=1&date="+stringDate);
            URLConnection conn = url.openConnection();
            conn.setDoOutput(true);
            OutputStreamWriter wr = new OutputStreamWriter(conn.getOutputStream());
            wr.write(data);
            wr.flush();
            // Get the response
            BufferedReader rd = new BufferedReader(new InputStreamReader(conn.getInputStream()));
            //System.out.println(rd.read());
            String line;
            int count =0;
            while ((line = rd.readLine()) != null) {
                // Process line...
                 System.out.println(count + line);
                 count++;
            wr.close();
            rd.close();
        } catch (Exception e) {
        try {
            // Construct data
            String data = URLEncoder.encode("key1", "UTF-8") + "=" + URLEncoder.encode("value1", "UTF-8");
            data += "&" + URLEncoder.encode("key2", "UTF-8") + "=" + URLEncoder.encode("value2", "UTF-8");
             //String data = "";
            // Send data
            URL url = new URL("http://localhost:8080/stocks/monitor?action=2");
            URLConnection conn = url.openConnection();
            conn.setDoOutput(true);
            OutputStreamWriter wr = new OutputStreamWriter(conn.getOutputStream());
            wr.write(data);
            wr.flush();
            // Get the response
            BufferedReader rd = new BufferedReader(new InputStreamReader(conn.getInputStream()));
            String line;
            int count =0;
            while ((line = rd.readLine()) != null) {
                // Process line...
                 System.out.println(count + line);
                 count++;
            wr.close();
            rd.close();
        } catch (Exception e) {
        }I send this request twice with different params
    public static String sendGetRequest(String endpoint, String requestParameters)
    String result = null;
    if (endpoint.startsWith("http://"))
    // Send a GET request to the servlet
    try
    // Construct data
    StringBuffer data = new StringBuffer();
    // Send data
    String urlStr = endpoint;
    if (requestParameters != null && requestParameters.length () > 0)
    urlStr += "?" + requestParameters;
    URL url = new URL(urlStr);
    URLConnection conn = url.openConnection ();
    // Get the response
    BufferedReader rd = new BufferedReader(new InputStreamReader(conn.getInputStream()));
    StringBuffer sb = new StringBuffer();
    String line;
    while ((line = rd.readLine()) != null)
    sb.append(line);
    rd.close();
    result = sb.toString();
    } catch (Exception e)
    e.printStackTrace();
    return result;
    }Any ideas?
    Edited by: iketurna on Mar 13, 2008 7:21 AM

    You appear to have empty catch blocks. Which means you don't get the error message that would tell you what is failing.
    Put in code that logs the exception and the stack trace of the exception. If you can't figure out the error message, post it here.
    You should be closing streams in finally statements. Otherwise they might not get closed when there is an error -> you leak descriptors -> you run out of descriptors -> every stream open will fail -> more errors -> more descriptors get leaked -> etc -> everything stops working. Always do it like this:
        WhateverStream out = null;
        try {
            out = ...;
            ...use out...;
        } finally {
            try {
                if (out != null) out.close();
           } catch (IOException e) { ...log it... }
        }

  • Migration of  10g RAC database to new sever which includes 11g upgrade

    Hello all,
    I have requirement here to migrate 10g RAC database from one server to another server as part of migration i want to perform upgrade to 11g as well.
    To Help me out.Please post your inputs or way to perform the task.
    Precautions to be taken while doing the activity.
    Please find the below for more info
    Old Box:
    Current environment
    RAC -2node
    ASM
    Database Version 10.2.02
    Os:HP-UX 11.21
    New Box:
    New environment should be
    RAC -2node
    ASM
    Database Version 11.2
    Os:HP-UX 11.31
    Thanks for the help in advance
    Anand

    Pl do not post duplicate threads - Migration of  10g RAC database to new sever which includes 11g upgrade

  • I downloaded CS6 and am having issues with my print driver. It is not compatible with the HP 2600n and have tried to download drivers given to me by adobe ( (Jupiter 3) but it is not working. after a few days. Its a temporary fix and is still looking for

    I downloaded CS6 and am having issues with my print driver. It is not compatible with the HP 2600n and have tried to download drivers given to me by adobe ( (Jupiter 3) but it is not working. after a few days. Its a temporary fix and is still looking for the HP driver when i boot up. It also will not save in any print or postscript format. Does anyone know how to fix?
    Currently use a Mac with the latest Mavericks 10.9.4

        Oh boy! Acting kind of weird seems to be an understatement, aquaequus!
    What type of troubleshooting were we able to do with you? I want to make sure that we can get some sort of resolution for this problem.
    It is quite possible the battery door may get your phone in working order again. I'm not sure if the store has it in stock, but it is available in our warehouse for $14.99 which can be ordered via customer service.
    Tamara H.
    Follow us on Twitter @VZWSupport

  • I just purchased the new ipad. I set up my email and put show 1000 messages. After a few days my email in my inbox disappears. I have gone thought all the settings. Please help.

    I just purchased the new ipad. I set up my email and put show 1000 messages. After a few days my email in my inbox disappears. I have gone thought all the settings. Please help.

    iPad Mail
    http://www.apple.com/support/ipad/mail/
     Cheers, Tom

  • HT4970 I've made entries three times over the past several months in Reminders only to have them dissappear after a few days. What am I doing wrong? Any chance they are being saved elsewhere so I can retreive them?

    Why do all of my Reminders disappear after a few days? Can I retrieve them?

    Why do all of my Reminders disappear after a few days? Can I retrieve them?

  • HT3964 I just opened up my MacBook Pro after a few days of not using it and it was frozen. I turned it off and back on and all I see is a white screen, there was a file with a question mark flashing for a bit but I can't do anything, please help me!!

    I just opened up my MacBook Pro after a few days of not using it and it was frozen. I turned it off and back on and all I see is a white screen, there was a file with a question mark flashing for a bit but I can't do anything, please help me!!

    Jerricayoung,
    you have a 13-inch Mid 2012 MacBook Pro. It’s modern enough that it supports booting into Recovery mode. To do so, hold down a Command key and the R key as you start up. It should eventually show a Mac OS X Utilities menu. Select Disk Utility from that menu; when the Disk Utility window appears, select the bootable volume from the left-hand side of the window. (It’s typically called “Macintosh HD”.) When the volume is selected, some buttons will appear on the right-hand side. If it’s not greyed out, press the Verify Disk button; if it is greyed out, or if it reports on errors that it found, press the Repair Disk button. Once the verification/repair is completed, exit Disk Utility and select Restart from the Apple menu; that will restart your MacBook Pro in its normal mode. With luck, that will be enough to get you to your normal login screen, rather than the white screen.

  • My macbook pro shut off for a few days now after a few days that i tried turning it on it finally turned on its been doing this often what might be the problem?

    So my macbook pro turns off and i try to turn it on but it wont turn on then after a few days it turns back on. its been doing this alot sometimes it turns on after a few days sometimes it turns on after a few hours but i dont know what might be wrong

    Hi oscarqwolf,
    I'm sorry to hear you are having these issues with your MacBook Pro. If you are having intermittent but persistent power or startup issues with your Mac, you may find the troubleshooting steps outlined in the following article helpful:
    If your Mac won't turn on - Apple Support
    Regards,
    - Brenden

  • Purchased music skips after a few days

    My iPod skips my iTunes' purchased music after a few days. When I connect my iPod to the iMac again the songs are back on my iPod but only for a couple of days.
    Can some of you help me out to solve this problem?

    The same thing happens to me, and unfortunately, it almost always seems to happen at the worst times (getting on a long plane flight.)
    I've done a lot of research on this topic in the Apple Discussions groups, and have tried all the suggested remedies:
    (1) Re-setting ipod
    (2) Re-loading music
    (3) Updating firmware, software
    Net/net, when you update your iPod, and connect to iTunes, or do any of above, it will work for a little while, and then it will just start skipping purchased music again.
    I believe this must be result of defective players, or bug in some of players, as not everyone has this problem (my wife's works fine) - but based on number of comments re. this problem I see on this forum, many people do, and Apple should offer exchange.

  • Remote control stops working after a few days

    NW 6 SP3
    ZFD 3.2 SP2
    After importing workstations, remote control works fine. After a few
    days, remote ping might work but NOT remote control. Sometimes even remote ping errors out with the message that the "REMOTE MANAGEMENT
    AGENT
    IS NOT RUNNING"(when it appears it is). If I stop and start the
    "novell
    wuser agent" service, then I can usually remote control the
    workstation
    again. Occasionally, however, I get the message "The Remote
    Management
    agent was unable to locate the workstation in NDS". At which point I
    have
    to delete the workstation object and re-import the workstation to be
    able
    to remote control it again.
    Any help is appreciated.
    Jeff

    > [email protected]
    >
    > >
    > > The login script is running wsreg32.exe with the /s parameter
    > > pointing to the server servicing the import services.
    >
    > Is there a reason you are using this?
    > instead of DNS, hosts, or registry?
    >
    > As I am wondering if maybe at times the workstation manager tries to
    > registr, but can't find the server, even the wsreg is running.
    > Normally onces the workstation has registered it doesn't need the
    > address again, but if for whatever reason cannot find it's
    workstation
    > object it will need to resolve the import services.
    >
    > --
    > Jared L Jennings, CNE
    > Novell Support Forums SysOp.
    > http://support.novell.com/forums/faq_nntp.html
    >
    > Posting with XanaNews Reader 1.15.8.2
    > Geek by Nature, NetWare by Choice.
    I have a different server for each site(city) hosting the import
    services. There is only one variable allowed in DNS so all sites
    would
    try to get services and register to the same server.
    Using the container login script for each site to set the /s parameter
    was
    easier than changing the registry setting for each
    workstation...especially if that server changed. This way I would
    only
    have to change it one place.
    If the workstation has already been registered and imported AND I can
    remote control it, does it NEED to find the import server again if it
    hasn't been rebooted since the last time I remote controlled it?
    Thanks, jg

  • Flash player stops working after a few days.

    I'm recently having issues with flash player on IE and Firefox each time a new version is released. This time the flash player stops working after a few days without any reasons. I have asked two other questions and I have tried the solutions but this time they are not working:
    http://forums.adobe.com/message/4617987#4617987
    http://forums.adobe.com/message/4548211#4548211
    Could anyone tell me what the reason is?
    I also need links providing Flash player for IE and Firefox which are always updated. I need to first download the installer then install it offline not install it directly from adobe website.
    Please help me solve this once forever!
    Message was edited by: sia1989

    I'd suggest trying a clean install: How do I do a clean install of Flash Player?
    For offline installers, please see this help document: http://helpx.adobe.com/content/help/en/flash-player/kb/installation-problems-flash-player- windows.html#main-pars_header

  • I updated to FF4 and after a few days, strange things started happening. How do I recover to normal?

    I updated to FF4 and after a few days, strange things started happening.
    The groups I had set up disappeared and the Group button is gone.
    When I first open the app, it opens 8 - 12 separate windows with 1 blank tab. The orange Firefox button is there with nothing else on that bar. Also, 3 tool bars are blank (i.e. space is there but it's just grey) with NO buttons.
    If I want my previously opened tabs, I have to use the "Recently Closed windows" option to get back to where I was when I last closed down. The orange Firefox button is missing from this window but all the original menu items are back across the top.
    So how do I get back to the way it is supposed to operate?

    Have a look here
    Mac maintenance Quick Assist
    http://support.apple.com/kb/HT1147

Maybe you are looking for

  • Condition type Purchase per delivery and per purchase order??

    Hi Guru's, Please advice. I have 3 questions: 1) Who knows a condition type in a purchase order with the goal that I can register costs per purchase order item ? Example 10 Euro for a certificate per item? 2) The same but now per purchase order 10 Eu

  • Using pre-query with 2 characteristics in rows?

    Hi @all, I need your help regarding the following problem: There are multiple conditions in one query, I'd like to do in a certain sequence. Because these multiple conditions in BEx where processed as "AND", for example companies must be part of the

  • MOULD DIE as a PRT tool in CF01

    Dear All, I have created -MOULD DIE as a PRT tool in CF01.(I dont want to  create the Mould die as a Material it can only be a PRT tool) I need to issue the Mould whenever it is required to production. I need to have track on the PRT issued to produc

  • BEx setting in an ODS

    At the moment I am loading to an Inventory Snapshot ODS which does not have the BEx reporting flag set. After the loads are complete am I able to change the setting and switch the BEx flag on and thereafter be able to report on the ODS or would I nee

  • AAMEE + Acrobat X

    We've just received our copy of Acrobat X PRO and I've found AAMEE does not recognise the installer. After locating the Product Install folder, selecting the OS I want to create a package for and click Next, I'm getting an error stating "A valid inst