Weblogic suspending both datasource in multidatasource when one DB is down

Hi,
I'm trying to connect oracle UCM to multi datasource that has 2 datasources each talking to different databases that are kept in sync with oracle streams. when i stopped database connected to first datasource, the other datasource in the multi datasource is also suspended. Weblogic is not failing over to the second datasource. I'm using weblogic 10.3.4, and UCM 11.1.1.4.0. Please advice.
Thanks,
Raj

Can you turn on JDBC logging and show the SQLExceptions that come when
the DataSources fail to make connections after you shut one of the DBMSes
down? There should be some exceptions from the second datasource as well.
I believe you already have your test-on-reserve set, and that when you kill the
first DBMS, that somehow affects the second DBMS in a way that makes it not
work.

Similar Messages

  • Failover did not happen when one node went down!!! PLEASE HELP

    Hi gurus,
    Yesterday one disaster struck my RAC database. We have two node cluster and it is 10.2.0.2, both of them located in different sites, yesterday suddenly power went down and the one of the network switch went down and got destructed, node one of RAC database was connected to that switch, but the failover did not happen to the node two as this should be the case when one node goes down the other should be available for all the node one sessions/connections.
    when I tried to ping/telnet the node 1, it was not happening because the switch was down, the network guyz connected the cables to other switch available. When I connected to the node 1, it was showing "Oracle is not available" message.
    And when I tried the other node, it was the same case but I did not see any error in alert log file. Then my TL restarted both the nodes and then the database was available.
    I am very confused that how the failover did not happen and how the database went down, PLEASE suggest something to how to identifiy what was happened. Thanks & Regards

    Thanks for your reply,
    after the network switch was replaced we connected to both the nodes and found that the instances are down with no reason given in the Alertlog file. We just restarted both the instances and then the database was up and the clients connected to both the instances with equal sessions on both the instances. I want to know that whether the failover can be done at the application side or it should be done on the database side i,e; in tnsnames.ora file with the required parameters? as in our scenario there is no failover configuration in the tnsnames.ora file.
    Thanks & Regards

  • Both phone numbers ring when one is called

    I have two phone on my account.  I just uploaded iOS8 and now when one phone number is called, both phones ring.  This will certainly cause confusion and lead to frustration.  How can I turn this feature off?

    That setting does not control making a FaceTime call while in cellular. That controls the continuity with answering your iPhone cellular calls on another device. This phenomenon will only occur when both devices are logged into the same Apple ID and are currently on the same Wi-Fi network.

  • Health Service Heartbeat Failure Alert for Generated when one Management Server Down,

    Hi,
    I have Two Management Server, every one manage about 100 server, when one Management Server goes down unexpected, I receive 100 Alert for 100 Server Health Service Heartbeat Failure.
    My Question, why when the Management Server down, it send that all Managed agent Health Service Heartbeat Failure?
    Is there a way to change this?

    SCOM 2012 agent will autofailover when primary server is down. You can check the failover management server by using the following powershell cmdlet:
    #Verify Failover for Agents reporting to MS1
    $Agents = Get-SCOMAgent | where {$_.PrimaryManagementServerName -eq 'MS1.DOMAIN.COM'}
    $Agents | sort | foreach {
    Write-Host "";
    "Agent :: " + $_.Name;
    "--Primary MS :: " + ($_.GetPrimaryManagementServer()).ComputerName;
    $failoverServers = $_.getFailoverManagementServers();
    foreach ($managementServer in $failoverServers) {
    "--Failover MS :: " + ($managementServer.ComputerName);
    Write-Host "";
    http://www.systemcentercentral.com/how-does-the-failover-process-work-in-opsmgr-2012-scom-sysctr/

  • How to select between sequences on different DB links when one DB is down

    Hi!
    I'm running Oracle 11.1.0.7.0 on HPUX v3 in a Stream replicated environment.
    One of the applications would need to connect to both DBs ensuring some consistency across sequences defined locally.
    We cannot use odd/even approach in the sequences so that I thought we may take the greatest value of the two sequences (one on each node).
    The SQL would look like:
    select greatest(user.SEQ_NUMBER.nextval@DB_B,user.SEQ_NUMBER.nextval@DB_A) from dual;
    it works fine but only in a sunny day scenario, i.e. when both nodes are up and running. In case of issues with one of the DBs I would get:
    ERROR at line 1:
    ORA-12541: TNS:no listener
    Is there any way to intercept this error and take the value from the local DB ?
    Thanks,
    Mike

    Hi Mike,
    Maybe something like this would help you:
      1  create or replace function f1 return number as
      2  lv_stmt varchar2(128);
      3  lv_ret  number;
      4  begin
      5   lv_stmt := 'select 1 from dual@dummy_dblnk';
      6   execute immediate lv_stmt into lv_ret;
      7  exception
      8   when others then
      9    if sqlcode = -2019 then
    10      return -1;
    11    else
    12     raise;
    13    end if;
    14* end;
    SQL> /
    Function created.
    SQL> save f1 replace
    Wrote file f1.sql
    SQL> SELECT f1 FROM dual;
            F1
            -1Thanks,
    Lukasz

  • ACE slowness issue when one server goes down

    Hi,
    We are having two application servers.Both are load balanced using ACE.
    When we bring down one server, we find that when we upload some files into the second application server, its too slow.
    But when primary server comes up again the performance increases.This issue happens only when we bring the primary server down.
    We are using cookie based stickiness.Any ideas where we can look into.
    Rgds.,
    Sachin

    Depending on the load-balancing algorithm or predictor that you configure, the ACE performs a series of checks and calculations to determine which server can best service each client request. The ACE bases server selection on several factors including the source or destination address, cookies, URLs, HTTP headers, or the server with the fewest connections with respect to load.
    http://www.cisco.com/en/US/docs/app_ntwk_services/data_center_app_services/ace_appliances/vA1_7_/configuration/slb/guide/classlb.html

  • Change DNS host name automatically when one device is down

    Hi all
    I am first copying below the question which I have posted in a thread related to DNS few days back .
    You cannot vote on your own post                    
    Hi
    our network team has setup Bluecoat proxy server in our main office and DR site . We have configured GPO to populate the details of proxy server hostname in the proxy setting  eg : Proxy server .  IP1 is the IP address for the device in Head office
    and IP2 is the IP address assigned for device in DR site. currently  we have given a DNS entry as
    Proxyserver - IP1.
    So DNS resolves the IP address of device in Head office and authentication and internet access works.
    But we want to use the DR device  to be used automatically   (only )when head office device is not reachable. But how to configure the settings for DNS I am not sure.
    I don't want to use round robin as it will send the traffic to both head office and DR in a load balancing fashion .
    for this request I had received an advice to use a script to monitor the first device continuously and if not reachable  then change the Hostname in DNS to that of second device  .
    Let me know if anyone can help me on this.
    Thanks in advance
    Midhin 

    Hi Midhin,
    Agree with Jrv, I also recommend you can consult proxy vendor for a better solution.
    In addition, if you still want to update host name in DNS via Powershell, please refer to the script to update A record in DNS:
    $Zone = "dns zone"
    $oldobj = Get-DnsServerResourceRecord -ZoneName $Zone -RRType "A" -Name "hostname"
    $newobj = Get-DnsServerResourceRecord -ZoneName $Zone -RRType "A" -Name "hostname"
    $NewObj.RecordData.IPv4Address = "newipaddress"
    Set-DnsServerResourceRecord -NewInputObject $NewObj -OldInputObject $OldObj -ZoneName $Zone
    If there is anything else regarding the powershell script issue, please feel free to post back.
    Best Regards,
    Anna Wang
    Please remember to mark the replies as answers if they help and unmark them if they provide no help. If you have feedback for TechNet Support, contact [email protected]

  • JMS Uniform Distribute Queue Unit Of Order, problem when one node goes down

    Hi ,
    I have the following code which post a message (with Unit of Order set ) to a Uniform Distribute Queue in a cluster with two member servers (server1 and server2).
    --UDQ is targeted to a subdeployment that is mapped to two JMS servers pointing to each member servers
    --Connection Factory is using default targeting ( i tried mapping to Sub deployment also)
    javax.naming.InitialContext serverContext = new javax.naming.InitialContext();
    javax.jms.QueueConnectionFactory qConnFactory = (javax.jms.QueueConnectionFactory)serverContext.lookup(jmsQConnFactoryName);
    javax.jms.QueueConnection qConn = (javax.jms.QueueConnection)qConnFactory.createConnection();
    javax.jms.QueueSession qSession = qConn.createQueueSession(false, Session.AUTO_ACKNOWLEDGE);
    javax.jms.Queue q = ( javax.jms.Queue)serverContext.lookup(jmsQName);
    weblogic.jms.extensions.WLMessageProducer qSender = (weblogic.jms.extensions.WLMessageProducer) qSession.createProducer(q);
    qSender.setUnitOfOrder("MyUnitOfOrder");
    javax.jms.ObjectMessage message = qSession.createObjectMessage();
    HashMap<String, Object> map = new HashMap<String, Object>();
    map.put("something", "SomeObject");
    message.setObject(map);
    qSender.send(message);
    } catch (Exception e) {           
    Steps followed:
    1. Post a message from "server1"
    2. Message picked up by "server2"
    3. Everything fine
    4. Shutdown "server2"
    5. Post a message from "server1"
    6. ERROR: "hashed member of MyAppJMSModule!MyDistributedQ is MyAppJMSModule!MyJMSServer-2@MyDistributedQ which is not available"
    WebLogic version : 10.3.5
    Is there a way (other than configuring Path Service ) to make this code work "with unit of order" for a UDQ even if some member servers go down ?
    Thanks very much for your time.

    If you want to avoid use of the Path Service, then the alternative is to make the destination members highly available. This will help ensure that the host member for a particular UOO is up.
    One approach to HA is to configure "service migration". For more information see the Automatic Service Migration white-paper at
    http://www.oracle.com/technology/products/weblogic/pdf/weblogic-automatic-service-migration-whitepaper.pdf
    In addition, I recommend referencing Best Practices for JMS Beginners and Advanced Users
    http://docs.oracle.com/cd/E17904_01/web.1111/e13738/best_practice.htm#JMSAD455 to help with WL configuration in general.
    Hope this helps,
    Tom

  • Call in progress when one CUCM go down

    Hi everybody
    We have two CUCM 8.6 and we need to poweroff one of these to change a parts in the UCS server. I know that the phone who's registered with the one I will power off will just "blink" and change to the other CUCM. But I don't know if the phone registered with the one I will power off, if they had a call in progress, did the call will be dropped or will stay in progress?
    The answer will guide me to fix a maintenance Windows if it need to be.
    Thanks

    Hi There,
    The calls should be maintained
    Table 11-1 Call Preservation Scenarios 
    Scenario
    Call Preservation Handling
    Cisco Unified Communications Manager fails.
    A Cisco Unified  Communications Manager failure causes the call-processing function for  all calls that were set up through the failed Cisco Unified  Communications Manager to be lost.
    The affected devices recognize that their current Cisco Unified  Communications Manager failed. Similarly, the other Cisco Unified  Communications Managers in the cluster detect the Cisco Unified  Communications Manager failure.
    Cisco Unified Communications Manager maintains affected active calls  until the end user hangs up or until the devices can determine that the  media connection has been released. Users cannot invoke any  call-processing features for calls that are maintained as a result of  this failure.
    Communication failure occurs between Cisco Unified Communications Manager and device.
    When  communication fails between a device and the Cisco Unified  Communications Manager that controls it, the device recognizes the  failure and maintains active connections. The Cisco Unified  Communications Manager recognizes the communication failure and clears  call-processing entities that are associated with calls in the device  where communication was lost.
    The Cisco Unified Communications Managers still maintain control of the  surviving devices that are associated with the affected calls. Cisco  Unified Communications Manager maintains affected active calls until the  end user hangs up or until the devices can determine that the media  connection has been released. Users cannot invoke any call-processing  features for calls that are maintained as a result of this failure.
    Device failure
    (Phone, gateway, conference bridge, transcoder, MTP)
    When a device  fails, the connections that exist through the device stop streaming  media. The active Cisco Unified Communications Manager recognizes the  device failure and clears call-processing entities that are associated  with calls in the failed device.
    The Cisco Unified Communications Managers maintain control of the  surviving devices that are associated with the affected calls. Cisco  Unified Communications Manager maintains the active connections (calls)  that are associated with the surviving devices until the surviving end  users hang up or until the surviving devices can determine that the  media connection has been released.
    http://www.cisco.com/c/en/us/td/docs/voice_ip_comm/cucm/admin/8_0_1/ccmsys/accm-801-cm/a02dvsup.html#wp1021885
    Cheers!
    Rob

  • Automating the process of changing SAP servers when one server is down

    Hi,
    We have observed that sometimes users are unable to run a CRM IC web application due to server being down.
    So we want to automate the process that if a server is down then it will change to another server so that the application can run and the downtime is reduced.
    Can you please let me know if anyone has done anything on this front.
    Kindly send your suggestions.
    Thanks in advance.
    Regards,
    Jaisish Mohanty

    You can use sapwebdispatcher together with logon groups for this purpose:
    http://help.sap.com/saphelp_nw70ehp1/helpdata/en/be/6041418e998739e10000000a1550b0/frameset.htm
    Markus

  • SC 3.2 Solaris 10 x86. When one node reboot, the other one does also

    Configured a two node cluster with a EMC clariion san (Raid 6) for holding a zpool and use as quorum device.
    When one node goes down, the other one does also.
    There seems a problem with the quorum.
    I can not understand or figure out what actually goes wrong.
    When starting up:
    Booting as part of a cluster
    NOTICE: CMM: Node cnode01 (nodeid = 1) with votecount = 1 added.
    NOTICE: CMM: Node cnode02 (nodeid = 2) with votecount = 1 added.
    NOTICE: CMM: Quorum device 1 (/dev/did/rdsk/d1s2) added; votecount = 1, bitmask of nodes with configured paths = 0x3.
    NOTICE: clcomm: Adapter nge3 constructed
    NOTICE: clcomm: Adapter nge2 constructed
    NOTICE: CMM: Node cnode01: attempting to join cluster.
    NOTICE: nge3: link down
    NOTICE: nge2: link down
    NOTICE: nge3: link up 1000Mbps Full-Duplex
    NOTICE: nge2: link up 1000Mbps Full-Duplex
    NOTICE: nge3: link down
    NOTICE: nge2: link down
    NOTICE: nge3: link up 1000Mbps Full-Duplex
    NOTICE: nge2: link up 1000Mbps Full-Duplex
    NOTICE: CMM: Node cnode02 (nodeid: 2, incarnation #: 1248284052) has become reachable.
    NOTICE: clcomm: Path cnode01:nge2 - cnode02:nge2 online
    NOTICE: clcomm: Path cnode01:nge3 - cnode02:nge3 online
    NOTICE: CMM: Cluster has reached quorum.
    NOTICE: CMM: Node cnode01 (nodeid = 1) is up; new incarnation number = 1248284001.
    NOTICE: CMM: Node cnode02 (nodeid = 2) is up; new incarnation number = 1248284052.
    NOTICE: CMM: Cluster members: cnode01 cnode02.
    NOTICE: CMM: node econfiguration #1 completed.
    NOTICE: CMM: Node cnode01: joined cluster.
    ip: joining multicasts failed (18) on clprivnet0 - will use link layer broadcasts for multicast
    /dev/rdsk/c2t0d0s5 is clean
    Reading ZFS config: done.
    obtaining access to all attached disks
    cnode01 console login:
    Then this on the second node:
    Booting as part of a cluster
    NOTICE: CMM: Node cnode01 (nodeid = 1) with votecount = 1
    NOTICE: CMM: Node cnode02 (nodeid = 2) with votecount = 1
    NOTICE: CMM: Quorum device 1 (/dev/did/rdsk/d1s2) added; votecount = 1, bitmask of nodes with configured paths = 0x3.
    NOTICE: clcomm: Adapter nge3 constructed
    NOTICE: clcomm: Adapter nge2 constructed
    NOTICE: CMM: Node cnode02: attempting to join cluster.
    NOTICE: CMM: Node cnode01 (nodeid: 1, incarnation #: 1248284001) has become reachable.
    NOTICE: clcomm: Path cnode02:nge2 - cnode01:nge2 online
    NOTICE: clcomm: Path cnode02:nge3 - cnode01:nge3 online
    WARNING: CMM: Issuing a NULL Preempt failed on quorum device /dev/did/rdsk/d1s2 with error 2.
    NOTICE: CMM: Cluster has reached quorum.ion ratio 4.77, dump succeeded
    NOTICE: CMM: Node cnode01 (nodeid = 1) is up; new incarnation number = 1248284001.
    NOTICE: CMM: Node cnode02 (nodeid = 2) is up; new incarnation number = 1248284052.
    NOTICE: CMM: Cluster members: cnode01 cnode02.
    NOTICE: CMM: node reconfiguration #1 completed.
    NOTICE: CMM: Node cnode02: joined cluster.
    NOTICE: CCR: Waiting for repository synchronization to finish.
    *{color:#ff0000}WARNING: CMM: Issuing a NULL Preempt failed on quorum device /dev/did/rdsk/d1s2 with error 2.{color}*
    ip: joining multicasts failed (18) on clprivnet0 - will use link layer broadcasts for multicast
    /dev/rdsk/c2t0d0s5 is clean
    Reading ZFS config: done.
    obtaining access to all attached disks
    cnode02 console login:
    But when the first node reboot, on the second node this message:
    Jul 22 19:24:48 cnode02 genunix: [ID 936769 kern.info] devinfo0 is /pseudo/devinfo@0
    Jul 22 19:30:57 cnode02 nge: [ID 812601 kern.notice] NOTICE: nge3: link down
    Jul 22 19:30:57 cnode02 nge: [ID 812601 kern.notice] NOTICE: nge2: link down
    Jul 22 19:30:59 cnode02 nge: [ID 812601 kern.notice] NOTICE: nge3: link up 1000Mbps Full-Duplex
    Jul 22 19:31:00 cnode02 nge: [ID 812601 kern.notice] NOTICE: nge2: link up 1000Mbps Full-Duplex
    Jul 22 19:31:06 cnode02 genunix: [ID 489438 kern.notice] NOTICE: clcomm: Path cnode02:nge2 - cnode01:nge2 being drained
    {color:#ff0000}Jul 22 19:31:06 cnode02 scsi_vhci: [ID 734749 kern.warning] WARNING: vhci_scsi_reset 0x0{color}
    Jul 22 19:31:06 cnode02 genunix: [ID 489438 kern.notice] NOTICE: clcomm: Path cnode02:nge3 - cnode01:nge3 being drained
    Jul 22 19:31:11 cnode02 nge: [ID 812601 kern.notice] NOTICE: nge3: link down
    {color:#ff0000}Jul 22 19:31:12 cnode02 genunix: [ID 414208 kern.warning] WARNING: QUORUM_GENERIC: quorum preempt error in CMM: Error 5 --- QUORUM_GENERIC Tkown ioctl failed on quorum device /dev/did/rdsk/d1s2.{color}
    {color:#ff0000}Jul 22 19:31:12 cnode02 cl_dlpitrans: [ID 624622 kern.notice] Notifying cluster that this node is panicking
    Jul 22 19:31:12 cnode02 unix: [ID 836849 kern.notice]
    Jul 22 19:31:12 cnode02 ^Mpanic[cpu3]/thread=ffffffff8b5c06e0:
    Jul 22 19:31:12 cnode02 genunix: [ID 265925 kern.notice] CMM: Cluster lost operational quorum; aborting.{color}
    Jul 22 19:31:12 cnode02 unix: [ID 100000 kern.notice]
    Jul 22 19:31:12 cnode02 genunix: [ID 655072 kern.notice] fffffe8002651b40 genunix:vcmn_err+13 ()
    Jul 22 19:31:12 cnode02 genunix: [ID 655072 kern.notice] fffffe8002651b50 cl_runtime:__1cZsc_syslog_msg_log_no_args6FpviipkcpnR__va_list_element__nZsc_syslog_msg_status_enum__+24 ()
    Jul 22 19:31:12 cnode02 genunix: [ID 655072 kern.notice] fffffe8002651c30 cl_runtime:__1cCosNsc_syslog_msgDlog6MiipkcE_nZsc_syslog_msg_status_enum__+9d ()
    Jul 22 19:31:12 cnode02 genunix: [ID 655072 kern.notice] fffffe8002651e20 cl_haci:__1cOautomaton_implbAstate_machine_qcheck_state6M_nVcmm_automaton_event_t__+3bc ()
    Jul 22 19:31:12 cnode02 genunix: [ID 655072 kern.notice] fffffe8002651e60 cl_haci:__1cIcmm_implStransitions_thread6M_v_+de ()
    Jul 22 19:31:12 cnode02 genunix: [ID 655072 kern.notice] fffffe8002651e70 cl_haci:__1cIcmm_implYtransitions_thread_start6Fpv_v_+b ()
    Jul 22 19:31:12 cnode02 genunix: [ID 655072 kern.notice] fffffe8002651ed0 cl_orb:cllwpwrapper+106 ()
    Jul 22 19:31:13 cnode02 genunix: [ID 655072 kern.notice] fffffe8002651ee0 unix:thread_start+8 ()
    Jul 22 19:31:13 cnode02 unix: [ID 100000 kern.notice]
    Jul 22 19:31:13 cnode02 genunix: [ID 672855 kern.notice] syncing file systems...
    Jul 22 19:31:13 cnode02 genunix: [ID 733762 kern.notice] 1
    Jul 22 19:31:34 cnode02 last message repeated 20 times
    Jul 22 19:31:35 cnode02 genunix: [ID 622722 kern.notice] done (not all i/o completed)
    Jul 22 19:31:36 cnode02 genunix: [ID 111219 kern.notice] dumping to /dev/dsk/c2t0d0s1, offset 3436511232, content: kernel
    Jul 22 19:31:45 cnode02 genunix: [ID 409368 kern.notice] ^M100% done: 136950 pages dumped, compression ratio 4.77,
    Jul 22 19:31:45 cnode02 genunix: [ID 851671 kern.notice] dump succeeded
    Jul 22 19:33:18 cnode02 genunix: [ID 540533 kern.notice] ^M

    Hi,
    the problem lies in the error message around the quorum device. The SC documentation, specifically the Sun Cluster Error Messages Guide at http://docs.sun.com/app/docs/doc/820-4681 explains this as follows:
    414208 QUORUM_GENERIC: quorum preempt error in CMM: Error %d --- QUORUM_GENERIC Tkown ioctl failed on quorum device %s.
    Description:
    This node encountered an error when issuing a QUORUM_GENERIC Take Ownership operation on a quorum device. This error indicates that the node was unsuccessful in preempting keys from the quorum device, and the partition to which it belongs was preempted. If a cluster is divided into two or more disjoint subclusters, one of these must survive as the operational cluster. The surviving cluster forces the other subclusters to abort by gathering enough votes to grant it majority quorum. This action is called "preemption of the losing subclusters".
    Solution:
    Other related messages identify the quorum device where the error occurred. If an EACCES error occurs, the QUORUM_GENERIC command might have failed because of the SCSI3 keys on the quorum device. Scrub the SCSI3 keys off the quorum device and reboot the preempted nodes."
    You should try to follow this advice. I would propose to chose a different QD before trying to do this, if you have one available. Is it possible that this LUN has been in use by a different cluster?
    To scrub SCSI3 keys you should use the scsi command in /usr/cluster/lib/sc: ./scsi -c inkeys -d <device> to check for the existence of keys, and ...-c scrub.. to remove any SCSI3 keys.
    Regards
    Hartmut

  • Logicalhostname IP wont failover when one member of the cluster dies

    Hi There,
    I've setup a failover cluster with 2 servers. The cluser IP is set up as a logicalhostname and each server has two network cards configured as IPMP groups.
    I can test the IPMP failover on each server by failing a network card and checkign the IP address fails over.
    I can test the logicalhost name failsover by switchign the resource group over from one node to the other
    BUT
    If I drop one member of the cluster the failover fails
    Nov 4 15:09:06 nova cl_runtime: NOTICE: clcomm: Path nova:qfe2 - gambit:qfe2 errors during initiation
    Nov 4 15:09:06 nova cl_runtime: WARNING: Path nova:ce1 - gambit:bge1 initiation encountered errors, errno = 62. Remote node may be down or unreachable through this path.
    Nov 4 15:09:06 nova cl_runtime: WARNING: Path nova:qfe2 - gambit:qfe2 initiation encountered errors, errno = 62. Remote node may be down or unreachable through this path.
    ova
    Nov 4 15:09:08 nova Cluster.PNM: PNM daemon system error: SIOCLIFADDIF failed.: Network is down
    Nov 4 15:09:08 nova Cluster.PNM: production can't plumb 130.159.17.1.
    Nov 4 15:09:08 nova SC[SUNW.LogicalHostname,test-vle,vle1,hafoip_prenet_start]: IPMP logical interface configuration operation failed with <-1>.
    Nov 4 15:09:08 nova Cluster.RGM.rgmd: Method <hafoip_prenet_start> failed on resource <vle1> in resource group <test-vle>, exit code <1>, time used: 0% of timeout <300 seconds>
    Nov 4 15:09:08 nova ip: TCP_IOC_ABORT_CONN: local = 130.159.017.001:0, remote = 000.000.000.000:0, start = -2, end = 6
    Nov 4 15:09:08 nova ip: TCP_IOC_ABORT_CONN: aborted 0 connection
    scswitch: Resource group test-vle failed to start on chosen node and may fail over to other node(s)
    Any ideas would be appreciated as I dont understand how it all fails over correctly if the cluster is up but fails when one member is down.

    Hi,
    looking at the messages, the problem seems to be with the network setup on nova. I would suggest to try to configure the logical IP on nova manually to see if that works. If that does not it should tell you where the problem is.
    Or are you saying that manually switching the RG works, but when a node dies and cluster switches the RG it doesn't. That would be strange.
    You should also post the status of your network on nova in the failure case. There might be something wrong with your IPMP setup. Or has the public net failed completely when you killed the other node?
    Regards
    Hartmut

  • Failover did not happen when network switch went down!!! PLEASE HELP

    Hi gurus,
    Yesterday one disaster struck my RAC database. We have two node cluster and it is 10.2.0.2, both of them located in different sites, yesterday suddenly power went down and the one of the network switch went down and got destructed, node one of RAC database was connected to that switch, but the failover did not happen to the node two as this should be the case when one node goes down the other should be available for all the node one sessions/connections.
    when I tried to ping/telnet the node 1, it was not happening because the switch was down, the network guyz connected the cables to other switch available. When I connected to the node 1, it was showing "Oracle is not available" message.
    And when I tried the other node, it was the same case but I did not see any error in alert log file. Then my TL restarted both the nodes and then the database was available.
    I am very confused that how the failover did not happen and how the database went down, PLEASE suggest something to how to identifiy what was happened. Thanks & Regards
    Edited by: user1221 on Mar 18, 2009 1:09 AM

    About Oracle RAC ... you have 2 nodes ... You have to connect 4 IPs.
    I mean
    - IP public node1
    - IP public node2
    - IP Virtual node1
    - IP Virtual node2
    When node1 down
    You can not ping "IP public node1", but you should ping "- IP Virtual node1", because it should up on node2.
    But not bind 1521 port.
    Idea about failover
    you have to create new difference Service...on RAC Database!
    and on your client have to set TNS to on failover and balance (if you used OCI), you can use TAF feature
    Example:
    DB =
    (DESCRIPTION = (ADDRESS = (PROTOCOL = TCP)(HOST = db01-vip)(PORT = 1521)) (ADDRESS = (PROTOCOL = TCP)(HOST = db02-vip)(PORT = 1521)) (FAILOVER=ON)(LOAD_BALANCE = yes) (CONNECT_DATA = (SERVER = DEDICATED) (SERVICE_NAME = NEW_SERVICE_DB) (FAILOVER_MODE = (TYPE = SELECT) (METHOD = BASIC) (RETRIES = 180) (DELAY = 5))))
    If you use JDBC, can not use TAF.
    Anyway, I suggest you read more about RAC on http://otn.oracle.com/rac and http://oracleracsig.org
    Good Luck

  • Webapp should dynamically choose second DB when first DB is down.

    I need to create a web app, considering disaster recovery situation. I have to DBs on two remote locations.
    Now what necessary steps should I take, so that when one DB is down, my app will pick Another DB?
    Please help me out...!!!
    Note: I'm using Jdev 11.1.1.4 version & Oracle XE 10g version

    This is something you normally handle on the wls where you set up a cluster and a rac db connection to a rac db.
    You can configure the data sources on the wls to check each connection before checking it out of the pool.  This check is then used to use the other db from the rac.
    This however needs the full Oracle db. You can't do this with XE.
    If you want somehow mimic this yourself you have to handle the db connections yourself which needs some tweaking of the framework.
    Timo

  • IE 5.0 hanging when one is reading HTML from Weblogic 6.0

    I have following configuration:
    Local network (TCP/IP, Wins server)
    SERVER: NT 2000 AdvServer & Weblogic Eval 6.0 Sp2 & Oracle 8.1.7
    CLIENTS
    1) Win 2000Pro & IE 5.0
    2) Win 98 & IE 5.0
    Quite often IE (client one) is hanging when one is reading static HTML pages.
    Also IE (client two) says that there is fail in Kernel and display dump of memory.
    In last case it is occured less often.
    I located these static HTML pages in Webdefault catalog.
    Also I have two simple JSP pages, one of them is genering current time and is
    working constantly.
    This situation was not changed when I did the following:
    1. Disable Java Plugin
    2. Set Weblogic Sp1.
    What did I wrong?
    Thanks in advance
    Eugenia
    [email protected]

    I play the mp3 files as sprites in the score that are
    streaming from the web server. If you selecr "Insert->Media
    Element->Shockwave Audio" from the menu in Director you can
    choose to import a mp3/swa file. This file will import as a cast
    member which will have an external link address. (e.g.
    http://www.adobe.com/mysound/myfile.mp3)
    I have then dragged this cast member onto channel 1 in the score
    spanning for instance 10 frames.
    It does not make a difference wether the soundfile is of type
    mp3 or swa, I have tried both.

Maybe you are looking for