UCCX 8.0 - 1st node crashed

Hi,
We have a 2-server UCCX 8.0 cluster running on UCS servers. Recently, when moving the publisher (1st node) to a new UCS server, we accidently deleted some files of the Virtual machine. (there are 2 folders in the datastore, named UCCX1 and UCCX1_1; my colleague deleted the UCCX1_1 folder as he thought it was not neccessary). After that, the ESXi kept asking for UCCX1_1\UCCX1_1.vmx file when we trying to boot the server. We had to re-add the server (browse to the vmx file in the datastore, and Add to Inventory); the server could boot up now, but I think we lost all the data (we cannot access to the Application web page).
Now we still have UCCX 2 running, could we force the 1st server to update its database to sync with the UCCX 2? If YES, how to do that?
If NO, what should we do? Re-install everything or is there a better way to recover the cluster?
Thanks,
hoanghiep

Hi Hoanghiep,
You can not make the UCCX 8.x series second node as the first node, this was supported only on Windows platform (i.e. UCCX 7.x and earlier releases).
If you have taken a valid DRS backup, than yes reinstall the UCCX 8.x first node (with the same details as before like hostnema, ip address, DNS....etc) and than restore this backup.
http://www.cisco.com/en/US/docs/voice_ip_comm/cust_contact/contact_center/crs/express_8_0/configuration/guide/uccx801drs.pdf
Restoring only the Publisher Node in an HA Setup (with Rebuild)
In a high availability (HA) setup , if there is a hard-drive failure or any other critical hardware or
Software failure which needs rebuild of the Publisher ( first ) node, then follow the below procedure to
recover the publisher node to the last backed up state of the publisher. Run the below procedure if you
have a valid backup taken before the failure of the node.
Procedure
Step 1 Perform a fresh installation of the same version of Cisco Unified Contact Center Express (using the same
administrator credentials, network configuration and security password used earlier) on the node prior
to restoring it. 
For more information on installing Cisco Unified Contact Center Express, see the Installing Cisco
Unified Contact Center Express available here:
http://www.cisco.com/en/US/products/sw/custcosw/ps1846/prod_installation_guides_list.html
Step 2 Navigate to Cisco Unified Contact Center Administration, select Disaster Recovery System from the
Navigation drop-down list box in the upper-right corner of the Cisco Unified Contact Center Express
Administration window, and click Go.
The Disaster Recovery System Logon window displays.
Step 3 Log in to the Disaster Recovery System by using the same Platform Administrator username and
password that you use to log in to Cisco Unified Operating System Administration.
Step 4 Configure the backup device. For more information, see Managing Backup Devices, page 7.
Step 5 Navigate to Restore > Restore Wizard. The Restore Wizard Step 1 window displays.
Step 6 In the Select Backup Device area, choose the backup device from which to restore.
Step 7 Click Next. The Restore Wizard Step 2 window displays.
Step 8 Choose the backup file that you want to restore.
Note The backup filename indicates the date and time that the system created the backup file.
Step 9 Click Next. The Restore Wizard Step 3 window displays.
Step 10 Select the feature UCCX.
Step 11 Click Next. The Restore Wizard Step 4 window displays,
Step 12 When you get prompted to choose the nodes to restore, choose only the first node (the publisher).
CautionDo not select the second (subscriber) node in this condition as this will result in failure of the restore attempt.
Step 13 To start restoring the data, click Restore.
Note During the restore process, do not perform any tasks with Cisco Unified Contact Center Express
Administration or User Options.
Restoring the first node may take up to several hours based on the size of database that is being restored.
Depending on the size of your database that you choose to restore, the system can require one hour or
more to restore.
Note Based on the requirements, you have the option to either retrieve the existing publisher node data
from the DRS backup to be available on all the nodes in the cluster or retrieve the more recent
data (if available) from the subscriber node to be available in the cluster.
Step 14 Run the following CLI command from the Subscriber node after the restore process is successful (restore
status indicates 100 per cent) to inititate restoring the Publisher node only (with rebuild).
utils uccx setuppubrestore
Step 15 Run the following CLI command on the target node; that is if you want to retrieve the publisher node’s
data, then run this command on the subscriber node, but if you want to retrieve the subscriber node’s data
(which is more up-to-date), then run this command on the publisher node.
utils uccx database forcedatasync
Warning In any case, you must execute this command on either of the nodes after restoring the publisher node.
Step 16 Restart both the nodes and run the following CLI command on the Publisher node to set up replication.
utils uccx dbreplication reset
For more information on restarting, see the Cisco Unified Communications Operating System
Administration Guide available here:
http://www.cisco.com/en/US/products/sw/custcosw/ps1846/prod_maintenance_guides_list.html.
CautionIf you have done some configuration or hardware changes while performing fresh installation in Step 1 that might impact the License MAC, then rehost your license again using the license rehosting mechanism before running the CLI command “utils uccx dbreplication reset”. For more information on the licensing rehosting mechanism, see the Installing Cisco Unified Contact Center Express available here:
http://www.cisco.com/en/US/products/sw/custcosw/ps1846/prod_installation_guides_list.html
Step 17 Your data gets restored on the publisher node. To view the status of the restore, see the “Viewing the
Restore Status” section on page 19.
Hope this helps.
Anand
Please rate helpful posts !!

Similar Messages

  • Server restore causing some db's to crash on 1st node

    Hi all!,
    I was wondering if anybody has an idea for why the following happend. I checked the database logs and there's nothing before the crash indicating a problem. Here's what happened on our test RAC 2node cluster servers/db's.
    Last week our 2nd node went down due to a patch of the unix os. It was a patch for the next version ahead of ours but our systems didn't catch it until it was too late. The patch was to allow them to not have to bounce each node when adding more disk to the SAN. It crashed the 2nd node as that was where they were starting to apply the patch. This 2nd node has been down all week as they couldn't get it up even with HP support on the line. They decided to restore the image from our 1st node onto the 2nd node and bring it online. When they did, it caused 3 out of 8 databases to suddenly crash on the 1st node. We got them up but need to find out why. Has anyone ever experienced such a thing and/or have any advice as what to look at? I don't know that much about our cluster, could there be parms set to individual db's that could be the culprit?
    Any advice would be greatly appreciated, thanks in advance for your replies!,
    Dave

    No, that's what's troubleing. The last entry before crash shows the normal archive log switching that occurs and then the next entry is the database being started.

  • Services not starting after a node crash

    hi
    We have a 3 node cluster and one of the nodes crashed today, also the services did not get relocated to the other node and when we try to manullay stop/start/relocate the service we get the following error
    srvctl stop service -d BCB -s BCB_J2EE -f
    PRCD-1085 : Failed to stop service BCB_J2EE
    PRCR-1065 : Failed to stop resource ora.BCB.BCB_j2ee.svc
    CRS-2533: Server 'bcb528' is down. Unable to perform the operation on 'ora.BCB.BCB_j2ee.svc'
    Would anyone has seen this before
    Thx
    JJ

    this is what i can find in log
    [   CRSPE][60] Server [bcb528] is unreachable. Stopping the sequencer for: bcbCRON 1 1
    2011-02-28 08:15:21.778: [   CRSPE][60] Sequencer for [bcbCRON 1 1] has completed with error: CRS-2533: Server 'bcb528' is down. Unable to pe
    rform the operation on 'bcbCRON'
    2011-02-28 08:15:21.778: [   CRSPE][60] Required instruction failed in op: START of [bcbCRON 1 1] on [bcb529] : 105247290
    2011-02-28 08:15:21.781: [UiServer][62] Container [ Name: ORDER
    MESSAGE:
    TextMessage[CRS-2533: Server 'bcb528' is down. Unable to perform the operation on 'bcbCRON']
    MSGTYPE:
    TextMessage[1]
    OBJID:
    TextMessage[bcbCRON 1 1]
    WAIT:
    TextMessage[0]

  • RAC 11gR2 cluster installation: root.sh failed on the 1st node

    Hi,
    Does anybody know why is possible when I run the root.sh on the 1st node, during the Oracle 11gR2 RAC installation (cluster installation) to get the following error?
    The following environment variables are set as:
    ORACLE_OWNER= oracle
    ORACLE_HOME= /oracle/grid
    Enter the full pathname of the local bin directory: [usr/local/bin]:
    The file "dbhome" already exists in /usr/local/bin. Overwrite it? (y/n) [n]: y
    Copying dbhome to /usr/local/bin ...
    The file "oraenv" already exists in /usr/local/bin. Overwrite it? (y/n) [n]: y
    Copying oraenv to /usr/local/bin ...
    The file "coraenv" already exists in /usr/local/bin. Overwrite it? (y/n) [n]: y
    Copying coraenv to /usr/local/bin ...
    Entries will be added to the /etc/oratab file as needed by
    Database Configuration Assistant when a database is created
    Finished running generic part of root.sh script.
    Now product-specific root actions will be performed.
    2010-06-29 14:17:43: Parsing the host name
    2010-06-29 14:17:43: Checking for super user privileges
    2010-06-29 14:17:43: User has super user privileges
    Using configuration parameter file: /oracle/grid/crs/install/crsconfig_params
    Creating trace directory
    User oracle has the required capabilities to run CSSD in realtime mode
    LOCAL ADD MODE
    Creating OCR keys for user 'root', privgrp 'system'..
    Operation successful.
    root wallet
    root wallet cert
    root cert export
    peer wallet
    profile reader wallet
    pa wallet
    peer wallet keys
    pa wallet keys
    peer cert request
    pa cert request
    peer cert
    pa cert
    peer root cert TP
    profile reader root cert TP
    pa root cert TP
    peer pa cert TP
    pa peer cert TP
    profile reader pa cert TP
    profile reader peer cert TP
    peer user cert
    pa user cert
    Adding daemon to inittab
    CRS-4123: Oracle High Availability Services has been started.
    ohasd is starting
    CRS-2672: Attempting to start 'ora.gipcd' on 'trz1test_rac'
    CRS-2672: Attempting to start 'ora.mdnsd' on 'trz1test_rac'
    CRS-2676: Start of 'ora.gipcd' on 'trz1test_rac' succeeded
    CRS-2676: Start of 'ora.mdnsd' on 'trz1test_rac' succeeded
    CRS-2672: Attempting to start 'ora.gpnpd' on 'trz1test_rac'
    CRS-2676: Start of 'ora.gpnpd' on 'trz1test_rac' succeeded
    CRS-2672: Attempting to start 'ora.cssdmonitor' on 'trz1test_rac'
    CRS-2676: Start of 'ora.cssdmonitor' on 'trz1test_rac' succeeded
    CRS-2672: Attempting to start 'ora.cssd' on 'trz1test_rac'
    CRS-2672: Attempting to start 'ora.diskmon' on 'trz1test_rac'
    CRS-2676: Start of 'ora.diskmon' on 'trz1test_rac' succeeded
    CRS-2676: Start of 'ora.cssd' on 'trz1test_rac' succeeded
    CRS-2672: Attempting to start 'ora.ctssd' on 'trz1test_rac'
    CRS-2676: Start of 'ora.ctssd' on 'trz1test_rac' succeeded
    clscfg: -install mode specified
    Successfully accumulated necessary OCR keys.
    Creating OCR keys for user 'root', privgrp 'system'..
    Operation successful.
    CRS-2672: Attempting to start 'ora.crsd' on 'trz1test_rac'
    CRS-2676: Start of 'ora.crsd' on 'trz1test_rac' succeeded
    Now formatting voting disk: /data_gpfs/oracle/crs/vdsk.
    CRS-4603: Successful addition of voting disk /data_gpfs/oracle/crs/vdsk.
    ## STATE File Universal Id File Name Disk group
    1. ONLINE 653624f2aa1f4f83bf774e8052889a32 (/data_gpfs/oracle/crs/vdsk) []
    Located 1 voting disk(s).
    CRS-2673: Attempting to stop 'ora.crsd' on 'trz1test_rac'
    CRS-2677: Stop of 'ora.crsd' on 'trz1test_rac' succeeded
    CRS-2673: Attempting to stop 'ora.ctssd' on 'trz1test_rac'
    CRS-2677: Stop of 'ora.ctssd' on 'trz1test_rac' succeeded
    CRS-2673: Attempting to stop 'ora.cssdmonitor' on 'trz1test_rac'
    CRS-2677: Stop of 'ora.cssdmonitor' on 'trz1test_rac' succeeded
    CRS-2673: Attempting to stop 'ora.cssd' on 'trz1test_rac'
    CRS-2677: Stop of 'ora.cssd' on 'trz1test_rac' succeeded
    CRS-2673: Attempting to stop 'ora.gpnpd' on 'trz1test_rac'
    CRS-2677: Stop of 'ora.gpnpd' on 'trz1test_rac' succeeded
    CRS-2673: Attempting to stop 'ora.gipcd' on 'trz1test_rac'
    CRS-2677: Stop of 'ora.gipcd' on 'trz1test_rac' succeeded
    CRS-2673: Attempting to stop 'ora.mdnsd' on 'trz1test_rac'
    CRS-2677: Stop of 'ora.mdnsd' on 'trz1test_rac' succeeded
    CRS-2672: Attempting to start 'ora.mdnsd' on 'trz1test_rac'
    CRS-2676: Start of 'ora.mdnsd' on 'trz1test_rac' succeeded
    CRS-2672: Attempting to start 'ora.gipcd' on 'trz1test_rac'
    CRS-2676: Start of 'ora.gipcd' on 'trz1test_rac' succeeded
    CRS-2672: Attempting to start 'ora.gpnpd' on 'trz1test_rac'
    CRS-2676: Start of 'ora.gpnpd' on 'trz1test_rac' succeeded
    CRS-2672: Attempting to start 'ora.cssdmonitor' on 'trz1test_rac'
    CRS-2676: Start of 'ora.cssdmonitor' on 'trz1test_rac' succeeded
    CRS-2672: Attempting to start 'ora.cssd' on 'trz1test_rac'
    CRS-2672: Attempting to start 'ora.diskmon' on 'trz1test_rac'
    CRS-2676: Start of 'ora.diskmon' on 'trz1test_rac' succeeded
    CRS-2676: Start of 'ora.cssd' on 'trz1test_rac' succeeded
    CRS-2672: Attempting to start 'ora.ctssd' on 'trz1test_rac'
    CRS-2676: Start of 'ora.ctssd' on 'trz1test_rac' succeeded
    CRS-2672: Attempting to start 'ora.crsd' on 'trz1test_rac'
    CRS-2676: Start of 'ora.crsd' on 'trz1test_rac' succeeded
    CRS-2672: Attempting to start 'ora.evmd' on 'trz1test_rac'
    CRS-2676: Start of 'ora.evmd' on 'trz1test_rac' succeeded
    */oracle/grid/bin/srvctl start nodeapps -n trz1test_rac ... failed*
    Configure Oracle Grid Infrastructure for a Cluster ... failed
    This is because ora.eONS daemon is not starting. There is a Metalink note that we MIGHT start this daemon manually ... but this is not working.
    *./srvctl status nodeapps -n trz1test_rac*
    -n <node_name> option has been deprecated.
    VIP trz1test_rac_vip is enabled
    VIP trz1test_rac_vip is running on node: trz1test_rac
    Network is enabled
    Network is running on node: trz1test_rac
    GSD is disabled
    GSD is not running on node: trz1test_rac
    ONS is enabled
    ONS daemon is running on node: trz1test_rac
    eONS is enabled
    eONS daemon is not running on node: trz1test_rac

    I run my clusterware/DB on AIX 5.3
    When I run runcluvfy.sh here are the things which are not passing:
    Check: Node connectivity of subnet "192.168.1.0"
    Source Destination Connected?
    trz2test_rac:en5 trz2test_rac:en5 yes
    trz2test_rac:en5 trz1test_rac:en5 yes
    trz2test_rac:en5 trz1test_rac:en5 yes
    trz2test_rac:en5 trz1test_rac:en5 yes
    trz2test_rac:en5 trz1test_rac:en5 yes
    trz1test_rac:en5 trz1test_rac:en5 yes
    Result: Node connectivity passed for subnet "192.168.1.0" with node(s) trz2test_rac,trz1test_rac
    Check: TCP connectivity of subnet "192.168.1.0"
    Source Destination Connected?
    trz1test_rac:192.168.1.140 trz2test_rac:192.168.1.142 failed
    trz1test_rac:192.168.1.140 trz2test_rac:192.168.1.142 failed
    Result: TCP connectivity check failed for subnet "192.168.1.0"
    NTP daemon slewing option check failed on some nodes
    PRVF-5436 : The NTP daemon running on one or more nodes lacks the slewing option "-x"
    Result: Clock synchronization check using Network Time Protocol(NTP) failed
    NTP mustn't be a problem I guess as the date are identical on the 2 nodes.
    I have no idea how to fix the TCP connectivity issue with the subnet "192.168.1.0". Some posts wrote that could be a firewall issue. Are there any other causes ?
    Thanks to all,
    Paul

  • RAC -process failover at node crash

    Hi,
    how to prevent running process(transaction) from termination in RAC while a node crashes ..
    Ex: if there is a process running on node 1 and if it suddenly crashes in RAC how does we make node2 or node3 to pick it up and process or start the transaction again??
    Thanks,

    Hello,
    Look at your tnsnames.ora entry and see if it configured to benefit from FAILOVER, you can aslo explore other available options options
    http://stanford.edu/dept/itss/docs/oracle/10g/network.101/b10776/tnsnames.htm
    myservice=
    (DESCRIPTION=
       (SOURCE_ROUTE=yes)
       (ADDRESS=(PROTOCOL=tcp)(HOST=host1)(PORT=1630))    # <-- hop 1
       (ADDRESS_LIST= 
         (FAILOVER=on)
         (LOAD_BALANCE=off)                                #  <--- hop 2
         (ADDRESS=(PROTOCOL=tcp)(HOST=host2a)(PORT=1532))
         (ADDRESS=(PROTOCOL=tcp)(HOST=host2b)(PORT=1521)))
       (ADDRESS=(PROTOCOL=tcp)(HOST=host3)(PORT=1521))    #  <--  hop 3
       (CONNECT_DATA=(SERVICE_NAME=myservice)))Another Example
    MYSERVICE =
      (DESCRIPTION = 
      (ADDRESS_LIST= 
         (FAILOVER = on)
         (LOAD_BALANCE = on) 
         (ADDRESS= (PROTOCOL = TCP)(HOST = server1)(PORT = 1521))
         (ADDRESS= (PROTOCOL = TCP)(HOST = server2)(PORT = 1521))
         (ADDRESS= (PROTOCOL = TCP)(HOST = server3)(PORT = 1521))
      (CONNECT_DATA=
         (SERVICE_NAME =MYSERVICE)
         (FAILOVER_MODE = 
             (BACKUP=server2)
             (TYPE=select)
             (METHOD=preconnect)
             (RETRIES=20)
             (DELAY=3)
    ) Regards
    OrionNet
    Edited by: OrionNet on Dec 18, 2008 3:12 PM

  • Can I have RAC 1st node in RHEL 5 and 2nd node in RHEL 4?

    Can I have my RAC 1st node in RHEL5 and 2nd node in RHEL 4?
    I am just checking if there is any possibility like that.
    Thanks,
    Mahi

    Even if it works by accident, it wouldn't be supported.

  • RAC: When 1st node started, the 2nd node failed to start

    I got a problem in Oracle 10gR2 RAC on Windows 2003R2 Domain member environment. I have a 2 nodes RAC using ASM in 2 MS Windows 2003 Standard Server, it is a clean environment, only have Oracle and Norton Antivirus software installed.
    When the 1st node started successfully from booting up the machine, the 2nd node is failed to startup. It stays in the Windows startup screen (Applying Computer Setting ...) for more then 1 hour. Eventually, the window login screen come out, but I cannot login to the system after input username and password. This situation is reversable (the 1st node failed to start if I startup the 2nd node first).
    In case I set the Oracle Services (OracleCRService and OracleEVMService) into Manual startup at 2nd Nodes, the 2nd node can startup smoothly. After login to the 2nd node, I can start these 2 oracle services without problem.
    P.S. This problem is just happened after applied I applied all MS Security Update on 10 Apr, 2008.
    Any suggestion how to shoot this problem? Thanks.
    Message was edited by:
    ckhlam

    A couple of things you could try :
    a) Disable the Norton AntiVirus Software and check whether rebooting the
    Server allows the CRS stack to come up. Recall reading about an issue
    where-in NAV waits for the Network Stack to come up and blocks
    CRS's startup sequence. This is just a guess at this time but worth a try.
    b) You might also want to check if configuring Oracle Process Manager as detailed
    in Note:358156.1 allows the CRS stack to be delayed long enough to fully
    initialize the OS stack beneath it.
    c) If none of the above helps , you might want to uninstall the MS Security Update
    to check if this was a problem introduced by this Patch. You might then have
    to work with MS / Oracle to dig further into this.
    Do update this thread with your observations on this ..
    Vishwa

  • Central CCMS Alert for Java instance server node crash

    Hi,
    Is it possible to trigger an alert of an Java system server node crash using central CCMS alerts.
    J2EE instance CCMS alert does show some MTEs, however they do not trigger alerts. I got this info from below link
    http://help.sap.com/saphelp_nwce711/helpdata/en/46/11aaf352da14dce10000000a155369/frameset.htm
    Any idea how to trigger the alert for server node crash.
    I understand that server node crash should be investigated for permanent fix, however we need this as a proactive measure to know the crash if they happen.
    Thanks
    Imtiaz

    Hi Imtiaz,
    Do you see any error on the view>status auto-reaction? Have you been able to assign an auto-reaction to this MTE?
    Cheers,
    Maurício

  • Matlab node crashes LV 8.5

    Today, I observed a bizzare phenmenon in LV 8.5, when playing with Matlab Script node.
    Just open any example from the Example Finder. Rt click on the Script node & Choose Script Server -->> Xmath Script.
    Now, do a Ctrl + Z [Undo] & LV ll get crashed.
    See attached pic.
    - Partha
    LabVIEW - Wires that catch bugs!
    Attachments:
    Ctrl+Z on Matlab node crashes LV 8.5.PNG ‏101 KB

    Hi Parthabe,
    Thank you for the feedback. This is definetely a bug in Labview and I have filed a CAR for this issue. For your reference the CAR# is 96755
    Eli S.
    National Instruments
    Applications Engineer

  • Node crashes when enabling RDS for private interconnect.

    OS: oel6.3 - 2.6.39-300.17.2.el6uek.x86_64
    Grid and DB: 11.2.0.3.4
    This is a two node Standard Edition cluster.
    The node crashes upon restart of clusterware after following the instructions from note:751343.1 (RAC Support for RDS Over Infiniband) to enable RDS.
    The cluster is running fine using ipoib for the cluster_interconnect.
    1) As the ORACLE_HOME/GI_HOME owner, stop all resources (database, listener, ASM etc) that's running from the home. When stopping database, use NORMAL or IMMEDIATE option.
    2) As root, if relinking 11gR2 Grid Infrastructure (GI) home, unlock GI home: GI_HOME/crs/install/rootcrs.pl -unlock
    3) As the ORACLE_HOME/GI_HOME owner, go to ORACLE_HOME/GI_HOME and cd to rdbms/lib
    4) As the ORACLE_HOME/GI_HOME owner, issue "make -f ins_rdbms.mk ipc_rds ioracle"
    5) As root, if relinking 11gR2 Grid Infrastructure (GI) home, lock GI home: GI_HOME/crs/install/rootcrs.pl -patch
    Looks to abend when asm tries to start with the message below on the console.
    I have a service request open for this issue but, I am hoping someone may have seen this and has
    some way around it.
    Thanks
    Alan
    kernel BUG at net/rds/ib_send.c:547!
    invalid opcode: 0000 [#1] SMP
    CPU 2
    Modules linked in: 8021q garp stp llc iptable_filter ip_tables nfs lockd
    fscache auth_rpcgss nfs_acl sunrpc cpufreq_ondemand powernow_k8
    freq_table mperf rds_rdma rds_tcp rds ib_ipoib rdma_ucm ib_ucm ib_uverbs
    ib_umad rdma_cm ib_cm iw_cm ib_addr ipv6 ib_sa sr_mod cdrom microcode
    serio_raw pcspkr ghes hed k10temp hwmon amd64_edac_mod edac_core
    edac_mce_amd i2c_piix4 i2c_core sg igb dca mlx4_ib ib_mad ib_core
    mlx4_en mlx4_core ext4 mbcache jbd2 usb_storage sd_mod crc_t10dif ahci
    libahci dm_mirror dm_region_hash dm_log dm_mod [last unloaded:
    scsi_wait_scan]
    Pid: 4140, comm: kworker/u:1 Not tainted 2.6.39-300.17.2.el6uek.x86_64
    #1 Supermicro BHDGT/BHDGT
    RIP: 0010:[<ffffffffa02db829>] [<ffffffffa02db829>]
    rds_ib_xmit+0xa69/0xaf0 [rds_rdma]
    RSP: 0018:ffff880fb84a3c50 EFLAGS: 00010202
    RAX: ffff880fbb694000 RBX: ffff880fb3e4e600 RCX: 0000000000000000
    RDX: 0000000000000030 RSI: ffff880fbb6c3a00 RDI: ffff880fb058a048
    RBP: ffff880fb84a3d30 R08: 0000000000000fd0 R09: ffff880fbb6c3b90
    R10: 0000000000000000 R11: 000000000000001a R12: ffff880fbb6c3a00
    R13: ffff880fbb6c3a00 R14: 0000000000000000 R15: ffff880fb84a3d90
    FS: 00007fd0a3a56700(0000) GS:ffff88101e240000(0000) knlGS:0000000000000000
    CS: 0010 DS: 0000 ES: 0000 CR0: 000000008005003b
    CR2: 0000000002158ca2 CR3: 0000000001783000 CR4: 00000000000406e0
    DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
    DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
    Process kworker/u:1 (pid: 4140, threadinfo ffff880fb84a2000, task
    ffff880fae970180)
    Stack:
    0000000000012200 0000000000012200 ffff880f00000000 0000000000000000
    000000000000e5b0 ffffffff8115af81 ffffffff81b8d6c0 ffffffffa02b2e12
    00000001bf272240 ffffffff81267020 ffff880fbb6c3a00 0000003000000002
    Call Trace:
    [<ffffffff8115af81>] ? __kmalloc+0x1f1/0x200
    [<ffffffffa02b2e12>] ? rds_message_alloc+0x22/0x90 [rds]
    [<ffffffff81267020>] ? sg_init_table+0x30/0x50
    [<ffffffffa02b2db2>] ? rds_message_alloc_sgs+0x62/0xa0 [rds]
    [<ffffffffa02b31e4>] ? rds_message_map_pages+0xa4/0x110 [rds]
    [<ffffffffa02b4f3b>] rds_send_xmit+0x38b/0x6e0 [rds]
    [<ffffffff81089d53>] ? cwq_activate_first_delayed+0x53/0x100
    [<ffffffffa02b6040>] ? rds_recv_worker+0xc0/0xc0 [rds]
    [<ffffffffa02b6075>] rds_send_worker+0x35/0xc0 [rds]
    [<ffffffff81089fd6>] process_one_work+0x136/0x450
    [<ffffffff8108bbe0>] worker_thread+0x170/0x3c0
    [<ffffffff8108ba70>] ? manage_workers+0x120/0x120
    [<ffffffff810907e6>] kthread+0x96/0xa0
    [<ffffffff81515544>] kernel_thread_helper+0x4/0x10
    [<ffffffff81090750>] ? kthread_worker_fn+0x1a0/0x1a0
    [<ffffffff81515540>] ? gs_change+0x13/0x13
    Code: ff ff e9 b1 fe ff ff 48 8b 0d b4 54 4b e1 48 89 8d 70 ff ff ff e9
    71 ff ff ff 83 bd 7c ff ff ff 00 0f 84 f4 f5 ff ff 0f 0b eb fe <0f> 0b
    eb fe 44 8b 8d 48 ff ff ff 41 b7 01 e9 51 f6 ff ff 0f 0b
    RIP [<ffffffffa02db829>] rds_ib_xmit+0xa69/0xaf0 [rds_rdma]
    RSP <ffff880fb84a3c50>
    Initializing cgroup subsys cpuset
    Initializing cgroup subsys cpu
    Linux version 2.6.39-300.17.2.el6uek.x86_64
    ([email protected]) (gcc version 4.4.6 20110731 (Red
    Hat 4.4.6-3) (GCC) ) #1 SMP Wed Nov 7 17:48:36 PST 2012
    Command line: ro root=UUID=5ad1a268-b813-40da-bb76-d04895215677
    rd_DM_UUID=ddf1_stor rd_NO_LUKS rd_NO_LVM LANG=en_US.UTF-8 rd_NO_MD
    SYSFONT=latarcyrheb-sun16 KEYBOARDTYPE=pc KEYTABLE=us numa=off
    console=ttyS1,115200n8 irqpoll maxcpus=1 nr_cpus=1 reset_devices
    cgroup_disable=memory mce=off memmap=exactmap memmap=538K@64K
    memmap=130508K@770048K elfcorehdr=900556K memmap=72K#3668608K
    memmap=184K#3668680K
    BIOS-provided physical RAM map:
    BIOS-e820: 0000000000000100 - 0000000000096800 (usable)
    BIOS-e820: 0000000000096800 - 00000000000a0000 (reserved)
    BIOS-e820: 00000000000e6000 - 0000000000100000 (reserved)
    BIOS-e820: 0000000000100000 - 00000000dfe90000 (usable)
    BIOS-e820: 00000000dfe9e000 - 00000000dfea0000 (reserved)
    BIOS-e820: 00000000dfea0000 - 00000000dfeb2000 (ACPI data)
    BIOS-e820: 00000000dfeb2000 - 00000000dfee0000 (ACPI NVS)
    BIOS-e820: 00000000dfee0000 - 00000000f0000000 (reserved)
    BIOS-e820: 00000000ffe00000 - 0000000100000000 (reserved)

    I believe OFED version is 1.5.3.3 but I am not sure if this is correct.
    We have not added any third parry drivers. All that has been done to add infiniband to our build is
    a yum groupinstall iInfiniband support.
    I have not tries rds-stress but rds-ping works fine and rds-info seems fine.
    A service request has been opened but so far I have had better response here.
    oracle@blade1-6:~> rds-info
    RDS IB Connections:
    LocalAddr RemoteAddr LocalDev RemoteDev
    10.10.0.116 10.10.0.119 fe80::25:90ff:ff07:df1d fe80::25:90ff:ff07:e0e5
    TCP Connections:
    LocalAddr LPort RemoteAddr RPort HdrRemain DataRemain SentNxt ExpectUna SeenUna
    Counters:
    CounterName Value
    conn_reset 5
    recv_drop_bad_checksum 0
    recv_drop_old_seq 0
    recv_drop_no_sock 1
    recv_drop_dead_sock 0
    recv_deliver_raced 0
    recv_delivered 18
    recv_queued 18
    recv_immediate_retry 0
    recv_delayed_retry 0
    recv_ack_required 4
    recv_rdma_bytes 0
    recv_ping 14
    send_queue_empty 18
    send_queue_full 0
    send_lock_contention 0
    send_lock_queue_raced 0
    send_immediate_retry 0
    send_delayed_retry 0
    send_drop_acked 0
    send_ack_required 3
    send_queued 32
    send_rdma 0
    send_rdma_bytes 0
    send_pong 14
    page_remainder_hit 0
    page_remainder_miss 0
    copy_to_user 0
    copy_from_user 0
    cong_update_queued 0
    cong_update_received 1
    cong_send_error 0
    cong_send_blocked 0
    ib_connect_raced 4
    ib_listen_closed_stale 0
    ib_tx_cq_call 6
    ib_tx_cq_event 6
    ib_tx_ring_full 0
    ib_tx_throttle 0
    ib_tx_sg_mapping_failure 0
    ib_tx_stalled 16
    ib_tx_credit_updates 0
    ib_rx_cq_call 33
    ib_rx_cq_event 38
    ib_rx_ring_empty 0
    ib_rx_refill_from_cq 0
    ib_rx_refill_from_thread 0
    ib_rx_alloc_limit 0
    ib_rx_credit_updates 0
    ib_ack_sent 4
    ib_ack_send_failure 0
    ib_ack_send_delayed 0
    ib_ack_send_piggybacked 0
    ib_ack_received 3
    ib_rdma_mr_alloc 0
    ib_rdma_mr_free 0
    ib_rdma_mr_used 0
    ib_rdma_mr_pool_flush 8
    ib_rdma_mr_pool_wait 0
    ib_rdma_mr_pool_depleted 0
    ib_atomic_cswp 0
    ib_atomic_fadd 0
    iw_connect_raced 0
    iw_listen_closed_stale 0
    iw_tx_cq_call 0
    iw_tx_cq_event 0
    iw_tx_ring_full 0
    iw_tx_throttle 0
    iw_tx_sg_mapping_failure 0
    iw_tx_stalled 0
    iw_tx_credit_updates 0
    iw_rx_cq_call 0
    iw_rx_cq_event 0
    iw_rx_ring_empty 0
    iw_rx_refill_from_cq 0
    iw_rx_refill_from_thread 0
    iw_rx_alloc_limit 0
    iw_rx_credit_updates 0
    iw_ack_sent 0
    iw_ack_send_failure 0
    iw_ack_send_delayed 0
    iw_ack_send_piggybacked 0
    iw_ack_received 0
    iw_rdma_mr_alloc 0
    iw_rdma_mr_free 0
    iw_rdma_mr_used 0
    iw_rdma_mr_pool_flush 0
    iw_rdma_mr_pool_wait 0
    iw_rdma_mr_pool_depleted 0
    tcp_data_ready_calls 0
    tcp_write_space_calls 0
    tcp_sndbuf_full 0
    tcp_connect_raced 0
    tcp_listen_closed_stale 0
    RDS Sockets:
    BoundAddr BPort ConnAddr CPort SndBuf RcvBuf Inode
    0.0.0.0 0 0.0.0.0 0 131072 131072 340441
    RDS Connections:
    LocalAddr RemoteAddr NextTX NextRX Flg
    10.10.0.116 10.10.0.119 33 38 --C
    Receive Message Queue:
    LocalAddr LPort RemoteAddr RPort Seq Bytes
    Send Message Queue:
    LocalAddr LPort RemoteAddr RPort Seq Bytes
    Retransmit Message Queue:
    LocalAddr LPort RemoteAddr RPort Seq Bytes
    10.10.0.116 0 10.10.0.119 40549 32 0
    oracle@blade1-6:~> cat /etc/rdma/rdma.conf
    # Load IPoIB
    IPOIB_LOAD=yes
    # Load SRP module
    SRP_LOAD=no
    # Load iSER module
    ISER_LOAD=no
    # Load RDS network protocol
    RDS_LOAD=yes
    # Should we modify the system mtrr registers? We may need to do this if you
    # get messages from the ib_ipath driver saying that it couldn't enable
    # write combining for the PIO buffs on the card.
    # Note: recent kernels should do this for us, but in case they don't, we'll
    # leave this option
    FIXUP_MTRR_REGS=no
    # Should we enable the NFSoRDMA service?
    NFSoRDMA_LOAD=yes
    NFSoRDMA_PORT=2050
    oracle@blade1-6:~> /etc/init.d/rdma status
    Low level hardware support loaded:
         mlx4_ib
    Upper layer protocol modules:
         rds_rdma ib_ipoib
    User space access modules:
         rdma_ucm ib_ucm ib_uverbs ib_umad
    Connection management modules:
         rdma_cm ib_cm iw_cm
    Configured IPoIB interfaces: none
    Currently active IPoIB interfaces: ib0

  • Cluster Node Crashes

    I'm not sure this is the proper forum for this post, if it's not please feel free to move it.
    The situation I'm facing is this:
    My company has clusters setup across North America with our software that utilizes the Oracle database. 90% of the time everything functions exactly as it is supposed to. However, it is the other 10% of sites that I am here to ask about.
    Our clusters are setup in a dual-server environment that basically act as a single server. The application runs on one server and the database runs on another, and in the case of problems, either can be failed over to run both sets of services on a single server (basic, I realize). At certain sites we are unable to run services on one of the nodes. When they are run as they are supposed to, every so often (at some sites a matter of minutes/hours, at others it can be a couple weeks) they will BSOD.
    I fully understand what the blue screen is. The minidump shows that it's the orafencedrv.sys stop, where the Oracle database shuts down a node after loss of communications in order to prevent corruption of the database. This is a great feature and I'm grateful for it, however it has caused us many headaches in diagnosing what it actually causing the drop in communications.
    The interconnect and the public IP are both hooked up over a single switch but they operate on different subnets. Could operating on a single switch be part of the problem?
    Could the problem be that the switches are being overloaded with traffic causing temporary packet losses between the two nodes, which I know is enough to have Oracle BSOD a node?
    Below I'm posting one of the dumps listed in the CSSD log when the node crashes, hopefully this will provide some sort of information as to what is happening.
    If any other information is needed, please feel free to let me know. Thanks for your help in advance.
    [    CSSD]2008-10-29 13:30:06.211 [2732] >ERROR: clssnmvDiskKillCheck: Aborting, evicted by node 1, sync 13, stamp 99832890,
    [    CSSD]2008-10-29 13:30:06.211 [2732] >ERROR: ###################################
    [    CSSD]2008-10-29 13:30:06.211 [2732] >ERROR: clssscExit: CSSD aborting
    [    CSSD]2008-10-29 13:30:06.211 [2732] >ERROR: ###################################
    [    CSSD]--- DUMP GROCK STATE DB ---
    [    CSSD]----------
    [    CSSD] type 2, Id 3, Name = (crs_version)
    [    CSSD] flags: 0x0
    [    CSSD] grant: count=0, type 0, wait 0
    [    CSSD] Member Count =2, master 0
    [    CSSD] . . . . .
    [    CSSD] memberNo =0, seq 5
    [    CSSD] flags = 0x0, granted 0
    [    CSSD] refCnt = 1
    [    CSSD] nodeNum = 2, nodeBirth 6
    [    CSSD] privateDataSize = 0
    [    CSSD] publicDataSize = 0
    [    CSSD] . . . . .
    [    CSSD] memberNo =1, seq 11
    [    CSSD] flags = 0x0, granted 0
    [    CSSD] refCnt = 1
    [    CSSD] nodeNum = 1, nodeBirth 12
    [    CSSD] privateDataSize = 0
    [    CSSD] publicDataSize = 0
    [    CSSD]----------
    [    CSSD]----------
    [    CSSD] type 2, Id 2, Name = (ocr_STLRZOPRCL)
    [    CSSD] flags: 0x0
    [    CSSD] grant: count=0, type 0, wait 0
    [    CSSD] Member Count =2, master 2
    [    CSSD] . . . . .
    [    CSSD] memberNo =2, seq 5
    [    CSSD] flags = 0x0, granted 0
    [    CSSD] refCnt = 1
    [    CSSD] nodeNum = 2, nodeBirth 6
    [    CSSD] privateDataSize = 0
    [    CSSD] publicDataSize = 32
    [    CSSD] . . . . .
    [    CSSD] memberNo =1, seq 11
    [    CSSD] flags = 0x0, granted 0
    [    CSSD] refCnt = 1
    [    CSSD] nodeNum = 1, nodeBirth 12
    [    CSSD] privateDataSize = 0
    [    CSSD] publicDataSize = 32
    [    CSSD]----------
    [    CSSD]----------
    [    CSSD] type 3, Id 15, Name = (_ORA_CRS_MEMBER_stlrzoprcl1)
    [    CSSD] flags: 0x0
    [    CSSD] grant: count=1, type 3, wait 1
    [    CSSD] Member Count =1, master -3
    [    CSSD] . . . . .
    [    CSSD] memberNo =0, seq 0
    [    CSSD] flags = 0x12, granted 0
    [    CSSD] refCnt = 1
    [    CSSD] nodeNum = 1, nodeBirth 12
    [    CSSD] privateDataSize = 0
    [    CSSD] publicDataSize = 0
    [    CSSD]----------
    [    CSSD]----------
    [    CSSD] type 3, Id 15, Name = (_ORA_CRS_MEMBER_stlrzoprcl2)
    [    CSSD] flags: 0x0
    [    CSSD] grant: count=1, type 3, wait 1
    [    CSSD] Member Count =1, master -3
    [    CSSD] . . . . .
    [    CSSD] memberNo =0, seq 0
    [    CSSD] flags = 0x12, granted 1
    [    CSSD] refCnt = 1
    [    CSSD] nodeNum = 2, nodeBirth 6
    [    CSSD] privateDataSize = 0
    [    CSSD] publicDataSize = 0
    [    CSSD]----------
    [    CSSD]----------
    [    CSSD] type 2, Id 4, Name = (CRSDMAIN)
    [    CSSD] flags: 0x0
    [    CSSD] grant: count=0, type 0, wait 0
    [    CSSD] Member Count =2, master 2
    [    CSSD] . . . . .
    [    CSSD] memberNo =2, seq 5
    [    CSSD] flags = 0x0, granted 0
    [    CSSD] refCnt = 1
    [    CSSD] nodeNum = 2, nodeBirth 6
    [    CSSD] privateDataSize = 128
    [    CSSD] publicDataSize = 128
    [    CSSD] . . . . .
    [    CSSD] memberNo =1, seq 11
    [    CSSD] flags = 0x0, granted 0
    [    CSSD] refCnt = 1
    [    CSSD] nodeNum = 1, nodeBirth 12
    [    CSSD] privateDataSize = 128
    [    CSSD] publicDataSize = 128
    [    CSSD]----------
    [    CSSD]----------
    [    CSSD] type 2, Id 1, Name = (EVMDMAIN)
    [    CSSD] flags: 0x0
    [    CSSD] grant: count=0, type 0, wait 0
    [    CSSD] Member Count =2, master 2
    [    CSSD] . . . . .
    [    CSSD] memberNo =2, seq 5
    [    CSSD] flags = 0x0, granted 0
    [    CSSD] refCnt = 1
    [    CSSD] nodeNum = 2, nodeBirth 6
    [    CSSD] privateDataSize = 508
    [    CSSD] publicDataSize = 504
    [    CSSD] . . . . .
    [    CSSD] memberNo =1, seq 11
    [    CSSD] flags = 0x0, granted 0
    [    CSSD] refCnt = 1
    [    CSSD] nodeNum = 1, nodeBirth 12
    [    CSSD] privateDataSize = 508
    [    CSSD] publicDataSize = 504
    [    CSSD]----------
    [    CSSD]--- END OF GROCK STATE DUMP ---
    [    CSSD]------- End Dump -------

    Hi user10508733
    Seems to be your first post, welcome to this forum!!
    What is the OS (blue screen that should be windows? ) and what is the release of your CRS and RDBMS ? hopefully not 10.1x.x.x, if yes please patch it to 10.2.0.4.
    Seems to have a lot of bugs about CRS before 10.2.0.3 see that list
    Doc ID:      Note:391116.1
    Subject:      10.2.0.3 Patch Set - List of Bug Fixes by Problem Type
    let us know what's the result
    thanks

  • Ipod Touch, 1st generation crashed during upgrade to v3.1.3 software

    I have just paid to upgrade the software on my 1st generation Ipod touch. The software seems to have downloaded to my itunes, but some sort of error message occured during the download, and my ipod has crashed. The screen is locked on an image of an apple icon with a download progress bar half way filled. I can't turn the ipod on or off. When I plug it into the computer, it recognizes that a device is there, and it allows me to sync my ipod, but the screen remains unchanged. Does anyone have any idea on what I can do to fix this?
    Thanks!

    I am continuing to have problems with getting my ipod touch restored. I am following the directions on the faq's but it seems that my touch continues to freeze up during the restoring nprocess. Itunes continues to sync, but the ipod screen remains unchanged.

  • J2EE server node crashes / .hotspot_compiler

    Hi,
    I'm trying to install a NW AS Java + usage type DI 7.0 SR3 on W2K3 R2 x64 SP2 with MS SQL Server 2005 and Java HotSpot(TM) 64-Bit Server VM (build 1.4.2_18-b06, mixed mode). During sapinst the server node was shut down by the program but doesn't come up anymore. It crashes constantly and is directly beeing restarted.
    A look in std_server0.out gave me the following insight (example, added lf):
    Login :33.902: [ParNew 218352K->59053K(966656K), 0.0923355 secs]
    39.132: [ParNew 222893K->62535K(966656K), 0.1131081 secs]
    44.103: [ParNew 226375K->65891K(966656K), 0.0794097 secs]
    ### Excluding compile:  com.sap.engine.services.
    webservices.jaxrpc.encoding.TypeMappingImpl::initializeRelations
    48.296: [ParNew 229731K->66972K(966656K), 0.0845196 secs]
    ### Excluding compile:  com.sap.engine.services.
    webservices.jaxrpc.encoding.InstanceBuilder::readElement
    52.107: [ParNew 230812K->71757K(966656K), 0.0862884 secs]
    ### Excluding compile:  com.sap.engine.services.
    webservices.jaxrpc.encoding.GeneratedComplexType::_loadInto
    56.691: [ParNew 235597K->75601K(966656K), 0.0875517 secs]
    An unrecoverable stack overflow has occurred.
    # An unexpected error has been detected by HotSpot Virtual Machine:
    #  EXCEPTION_STACK_OVERFLOW (0xc00000fd) at pc=0x00000000080e3dd6, pid=4460, tid=5532
    # Java VM: Java HotSpot(TM) 64-Bit Server VM (1.4.2_18-b06 mixed mode)
    # Problematic frame:
    # V  [jvm.dll+0xe3dd6]
    # An error report file with more information is saved as hs_err_pid4460.log
    # If you would like to submit a bug report, please visit:
    #   http://java.sun.com/webapps/bugreport/crash.jsp
    stdout/stderr redirect
    node name   : server0
    pid         : 3924
    system name : DID
    system nr.  : 00
    started at  : Wed Sep 24 12:27:17 2008
    As you can see the VM crashes. A look in the corresponding log-file (hs_err_pid4460.log) gave me the following insight (added lf):
    Current CompileTask:
    opto:1295  !   com.sap.engine.core.cluster.impl6.ms.MSRawConnection.sendMessage(
    Lcom/sap/engine/core/cluster/impl6/ms/MSMessageObjectImpl;
    Lcom/sap/engine/core/cluster/impl6/ms/MSRegistrable;
    [BIIZ)Lcom/sap/engine/frame/cluster/message/MessageAnswer; (1477 bytes)
    It seems that the CompileTask for the Hotspot VM always crashes when trying to compile the MSRawConnection class natively (reproducable). Sun describes a workaround for this type of problem, http://java.sun.com/javase/6/webnotes/trouble/TSG-VM/html/gbyzo.html#gbyzd . The workaround is to place a .hotspot_compiler file in the working directory of the application with an exclusion of the method. This will advice the VM whenever it decides to natively compile a certain bit of code first to check this file for exclusions. If the method identified by the VM to be compiled is exluded in this file, the compilation will be skipped. Therefore it would have been worth a try to exclude the above mentioned method sendMessage of class MSRawConnection for hotspot compilation. But when I do a quick search for .hotspot_compiler in my sap folder I find four of them, it seems SAP is already making heavy use of this "workaround" instead of reporting a bug. These files are located under:
    D:\usr\sap\DID\JC00\j2ee\cluster
    D:\usr\sap\DID\JC00\SDM\program
    D:\usr\sap\DID\JC00\j2ee\cluster\dispatcher
    D:\usr\sap\DID\JC00\j2ee\cluster\server0
    They all contain the same exclusions, listed here (added lf):
    ## This file contains a list of methods which are going to be excluded from JIT compilation on server start
    ## The format of the file is as follows
    ## exclude package/subpackage1/subpackage2/../subpackageN/<Class_name> <method_to_exclude>
    ## Each line of the file describes only one method
    ## <method_to_exclude> is method name that will not be compiled with JIT
    ## package/subpackage1/subpackage2/../subpackageN/<Class_name>
    is the name of the class with the packages containing <method_to_exclude>
    ## Example:
    ## exclude com/sap/engine/boot/Start main
    ## will not compile with JIT the main method of com.sap.engine.boot.Start class
    ## To enter a list of methods to exclude from JIT compilation write them after this line
    exclude com/sapportals/portal/pb/layout/taglib/ContainerTag addIviewResources
    exclude com/sap/engine/services/keystore/impl/security/CodeBasedSecurityConnector getApplicationDomain
    exclude com/sap/engine/services/rmi_p4/P4StubSkeletonGenerator generateStub
    exclude com/sapportals/portal/prt/util/StringUtils escapeToJS
    exclude com/sapportals/portal/prt/core/broker/PortalServiceItem startServices
    exclude com/sap/engine/services/webservices/server/deploy/WSConfigurationHandler downloadFile
    exclude com/sapportals/portal/prt/jndisupport/util/AbstractHierarchicalContext lookup
    exclude com/sapportals/portal/navigation/cache/CacheNavigationNode getAttributeValue
    exclude com/sapportals/portal/navigation/TopLevelNavigationiView PrintNode
    exclude com/sapportals/wcm/service/ice/wcm/ICEPropertiesCoder encode
    exclude com/sap/lcr/pers/delta/importing/ObjectLoader loadObjects
    exclude com/sap/engine/services/webservices/jaxrpc/encoding/InstanceBuilder readElement
    exclude com/sap/engine/services/webservices/jaxrpc/encoding/InstanceBuilder readSequence
    exclude com/sap/engine/services/webservices/jaxrpc/encoding/TypeMappingImpl initializeRelations
    exclude com/sap/engine/services/webservices/jaxrpc/encoding/GeneratedComplexType _loadInto
    I thought the D:\usr\sap\DID\JC00\j2ee\cluster\server0\.hotspot_compiler file must be the right file, but however to be sure (test) I added the following line to all four files after having shutdown the instance:
    exclude com/sap/engine/core/cluster/impl6/ms/MSRawConnection sendMessage
    When I start the engine again, the content of every file gets overwritten by the original content, therefore lacking my new line. So it seems to me that the content is somehow hardcoded or contained in the db. If it is in the db, is it possible to change the content via config tool? It also seems that this jdk is a beta version since it reports itself with the version string 1.4.2_18-b06. This is the one officially delivered by Sun on the [SAP download page|http://java.sun.com/j2se/1.4.2/SAPsite/download.html], as mentioned in [SAP Note 941595|https://websmp130.sap-ag.de/sap(bD1kZSZjPTAwMQ==)/bc/bsp/spn/sapnotes/index2.htm?numm=941595]. Can you please provide me a solution to add an exclusion to the .hotspot_compiler file or workaround for the above mentioned problem. As a last option I will deinstall the system and reinstall it with another jdk (e.g. J2SE v 1.4.2_17 x64 SDK), but first I want to try to exclude the method/class from compilation. Thanks for your help!
    Best regards,
    Fabian

    Hi,
    You can tell the VM which file to load as compiler exclusion list. Therefore I copied .hotspot_compiler to .ext_hotspot_compiler and added my line
    exclude com/sap/engine/core/cluster/impl6/ms/MSRawConnection sendMessage
    then I went to config tool and added under cluster data -> myinstance -> myservernode under tab General the Java parameter
    -XX:CompileCommandFile=D:/usr/sap/DID/JC00/j2ee/cluster/server0/.ext_hotspot_compiler
    The J2EE node is now starting up without problems.
    Best regards,
    Fabian

  • Feedback node crashing bug

    I have came across a nasty bug that caused Labview 2010 SP1 (Runnnig Win 7 Ultimate x64 bit) to crash without any warning.
    To replicate the bug do the following:
    Add a numeric control and another indicator to the front panel
    Switch to block diagram and add a feed back node
    Connect the initializer terminal of the feed back node to the output of the control
    Now do ANY of the following to cause the bug:
    Press the run buttong (which is broken due to not connecting the input of the feed back node) it will turn to a normal run without displaying the error
    Do an extra action and undo it, the run button will turn from list error normal
    So far the Vi can be saved normally. Now connect the output of the feed back node to the indicator and try any of the followings:
    Save the VI
    Close the VI
    Create a new project and select to add the VI to the project
    This will cause Labview to crash without any notice!
    When you are at step 4, the bug is there but harmless. Once you combine it with step 5 (connect to indicator), the bug is active and cause crashing. I have attached a snapshot of how the Front panel/block diagram look like before saving (since it can't be saved). Notice how the run button is enabled although the input of the feedback node is not connected.
    I have tried to replicate the error on Labview 2009 but couldn't.
    Attachments:
    FBN Bug.jpg ‏56 KB

    Dear ªL¡
    Thank you for briging our attention to this issue.
    I replicated it on LabVIEW 2010 SP1 and confirmed, that in LabVIEW 2011 it has been fixed.
    Thanks again!
    Best regards,
    Mateusz Stokłosa
    Applications Engineer
    National Instruments

  • Dbms_schduler job is not running on a 2 node rac when 1st node fails

    Hi,
    I want to create a dbms_scheduler job in a 2 node RAC and the job should always run on the node1 and if node1 is down then it should run on node2. This is Oracle 10gR2 (10.2.0.3 in WINDOWS) .In order to do the same I did following
    -- First Step
    Using DBCA- Service Managment - Created a service (BATCH_SERVICE) and given node1 as preferred and node2 as available. This created following entry in tnsnames.ora in both nodes.
    BATCH_SERVICE =
    (DESCRIPTION =
    (ADDRESS = (PROTOCOL = TCP)(HOST = node1-vip)(PORT = 1521))
    (ADDRESS = (PROTOCOL = TCP)(HOST = node2-vip)(PORT = 1521))
    (LOAD_BALANCE = yes)
    (CONNECT_DATA =
    (SERVER = DEDICATED)
    (SERVICE_NAME = BATCH_SERVICE)
    (FAILOVER_MODE =
    (TYPE = SELECT)
    (METHOD = BASIC)
    (RETRIES = 180)
    (DELAY = 5)
    --- Step 2
    -- Created BATCH job classes.
    BEGIN
    DBMS_SCHEDULER.create_job_class(
    job_class_name => 'BATCH_JOB_CLASS',
    service => 'BATCH_SERVICE');
    END;
    -- Step 3 -- created a job using job_class as BATCH_JOB_CLASS
    begin
    dbms_scheduler.create_job(
    job_name => 'oltp_job_test'
    ,job_type => 'STORED_PROCEDURE'
    ,job_action => 'schema1.P1'
    ,start_date => systimestamp at time zone 'US/Central'
    ,repeat_interval => 'FREQ=DAILY;BYHOUR=11;BYMINUTE=30;'
    ,job_class => 'BATCH_JOB_CLASS'
    ,enabled => TRUE
    ,comments => 'New Job.');
    end;
    Now when I monitor this job it runs on node1. Now I started testing for failover. I manually shutdown 1st instance. Then as per my understanding job should run on 2nd node. But job is not picking up.
    when I run the followign command
    srvctl status service -d db -s BATCH_SERVICE
    service BATCH_SERVICE is running on instance node2.
    Any help is really appreciated.

    It does not show that whether job is running or broken.

Maybe you are looking for

  • Deficit of stock in Posto Goods Receipt of inbound delivery

    Hi folks. I´m trying to post goods receipt of an inbound delivery (return order), but for one material we get this message: Deficit of BA Unrestricted-use 21 CRT : 697-0 0001 3002 DEVOLUÇÕES I don´t get it, since it seems to me that its calling a def

  • Product cost controlling by sales order

    Dear all, My client is having mto scenerio (non valuated stock) ,When i am confirming actual activities like electricity in production order the system picks planned activity rates and loads costs on production order,My client wants system to pick ac

  • Not all songs sync from computer to itouch

    I recently upgrade my iTouch to the new iOS 5.0.1 software (not sure if that was relevant to tell), but I have noticed it will not sync all my songs from my computer to my iTouch, for example, on my computer it says one playlist has 55 songs, however

  • 875P Neo - Almost there....

    Okay... I've fought through this board to the point where I'm about to return it but I almost have it going. I have my 1 gig of GEIL Golden Dragon Ram running at 2.85 volts but the system still wants to freeze up about 2 times a day. The screen will

  • Some cs4 projects hang when opening in cs5 versions

    Greetings, I upgraded from CS4 Master to CS5. I am able to open some of my CS4 authored projects in the CS5 versions and others do not. Specifically I am in after effects right now trying to open a project created in AE CS4. It said, "gotta convert"