UCCX 8.0 - 1st node crashed
Hi,
We have a 2-server UCCX 8.0 cluster running on UCS servers. Recently, when moving the publisher (1st node) to a new UCS server, we accidently deleted some files of the Virtual machine. (there are 2 folders in the datastore, named UCCX1 and UCCX1_1; my colleague deleted the UCCX1_1 folder as he thought it was not neccessary). After that, the ESXi kept asking for UCCX1_1\UCCX1_1.vmx file when we trying to boot the server. We had to re-add the server (browse to the vmx file in the datastore, and Add to Inventory); the server could boot up now, but I think we lost all the data (we cannot access to the Application web page).
Now we still have UCCX 2 running, could we force the 1st server to update its database to sync with the UCCX 2? If YES, how to do that?
If NO, what should we do? Re-install everything or is there a better way to recover the cluster?
Thanks,
hoanghiep
Hi Hoanghiep,
You can not make the UCCX 8.x series second node as the first node, this was supported only on Windows platform (i.e. UCCX 7.x and earlier releases).
If you have taken a valid DRS backup, than yes reinstall the UCCX 8.x first node (with the same details as before like hostnema, ip address, DNS....etc) and than restore this backup.
http://www.cisco.com/en/US/docs/voice_ip_comm/cust_contact/contact_center/crs/express_8_0/configuration/guide/uccx801drs.pdf
Restoring only the Publisher Node in an HA Setup (with Rebuild)
In a high availability (HA) setup , if there is a hard-drive failure or any other critical hardware or
Software failure which needs rebuild of the Publisher ( first ) node, then follow the below procedure to
recover the publisher node to the last backed up state of the publisher. Run the below procedure if you
have a valid backup taken before the failure of the node.
Procedure
Step 1 Perform a fresh installation of the same version of Cisco Unified Contact Center Express (using the same
administrator credentials, network configuration and security password used earlier) on the node prior
to restoring it.
For more information on installing Cisco Unified Contact Center Express, see the Installing Cisco
Unified Contact Center Express available here:
http://www.cisco.com/en/US/products/sw/custcosw/ps1846/prod_installation_guides_list.html
Step 2 Navigate to Cisco Unified Contact Center Administration, select Disaster Recovery System from the
Navigation drop-down list box in the upper-right corner of the Cisco Unified Contact Center Express
Administration window, and click Go.
The Disaster Recovery System Logon window displays.
Step 3 Log in to the Disaster Recovery System by using the same Platform Administrator username and
password that you use to log in to Cisco Unified Operating System Administration.
Step 4 Configure the backup device. For more information, see Managing Backup Devices, page 7.
Step 5 Navigate to Restore > Restore Wizard. The Restore Wizard Step 1 window displays.
Step 6 In the Select Backup Device area, choose the backup device from which to restore.
Step 7 Click Next. The Restore Wizard Step 2 window displays.
Step 8 Choose the backup file that you want to restore.
Note The backup filename indicates the date and time that the system created the backup file.
Step 9 Click Next. The Restore Wizard Step 3 window displays.
Step 10 Select the feature UCCX.
Step 11 Click Next. The Restore Wizard Step 4 window displays,
Step 12 When you get prompted to choose the nodes to restore, choose only the first node (the publisher).
CautionDo not select the second (subscriber) node in this condition as this will result in failure of the restore attempt.
Step 13 To start restoring the data, click Restore.
Note During the restore process, do not perform any tasks with Cisco Unified Contact Center Express
Administration or User Options.
Restoring the first node may take up to several hours based on the size of database that is being restored.
Depending on the size of your database that you choose to restore, the system can require one hour or
more to restore.
Note Based on the requirements, you have the option to either retrieve the existing publisher node data
from the DRS backup to be available on all the nodes in the cluster or retrieve the more recent
data (if available) from the subscriber node to be available in the cluster.
Step 14 Run the following CLI command from the Subscriber node after the restore process is successful (restore
status indicates 100 per cent) to inititate restoring the Publisher node only (with rebuild).
utils uccx setuppubrestore
Step 15 Run the following CLI command on the target node; that is if you want to retrieve the publisher node’s
data, then run this command on the subscriber node, but if you want to retrieve the subscriber node’s data
(which is more up-to-date), then run this command on the publisher node.
utils uccx database forcedatasync
Warning In any case, you must execute this command on either of the nodes after restoring the publisher node.
Step 16 Restart both the nodes and run the following CLI command on the Publisher node to set up replication.
utils uccx dbreplication reset
For more information on restarting, see the Cisco Unified Communications Operating System
Administration Guide available here:
http://www.cisco.com/en/US/products/sw/custcosw/ps1846/prod_maintenance_guides_list.html.
CautionIf you have done some configuration or hardware changes while performing fresh installation in Step 1 that might impact the License MAC, then rehost your license again using the license rehosting mechanism before running the CLI command “utils uccx dbreplication reset”. For more information on the licensing rehosting mechanism, see the Installing Cisco Unified Contact Center Express available here:
http://www.cisco.com/en/US/products/sw/custcosw/ps1846/prod_installation_guides_list.html
Step 17 Your data gets restored on the publisher node. To view the status of the restore, see the “Viewing the
Restore Status” section on page 19.
Hope this helps.
Anand
Please rate helpful posts !!
Similar Messages
-
Server restore causing some db's to crash on 1st node
Hi all!,
I was wondering if anybody has an idea for why the following happend. I checked the database logs and there's nothing before the crash indicating a problem. Here's what happened on our test RAC 2node cluster servers/db's.
Last week our 2nd node went down due to a patch of the unix os. It was a patch for the next version ahead of ours but our systems didn't catch it until it was too late. The patch was to allow them to not have to bounce each node when adding more disk to the SAN. It crashed the 2nd node as that was where they were starting to apply the patch. This 2nd node has been down all week as they couldn't get it up even with HP support on the line. They decided to restore the image from our 1st node onto the 2nd node and bring it online. When they did, it caused 3 out of 8 databases to suddenly crash on the 1st node. We got them up but need to find out why. Has anyone ever experienced such a thing and/or have any advice as what to look at? I don't know that much about our cluster, could there be parms set to individual db's that could be the culprit?
Any advice would be greatly appreciated, thanks in advance for your replies!,
DaveNo, that's what's troubleing. The last entry before crash shows the normal archive log switching that occurs and then the next entry is the database being started.
-
Services not starting after a node crash
hi
We have a 3 node cluster and one of the nodes crashed today, also the services did not get relocated to the other node and when we try to manullay stop/start/relocate the service we get the following error
srvctl stop service -d BCB -s BCB_J2EE -f
PRCD-1085 : Failed to stop service BCB_J2EE
PRCR-1065 : Failed to stop resource ora.BCB.BCB_j2ee.svc
CRS-2533: Server 'bcb528' is down. Unable to perform the operation on 'ora.BCB.BCB_j2ee.svc'
Would anyone has seen this before
Thx
JJthis is what i can find in log
[ CRSPE][60] Server [bcb528] is unreachable. Stopping the sequencer for: bcbCRON 1 1
2011-02-28 08:15:21.778: [ CRSPE][60] Sequencer for [bcbCRON 1 1] has completed with error: CRS-2533: Server 'bcb528' is down. Unable to pe
rform the operation on 'bcbCRON'
2011-02-28 08:15:21.778: [ CRSPE][60] Required instruction failed in op: START of [bcbCRON 1 1] on [bcb529] : 105247290
2011-02-28 08:15:21.781: [UiServer][62] Container [ Name: ORDER
MESSAGE:
TextMessage[CRS-2533: Server 'bcb528' is down. Unable to perform the operation on 'bcbCRON']
MSGTYPE:
TextMessage[1]
OBJID:
TextMessage[bcbCRON 1 1]
WAIT:
TextMessage[0] -
RAC 11gR2 cluster installation: root.sh failed on the 1st node
Hi,
Does anybody know why is possible when I run the root.sh on the 1st node, during the Oracle 11gR2 RAC installation (cluster installation) to get the following error?
The following environment variables are set as:
ORACLE_OWNER= oracle
ORACLE_HOME= /oracle/grid
Enter the full pathname of the local bin directory: [usr/local/bin]:
The file "dbhome" already exists in /usr/local/bin. Overwrite it? (y/n) [n]: y
Copying dbhome to /usr/local/bin ...
The file "oraenv" already exists in /usr/local/bin. Overwrite it? (y/n) [n]: y
Copying oraenv to /usr/local/bin ...
The file "coraenv" already exists in /usr/local/bin. Overwrite it? (y/n) [n]: y
Copying coraenv to /usr/local/bin ...
Entries will be added to the /etc/oratab file as needed by
Database Configuration Assistant when a database is created
Finished running generic part of root.sh script.
Now product-specific root actions will be performed.
2010-06-29 14:17:43: Parsing the host name
2010-06-29 14:17:43: Checking for super user privileges
2010-06-29 14:17:43: User has super user privileges
Using configuration parameter file: /oracle/grid/crs/install/crsconfig_params
Creating trace directory
User oracle has the required capabilities to run CSSD in realtime mode
LOCAL ADD MODE
Creating OCR keys for user 'root', privgrp 'system'..
Operation successful.
root wallet
root wallet cert
root cert export
peer wallet
profile reader wallet
pa wallet
peer wallet keys
pa wallet keys
peer cert request
pa cert request
peer cert
pa cert
peer root cert TP
profile reader root cert TP
pa root cert TP
peer pa cert TP
pa peer cert TP
profile reader pa cert TP
profile reader peer cert TP
peer user cert
pa user cert
Adding daemon to inittab
CRS-4123: Oracle High Availability Services has been started.
ohasd is starting
CRS-2672: Attempting to start 'ora.gipcd' on 'trz1test_rac'
CRS-2672: Attempting to start 'ora.mdnsd' on 'trz1test_rac'
CRS-2676: Start of 'ora.gipcd' on 'trz1test_rac' succeeded
CRS-2676: Start of 'ora.mdnsd' on 'trz1test_rac' succeeded
CRS-2672: Attempting to start 'ora.gpnpd' on 'trz1test_rac'
CRS-2676: Start of 'ora.gpnpd' on 'trz1test_rac' succeeded
CRS-2672: Attempting to start 'ora.cssdmonitor' on 'trz1test_rac'
CRS-2676: Start of 'ora.cssdmonitor' on 'trz1test_rac' succeeded
CRS-2672: Attempting to start 'ora.cssd' on 'trz1test_rac'
CRS-2672: Attempting to start 'ora.diskmon' on 'trz1test_rac'
CRS-2676: Start of 'ora.diskmon' on 'trz1test_rac' succeeded
CRS-2676: Start of 'ora.cssd' on 'trz1test_rac' succeeded
CRS-2672: Attempting to start 'ora.ctssd' on 'trz1test_rac'
CRS-2676: Start of 'ora.ctssd' on 'trz1test_rac' succeeded
clscfg: -install mode specified
Successfully accumulated necessary OCR keys.
Creating OCR keys for user 'root', privgrp 'system'..
Operation successful.
CRS-2672: Attempting to start 'ora.crsd' on 'trz1test_rac'
CRS-2676: Start of 'ora.crsd' on 'trz1test_rac' succeeded
Now formatting voting disk: /data_gpfs/oracle/crs/vdsk.
CRS-4603: Successful addition of voting disk /data_gpfs/oracle/crs/vdsk.
## STATE File Universal Id File Name Disk group
1. ONLINE 653624f2aa1f4f83bf774e8052889a32 (/data_gpfs/oracle/crs/vdsk) []
Located 1 voting disk(s).
CRS-2673: Attempting to stop 'ora.crsd' on 'trz1test_rac'
CRS-2677: Stop of 'ora.crsd' on 'trz1test_rac' succeeded
CRS-2673: Attempting to stop 'ora.ctssd' on 'trz1test_rac'
CRS-2677: Stop of 'ora.ctssd' on 'trz1test_rac' succeeded
CRS-2673: Attempting to stop 'ora.cssdmonitor' on 'trz1test_rac'
CRS-2677: Stop of 'ora.cssdmonitor' on 'trz1test_rac' succeeded
CRS-2673: Attempting to stop 'ora.cssd' on 'trz1test_rac'
CRS-2677: Stop of 'ora.cssd' on 'trz1test_rac' succeeded
CRS-2673: Attempting to stop 'ora.gpnpd' on 'trz1test_rac'
CRS-2677: Stop of 'ora.gpnpd' on 'trz1test_rac' succeeded
CRS-2673: Attempting to stop 'ora.gipcd' on 'trz1test_rac'
CRS-2677: Stop of 'ora.gipcd' on 'trz1test_rac' succeeded
CRS-2673: Attempting to stop 'ora.mdnsd' on 'trz1test_rac'
CRS-2677: Stop of 'ora.mdnsd' on 'trz1test_rac' succeeded
CRS-2672: Attempting to start 'ora.mdnsd' on 'trz1test_rac'
CRS-2676: Start of 'ora.mdnsd' on 'trz1test_rac' succeeded
CRS-2672: Attempting to start 'ora.gipcd' on 'trz1test_rac'
CRS-2676: Start of 'ora.gipcd' on 'trz1test_rac' succeeded
CRS-2672: Attempting to start 'ora.gpnpd' on 'trz1test_rac'
CRS-2676: Start of 'ora.gpnpd' on 'trz1test_rac' succeeded
CRS-2672: Attempting to start 'ora.cssdmonitor' on 'trz1test_rac'
CRS-2676: Start of 'ora.cssdmonitor' on 'trz1test_rac' succeeded
CRS-2672: Attempting to start 'ora.cssd' on 'trz1test_rac'
CRS-2672: Attempting to start 'ora.diskmon' on 'trz1test_rac'
CRS-2676: Start of 'ora.diskmon' on 'trz1test_rac' succeeded
CRS-2676: Start of 'ora.cssd' on 'trz1test_rac' succeeded
CRS-2672: Attempting to start 'ora.ctssd' on 'trz1test_rac'
CRS-2676: Start of 'ora.ctssd' on 'trz1test_rac' succeeded
CRS-2672: Attempting to start 'ora.crsd' on 'trz1test_rac'
CRS-2676: Start of 'ora.crsd' on 'trz1test_rac' succeeded
CRS-2672: Attempting to start 'ora.evmd' on 'trz1test_rac'
CRS-2676: Start of 'ora.evmd' on 'trz1test_rac' succeeded
*/oracle/grid/bin/srvctl start nodeapps -n trz1test_rac ... failed*
Configure Oracle Grid Infrastructure for a Cluster ... failed
This is because ora.eONS daemon is not starting. There is a Metalink note that we MIGHT start this daemon manually ... but this is not working.
*./srvctl status nodeapps -n trz1test_rac*
-n <node_name> option has been deprecated.
VIP trz1test_rac_vip is enabled
VIP trz1test_rac_vip is running on node: trz1test_rac
Network is enabled
Network is running on node: trz1test_rac
GSD is disabled
GSD is not running on node: trz1test_rac
ONS is enabled
ONS daemon is running on node: trz1test_rac
eONS is enabled
eONS daemon is not running on node: trz1test_racI run my clusterware/DB on AIX 5.3
When I run runcluvfy.sh here are the things which are not passing:
Check: Node connectivity of subnet "192.168.1.0"
Source Destination Connected?
trz2test_rac:en5 trz2test_rac:en5 yes
trz2test_rac:en5 trz1test_rac:en5 yes
trz2test_rac:en5 trz1test_rac:en5 yes
trz2test_rac:en5 trz1test_rac:en5 yes
trz2test_rac:en5 trz1test_rac:en5 yes
trz1test_rac:en5 trz1test_rac:en5 yes
Result: Node connectivity passed for subnet "192.168.1.0" with node(s) trz2test_rac,trz1test_rac
Check: TCP connectivity of subnet "192.168.1.0"
Source Destination Connected?
trz1test_rac:192.168.1.140 trz2test_rac:192.168.1.142 failed
trz1test_rac:192.168.1.140 trz2test_rac:192.168.1.142 failed
Result: TCP connectivity check failed for subnet "192.168.1.0"
NTP daemon slewing option check failed on some nodes
PRVF-5436 : The NTP daemon running on one or more nodes lacks the slewing option "-x"
Result: Clock synchronization check using Network Time Protocol(NTP) failed
NTP mustn't be a problem I guess as the date are identical on the 2 nodes.
I have no idea how to fix the TCP connectivity issue with the subnet "192.168.1.0". Some posts wrote that could be a firewall issue. Are there any other causes ?
Thanks to all,
Paul -
RAC -process failover at node crash
Hi,
how to prevent running process(transaction) from termination in RAC while a node crashes ..
Ex: if there is a process running on node 1 and if it suddenly crashes in RAC how does we make node2 or node3 to pick it up and process or start the transaction again??
Thanks,Hello,
Look at your tnsnames.ora entry and see if it configured to benefit from FAILOVER, you can aslo explore other available options options
http://stanford.edu/dept/itss/docs/oracle/10g/network.101/b10776/tnsnames.htm
myservice=
(DESCRIPTION=
(SOURCE_ROUTE=yes)
(ADDRESS=(PROTOCOL=tcp)(HOST=host1)(PORT=1630)) # <-- hop 1
(ADDRESS_LIST=
(FAILOVER=on)
(LOAD_BALANCE=off) # <--- hop 2
(ADDRESS=(PROTOCOL=tcp)(HOST=host2a)(PORT=1532))
(ADDRESS=(PROTOCOL=tcp)(HOST=host2b)(PORT=1521)))
(ADDRESS=(PROTOCOL=tcp)(HOST=host3)(PORT=1521)) # <-- hop 3
(CONNECT_DATA=(SERVICE_NAME=myservice)))Another Example
MYSERVICE =
(DESCRIPTION =
(ADDRESS_LIST=
(FAILOVER = on)
(LOAD_BALANCE = on)
(ADDRESS= (PROTOCOL = TCP)(HOST = server1)(PORT = 1521))
(ADDRESS= (PROTOCOL = TCP)(HOST = server2)(PORT = 1521))
(ADDRESS= (PROTOCOL = TCP)(HOST = server3)(PORT = 1521))
(CONNECT_DATA=
(SERVICE_NAME =MYSERVICE)
(FAILOVER_MODE =
(BACKUP=server2)
(TYPE=select)
(METHOD=preconnect)
(RETRIES=20)
(DELAY=3)
) Regards
OrionNet
Edited by: OrionNet on Dec 18, 2008 3:12 PM -
Can I have RAC 1st node in RHEL 5 and 2nd node in RHEL 4?
Can I have my RAC 1st node in RHEL5 and 2nd node in RHEL 4?
I am just checking if there is any possibility like that.
Thanks,
MahiEven if it works by accident, it wouldn't be supported.
-
RAC: When 1st node started, the 2nd node failed to start
I got a problem in Oracle 10gR2 RAC on Windows 2003R2 Domain member environment. I have a 2 nodes RAC using ASM in 2 MS Windows 2003 Standard Server, it is a clean environment, only have Oracle and Norton Antivirus software installed.
When the 1st node started successfully from booting up the machine, the 2nd node is failed to startup. It stays in the Windows startup screen (Applying Computer Setting ...) for more then 1 hour. Eventually, the window login screen come out, but I cannot login to the system after input username and password. This situation is reversable (the 1st node failed to start if I startup the 2nd node first).
In case I set the Oracle Services (OracleCRService and OracleEVMService) into Manual startup at 2nd Nodes, the 2nd node can startup smoothly. After login to the 2nd node, I can start these 2 oracle services without problem.
P.S. This problem is just happened after applied I applied all MS Security Update on 10 Apr, 2008.
Any suggestion how to shoot this problem? Thanks.
Message was edited by:
ckhlamA couple of things you could try :
a) Disable the Norton AntiVirus Software and check whether rebooting the
Server allows the CRS stack to come up. Recall reading about an issue
where-in NAV waits for the Network Stack to come up and blocks
CRS's startup sequence. This is just a guess at this time but worth a try.
b) You might also want to check if configuring Oracle Process Manager as detailed
in Note:358156.1 allows the CRS stack to be delayed long enough to fully
initialize the OS stack beneath it.
c) If none of the above helps , you might want to uninstall the MS Security Update
to check if this was a problem introduced by this Patch. You might then have
to work with MS / Oracle to dig further into this.
Do update this thread with your observations on this ..
Vishwa -
Central CCMS Alert for Java instance server node crash
Hi,
Is it possible to trigger an alert of an Java system server node crash using central CCMS alerts.
J2EE instance CCMS alert does show some MTEs, however they do not trigger alerts. I got this info from below link
http://help.sap.com/saphelp_nwce711/helpdata/en/46/11aaf352da14dce10000000a155369/frameset.htm
Any idea how to trigger the alert for server node crash.
I understand that server node crash should be investigated for permanent fix, however we need this as a proactive measure to know the crash if they happen.
Thanks
ImtiazHi Imtiaz,
Do you see any error on the view>status auto-reaction? Have you been able to assign an auto-reaction to this MTE?
Cheers,
Maurício -
Matlab node crashes LV 8.5
Today, I observed a bizzare phenmenon in LV 8.5, when playing with Matlab Script node.
Just open any example from the Example Finder. Rt click on the Script node & Choose Script Server -->> Xmath Script.
Now, do a Ctrl + Z [Undo] & LV ll get crashed.
See attached pic.
- Partha
LabVIEW - Wires that catch bugs!
Attachments:
Ctrl+Z on Matlab node crashes LV 8.5.PNG 101 KBHi Parthabe,
Thank you for the feedback. This is definetely a bug in Labview and I have filed a CAR for this issue. For your reference the CAR# is 96755
Eli S.
National Instruments
Applications Engineer -
Node crashes when enabling RDS for private interconnect.
OS: oel6.3 - 2.6.39-300.17.2.el6uek.x86_64
Grid and DB: 11.2.0.3.4
This is a two node Standard Edition cluster.
The node crashes upon restart of clusterware after following the instructions from note:751343.1 (RAC Support for RDS Over Infiniband) to enable RDS.
The cluster is running fine using ipoib for the cluster_interconnect.
1) As the ORACLE_HOME/GI_HOME owner, stop all resources (database, listener, ASM etc) that's running from the home. When stopping database, use NORMAL or IMMEDIATE option.
2) As root, if relinking 11gR2 Grid Infrastructure (GI) home, unlock GI home: GI_HOME/crs/install/rootcrs.pl -unlock
3) As the ORACLE_HOME/GI_HOME owner, go to ORACLE_HOME/GI_HOME and cd to rdbms/lib
4) As the ORACLE_HOME/GI_HOME owner, issue "make -f ins_rdbms.mk ipc_rds ioracle"
5) As root, if relinking 11gR2 Grid Infrastructure (GI) home, lock GI home: GI_HOME/crs/install/rootcrs.pl -patch
Looks to abend when asm tries to start with the message below on the console.
I have a service request open for this issue but, I am hoping someone may have seen this and has
some way around it.
Thanks
Alan
kernel BUG at net/rds/ib_send.c:547!
invalid opcode: 0000 [#1] SMP
CPU 2
Modules linked in: 8021q garp stp llc iptable_filter ip_tables nfs lockd
fscache auth_rpcgss nfs_acl sunrpc cpufreq_ondemand powernow_k8
freq_table mperf rds_rdma rds_tcp rds ib_ipoib rdma_ucm ib_ucm ib_uverbs
ib_umad rdma_cm ib_cm iw_cm ib_addr ipv6 ib_sa sr_mod cdrom microcode
serio_raw pcspkr ghes hed k10temp hwmon amd64_edac_mod edac_core
edac_mce_amd i2c_piix4 i2c_core sg igb dca mlx4_ib ib_mad ib_core
mlx4_en mlx4_core ext4 mbcache jbd2 usb_storage sd_mod crc_t10dif ahci
libahci dm_mirror dm_region_hash dm_log dm_mod [last unloaded:
scsi_wait_scan]
Pid: 4140, comm: kworker/u:1 Not tainted 2.6.39-300.17.2.el6uek.x86_64
#1 Supermicro BHDGT/BHDGT
RIP: 0010:[<ffffffffa02db829>] [<ffffffffa02db829>]
rds_ib_xmit+0xa69/0xaf0 [rds_rdma]
RSP: 0018:ffff880fb84a3c50 EFLAGS: 00010202
RAX: ffff880fbb694000 RBX: ffff880fb3e4e600 RCX: 0000000000000000
RDX: 0000000000000030 RSI: ffff880fbb6c3a00 RDI: ffff880fb058a048
RBP: ffff880fb84a3d30 R08: 0000000000000fd0 R09: ffff880fbb6c3b90
R10: 0000000000000000 R11: 000000000000001a R12: ffff880fbb6c3a00
R13: ffff880fbb6c3a00 R14: 0000000000000000 R15: ffff880fb84a3d90
FS: 00007fd0a3a56700(0000) GS:ffff88101e240000(0000) knlGS:0000000000000000
CS: 0010 DS: 0000 ES: 0000 CR0: 000000008005003b
CR2: 0000000002158ca2 CR3: 0000000001783000 CR4: 00000000000406e0
DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
Process kworker/u:1 (pid: 4140, threadinfo ffff880fb84a2000, task
ffff880fae970180)
Stack:
0000000000012200 0000000000012200 ffff880f00000000 0000000000000000
000000000000e5b0 ffffffff8115af81 ffffffff81b8d6c0 ffffffffa02b2e12
00000001bf272240 ffffffff81267020 ffff880fbb6c3a00 0000003000000002
Call Trace:
[<ffffffff8115af81>] ? __kmalloc+0x1f1/0x200
[<ffffffffa02b2e12>] ? rds_message_alloc+0x22/0x90 [rds]
[<ffffffff81267020>] ? sg_init_table+0x30/0x50
[<ffffffffa02b2db2>] ? rds_message_alloc_sgs+0x62/0xa0 [rds]
[<ffffffffa02b31e4>] ? rds_message_map_pages+0xa4/0x110 [rds]
[<ffffffffa02b4f3b>] rds_send_xmit+0x38b/0x6e0 [rds]
[<ffffffff81089d53>] ? cwq_activate_first_delayed+0x53/0x100
[<ffffffffa02b6040>] ? rds_recv_worker+0xc0/0xc0 [rds]
[<ffffffffa02b6075>] rds_send_worker+0x35/0xc0 [rds]
[<ffffffff81089fd6>] process_one_work+0x136/0x450
[<ffffffff8108bbe0>] worker_thread+0x170/0x3c0
[<ffffffff8108ba70>] ? manage_workers+0x120/0x120
[<ffffffff810907e6>] kthread+0x96/0xa0
[<ffffffff81515544>] kernel_thread_helper+0x4/0x10
[<ffffffff81090750>] ? kthread_worker_fn+0x1a0/0x1a0
[<ffffffff81515540>] ? gs_change+0x13/0x13
Code: ff ff e9 b1 fe ff ff 48 8b 0d b4 54 4b e1 48 89 8d 70 ff ff ff e9
71 ff ff ff 83 bd 7c ff ff ff 00 0f 84 f4 f5 ff ff 0f 0b eb fe <0f> 0b
eb fe 44 8b 8d 48 ff ff ff 41 b7 01 e9 51 f6 ff ff 0f 0b
RIP [<ffffffffa02db829>] rds_ib_xmit+0xa69/0xaf0 [rds_rdma]
RSP <ffff880fb84a3c50>
Initializing cgroup subsys cpuset
Initializing cgroup subsys cpu
Linux version 2.6.39-300.17.2.el6uek.x86_64
([email protected]) (gcc version 4.4.6 20110731 (Red
Hat 4.4.6-3) (GCC) ) #1 SMP Wed Nov 7 17:48:36 PST 2012
Command line: ro root=UUID=5ad1a268-b813-40da-bb76-d04895215677
rd_DM_UUID=ddf1_stor rd_NO_LUKS rd_NO_LVM LANG=en_US.UTF-8 rd_NO_MD
SYSFONT=latarcyrheb-sun16 KEYBOARDTYPE=pc KEYTABLE=us numa=off
console=ttyS1,115200n8 irqpoll maxcpus=1 nr_cpus=1 reset_devices
cgroup_disable=memory mce=off memmap=exactmap memmap=538K@64K
memmap=130508K@770048K elfcorehdr=900556K memmap=72K#3668608K
memmap=184K#3668680K
BIOS-provided physical RAM map:
BIOS-e820: 0000000000000100 - 0000000000096800 (usable)
BIOS-e820: 0000000000096800 - 00000000000a0000 (reserved)
BIOS-e820: 00000000000e6000 - 0000000000100000 (reserved)
BIOS-e820: 0000000000100000 - 00000000dfe90000 (usable)
BIOS-e820: 00000000dfe9e000 - 00000000dfea0000 (reserved)
BIOS-e820: 00000000dfea0000 - 00000000dfeb2000 (ACPI data)
BIOS-e820: 00000000dfeb2000 - 00000000dfee0000 (ACPI NVS)
BIOS-e820: 00000000dfee0000 - 00000000f0000000 (reserved)
BIOS-e820: 00000000ffe00000 - 0000000100000000 (reserved)I believe OFED version is 1.5.3.3 but I am not sure if this is correct.
We have not added any third parry drivers. All that has been done to add infiniband to our build is
a yum groupinstall iInfiniband support.
I have not tries rds-stress but rds-ping works fine and rds-info seems fine.
A service request has been opened but so far I have had better response here.
oracle@blade1-6:~> rds-info
RDS IB Connections:
LocalAddr RemoteAddr LocalDev RemoteDev
10.10.0.116 10.10.0.119 fe80::25:90ff:ff07:df1d fe80::25:90ff:ff07:e0e5
TCP Connections:
LocalAddr LPort RemoteAddr RPort HdrRemain DataRemain SentNxt ExpectUna SeenUna
Counters:
CounterName Value
conn_reset 5
recv_drop_bad_checksum 0
recv_drop_old_seq 0
recv_drop_no_sock 1
recv_drop_dead_sock 0
recv_deliver_raced 0
recv_delivered 18
recv_queued 18
recv_immediate_retry 0
recv_delayed_retry 0
recv_ack_required 4
recv_rdma_bytes 0
recv_ping 14
send_queue_empty 18
send_queue_full 0
send_lock_contention 0
send_lock_queue_raced 0
send_immediate_retry 0
send_delayed_retry 0
send_drop_acked 0
send_ack_required 3
send_queued 32
send_rdma 0
send_rdma_bytes 0
send_pong 14
page_remainder_hit 0
page_remainder_miss 0
copy_to_user 0
copy_from_user 0
cong_update_queued 0
cong_update_received 1
cong_send_error 0
cong_send_blocked 0
ib_connect_raced 4
ib_listen_closed_stale 0
ib_tx_cq_call 6
ib_tx_cq_event 6
ib_tx_ring_full 0
ib_tx_throttle 0
ib_tx_sg_mapping_failure 0
ib_tx_stalled 16
ib_tx_credit_updates 0
ib_rx_cq_call 33
ib_rx_cq_event 38
ib_rx_ring_empty 0
ib_rx_refill_from_cq 0
ib_rx_refill_from_thread 0
ib_rx_alloc_limit 0
ib_rx_credit_updates 0
ib_ack_sent 4
ib_ack_send_failure 0
ib_ack_send_delayed 0
ib_ack_send_piggybacked 0
ib_ack_received 3
ib_rdma_mr_alloc 0
ib_rdma_mr_free 0
ib_rdma_mr_used 0
ib_rdma_mr_pool_flush 8
ib_rdma_mr_pool_wait 0
ib_rdma_mr_pool_depleted 0
ib_atomic_cswp 0
ib_atomic_fadd 0
iw_connect_raced 0
iw_listen_closed_stale 0
iw_tx_cq_call 0
iw_tx_cq_event 0
iw_tx_ring_full 0
iw_tx_throttle 0
iw_tx_sg_mapping_failure 0
iw_tx_stalled 0
iw_tx_credit_updates 0
iw_rx_cq_call 0
iw_rx_cq_event 0
iw_rx_ring_empty 0
iw_rx_refill_from_cq 0
iw_rx_refill_from_thread 0
iw_rx_alloc_limit 0
iw_rx_credit_updates 0
iw_ack_sent 0
iw_ack_send_failure 0
iw_ack_send_delayed 0
iw_ack_send_piggybacked 0
iw_ack_received 0
iw_rdma_mr_alloc 0
iw_rdma_mr_free 0
iw_rdma_mr_used 0
iw_rdma_mr_pool_flush 0
iw_rdma_mr_pool_wait 0
iw_rdma_mr_pool_depleted 0
tcp_data_ready_calls 0
tcp_write_space_calls 0
tcp_sndbuf_full 0
tcp_connect_raced 0
tcp_listen_closed_stale 0
RDS Sockets:
BoundAddr BPort ConnAddr CPort SndBuf RcvBuf Inode
0.0.0.0 0 0.0.0.0 0 131072 131072 340441
RDS Connections:
LocalAddr RemoteAddr NextTX NextRX Flg
10.10.0.116 10.10.0.119 33 38 --C
Receive Message Queue:
LocalAddr LPort RemoteAddr RPort Seq Bytes
Send Message Queue:
LocalAddr LPort RemoteAddr RPort Seq Bytes
Retransmit Message Queue:
LocalAddr LPort RemoteAddr RPort Seq Bytes
10.10.0.116 0 10.10.0.119 40549 32 0
oracle@blade1-6:~> cat /etc/rdma/rdma.conf
# Load IPoIB
IPOIB_LOAD=yes
# Load SRP module
SRP_LOAD=no
# Load iSER module
ISER_LOAD=no
# Load RDS network protocol
RDS_LOAD=yes
# Should we modify the system mtrr registers? We may need to do this if you
# get messages from the ib_ipath driver saying that it couldn't enable
# write combining for the PIO buffs on the card.
# Note: recent kernels should do this for us, but in case they don't, we'll
# leave this option
FIXUP_MTRR_REGS=no
# Should we enable the NFSoRDMA service?
NFSoRDMA_LOAD=yes
NFSoRDMA_PORT=2050
oracle@blade1-6:~> /etc/init.d/rdma status
Low level hardware support loaded:
mlx4_ib
Upper layer protocol modules:
rds_rdma ib_ipoib
User space access modules:
rdma_ucm ib_ucm ib_uverbs ib_umad
Connection management modules:
rdma_cm ib_cm iw_cm
Configured IPoIB interfaces: none
Currently active IPoIB interfaces: ib0 -
I'm not sure this is the proper forum for this post, if it's not please feel free to move it.
The situation I'm facing is this:
My company has clusters setup across North America with our software that utilizes the Oracle database. 90% of the time everything functions exactly as it is supposed to. However, it is the other 10% of sites that I am here to ask about.
Our clusters are setup in a dual-server environment that basically act as a single server. The application runs on one server and the database runs on another, and in the case of problems, either can be failed over to run both sets of services on a single server (basic, I realize). At certain sites we are unable to run services on one of the nodes. When they are run as they are supposed to, every so often (at some sites a matter of minutes/hours, at others it can be a couple weeks) they will BSOD.
I fully understand what the blue screen is. The minidump shows that it's the orafencedrv.sys stop, where the Oracle database shuts down a node after loss of communications in order to prevent corruption of the database. This is a great feature and I'm grateful for it, however it has caused us many headaches in diagnosing what it actually causing the drop in communications.
The interconnect and the public IP are both hooked up over a single switch but they operate on different subnets. Could operating on a single switch be part of the problem?
Could the problem be that the switches are being overloaded with traffic causing temporary packet losses between the two nodes, which I know is enough to have Oracle BSOD a node?
Below I'm posting one of the dumps listed in the CSSD log when the node crashes, hopefully this will provide some sort of information as to what is happening.
If any other information is needed, please feel free to let me know. Thanks for your help in advance.
[ CSSD]2008-10-29 13:30:06.211 [2732] >ERROR: clssnmvDiskKillCheck: Aborting, evicted by node 1, sync 13, stamp 99832890,
[ CSSD]2008-10-29 13:30:06.211 [2732] >ERROR: ###################################
[ CSSD]2008-10-29 13:30:06.211 [2732] >ERROR: clssscExit: CSSD aborting
[ CSSD]2008-10-29 13:30:06.211 [2732] >ERROR: ###################################
[ CSSD]--- DUMP GROCK STATE DB ---
[ CSSD]----------
[ CSSD] type 2, Id 3, Name = (crs_version)
[ CSSD] flags: 0x0
[ CSSD] grant: count=0, type 0, wait 0
[ CSSD] Member Count =2, master 0
[ CSSD] . . . . .
[ CSSD] memberNo =0, seq 5
[ CSSD] flags = 0x0, granted 0
[ CSSD] refCnt = 1
[ CSSD] nodeNum = 2, nodeBirth 6
[ CSSD] privateDataSize = 0
[ CSSD] publicDataSize = 0
[ CSSD] . . . . .
[ CSSD] memberNo =1, seq 11
[ CSSD] flags = 0x0, granted 0
[ CSSD] refCnt = 1
[ CSSD] nodeNum = 1, nodeBirth 12
[ CSSD] privateDataSize = 0
[ CSSD] publicDataSize = 0
[ CSSD]----------
[ CSSD]----------
[ CSSD] type 2, Id 2, Name = (ocr_STLRZOPRCL)
[ CSSD] flags: 0x0
[ CSSD] grant: count=0, type 0, wait 0
[ CSSD] Member Count =2, master 2
[ CSSD] . . . . .
[ CSSD] memberNo =2, seq 5
[ CSSD] flags = 0x0, granted 0
[ CSSD] refCnt = 1
[ CSSD] nodeNum = 2, nodeBirth 6
[ CSSD] privateDataSize = 0
[ CSSD] publicDataSize = 32
[ CSSD] . . . . .
[ CSSD] memberNo =1, seq 11
[ CSSD] flags = 0x0, granted 0
[ CSSD] refCnt = 1
[ CSSD] nodeNum = 1, nodeBirth 12
[ CSSD] privateDataSize = 0
[ CSSD] publicDataSize = 32
[ CSSD]----------
[ CSSD]----------
[ CSSD] type 3, Id 15, Name = (_ORA_CRS_MEMBER_stlrzoprcl1)
[ CSSD] flags: 0x0
[ CSSD] grant: count=1, type 3, wait 1
[ CSSD] Member Count =1, master -3
[ CSSD] . . . . .
[ CSSD] memberNo =0, seq 0
[ CSSD] flags = 0x12, granted 0
[ CSSD] refCnt = 1
[ CSSD] nodeNum = 1, nodeBirth 12
[ CSSD] privateDataSize = 0
[ CSSD] publicDataSize = 0
[ CSSD]----------
[ CSSD]----------
[ CSSD] type 3, Id 15, Name = (_ORA_CRS_MEMBER_stlrzoprcl2)
[ CSSD] flags: 0x0
[ CSSD] grant: count=1, type 3, wait 1
[ CSSD] Member Count =1, master -3
[ CSSD] . . . . .
[ CSSD] memberNo =0, seq 0
[ CSSD] flags = 0x12, granted 1
[ CSSD] refCnt = 1
[ CSSD] nodeNum = 2, nodeBirth 6
[ CSSD] privateDataSize = 0
[ CSSD] publicDataSize = 0
[ CSSD]----------
[ CSSD]----------
[ CSSD] type 2, Id 4, Name = (CRSDMAIN)
[ CSSD] flags: 0x0
[ CSSD] grant: count=0, type 0, wait 0
[ CSSD] Member Count =2, master 2
[ CSSD] . . . . .
[ CSSD] memberNo =2, seq 5
[ CSSD] flags = 0x0, granted 0
[ CSSD] refCnt = 1
[ CSSD] nodeNum = 2, nodeBirth 6
[ CSSD] privateDataSize = 128
[ CSSD] publicDataSize = 128
[ CSSD] . . . . .
[ CSSD] memberNo =1, seq 11
[ CSSD] flags = 0x0, granted 0
[ CSSD] refCnt = 1
[ CSSD] nodeNum = 1, nodeBirth 12
[ CSSD] privateDataSize = 128
[ CSSD] publicDataSize = 128
[ CSSD]----------
[ CSSD]----------
[ CSSD] type 2, Id 1, Name = (EVMDMAIN)
[ CSSD] flags: 0x0
[ CSSD] grant: count=0, type 0, wait 0
[ CSSD] Member Count =2, master 2
[ CSSD] . . . . .
[ CSSD] memberNo =2, seq 5
[ CSSD] flags = 0x0, granted 0
[ CSSD] refCnt = 1
[ CSSD] nodeNum = 2, nodeBirth 6
[ CSSD] privateDataSize = 508
[ CSSD] publicDataSize = 504
[ CSSD] . . . . .
[ CSSD] memberNo =1, seq 11
[ CSSD] flags = 0x0, granted 0
[ CSSD] refCnt = 1
[ CSSD] nodeNum = 1, nodeBirth 12
[ CSSD] privateDataSize = 508
[ CSSD] publicDataSize = 504
[ CSSD]----------
[ CSSD]--- END OF GROCK STATE DUMP ---
[ CSSD]------- End Dump -------Hi user10508733
Seems to be your first post, welcome to this forum!!
What is the OS (blue screen that should be windows? ) and what is the release of your CRS and RDBMS ? hopefully not 10.1x.x.x, if yes please patch it to 10.2.0.4.
Seems to have a lot of bugs about CRS before 10.2.0.3 see that list
Doc ID: Note:391116.1
Subject: 10.2.0.3 Patch Set - List of Bug Fixes by Problem Type
let us know what's the result
thanks -
Ipod Touch, 1st generation crashed during upgrade to v3.1.3 software
I have just paid to upgrade the software on my 1st generation Ipod touch. The software seems to have downloaded to my itunes, but some sort of error message occured during the download, and my ipod has crashed. The screen is locked on an image of an apple icon with a download progress bar half way filled. I can't turn the ipod on or off. When I plug it into the computer, it recognizes that a device is there, and it allows me to sync my ipod, but the screen remains unchanged. Does anyone have any idea on what I can do to fix this?
Thanks!I am continuing to have problems with getting my ipod touch restored. I am following the directions on the faq's but it seems that my touch continues to freeze up during the restoring nprocess. Itunes continues to sync, but the ipod screen remains unchanged.
-
J2EE server node crashes / .hotspot_compiler
Hi,
I'm trying to install a NW AS Java + usage type DI 7.0 SR3 on W2K3 R2 x64 SP2 with MS SQL Server 2005 and Java HotSpot(TM) 64-Bit Server VM (build 1.4.2_18-b06, mixed mode). During sapinst the server node was shut down by the program but doesn't come up anymore. It crashes constantly and is directly beeing restarted.
A look in std_server0.out gave me the following insight (example, added lf):
Login :33.902: [ParNew 218352K->59053K(966656K), 0.0923355 secs]
39.132: [ParNew 222893K->62535K(966656K), 0.1131081 secs]
44.103: [ParNew 226375K->65891K(966656K), 0.0794097 secs]
### Excluding compile: com.sap.engine.services.
webservices.jaxrpc.encoding.TypeMappingImpl::initializeRelations
48.296: [ParNew 229731K->66972K(966656K), 0.0845196 secs]
### Excluding compile: com.sap.engine.services.
webservices.jaxrpc.encoding.InstanceBuilder::readElement
52.107: [ParNew 230812K->71757K(966656K), 0.0862884 secs]
### Excluding compile: com.sap.engine.services.
webservices.jaxrpc.encoding.GeneratedComplexType::_loadInto
56.691: [ParNew 235597K->75601K(966656K), 0.0875517 secs]
An unrecoverable stack overflow has occurred.
# An unexpected error has been detected by HotSpot Virtual Machine:
# EXCEPTION_STACK_OVERFLOW (0xc00000fd) at pc=0x00000000080e3dd6, pid=4460, tid=5532
# Java VM: Java HotSpot(TM) 64-Bit Server VM (1.4.2_18-b06 mixed mode)
# Problematic frame:
# V [jvm.dll+0xe3dd6]
# An error report file with more information is saved as hs_err_pid4460.log
# If you would like to submit a bug report, please visit:
# http://java.sun.com/webapps/bugreport/crash.jsp
stdout/stderr redirect
node name : server0
pid : 3924
system name : DID
system nr. : 00
started at : Wed Sep 24 12:27:17 2008
As you can see the VM crashes. A look in the corresponding log-file (hs_err_pid4460.log) gave me the following insight (added lf):
Current CompileTask:
opto:1295 ! com.sap.engine.core.cluster.impl6.ms.MSRawConnection.sendMessage(
Lcom/sap/engine/core/cluster/impl6/ms/MSMessageObjectImpl;
Lcom/sap/engine/core/cluster/impl6/ms/MSRegistrable;
[BIIZ)Lcom/sap/engine/frame/cluster/message/MessageAnswer; (1477 bytes)
It seems that the CompileTask for the Hotspot VM always crashes when trying to compile the MSRawConnection class natively (reproducable). Sun describes a workaround for this type of problem, http://java.sun.com/javase/6/webnotes/trouble/TSG-VM/html/gbyzo.html#gbyzd . The workaround is to place a .hotspot_compiler file in the working directory of the application with an exclusion of the method. This will advice the VM whenever it decides to natively compile a certain bit of code first to check this file for exclusions. If the method identified by the VM to be compiled is exluded in this file, the compilation will be skipped. Therefore it would have been worth a try to exclude the above mentioned method sendMessage of class MSRawConnection for hotspot compilation. But when I do a quick search for .hotspot_compiler in my sap folder I find four of them, it seems SAP is already making heavy use of this "workaround" instead of reporting a bug. These files are located under:
D:\usr\sap\DID\JC00\j2ee\cluster
D:\usr\sap\DID\JC00\SDM\program
D:\usr\sap\DID\JC00\j2ee\cluster\dispatcher
D:\usr\sap\DID\JC00\j2ee\cluster\server0
They all contain the same exclusions, listed here (added lf):
## This file contains a list of methods which are going to be excluded from JIT compilation on server start
## The format of the file is as follows
## exclude package/subpackage1/subpackage2/../subpackageN/<Class_name> <method_to_exclude>
## Each line of the file describes only one method
## <method_to_exclude> is method name that will not be compiled with JIT
## package/subpackage1/subpackage2/../subpackageN/<Class_name>
is the name of the class with the packages containing <method_to_exclude>
## Example:
## exclude com/sap/engine/boot/Start main
## will not compile with JIT the main method of com.sap.engine.boot.Start class
## To enter a list of methods to exclude from JIT compilation write them after this line
exclude com/sapportals/portal/pb/layout/taglib/ContainerTag addIviewResources
exclude com/sap/engine/services/keystore/impl/security/CodeBasedSecurityConnector getApplicationDomain
exclude com/sap/engine/services/rmi_p4/P4StubSkeletonGenerator generateStub
exclude com/sapportals/portal/prt/util/StringUtils escapeToJS
exclude com/sapportals/portal/prt/core/broker/PortalServiceItem startServices
exclude com/sap/engine/services/webservices/server/deploy/WSConfigurationHandler downloadFile
exclude com/sapportals/portal/prt/jndisupport/util/AbstractHierarchicalContext lookup
exclude com/sapportals/portal/navigation/cache/CacheNavigationNode getAttributeValue
exclude com/sapportals/portal/navigation/TopLevelNavigationiView PrintNode
exclude com/sapportals/wcm/service/ice/wcm/ICEPropertiesCoder encode
exclude com/sap/lcr/pers/delta/importing/ObjectLoader loadObjects
exclude com/sap/engine/services/webservices/jaxrpc/encoding/InstanceBuilder readElement
exclude com/sap/engine/services/webservices/jaxrpc/encoding/InstanceBuilder readSequence
exclude com/sap/engine/services/webservices/jaxrpc/encoding/TypeMappingImpl initializeRelations
exclude com/sap/engine/services/webservices/jaxrpc/encoding/GeneratedComplexType _loadInto
I thought the D:\usr\sap\DID\JC00\j2ee\cluster\server0\.hotspot_compiler file must be the right file, but however to be sure (test) I added the following line to all four files after having shutdown the instance:
exclude com/sap/engine/core/cluster/impl6/ms/MSRawConnection sendMessage
When I start the engine again, the content of every file gets overwritten by the original content, therefore lacking my new line. So it seems to me that the content is somehow hardcoded or contained in the db. If it is in the db, is it possible to change the content via config tool? It also seems that this jdk is a beta version since it reports itself with the version string 1.4.2_18-b06. This is the one officially delivered by Sun on the [SAP download page|http://java.sun.com/j2se/1.4.2/SAPsite/download.html], as mentioned in [SAP Note 941595|https://websmp130.sap-ag.de/sap(bD1kZSZjPTAwMQ==)/bc/bsp/spn/sapnotes/index2.htm?numm=941595]. Can you please provide me a solution to add an exclusion to the .hotspot_compiler file or workaround for the above mentioned problem. As a last option I will deinstall the system and reinstall it with another jdk (e.g. J2SE v 1.4.2_17 x64 SDK), but first I want to try to exclude the method/class from compilation. Thanks for your help!
Best regards,
FabianHi,
You can tell the VM which file to load as compiler exclusion list. Therefore I copied .hotspot_compiler to .ext_hotspot_compiler and added my line
exclude com/sap/engine/core/cluster/impl6/ms/MSRawConnection sendMessage
then I went to config tool and added under cluster data -> myinstance -> myservernode under tab General the Java parameter
-XX:CompileCommandFile=D:/usr/sap/DID/JC00/j2ee/cluster/server0/.ext_hotspot_compiler
The J2EE node is now starting up without problems.
Best regards,
Fabian -
I have came across a nasty bug that caused Labview 2010 SP1 (Runnnig Win 7 Ultimate x64 bit) to crash without any warning.
To replicate the bug do the following:
Add a numeric control and another indicator to the front panel
Switch to block diagram and add a feed back node
Connect the initializer terminal of the feed back node to the output of the control
Now do ANY of the following to cause the bug:
Press the run buttong (which is broken due to not connecting the input of the feed back node) it will turn to a normal run without displaying the error
Do an extra action and undo it, the run button will turn from list error normal
So far the Vi can be saved normally. Now connect the output of the feed back node to the indicator and try any of the followings:
Save the VI
Close the VI
Create a new project and select to add the VI to the project
This will cause Labview to crash without any notice!
When you are at step 4, the bug is there but harmless. Once you combine it with step 5 (connect to indicator), the bug is active and cause crashing. I have attached a snapshot of how the Front panel/block diagram look like before saving (since it can't be saved). Notice how the run button is enabled although the input of the feedback node is not connected.
I have tried to replicate the error on Labview 2009 but couldn't.
Attachments:
FBN Bug.jpg 56 KBDear ªL¡
Thank you for briging our attention to this issue.
I replicated it on LabVIEW 2010 SP1 and confirmed, that in LabVIEW 2011 it has been fixed.
Thanks again!
Best regards,
Mateusz Stokłosa
Applications Engineer
National Instruments -
Dbms_schduler job is not running on a 2 node rac when 1st node fails
Hi,
I want to create a dbms_scheduler job in a 2 node RAC and the job should always run on the node1 and if node1 is down then it should run on node2. This is Oracle 10gR2 (10.2.0.3 in WINDOWS) .In order to do the same I did following
-- First Step
Using DBCA- Service Managment - Created a service (BATCH_SERVICE) and given node1 as preferred and node2 as available. This created following entry in tnsnames.ora in both nodes.
BATCH_SERVICE =
(DESCRIPTION =
(ADDRESS = (PROTOCOL = TCP)(HOST = node1-vip)(PORT = 1521))
(ADDRESS = (PROTOCOL = TCP)(HOST = node2-vip)(PORT = 1521))
(LOAD_BALANCE = yes)
(CONNECT_DATA =
(SERVER = DEDICATED)
(SERVICE_NAME = BATCH_SERVICE)
(FAILOVER_MODE =
(TYPE = SELECT)
(METHOD = BASIC)
(RETRIES = 180)
(DELAY = 5)
--- Step 2
-- Created BATCH job classes.
BEGIN
DBMS_SCHEDULER.create_job_class(
job_class_name => 'BATCH_JOB_CLASS',
service => 'BATCH_SERVICE');
END;
-- Step 3 -- created a job using job_class as BATCH_JOB_CLASS
begin
dbms_scheduler.create_job(
job_name => 'oltp_job_test'
,job_type => 'STORED_PROCEDURE'
,job_action => 'schema1.P1'
,start_date => systimestamp at time zone 'US/Central'
,repeat_interval => 'FREQ=DAILY;BYHOUR=11;BYMINUTE=30;'
,job_class => 'BATCH_JOB_CLASS'
,enabled => TRUE
,comments => 'New Job.');
end;
Now when I monitor this job it runs on node1. Now I started testing for failover. I manually shutdown 1st instance. Then as per my understanding job should run on 2nd node. But job is not picking up.
when I run the followign command
srvctl status service -d db -s BATCH_SERVICE
service BATCH_SERVICE is running on instance node2.
Any help is really appreciated.It does not show that whether job is running or broken.
Maybe you are looking for
-
Deficit of stock in Posto Goods Receipt of inbound delivery
Hi folks. I´m trying to post goods receipt of an inbound delivery (return order), but for one material we get this message: Deficit of BA Unrestricted-use 21 CRT : 697-0 0001 3002 DEVOLUÇÕES I don´t get it, since it seems to me that its calling a def
-
Product cost controlling by sales order
Dear all, My client is having mto scenerio (non valuated stock) ,When i am confirming actual activities like electricity in production order the system picks planned activity rates and loads costs on production order,My client wants system to pick ac
-
Not all songs sync from computer to itouch
I recently upgrade my iTouch to the new iOS 5.0.1 software (not sure if that was relevant to tell), but I have noticed it will not sync all my songs from my computer to my iTouch, for example, on my computer it says one playlist has 55 songs, however
-
875P Neo - Almost there....
Okay... I've fought through this board to the point where I'm about to return it but I almost have it going. I have my 1 gig of GEIL Golden Dragon Ram running at 2.85 volts but the system still wants to freeze up about 2 times a day. The screen will
-
Some cs4 projects hang when opening in cs5 versions
Greetings, I upgraded from CS4 Master to CS5. I am able to open some of my CS4 authored projects in the CS5 versions and others do not. Specifically I am in after effects right now trying to open a project created in AE CS4. It said, "gotta convert"