Cluster nodes discover peers outside cluster domain

All, cross-posted from the ColdFusion Server Administration
forum:
I've run into an issue with CFMX7 clustering on a subnet with
multicast disabled. In our configuration, we have two physical
Windows Server 2003 Enterprise Edition servers hosting nine
ColdFusion MX 7 Enterprise clusters. Each server hosts one of two
instances in a cluster. i.e.:
server1 [1.2.3.4] - instance1-1 <- cluster1 -> server2
[1.2.3.5] - instance1-2
server1 [1.2.3.4] - instance2-1 <- cluster2 -> server2
[1.2.3.5] - instance2-2
server1 [1.2.3.4] - instance3-1 <- cluster3 -> server2
[1.2.3.5] - instance3-2
server1 [1.2.3.4] - instance4-1 <- cluster4 -> server2
[1.2.3.5] - instance4-2
server1 [1.2.3.4] - instance5-1 <- cluster5 -> server2
[1.2.3.5] - instance5-2
server1 [1.2.3.4] - instance6-1 <- cluster6 -> server2
[1.2.3.5] - instance6-2
server1 [1.2.3.4] - instance7-1 <- cluster7 -> server2
[1.2.3.5] - instance7-2
server1 [1.2.3.4] - instance8-1 <- cluster8 -> server2
[1.2.3.5] - instance8-2
server1 [1.2.3.4] - instance9-1 <- cluster9 -> server2
[1.2.3.5] - instance9-2
My first step in enabling peer discovery was to add the
unicastPeer attribute to the ClusterManager service under each
instance.
e.g. jrun.xml on instance1-1:
<service class="jrunx.cluster.ClusterManager"
name="ClusterManager">
<attribute name="bindToJNDI">true</attribute>
<attribute name="enabled">true</attribute>
<attribute
name="clusterDomain">cluster1</attribute>
<!-- While we will discover nearby peers automatically
without prior knowledge -->
<!-- of them, you can also add as many specific hosts as
you wish; these unicast -->
<!-- peers do not need to be nearby or reachable via
multicast. -->
<!--EXAMPLE: <attribute
name="unicastPeer">sneville</attribute> -->
<attribute name="unicastPeer">1.2.3.5</attribute>
<service class="jrunx.cluster.ClusterDeployerService"
name="ClusterDeployerService">
<attribute
name="deployDirectory">{jrun.server.rootdir}/SERVER-INF/cluster</attribute>
<attribute name="deactivated">false</attribute>
</service>
</service>
e.g. jrun.xml on instance1-2:
<service class="jrunx.cluster.ClusterManager"
name="ClusterManager">
<attribute name="bindToJNDI">true</attribute>
<attribute name="enabled">true</attribute>
<attribute
name="clusterDomain">cluster1</attribute>
<!-- While we will discover nearby peers automatically
without prior knowledge -->
<!-- of them, you can also add as many specific hosts as
you wish; these unicast -->
<!-- peers do not need to be nearby or reachable via
multicast. -->
<!--EXAMPLE: <attribute
name="unicastPeer">sneville</attribute> -->
<attribute name="unicastPeer">1.2.3.4</attribute>
<service class="jrunx.cluster.ClusterDeployerService"
name="ClusterDeployerService">
<attribute
name="deployDirectory">{jrun.server.rootdir}/SERVER-INF/cluster</attribute>
<attribute name="deactivated">false</attribute>
</service>
</service>
. . . and so on for each instance and cluster. This is where
the problem begins. When I start the instances, every instance
discovers every other instance as a cluster peer, regardless of
cluster domain.
Another forum user suggested using host:port, where port is
the JNDI listening port. That doesn't work. Using the Jini
listening port, however, does work, e.g.:
<attribute
name="unicastPeer">1.2.3.4:4160</attribute>
That presents another problem. The Jini listening port
defaults to 4160. If 4160 is taken, a port is chosen at random.
I can't find documentation on setting a static Jini listening
port, if that's even the correct action to take.
Thoughts?
From what I can tell, the version of Reggie (the Jini lookup
service) shipped with JRun only supports setting the unicast
listening port programmatically. Reggie is started by
jrunx.cluster.ClusterManager.init--actually, the private method
startLookupService--and JRun doesn't appear to ever call Reggie's
setUnicastPeer method.
Assuming we can't tweak Reggie, I guess a more appropriate
question is how do we get JRun's RMI service (?) to honor
groups/domains in a call to getPeers? I'll cross-post to the JRun
forums and investigate JRun Updater 6.
Trev

. . . and it appears I'm exposing my ignorance of Jini in
general. :-)
If I now understand the Jini discovery process correctly, a
multicast request includes one or more service IDs and one or more
groups. The registrar will respond if and only if its service ID is
not in the request and its group memberships exactly match one or
more of the groups in the request.
A unicast request includes nothing more than the protocol
version, and the registrar will respond as if a valid multicast
request had been received.
In both cases, the response packet includes a marshalled copy
of the ServiceRegistrar object and the names of all groups of which
the registrar is a member.
Without looking at more of JRun, I'm guessing that in some,
if not all cases, either JRun's discovery implementation assumes
that any response from a unicast query is valid, regardless of the
server IDs or group names received, or the logic that sorts out the
response isn't 100% correct.

Similar Messages

  • Cluster Node Unable to Maintain Cluster Membership

    My cluster logs are very similar to the above thread... was it ever addressed?
    [SV] Already protecting connection with message security level 'sign'
    [FTI] Stream already exists to node: false
    [Channel IP to another cluster node member] Close()
    GracefuleClose(1226) because of channel to remote endpoint another cluster node
    ~ is closed
    Cluster services stops and generates:
    The Kerberos client received a KRB_AP_ERR_MODIFIED error from the server serverName$. The target name used was
    serverName.
    This indicates that the target server failed to decrypt the ticket provided by the client. This can occur when the target server principal name (SPN) is registered on an account other than the account the target service is using. Ensure that the target SPN
    is only registered on the account used by the server.
    Roderick Lyons

    Hi Roderick Lyons,
    Could you tell us the exact URL “above thread” I am not very sure which thread you meaning.
     Please offer more information about your environment, such as, the DC server edition, the cluster node server edition.
    If you are 2003 and 2012R2 mixed DC environment please restart your cluster node then do the further monitor.
    The related article:
    It turns out that weird things can happen when you mix Windows Server 2003 and Windows Server 2012 R2 domain controllers
    http://blogs.technet.com/b/askds/archive/2014/07/23/it-turns-out-that-weird-things-can-happen-when-you-mix-windows-server-2003-and-windows-server-2012-r2-domain-controllers.aspx
    Can't log on after changing machine account password in mixed Windows Server 2012 R2 and Windows Server 2003 environment
    http://support.microsoft.com/kb/2989971
    From the current error another possible is you never run the cluster validation before you create the cluster, please run the cluster validation first then post
    the warning or error information.
    If above solution not work please consider reboot your PDC at unproductive time.
    More information:
    Kerberos Service Principal Name on Wrong Account
    https://support.microsoft.com/kb/2706695?wa=wsignin1.0
    Fixing the Security-Kerberos / 4 error
    http://blogs.technet.com/b/dcaro/archive/2013/07/04/fixing-the-security-kerberos-4-error.aspx
    Service Principal Names (SPNs) SetSPN Syntax (Setspn.exe)
    http://social.technet.microsoft.com/wiki/contents/articles/717.service-principal-names-spns-setspn-syntax-setspn-exe.aspx
    I’m glad to be of help to you!
    Please remember to mark the replies as answers if they help and unmark them if they provide no help. If you have feedback for TechNet Support, contact [email protected]

  • Cluster node networking

    I have five node Windows Server 2008 R2 Hyper-V cluster. I put one node to Maintance mode and all VMs migrated to other hosts. I pulled out LAN cables form that node for testing (one out, waited a litte, put it back and pulled second and so on) and put
    them right back on.
    After that I had a lot of cluster errors and some VMs restarted.
    I have put many times nodes on maintance mode and restarted / shut down them and never had any cluster problems. Why did I have now when I pulled out LAN cables?

    Hi antesl,
    The
     failover behavior occurs because the cluster node has detect the cluster resource or node fail, such as network, storage, please refer the following related KB to confirm there have no potential single point failure configuration in your
    cluster.
    Failover Cluster
    http://msdn.microsoft.com/en-us/library/ff650328.aspx
    Failover Cluster Step-by-Step Guide: Configuring the Quorum in a Failover Cluster
    http://technet.microsoft.com/zh-cn/library/cc770620(v=ws.10).aspx
    How a Server Cluster Works
    http://technet.microsoft.com/en-us/library/cc738051(v=ws.10).aspx
    HYPER-V 2008 R2 SP1 Best Practices (In Easy Checklist Form)
    http://blogs.technet.com/b/askpfeplat/archive/2012/11/19/hyper-v-2008-r2-sp1-best-practices-in-easy-checklist-form.aspx
    I’m glad to be of help to you!
    Please remember to mark the replies as answers if they help and unmark them if they provide no help. If you have feedback for TechNet Support, contact [email protected]

  • OES2 SP2a cluster node freeze

    Hi all.
    I have a 3 node cluster based on OES2 SP2a fully patched. There are a coupe of resources: Master_IP and a NSS volume.
    The cluster is virtualized on ESXi 4.1 fully patched, and vmware-tools are installed and up to date.
    If i do an "rcnetwork stop" on a node, it remains with no network for about 20 seconds, and then freezes. Does not reboot. Only freezes. The resource is balanced correctly, but the server remains hanged.
    This behaviour is the same on a server with a cluster resource on it and on a server with no cluster resource on it. Always hangs.
    The correct behaviour should be a reboot, shouldn't?
    Any hints?
    Thanks in advance.

    The node does not reboot because ....
    9.11 Preventing a Cluster Node Reboot after a Node Shutdown
    If LAN connectivity is lost between a cluster node and the other nodes in the cluster, it is possible that the lost node will be automatically shut down by the other cluster nodes. This is normal cluster operating behavior, and it prevents the lost node from trying to load cluster resources because it cannot detect the other cluster nodes. By default, cluster nodes are configured to reboot after an automatic shutdown.
    On certain occasions, you might want to prevent a downed cluster node from rebooting so you can troubleshoot problems.
    Section 9.11.1, OES 2 SP2 with Patches and Later
    Section 9.11.2, OES 2 SP2 Release Version and Earlier
    9.11.1 OES 2 SP2 with Patches and Later
    Beginning in the OES 2 SP2 Maintenance Patch for May 2010, the Novell Cluster Services reboot behavior conforms to the kernel panic setting for the Linux operating system. By default the kernel panic setting is set for no reboot after a node shutdown.
    You can set the kernel panic behavior in the /etc/sysctl.conf file by adding a kernel.panic command line. Set the value to 0 for no reboot after a node shutdown. Set the value to a positive integer value to indicate that the server should be rebooted after waiting the specified number of seconds. For information about the Linux sysctl, see the Linux man pages on sysctl and sysctl.conf.
    1.
    As the root user, open the /etc/sysctl.conf file in a text editor.
    2.
    If the kernel.panic token is not present, add it.
    kernel.panic = 0
    3.
    Set the kernel.panic value to 0 or to a positive integer value, depending on the desired behavior.
    No Reboot: To prevent an automatic cluster reboot after a node shutdown, set the kernel.panic token to value to 0. This allows the administrator to determine what caused the kernel panic condition before manually rebooting the server. This is the recommended setting.
    kernel.panic = 0
    Reboot: To allow a cluster node to reboot automatically after a node shutdown, set the kernel.panic token to a positive integer value that represents the seconds to delay the reboot.
    kernel.panic = <seconds>
    For example, to wait 1 minute (60 seconds) before rebooting the server, specify the following:
    kernel.panic = 60
    4.
    Save your changes.
    9.11.2 OES 2 SP2 Release Version and Earlier
    In OES 2 SP release version and earlier, you can modify the opt/novell/ncs/bin/ldncs file for the cluster to trigger the server to not automatically reboot after a shutdown.
    1.
    Open the opt/novell/ncs/bin/ldncs file in a text editor.
    2.
    Find the following line:
    echo -n $TOLERANCE > /proc/sys/kernel/panic
    3.
    Replace $TOLERANCE with a value of 0 to cause the server to not automatically reboot after a shutdown.
    4.
    After editing the ldncs file, you must reboot the server to cause the change to take effect.

  • Add cluster nodes from multiple machines to WebLogic domain in OEM 10.2.0.5

    Hello,
    I want to monitor a WebLogic domain in Oracle Enterprise Manager 10.2.0.5 with the following layout:
    - Admin server on machine 1
    - managed server, cluster node a on machine 2
    - managed server, cluster node b on machine 3
    How can I do this?
    When I go to "Add Weblogic Domain", I can enter the admin adress (machine 1) and tick the box to say that there is an agent running on another host (where I specify machine 2). However I do not see a possibility to discover managed servers from machine 3.
    Does anyone know how to do this?
    Thanks,
    Nadja

    LSNRCTL> status
    Connecting to (ADDRESS=(PROTOCOL=tcp)(HOST=)(PORT=1521))
    STATUS of the LISTENER
    Alias LISTENER
    Version TNSLSNR for Linux: Version 11.1.0.6.0 - Production
    Start Date 28-JAN-2010 00:36:10
    Uptime 0 days 17 hr. 11 min. 52 sec
    Trace Level off
    Security ON: Local OS Authentication
    SNMP OFF
    Listener Parameter File /oracle/app/oracle/product/11.1.0/db/network/admin/listener.ora
    Listener Log File /oracle/app/oracle/diag/tnslsnr/corp1052/listener/alert/log.xml
    Listening Endpoints Summary...
    (DESCRIPTION=(ADDRESS=(PROTOCOL=tcp)(HOST=corp1052)(PORT=1521)))
    Services Summary...
    Service "+ASM" has 1 instance(s).
    Instance "+ASM2", status READY, has 1 handler(s) for this service...
    Service "+ASM_XPT" has 1 instance(s).
    Instance "+ASM2", status READY, has 1 handler(s) for this service...
    Service "dex.example.com" has 2 instance(s).
    Instance "dex1", status READY, has 1 handler(s) for this service...
    Instance "dex2", status READY, has 2 handler(s) for this service...
    Service "dexXDB.example.com" has 2 instance(s).
    Instance "dex1", status READY, has 1 handler(s) for this service...
    Instance "dex2", status READY, has 1 handler(s) for this service...
    Service "dex_XPT.example.com" has 2 instance(s).
    Instance "dex1", status READY, has 1 handler(s) for this service...
    Instance "dex2", status READY, has 2 handler(s) for this service...
    The command completed successfully
    The output of SQLPlus:
    [oracle@dbhost: db]$ bin/sqlplus dex@DEX
    SQL*Plus: Release 11.1.0.6.0 - Production on Thu Jan 28 18:40:11 2010
    Copyright (c) 1982, 2007, Oracle. All rights reserved.
    Enter password:
    Connected to:
    Oracle Database 11g Enterprise Edition Release 11.1.0.6.0 - 64bit Production
    With the Partitioning, Real Application Clusters, OLAP, Data Mining
    and Real Application Testing options

  • Change of domain for the cluster nodes

    Hi,
    We need to change our existing domain i.e. NT domain to windows 2003 domain for the SAP production server cluster nodes.
    There are 2 nodes. They are in active active mode.
    Windows 2000 advanced server is installed in the nodes.
    Can anybody tell us that during the change of the domain what else we need to take care or what should be correct procedure for this so that database & application does not have any impact
    Eagerly waiting for the reply
    With best regards
    Joydeep

    I would NOT do that.
    What immediately comes to my mind is:
    - users need to be recreated in the new domain
    - filesystem owners and permissions of all SAP and database directories must be changed
    - permissions on registry keys must be changed
    - owner/permissions of the files in the cluster filesystem must be changed
    There is no easy way of doing this. Depending on the database that is used more things are necessary. If you don't exactly know, where which application has files, it will certainly be troublesome.
    The safest way would be to use a different hardware and do a homogeneous system copy by reinstalling the cluster in the new domain.
    Markus

  • SCVMM losing connection to cluster nodes

    Hey guys'n girls, I hope this is the right forum for this question. I already opened a ticket at MS support as well because it's impacting our production environment indirectly, but even after a week there's been no contact. Losing faith in MS support there
    The problem we're having is that scvmm is that a host enters the 'needs attention' state, with a winrm error 0x80338126. I guess it has something to do with the network or with Kerberos, and I've found some info on it, but I still haven't been able to solve
    it. Do you guys have any ideas?
    Problem summary:
    We are seeing an issue on our new hyper-v platform. The platform should have been in production last week, but this issue is delaying our project as we can't seem to get it stable.
    The problem we are experiencing is that SCVMM loses the connection to some of the Hyper-V nodes. Not one
     specific node. Last week it happened to two nodes, and today it happened to another node. I see issues with WinRM, and I expect something to do with kerberos. See the bottom of this post for background details and software versions.
    The host gets the status 'needs attention', and if you look at the status of the machine, WinRM gives an error. The error is:
    Error (2916)
    VMM is unable to complete the request. The connection to the agent cc1-hyp-10.domaincloud1.local was lost.
    WinRM: URL: [http://cc1-hyp-10.domaincloud1.local:5985], Verb: [ENUMERATE], Resource: [http://schemas.microsoft.com/wbem/wsman/1/wmi/root/cimv2/Win32_Service], Filter: [select * from Win32_Service where Name="WinRM"]
    Unknown error (0x80338126)
    Recommended Action
    Ensure that the Windows Remote Management (WinRM) service and the VMM agent are installed and running and that a firewall is not blocking HTTP/HTTPS traffic. Ensure that VMM server is able to communicate with cc1-hyp-10.domaincloud1.local over WinRM by successfully
    running the following command:
     winrm id –r:cc1-hyp-10.domaincloud1.local
    This
     problem can also be caused by a Windows Management Instrumentation (WMI) service crash. If the server is running Windows Server 2008 R2, ensure that KB 982293 (http://support.microsoft.com/kb/982293)
    is installed on it.
    If the error persists, restart cc1-hyp-10.domaincloud1.local and then try the operation again. /nRefer to
    http://support.microsoft.com/kb/2742275 for more details.
    Doing a simple test from the VMM server to the problematic cluster node shows this error:
    PS C:\> hostname
    CC1-VMM-01
    PS C:\> winrm id -r:cc1-hyp-10.domaincloud1.local
    WSManFault
        Message = WinRM cannot complete the operation. Verify that the specified computer name is valid, that the computer is accessible over the network, and that a firewall exception for the WinRM service is enabled and allows access from this
    computer. By default, the WinRM firewall exception for public profiles limits access to remote computers within the same local subnet.
    Error number:  -2144108250 0x80338126
    WinRM cannot complete the operation. Verify that the specified computer name is valid, that the computer is accessible over the network, and that a firewall exception for the WinRM service is enabled and allows access from this computer. By default, the WinRM
    firewall exception for public profiles limits access to remote computers within the same local subnet.
    I CAN connect from other hosts to this problematic cluster node:
    PS C:\> hostname
    CC1-HYP-16
    PS C:\> winrm id -r:cc1-hyp-10.domaincloud1.local
    IdentifyResponse
        ProtocolVersion =
    http://schemas.dmtf.org/wbem/wsman/1/wsman.xsd
        ProductVendor = Microsoft Corporation
        ProductVersion = OS: 6.3.9600 SP: 0.0 Stack: 3.0
        SecurityProfiles
            SecurityProfileName =
    http://schemas.dmtf.org/wbem/wsman/1/wsman/secprofile/http/spnego-kerberos
    And I can connect from the vmm server to all other cluster nodes:
    PS C:\> hostname
    CC1-VMM-01
    PS C:\> winrm id -r:cc1-hyp-11.domaincloud1.local
    IdentifyResponse
        ProtocolVersion =
    http://schemas.dmtf.org/wbem/wsman/1/wsman.xsd
        ProductVendor = Microsoft Corporation
        ProductVersion = OS: 6.3.9600 SP: 0.0 Stack: 3.0
        SecurityProfiles
            SecurityProfileName =
    http://schemas.dmtf.org/wbem/wsman/1/wsman/secprofile/http/spnego-kerberos
    So at this point only the test from the cc1-vmm-01 to cc1-hyp-10 seems to be problematic.
    I followed the steps in the page
    https://support.microsoft.com/kb/2742275 (which is referred to above). I tried the VMMCA, but it can't really get it working the way I want, or it seems to give outdated recommendations.
    I tried checking for duplicate SPN's by running setspn -x on affected machines. No results (although I do not understand
     what an SPN is or how it works). I rebuilt the performance counters.
    It tried setting 'sc config winrm type= own' as described in [http://blinditandnetworkadmin.blogspot.nl/2012/08/kb-how-to-troubleshoot-needs-attention.html].
    If I reboot this cc1-hyp-10 machine, it will start working perfectly again. However, then I can't troubleshoot the issue, and it will happen again.
    I want this problem to be solved, so vmm never loses connection to the hypervisors it's managing again!
    Background information:
    We've set up a platform with Hyper-V to run a VM workload. The platform consists of the following hardware:
    2 Dell R620's with 32GB of RAM, running hyper-v to virtualize the cloud management layer (DC's, VMM, SQL). These machines are called cc1-hyp-01 and cc1-hyp-02. They run the management vm's like cc1-dc-01/02, cc1-sql-01, cc1-vmm-01, etc. The names are self-explanatory.
    The VMM machine is NOT clustered.
    8 Dell M620 blades with 320GB of RAM, running hyper-v to virtualize the customer workload. The machines are
    called cc1-hyp-10 until cc1-hyp-17. They are in a cluster.
    2 Equallogic units form a SAN (premium storage), and we have a Dell R515 running iscsi target (budget storage).
    We have Dell Force10 switches and Cisco C3750X switches to connect everything together (mostly 10GB links).
    All hosts run Windows Server 2012R2 Datacenter edition. The VMM server runs System Center Virtual Machine Manage 2012 R2.
    All the latest Windows updates are installed on every host. There are no firewalls between any host (vmm and hypervisors) at this level. Windows firewalls are all disabled. No antivirus software is installed, no symantec software is installed.
    The only non-standard software that is installed is the Dell Host Integration Tools 4.7.1, Dell Openmanage Server Administrator, and some small stuff like 7-zip, bginfo, net-snap, etc.
    The SCVMM service is running under the domain account DOMAINCLOUD1\scvmm. This machine is in the local administrators group of each cluster node.
    On top of this cloud layer we're running the tenant layer with a lot of vm's for a specific customer (although they are all off now).

    I think I found the culprit, after an hour of analyzing wireshark dumps I found the vmm had jumbo frames enabled on the management interface to the hosts (and the underlying infrastructure does not).. Now my winrm commands started working again.

  • Unable to failover the services in active-active cluster node

    Hi,
    i am applying the sp2 patch for sql server 2008 r2 in active-active cluster, we have 3 services in the cluster , node 1 as 2 prefered owner and node 2 as 1 prefered owner, when i try to move the service from node 2 to node1 , i am getting the below errors
    DCOM was unable to communicate with the computer XXXXXXXXX using any of the configured protocols.
    The Kerberos client received a KRB_AP_ERR_MODIFIED error from the server XXXXXXXXX. The target name used was RPCSS/XXXXXX. This indicates that the target server failed to decrypt the ticket provided by the client. This can occur when the target server principal
    name (SPN) is registered on an account other than the account the target service is using. Please ensure that the target SPN is registered on, and only registered on, the account used by the server. This error can also happen when the target service is using
    a different password for the target service account than what the Kerberos Key Distribution Center (KDC) has for the target service account. Please ensure that the service on the server and the KDC are both updated to use the current password. If the server
    name is not fully qualified, and the target domain (XXXXXX) is different from the client domain (XXXXXXX), check if there are identically named server accounts in these two domains, or use the fully-qualified name to identify the server.
    The Cluster service failed to bring clustered service or application 'CHCROCHC045' completely online or offline. One or more resources may be in a failed state. This may impact the availability of the clustered service or application.
    Cluster resource 'SQL Server (CHCROCHC045)' in clustered service or application 'CHCROCHC045' failed.
    any inputs appreciated to resolve this issue as i could not procedd with patching
    BR
    PGR

    Hi PGR,
    As the issue is more related to Windows Server, I would like to recommend you post the issue in the
    Windows Server forums for better support.
    In addition, below are some article about troubleshooting error ” DCOM was unable to communicate with the computer XXXXXXXXX using any of the configured protocols” for your reference.
    Event ID 10009 — COM Remote Service Availability
    How to troubleshoot DCOM 10009 error logged in system event?
    Thanks,
    Lydia Zhang
    Lydia Zhang
    TechNet Community Support

  • Property nodes to items in cluster are FRAGILE

    This issue has bugged me for a long time .  I just got bit again and I wonder if anyone else has a workaround.
    I just checked and LV 2009 has it too.
    Create a custom control of a cluster with three elements named A, B, C (numeric controls, booleans, whatever, it doesn't matter).
    Make it a TYPE DEF (non-strict) and save it.
    Put an instance of it on a new VI panel.
    Create a PROPERTY NODE for item B and set it to DISABLED property.
    Wire a constant to that property node.
    Now, if you run the VI, it sets the DISABLED property of item B to the value of the constant.  Fine.
    Now go to the TYPEDEF, and add another item, called A2 to the cluster.
    Re-arrange the cluster order so that it's A, A2, B, C, and save the type def.
    Look at your diagram.  The property node is no longer linked to B, it's now linked to A2.
    Apparently, LV uses the cluster order internally to keep track of the links.  So now I'm linked to A2, not B.
    If A2 was a different type of object from B, then you MIGHT get lucky and the diagram breaks. At least then, you can see the fact that it changed.
    But almost everything has a DISABLED and a VISIBLE property, if that's the one you're using, then you won't notice that it was changed behind your back.
    I have taken to adding a free label with the name of the expected control (in parentheses) next to the property node, along with the true label.  So if I see a discrepancy between the true label and the free label, I can recognize such a case.
    But that's only if I remember that this crap happens.
    Anybody got any better ideas?  How can I prevent, or at least recognize, such an unwanted change?
    Bonus question: the same thing happens with events, for the same reason (I suppose).  If I tied an event to item B, that event switches to item A2 behind my back.
    Steve Bird
    Culverson Software - Elegant software that is a pleasure to use.
    Culverson.com
    Blog for (mostly LabVIEW) programmers: Tips And Tricks

    CoastalMaineBird wrote:
    This issue has bugged me for a long time .  I just got bit again and I wonder if anyone else has a workaround.
    I just checked and LV 2009 has it too.
    Create a custom control of a cluster with three elements named A, B, C (numeric controls, booleans, whatever, it doesn't matter).
    Make it a TYPE DEF (non-strict) and save it.
    Put an instance of it on a new VI panel.
    Create a PROPERTY NODE for item B and set it to DISABLED property.
    Wire a constant to that property node.
    Now, if you run the VI, it sets the DISABLED property of item B to the value of the constant.  Fine.
    Now go to the TYPEDEF, and add another item, called A2 to the cluster.
    Re-arrange the cluster order so that it's A, A2, B, C, and save the type def.
    Look at your diagram.  The property node is no longer linked to B, it's now linked to A2.
    Apparently, LV uses the cluster order internally to keep track of the links.  So now I'm linked to A2, not B.
    If A2 was a different type of object from B, then you MIGHT get lucky and the diagram breaks. At least then, you can see the fact that it changed.
    But almost everything has a DISABLED and a VISIBLE property, if that's the one you're using, then you won't notice that it was changed behind your back.
    I have taken to adding a free label with the name of the expected control (in parentheses) next to the property node, along with the true label.  So if I see a discrepancy between the true label and the free label, I can recognize such a case.
    But that's only if I remember that this crap happens.
    Anybody got any better ideas?  How can I prevent, or at least recognize, such an unwanted change?
    Bonus question: the same thing happens with events, for the same reason (I suppose).  If I tied an event to item B, that event switches to item A2 behind my back.
    For the disabled property inside of cluster whos definition hcanges... we are screuued.
    For the events your idea to include the text name, is the same technique that Jim Kring shared with us when he discovered the same thing can happen with dynamic event references. I belive that was CAR'd.
    No solutions so no bonus points.
    Ben
    Ben Rayner
    I am currently active on.. MainStream Preppers
    Rayner's Ridge is under construction

  • How to address message to the specific node of the ESB cluster

    Hi!
    We are running ESB on two node active-active cluster configuration. We use custom adapter to communicate with customer legacy system over Socket connection. The point is, that we need to transfer large amount of data from our database into customer system using one open socket connection. Because of memory limitation we had to split data into multiple messages in ESB. This socket connection can be open from any of the active cluster node, but ones it's open all messages must be send within the node which had open the connection.So, the messages should not arrive into another node. The question: How is it possible to address (route) ESB messages into specific node of the cluster?
    Thanks in advance.
    Jaro,

    Muthu92 wrote:
    Thank you for your reply. It is very useful for me.  But i can't give conditional operator in that operator. It is showing error.
    My Test cases
    A[0]+A[1]>150
    A[31]-50 <100.
    If those are your exact statements, then they don't make sense in a formula node.  You have to use them in an if-statement or some other type of control logic.  This mimics text-based programming languages such as C... See here for instructions on formula node syntax: http://zone.ni.com/reference/en-XX/help/371361G-01​/lvhowto/formula_node_syntax/
    For instance, you COULD say something like:
    **In this example, let A[0] == x, A[1] == y, A[31] == z**
    if(x+y>150)
    /* do something if this is true */
    else if(z-50 < 100)
    /* do something else */
    EDIT: 
    It should also be noted that Variable Names MUST be lowercase!! I just discovered this
    EDIT2:
    I stand corrected.  After trying to create an example and digging through the LabVIEW Help, I stumbled across this:
    Differences between the Parser in the Mathematics VIs and the Formula Node
    The parser in the Mathematics VIs supports all elements that Formula Nodes support with the following exceptions:
    Variables—Only a, a0, ..., a9, ... z, z0, ..., z9, are valid.
    Logical, conditional, inequality, equality—?:,, &&, !=, ==, <, >, <=, and >= are not valid.
    Functions—atan2, max, min, mod, pow, rem, and sizeOfDim are not valid. You can use these functions in a Formula Node or use their corresponding LabVIEW functions.
    So it DOES NOT seem like Eval Formula Node can do conditionals like you want.  I would just implement them using a LabVIEW based algorithm after all.  Trying to out smart the formula nodes to avoid simple LabVIEW code is just too much of a hassle
    ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
    If someone helped you out, please select their post as the solution and/or give them Kudos!

  • SQL LOG Backup failed in one Cluster Node

    I have 02 node SQL fail over cluster, NOD01 and NODE 02. and configure SQL log backup job via SQL Logshipping
    When the sql service is mounted to node 02 job backup will work without any issues, Once its connected to node 01 this will provide below issue
    Executed as user: <domain>\administrator. The process could not be created for step 1 of job 0xAC90A0F3623AE44285089E9EF53B12C7 (reason: The system cannot find the file specified).  The step failed.
    could anyone have on fix for this
    Thanx

    SQL Server Agent on both nodes run under same domain account?
    Are you sure that path location is correct?
    Best Regards,Uri Dimant SQL Server MVP,
    http://sqlblog.com/blogs/uri_dimant/
    MS SQL optimization: MS SQL Development and Optimization
    MS SQL Consulting:
    Large scale of database and data cleansing
    Remote DBA Services:
    Improves MS SQL Database Performance
    SQL Server Integration Services:
    Business Intelligence

  • Re-installing Hyper-V 2012 R2 cluster node

    We have a four HP BL460 Gen8 servers acting as a part of Hyper-V Cluster, running Windows Server 2012 R2 Datacenter.
    Storage is provided by two node 3PAR StoreServ 7400.
    All network and fc connections are managed by HP Virtual Connect.
    One of the four nodes crashed during HP SPP upgrade which resulted as non booting OS.
    I managed to get the OS alive by running multiple check disks and by manually restoring registry hives from backup via Windows 7 installation media's recovery console.
    After the recovery there were still some issues with filesystem. Corrupted, orphaned and missing files here and there.
    Now I want to re-install the OS from scratch to make sure everything will work correctly and to avoid any future errors.
    What I need to know is that is the best practice to re-install the OS with new computername, or should I drop the current OS to workgroup, re-install it and join the AD domain with same computer name? I've already evicted the node from Hyper-V cluster
    but the server is still running as a member server on AD.
    Any other things I should take into consideration before doing the re-installation?
    Thanks in advance!

    I agree that after a major problem it is much safer to rebuild the system.  It sounds like you have the node rebuilt, so I would evict it from the cluster and then remove it from the domain. Rebuild it and you can use the same name because those two
    actions will clean up its 'footprints'.
    If the machine were not running, you would still evict the node from the cluster, but you would need to go into Active Directory to delete the computer account.  Then rebuild.
    . : | : . : | : . tim

  • Installing First cluster node cluster service offline

    Hi ,
    While i'm trying to install the first cluster node on Node1 cluster disk shows online but the cluster service shows offline .
    as shown below in the screenshot .
    I have  run the prerequisite checker on both the nodes but  it was successful kindly give your input to resolve this issue 

    you are performing this actions as a domain administrator?
    If not repeat the stuff you want to perform as domain administrator.
    kind regards
    Peter

  • Error while getting cluster node subtree

    Hi,
      We are on SP15.
    The console logs show the following error
    log generation timestamp : 2006_01_17_at_17_14_05
    java.rmi.RemoteException: Error while getting cluster node subtree of :name=ClusterNodeRepresentative,j2eeType=com.sap.engine.services.adminadapter.impl.ClusterNodeRepresentative,SAP_J2EEClusterNode=2053400,SAP_J2EECluster=""; nested exception is:
         com.sap.engine.services.jmx.exception.MBeanServerClusterException: Exception during invocation of remote MBeanServer method, target node: 2053400
         at com.sap.engine.services.adminadapter.impl.ConvenienceEngineAdministratorImpl.getClusterNodeSubTree(ConvenienceEngineAdministratorImpl.java:242)
         at com.sap.engine.services.adminadapter.impl.ConvenienceEngineAdministratorImplp4_Skel.dispatch(ConvenienceEngineAdministratorImplp4_Skel.java:99)
         at com.sap.engine.services.rmi_p4.DispatchImpl._runInternal(DispatchImpl.java:304)
         at com.sap.engine.services.rmi_p4.DispatchImpl._run(DispatchImpl.java:193)
         at com.sap.engine.services.rmi_p4.server.P4SessionProcessor.request(P4SessionProcessor.java:122)
         at com.sap.engine.core.service630.context.cluster.session.ApplicationSessionMessageListener.process(ApplicationSessionMessageListener.java:33)
         at com.sap.engine.core.cluster.impl6.session.MessageRunner.run(MessageRunner.java:41)
         at com.sap.engine.core.thread.impl3.ActionObject.run(ActionObject.java:37)
         at java.security.AccessController.doPrivileged(Native Method)
         at com.sap.engine.core.thread.impl3.SingleThread.execute(SingleThread.java:100)
         at com.sap.engine.core.thread.impl3.SingleThread.run(SingleThread.java:170)
    Caused by: com.sap.engine.services.jmx.exception.MBeanServerClusterException: Exception during invocation of remote MBeanServer method, target node: 2053400
         at com.sap.engine.services.jmx.ClusterInterceptor.invoke(ClusterInterceptor.java:816)
         at com.sap.pj.jmx.server.interceptor.MBeanServerInterceptorChain.invoke(MBeanServerInterceptorChain.java:330)
         at com.sap.engine.services.adminadapter.impl.ConvenienceEngineAdministratorImpl.getClusterNodeSubTree(ConvenienceEngineAdministratorImpl.java:239)
         ... 10 more
    Caused by: com.sap.engine.services.jmx.exception.JmxConnectorException: Unable to de-serialize request parameters, message [ JMX request (java) v1.0 len: 345 |  src: cluster target-node: 2053400 req: invoke params-number: 4 params-bytes: 0 | :name=ClusterNodeRepresentative,j2eeType=com.sap.engine.services.adminadapter.impl.ClusterNodeRepresentative,SAP_J2EEClusterNode=2053400,SAP_J2EECluster="" null null null ]
         at com.sap.engine.services.jmx.MBeanServerConnectionImpl.invokeMbsInternal(MBeanServerConnectionImpl.java:680)
         at com.sap.engine.services.jmx.MBeanServerConnectionImpl.invoke(MBeanServerConnectionImpl.java:467)
         at com.sap.engine.services.jmx.MBeanServerConnectionSecurityWrapper.invoke(MBeanServerConnectionSecurityWrapper.java:221)
         at com.sap.engine.services.jmx.ClusterInterceptor.invoke(ClusterInterceptor.java:813)
         ... 12 more
    Caused by: javax.management.InstanceNotFoundException: MBean with name com.sap.default:name=ClusterNodeRepresentative,j2eeType=com.sap.engine.services.adminadapter.impl.ClusterNodeRepresentative,SAP_J2EEClusterNode=2053400,SAP_J2EECluster=XD1 not found in repository
         at com.sap.pj.jmx.server.MBeanServerImpl.getClassLoaderFor(MBeanServerImpl.java:1408)
         at com.sap.pj.jmx.server.interceptor.MBeanServerWrapperInterceptor.getClassLoaderFor(MBeanServerWrapperInterceptor.java:455)
         at com.sap.engine.services.jmx.CompletionInterceptor.getClassLoaderFor(CompletionInterceptor.java:567)
         at com.sap.pj.jmx.server.interceptor.BasicMBeanServerInterceptor.getClassLoaderFor(BasicMBeanServerInterceptor.java:438)
         at com.sap.jmx.provider.ProviderInterceptor.getClassLoaderFor(ProviderInterceptor.java:330)
         at com.sap.engine.services.jmx.RedirectInterceptor.getClassLoaderFor(RedirectInterceptor.java:501)
         at com.sap.pj.jmx.server.interceptor.MBeanServerInterceptorChain.getClassLoaderFor(MBeanServerInterceptorChain.java:443)
         at com.sap.engine.services.jmx.RequestMessage.readParams(RequestMessage.java:523)
         at com.sap.engine.services.jmx.RequestMessage.getParams(RequestMessage.java:578)
         at com.sap.engine.services.jmx.MBeanServerInvoker.invokeMbs(MBeanServerInvoker.java:106)
         at com.sap.engine.services.jmx.JmxServiceConnectorServer.receiveWait(JmxServiceConnectorServer.java:173)
         at com.sap.engine.core.service630.context.cluster.message.MessageListenerWrapper.process(MessageListenerWrapper.java:81)
         at com.sap.engine.core.cluster.impl6.ms.MSListenerThread.run(MSListenerThread.java:47)
         at com.sap.engine.frame.core.thread.Task.run(Task.java:64)
         at com.sap.engine.core.thread.impl6.SingleThread.execute(SingleThread.java:78)
         at com.sap.engine.core.thread.impl6.SingleThread.run(SingleThread.java:148)
    java.lang.NullPointerException
         at com.sap.engine.services.adminadapter.gui.ClusterView.addGlobalDispatcherServiceProperties(ClusterView.java:455)
         at com.sap.engine.services.adminadapter.gui.ClusterView.createGlobalTrees(ClusterView.java:508)
         at com.sap.engine.services.adminadapter.gui.ClusterView.access$1200(ClusterView.java:29)
         at com.sap.engine.services.adminadapter.gui.ClusterView$4.run(ClusterView.java:420)
    java.rmi.RemoteException: Error while getting cluster node subtree of :name=ClusterNodeRepresentative,j2eeType=com.sap.engine.services.adminadapter.impl.ClusterNodeRepresentative,SAP_J2EEClusterNode=2053400,SAP_J2EECluster=""; nested exception is:
         com.sap.engine.services.jmx.exception.MBeanServerClusterException: Exception during invocation of remote MBeanServer method, target node: 2053400
         at com.sap.engine.services.adminadapter.impl.ConvenienceEngineAdministratorImpl.getClusterNodeSubTree(ConvenienceEngineAdministratorImpl.java:242)
         at com.sap.engine.services.adminadapter.impl.ConvenienceEngineAdministratorImplp4_Skel.dispatch(ConvenienceEngineAdministratorImplp4_Skel.java:99)
         at com.sap.engine.services.rmi_p4.DispatchImpl._runInternal(DispatchImpl.java:304)
         at com.sap.engine.services.rmi_p4.DispatchImpl._run(DispatchImpl.java:193)
         at com.sap.engine.services.rmi_p4.server.P4SessionProcessor.request(P4SessionProcessor.java:122)
         at com.sap.engine.core.service630.context.cluster.session.ApplicationSessionMessageListener.process(ApplicationSessionMessageListener.java:33)
         at com.sap.engine.core.cluster.impl6.session.MessageRunner.run(MessageRunner.java:41)
         at com.sap.engine.core.thread.impl3.ActionObject.run(ActionObject.java:37)
         at java.security.AccessController.doPrivileged(Native Method)
         at com.sap.engine.core.thread.impl3.SingleThread.execute(SingleThread.java:100)
         at com.sap.engine.core.thread.impl3.SingleThread.run(SingleThread.java:170)
    Caused by: com.sap.engine.services.jmx.exception.MBeanServerClusterException: Exception during invocation of remote MBeanServer method, target node: 2053400
         at com.sap.engine.services.jmx.ClusterInterceptor.invoke(ClusterInterceptor.java:816)
         at com.sap.pj.jmx.server.interceptor.MBeanServerInterceptorChain.invoke(MBeanServerInterceptorChain.java:330)
         at com.sap.engine.services.adminadapter.impl.ConvenienceEngineAdministratorImpl.getClusterNodeSubTree(ConvenienceEngineAdministratorImpl.java:239)
         ... 10 more
    Caused by: com.sap.engine.services.jmx.exception.JmxConnectorException: Unable to de-serialize request parameters, message [ JMX request (java) v1.0 len: 345 |  src: cluster target-node: 2053400 req: invoke params-number: 4 params-bytes: 0 | :name=ClusterNodeRepresentative,j2eeType=com.sap.engine.services.adminadapter.impl.ClusterNodeRepresentative,SAP_J2EEClusterNode=2053400,SAP_J2EECluster="" null null null ]
         at com.sap.engine.services.jmx.MBeanServerConnectionImpl.invokeMbsInternal(MBeanServerConnectionImpl.java:680)
         at com.sap.engine.services.jmx.MBeanServerConnectionImpl.invoke(MBeanServerConnectionImpl.java:467)
         at com.sap.engine.services.jmx.MBeanServerConnectionSecurityWrapper.invoke(MBeanServerConnectionSecurityWrapper.java:221)
         at com.sap.engine.services.jmx.ClusterInterceptor.invoke(ClusterInterceptor.java:813)
         ... 12 more
    Caused by: javax.management.InstanceNotFoundException: MBean with name com.sap.default:name=ClusterNodeRepresentative,j2eeType=com.sap.engine.services.adminadapter.impl.ClusterNodeRepresentative,SAP_J2EEClusterNode=2053400,SAP_J2EECluster=XD1 not found in repository
         at com.sap.pj.jmx.server.MBeanServerImpl.getClassLoaderFor(MBeanServerImpl.java:1408)
         at com.sap.pj.jmx.server.interceptor.MBeanServerWrapperInterceptor.getClassLoaderFor(MBeanServerWrapperInterceptor.java:455)
         at com.sap.engine.services.jmx.CompletionInterceptor.getClassLoaderFor(CompletionInterceptor.java:567)
         at com.sap.pj.jmx.server.interceptor.BasicMBeanServerInterceptor.getClassLoaderFor(BasicMBeanServerInterceptor.java:438)
         at com.sap.jmx.provider.ProviderInterceptor.getClassLoaderFor(ProviderInterceptor.java:330)
         at com.sap.engine.services.jmx.RedirectInterceptor.getClassLoaderFor(RedirectInterceptor.java:501)
         at com.sap.pj.jmx.server.interceptor.MBeanServerInterceptorChain.getClassLoaderFor(MBeanServerInterceptorChain.java:443)
         at com.sap.engine.services.jmx.RequestMessage.readParams(RequestMessage.java:523)
         at com.sap.engine.services.jmx.RequestMessage.getParams(RequestMessage.java:578)
         at com.sap.engine.services.jmx.MBeanServerInvoker.invokeMbs(MBeanServerInvoker.java:106)
         at com.sap.engine.services.jmx.JmxServiceConnectorServer.receiveWait(JmxServiceConnectorServer.java:173)
         at com.sap.engine.core.service630.context.cluster.message.MessageListenerWrapper.process(MessageListenerWrapper.java:81)
         at com.sap.engine.core.cluster.impl6.ms.MSListenerThread.run(MSListenerThread.java:47)
         at com.sap.engine.frame.core.thread.Task.run(Task.java:64)
         at com.sap.engine.core.thread.impl6.SingleThread.execute(SingleThread.java:78)
         at com.sap.engine.core.thread.impl6.SingleThread.run(SingleThread.java:148)
    java.lang.NullPointerException
         at com.sap.engine.services.adminadapter.gui.ClusterView.addGlobalDispatcherServiceProperties(ClusterView.java:455)
         at com.sap.engine.services.adminadapter.gui.ClusterView.createGlobalTrees(ClusterView.java:508)
         at com.sap.engine.services.adminadapter.gui.ClusterView.access$1200(ClusterView.java:29)
         at com.sap.engine.services.adminadapter.gui.ClusterView$4.run(ClusterView.java:420)
    Any clue whats it?
    rgds

    Go the same error
    + /usr/java14_64/bin/java -showversion -Duser.language=en -DP4ClassLoad=P4Connection -Dp4Cache=clean -jar go.jar
    java version "1.4.2"
    Java(TM) 2 Runtime Environment, Standard Edition (build 1.4.2)
    Classic VM (build 1.4.2, J2RE 1.4.2 IBM AIX 5L for PowerPC (64 bit JVM) build caix64142ifx-20061222 (ifix 113727: SR7 + 112603) (JIT enabled: jitc))
    java.lang.NullPointerException
            at com.sap.engine.services.adminadapter.gui.ClusterView$4.run(ClusterView.java:405)
    Need some help!
    Bernard

  • High CPU usage with Coherence 3.5.0 cluster node

    We're noticing sustained high CPU usage on one of our Coherence nodes - this happens after running a job that pretty intensively interacts with the cache. The jobs complete, however we still see the Coherence process using approximately the equivalent of one core's worth of CPU. I took a couple of thread dumps about fifteen minutes apart, and the only noticeable differences are within the PacketListener1 and PacketListenerN threads - they're holding locks on different DatagramPacket instances, which suggests to be that this may be the piece of code that is looping.
    There are two other cache nodes in this particular cluster - both of them on a different machine - and neither of them are exhibiting the same CPU utilization.
    Environment:
    Coherence 3.5.0
    Java - BEA JRockit(R) (build R27.6.3-40_o-112056-1.6.0_11-20090318-2103-linux-x86_64, compiled mode)
    Has anyone encountered this scenario before?
    Here's a copy of the two thread dumps:
    Thread Dump 1 -
    ===== FULL THREAD DUMP ===============
    Thu Feb 24 21:45:00 2011
    BEA JRockit(R) R27.6.3-40_o-112056-1.6.0_11-20090318-2103-linux-x86_64
    "Main Thread" id=1 idx=0x4 tid=18143 prio=5 alive, in native, waiting
    -- Waiting for notification on: java/lang/Class@0x43587b58[fat lock]
    at jrockit/vm/Threads.waitForNotifySignal(JLjava/lang/Object;)Z(Native Method)
    at java/lang/Object.wait(J)V(Native Method)[optimized]
    at com/tangosol/net/DefaultCacheServer.main(DefaultCacheServer.java:80)
    ^-- Lock released while waiting: java/lang/Class@0x43587b58[fat lock]
    at com/zzzz/carbon/cacheserver/ZzzzCoherenceServerStartup.doWork(ZzzzCoherenceServerStartup.java:29)
    at com/zzzz/util/runner/ZzzzRunnerBase.run(ZzzzRunnerBase.java:23)
    at com/zzzz/carbon/cacheserver/ZzzzCoherenceServerStartup.main(ZzzzCoherenceServerStartup.java:16)
    at jrockit/vm/RNI.c2java(JJJJJ)V(Native Method)
    -- end of trace
    "(Signal Handler)" id=2 idx=0x8 tid=18144 prio=5 alive, in native, daemon
    "(GC Main Thread)" id=3 idx=0xc tid=18145 prio=5 alive, in native, daemon
    "(GC Worker Thread 1)" id=? idx=0x10 tid=18146 prio=5 alive, in native, daemon
    "(GC Worker Thread 2)" id=? idx=0x14 tid=18147 prio=5 alive, in native, daemon
    "(GC Worker Thread 3)" id=? idx=0x18 tid=18148 prio=5 alive, in native, daemon
    "(GC Worker Thread 4)" id=? idx=0x1c tid=18149 prio=5 alive, in native, daemon
    "(Code Generation Thread 1)" id=4 idx=0x20 tid=18150 prio=5 alive, in native, native_waiting, daemon
    "(Code Optimization Thread 1)" id=5 idx=0x24 tid=18151 prio=5 alive, in native, native_waiting, daemon
    "(VM Periodic Task)" id=6 idx=0x28 tid=18152 prio=10 alive, in native, daemon
    "Finalizer" id=7 idx=0x2c tid=18153 prio=8 alive, in native, native_waiting, daemon
    at jrockit/memory/Finalizer.waitForFinalizees([Ljava/lang/Object;)I(Native Method)
        at jrockit/memory/Finalizer.access$500(Finalizer.java:12)
        at jrockit/memory/Finalizer$4.run(Finalizer.java:159)
        at java/lang/Thread.run(Thread.java:619)
        at jrockit/vm/RNI.c2java(JJJJJ)V(Native Method)
        -- end of trace
    "Reference Handler" id=8 idx=0x30 tid=18154 prio=10 alive, in native, native_waiting, daemon
        at java/lang/ref/Reference.waitForActivatedQueue()Ljava/lang/ref/Reference;(Native Method)
        at java/lang/ref/Reference.access$100(Reference.java:11)
        at java/lang/ref/Reference$ReferenceHandler.run(Reference.java:79)
        at jrockit/vm/RNI.c2java(JJJJJ)V(Native Method)
        -- end of trace
    "(Sensor Event Thread)" id=9 idx=0x34 tid=18155 prio=5 alive, in native, daemon
    "Logger@9217551 3.5/459" id=12 idx=0x38 tid=18158 prio=3 alive, in native, waiting, daemon
        -- Waiting for notification on: com/tangosol/coherence/component/application/console/Coherence$Logger$Queue@0x4e1faef8[fat lock]
    at jrockit/vm/Threads.waitForNotifySignal(JLjava/lang/Object;)Z(Native Method)
    at java/lang/Object.wait(J)V(Native Method)[optimized]
    at com/tangosol/coherence/component/util/Daemon.onWait(Daemon.CDB:18)[optimized]
    ^-- Lock released while waiting: com/tangosol/coherence/component/application/console/Coherence$Logger$Queue@0x4e1faef8[fat lock]
    at com/tangosol/coherence/component/util/Daemon.run(Daemon.CDB:34)
    at java/lang/Thread.run(Thread.java:619)
    at jrockit/vm/RNI.c2java(JJJJJ)V(Native Method)
    -- end of trace
    "PacketListener1" id=14 idx=0x3c tid=18159 prio=8 alive, in native, daemon
    at java/net/PlainDatagramSocketImpl.receive0(Ljava/net/DatagramPacket;)V(Native Method)
    ^-- Holding lock: java/net/PlainDatagramSocketImpl@0x4cb21360[recursive]
    at java/net/PlainDatagramSocketImpl.receive(PlainDatagramSocketImpl.java:136)[inlined]
    at java/net/DatagramSocket.receive(DatagramSocket.java:712)[optimized]
    ^-- Holding lock: java/net/PlainDatagramSocketImpl@0x4cb21360[thin lock]
    ^-- Holding lock: java/net/DatagramPacket@0x73aa4e80[thin lock]
    ^-- Holding lock: java/net/DatagramSocket@0x4cb21338[thin lock]
    at com/tangosol/coherence/component/net/socket/UdpSocket.receive(UdpSocket.CDB:20)[optimized]
    at com/tangosol/coherence/component/net/UdpPacket.receive(UdpPacket.CDB:4)[optimized]
    at com/tangosol/coherence/component/util/daemon/queueProcessor/packetProcessor/PacketListener.onNotify(PacketListener.CDB:19)
    at com/tangosol/coherence/component/util/Daemon.run(Daemon.CDB:37)
    at java/lang/Thread.run(Thread.java:619)
    at jrockit/vm/RNI.c2java(JJJJJ)V(Native Method)
    -- end of trace
    "PacketReceiver" id=15 idx=0x40 tid=18160 prio=7 alive, in native, waiting, daemon
    -- Waiting for notification on: com/tangosol/coherence/component/net/Cluster$PacketReceiver$InQueue@0x4cf60d58[fat lock]
    at jrockit/vm/Threads.waitForNotifySignal(JLjava/lang/Object;)Z(Native Method)
    at java/lang/Object.wait(J)V(Native Method)[optimized]
    at com/tangosol/coherence/component/util/Daemon.onWait(Daemon.CDB:18)[inlined]
    at com/tangosol/coherence/component/util/daemon/queueProcessor/packetProcessor/PacketReceiver.onWait(PacketReceiver.CDB:2)[optimized]
    ^-- Lock released while waiting: com/tangosol/coherence/component/net/Cluster$PacketReceiver$InQueue@0x4cf60d58[fat lock]
    at com/tangosol/coherence/component/util/Daemon.run(Daemon.CDB:34)
    at java/lang/Thread.run(Thread.java:619)
    at jrockit/vm/RNI.c2java(JJJJJ)V(Native Method)
    -- end of trace
    "PacketPublisher" id=16 idx=0x44 tid=18161 prio=6 alive, in native, waiting, daemon
    -- Waiting for notification on: com/tangosol/coherence/component/net/Cluster$PacketPublisher$InQueue@0x4ca053a8[fat lock]
    at jrockit/vm/Threads.waitForNotifySignal(JLjava/lang/Object;)Z(Native Method)
    at java/lang/Object.wait(J)V(Native Method)[optimized]
    at com/tangosol/coherence/component/util/Daemon.onWait(Daemon.CDB:18)[inlined]
    at com/tangosol/coherence/component/util/daemon/queueProcessor/packetProcessor/PacketPublisher.onWait(PacketPublisher.CDB:2)[optimized]
    ^-- Lock released while waiting: com/tangosol/coherence/component/net/Cluster$PacketPublisher$InQueue@0x4ca053a8[fat lock]
    ^-- Lock released while waiting: com/tangosol/coherence/component/net/Cluster$PacketPublisher$InQueue@0x4ca053a8[fat lock]
    at com/tangosol/coherence/component/util/Daemon.run(Daemon.CDB:34)
    at java/lang/Thread.run(Thread.java:619)
    at jrockit/vm/RNI.c2java(JJJJJ)V(Native Method)
    -- end of trace
    "PacketSpeaker" id=17 idx=0x48 tid=18162 prio=8 alive, in native, waiting, daemon
    -- Waiting for notification on: com/tangosol/coherence/component/net/Cluster$PacketSpeaker$BundlingQueue@0x4cabe9d8[fat lock]
    at jrockit/vm/Threads.waitForNotifySignal(JLjava/lang/Object;)Z(Native Method)
    at java/lang/Object.wait(J)V(Native Method)[optimized]
    at com/tangosol/coherence/component/util/queue/ConcurrentQueue.waitForEntry(ConcurrentQueue.CDB:16)
    ^-- Lock released while waiting: com/tangosol/coherence/component/net/Cluster$PacketSpeaker$BundlingQueue@0x4cabe9d8[fat lock]
    at com/tangosol/coherence/component/util/queue/ConcurrentQueue.remove(ConcurrentQueue.CDB:7)
    at com/tangosol/coherence/component/util/Queue.remove(Queue.CDB:1)
    at com/tangosol/coherence/component/util/daemon/queueProcessor/packetProcessor/PacketSpeaker.onNotify(PacketSpeaker.CDB:62)
    at com/tangosol/coherence/component/util/Daemon.run(Daemon.CDB:37)
    at java/lang/Thread.run(Thread.java:619)
    at jrockit/vm/RNI.c2java(JJJJJ)V(Native Method)
    -- end of trace
    "PacketListenerN" id=18 idx=0x4c tid=18163 prio=8 alive, in native, daemon
    at java/net/PlainDatagramSocketImpl.receive0(Ljava/net/DatagramPacket;)V(Native Method)
    ^-- Holding lock: java/net/PlainDatagramSocketImpl@0x4cb221b0[recursive]
    at java/net/PlainDatagramSocketImpl.receive(PlainDatagramSocketImpl.java:136)[inlined]
    at java/net/DatagramSocket.receive(DatagramSocket.java:712)[optimized]
    ^-- Holding lock: java/net/PlainDatagramSocketImpl@0x4cb221b0[thin lock]
    ^-- Holding lock: java/net/DatagramPacket@0x73e59890[thin lock]
    ^-- Holding lock: java/net/MulticastSocket@0x4cb22178[thin lock]
    at com/tangosol/coherence/component/net/socket/UdpSocket.receive(UdpSocket.CDB:20)[optimized]
    at com/tangosol/coherence/component/net/UdpPacket.receive(UdpPacket.CDB:4)[optimized]
    at com/tangosol/coherence/component/util/daemon/queueProcessor/packetProcessor/PacketListener.onNotify(PacketListener.CDB:19)
    at com/tangosol/coherence/component/util/Daemon.run(Daemon.CDB:37)
    at java/lang/Thread.run(Thread.java:619)
    at jrockit/vm/RNI.c2java(JJJJJ)V(Native Method)
    -- end of trace
    "Cluster|Member(Id=2, Timestamp=2011-02-23 16:25:36.488, Address=10.10.100.77:8095, MachineId=25677, Location=site:emtzzzz.com,machine:njdev04,process:18142,member:cldev201~carboncachelauncher~181)" id=19 idx=0x50 tid=18164 prio=5 alive, in native, waiting, daemon
    -- Waiting for notification on: com/tangosol/coherence/component/util/queue/concurrentQueue/DualQueue@0x4cac64d8[fat lock]
    at jrockit/vm/Threads.waitForNotifySignal(JLjava/lang/Object;)Z(Native Method)
    at java/lang/Object.wait(J)V(Native Method)[optimized]
    at com/tangosol/coherence/component/util/Daemon.onWait(Daemon.CDB:18)[inlined]
    at com/tangosol/coherence/component/util/daemon/queueProcessor/service/Grid.onWait(Grid.CDB:9)[optimized]
    ^-- Lock released while waiting: com/tangosol/coherence/component/util/queue/concurrentQueue/DualQueue@0x4cac64d8[fat lock]
    at com/tangosol/coherence/component/util/Daemon.run(Daemon.CDB:34)
    at java/lang/Thread.run(Thread.java:619)
    at jrockit/vm/RNI.c2java(JJJJJ)V(Native Method)
    -- end of trace
    "TcpRingListener" id=20 idx=0x54 tid=18165 prio=6 alive, in native, daemon
    at java/net/PlainSocketImpl.socketAccept(Ljava/net/SocketImpl;)V(Native Method)
    at java/net/PlainSocketImpl.accept(PlainSocketImpl.java:384)
    ^-- Holding lock: java/net/SocksSocketImpl@0x4cb227d8[thin lock]
    at java/net/ServerSocket.implAccept(ServerSocket.java:453)
    at java/net/ServerSocket.accept(ServerSocket.java:421)
    at com/tangosol/coherence/component/net/socket/TcpSocketAccepter.accept(TcpSocketAccepter.CDB:18)
    at com/tangosol/coherence/component/util/daemon/TcpRingListener.acceptConnection(TcpRingListener.CDB:10)
    at com/tangosol/coherence/component/util/daemon/TcpRingListener.onNotify(TcpRingListener.CDB:9)
    at com/tangosol/coherence/component/util/Daemon.run(Daemon.CDB:37)
    at java/lang/Thread.run(Thread.java:619)
    at jrockit/vm/RNI.c2java(JJJJJ)V(Native Method)
    -- end of trace
    "Invocation:Management:EventDispatcher" id=22 idx=0x58 tid=18166 prio=5 alive, in native, waiting, daemon
    -- Waiting for notification on: com/tangosol/coherence/component/util/daemon/queueProcessor/Service$EventDispatcher$Queue@0x4ab76140[fat lock]
    at jrockit/vm/Threads.waitForNotifySignal(JLjava/lang/Object;)Z(Native Method)
    at java/lang/Object.wait(J)V(Native Method)[optimized]
    at com/tangosol/coherence/component/util/Daemon.onWait(Daemon.CDB:18)[optimized]
    ^-- Lock released while waiting: com/tangosol/coherence/component/util/daemon/queueProcessor/Service$EventDispatcher$Queue@0x4ab76140[fat lock]
    at com/tangosol/coherence/component/util/daemon/queueProcessor/Service$EventDispatcher.onWait(Service.CDB:7)
    at com/tangosol/coherence/component/util/Daemon.run(Daemon.CDB:34)
    at java/lang/Thread.run(Thread.java:619)
    at jrockit/vm/RNI.c2java(JJJJJ)V(Native Method)
    -- end of trace
    "Invocation:Management" id=23 idx=0x5c tid=18167 prio=5 alive, in native, waiting, daemon
    -- Waiting for notification on: com/tangosol/coherence/component/util/queue/concurrentQueue/DualQueue@0x4ab75508[fat lock]
    at jrockit/vm/Threads.waitForNotifySignal(JLjava/lang/Object;)Z(Native Method)
    at java/lang/Object.wait(J)V(Native Method)[optimized]
    at com/tangosol/coherence/component/util/Daemon.onWait(Daemon.CDB:18)[inlined]
    at com/tangosol/coherence/component/util/daemon/queueProcessor/service/Grid.onWait(Grid.CDB:9)[optimized]
    ^-- Lock released while waiting: com/tangosol/coherence/component/util/queue/concurrentQueue/DualQueue@0x4ab75508[fat lock]
    at com/tangosol/coherence/component/util/Daemon.run(Daemon.CDB:34)
    at java/lang/Thread.run(Thread.java:619)
    at jrockit/vm/RNI.c2java(JJJJJ)V(Native Method)
    -- end of trace
    "DistributedCache" id=25 idx=0x60 tid=18168 prio=5 alive, in native, waiting, daemon
    -- Waiting for notification on: com/tangosol/coherence/component/util/queue/concurrentQueue/DualQueue@0x4a3869b0[fat lock]
    at jrockit/vm/Threads.waitForNotifySignal(JLjava/lang/Object;)Z(Native Method)
    at java/lang/Object.wait(J)V(Native Method)[optimized]
    at com/tangosol/coherence/component/util/Daemon.onWait(Daemon.CDB:18)[inlined]
    at com/tangosol/coherence/component/util/daemon/queueProcessor/service/Grid.onWait(Grid.CDB:9)[optimized]
    ^-- Lock released while waiting: com/tangosol/coherence/component/util/queue/concurrentQueue/DualQueue@0x4a3869b0[fat lock]
    at com/tangosol/coherence/component/util/Daemon.run(Daemon.CDB:34)
    at java/lang/Thread.run(Thread.java:619)
    at jrockit/vm/RNI.c2java(JJJJJ)V(Native Method)
    -- end of trace
    ===== END OF THREAD DUMP ===============
    Thread Dump 2 -
    ===== FULL THREAD DUMP ===============
    Thu Feb 24 21:27:53 2011
    BEA JRockit(R) R27.6.3-40_o-112056-1.6.0_11-20090318-2103-linux-x86_64
    "Main Thread" id=1 idx=0x4 tid=18143 prio=5 alive, in native, waiting
    -- Waiting for notification on: java/lang/Class@0x43587b58[fat lock]
    at jrockit/vm/Threads.waitForNotifySignal(JLjava/lang/Object;)Z(Native Method)
    at java/lang/Object.wait(J)V(Native Method)[optimized]
    at com/tangosol/net/DefaultCacheServer.main(DefaultCacheServer.java:80)
    ^-- Lock released while waiting: java/lang/Class@0x43587b58[fat lock]
    at com/zzzz/carbon/cacheserver/ZzzzCoherenceServerStartup.doWork(ZzzzCoherenceServerStartup.java:29)
    at com/zzzz/util/runner/ZzzzRunnerBase.run(ZzzzRunnerBase.java:23)
    at com/zzzz/carbon/cacheserver/ZzzzCoherenceServerStartup.main(ZzzzCoherenceServerStartup.java:16)
    at jrockit/vm/RNI.c2java(JJJJJ)V(Native Method)
    -- end of trace
    "(Signal Handler)" id=2 idx=0x8 tid=18144 prio=5 alive, in native, daemon
    "(GC Main Thread)" id=3 idx=0xc tid=18145 prio=5 alive, in native, native_waiting, daemon
    "(GC Worker Thread 1)" id=? idx=0x10 tid=18146 prio=5 alive, in native, daemon
    "(GC Worker Thread 2)" id=? idx=0x14 tid=18147 prio=5 alive, in native, daemon
    "(GC Worker Thread 3)" id=? idx=0x18 tid=18148 prio=5 alive, in native, daemon
    "(GC Worker Thread 4)" id=? idx=0x1c tid=18149 prio=5 alive, in native, daemon
    "(Code Generation Thread 1)" id=4 idx=0x20 tid=18150 prio=5 alive, in native, native_waiting, daemon
    "(Code Optimization Thread 1)" id=5 idx=0x24 tid=18151 prio=5 alive, in native, native_waiting, daemon
    "(VM Periodic Task)" id=6 idx=0x28 tid=18152 prio=10 alive, in native, daemon
    "Finalizer" id=7 idx=0x2c tid=18153 prio=8 alive, in native, native_waiting, daemon
    at jrockit/memory/Finalizer.waitForFinalizees([Ljava/lang/Object;)I(Native Method)
        at jrockit/memory/Finalizer.access$500(Finalizer.java:12)
        at jrockit/memory/Finalizer$4.run(Finalizer.java:159)
        at java/lang/Thread.run(Thread.java:619)
        at jrockit/vm/RNI.c2java(JJJJJ)V(Native Method)
        -- end of trace
    "Reference Handler" id=8 idx=0x30 tid=18154 prio=10 alive, in native, native_waiting, daemon
        at java/lang/ref/Reference.waitForActivatedQueue()Ljava/lang/ref/Reference;(Native Method)
        at java/lang/ref/Reference.access$100(Reference.java:11)
        at java/lang/ref/Reference$ReferenceHandler.run(Reference.java:79)
        at jrockit/vm/RNI.c2java(JJJJJ)V(Native Method)
        -- end of trace
    "(Sensor Event Thread)" id=9 idx=0x34 tid=18155 prio=5 alive, in native, daemon
    "Logger@9217551 3.5/459" id=12 idx=0x38 tid=18158 prio=3 alive, in native, waiting, daemon
        -- Waiting for notification on: com/tangosol/coherence/component/application/console/Coherence$Logger$Queue@0x4e1faef8[fat lock]
    at jrockit/vm/Threads.waitForNotifySignal(JLjava/lang/Object;)Z(Native Method)
    at java/lang/Object.wait(J)V(Native Method)[optimized]
    at com/tangosol/coherence/component/util/Daemon.onWait(Daemon.CDB:18)[optimized]
    ^-- Lock released while waiting: com/tangosol/coherence/component/application/console/Coherence$Logger$Queue@0x4e1faef8[fat lock]
    at com/tangosol/coherence/component/util/Daemon.run(Daemon.CDB:34)
    at java/lang/Thread.run(Thread.java:619)
    at jrockit/vm/RNI.c2java(JJJJJ)V(Native Method)
    -- end of trace
    "PacketListener1" id=14 idx=0x3c tid=18159 prio=8 alive, in native, daemon
    at java/net/PlainDatagramSocketImpl.receive0(Ljava/net/DatagramPacket;)V(Native Method)
    ^-- Holding lock: java/net/PlainDatagramSocketImpl@0x4cb21360[recursive]
    at java/net/PlainDatagramSocketImpl.receive(PlainDatagramSocketImpl.java:136)[inlined]
    at java/net/DatagramSocket.receive(DatagramSocket.java:712)[optimized]
    ^-- Holding lock: java/net/PlainDatagramSocketImpl@0x4cb21360[thin lock]
    ^-- Holding lock: java/net/DatagramPacket@0x73a80930[thin lock]
    ^-- Holding lock: java/net/DatagramSocket@0x4cb21338[thin lock]
    at com/tangosol/coherence/component/net/socket/UdpSocket.receive(UdpSocket.CDB:20)[optimized]
    at com/tangosol/coherence/component/net/UdpPacket.receive(UdpPacket.CDB:4)[optimized]
    at com/tangosol/coherence/component/util/daemon/queueProcessor/packetProcessor/PacketListener.onNotify(PacketListener.CDB:19)
    at com/tangosol/coherence/component/util/Daemon.run(Daemon.CDB:37)
    at java/lang/Thread.run(Thread.java:619)
    at jrockit/vm/RNI.c2java(JJJJJ)V(Native Method)
    -- end of trace
    "PacketReceiver" id=15 idx=0x40 tid=18160 prio=7 alive, in native, waiting, daemon
    -- Waiting for notification on: com/tangosol/coherence/component/net/Cluster$PacketReceiver$InQueue@0x4cf60d58[fat lock]
    at jrockit/vm/Threads.waitForNotifySignal(JLjava/lang/Object;)Z(Native Method)
    at java/lang/Object.wait(J)V(Native Method)[optimized]
    at com/tangosol/coherence/component/util/Daemon.onWait(Daemon.CDB:18)[inlined]
    at com/tangosol/coherence/component/util/daemon/queueProcessor/packetProcessor/PacketReceiver.onWait(PacketReceiver.CDB:2)[optimized]
    ^-- Lock released while waiting: com/tangosol/coherence/component/net/Cluster$PacketReceiver$InQueue@0x4cf60d58[fat lock]
    at com/tangosol/coherence/component/util/Daemon.run(Daemon.CDB:34)
    at java/lang/Thread.run(Thread.java:619)
    at jrockit/vm/RNI.c2java(JJJJJ)V(Native Method)
    -- end of trace
    "PacketPublisher" id=16 idx=0x44 tid=18161 prio=6 alive, in native, waiting, daemon
    -- Waiting for notification on: com/tangosol/coherence/component/net/Cluster$PacketPublisher$InQueue@0x4ca053a8[fat lock]
    at jrockit/vm/Threads.waitForNotifySignal(JLjava/lang/Object;)Z(Native Method)
    at java/lang/Object.wait(J)V(Native Method)[optimized]
    at com/tangosol/coherence/component/util/Daemon.onWait(Daemon.CDB:18)[inlined]
    at com/tangosol/coherence/component/util/daemon/queueProcessor/packetProcessor/PacketPublisher.onWait(PacketPublisher.CDB:2)[optimized]
    ^-- Lock released while waiting: com/tangosol/coherence/component/net/Cluster$PacketPublisher$InQueue@0x4ca053a8[fat lock]
    ^-- Lock released while waiting: com/tangosol/coherence/component/net/Cluster$PacketPublisher$InQueue@0x4ca053a8[fat lock]
    at com/tangosol/coherence/component/util/Daemon.run(Daemon.CDB:34)
    at java/lang/Thread.run(Thread.java:619)
    at jrockit/vm/RNI.c2java(JJJJJ)V(Native Method)
    -- end of trace
    "PacketSpeaker" id=17 idx=0x48 tid=18162 prio=8 alive, in native, waiting, daemon
    -- Waiting for notification on: com/tangosol/coherence/component/net/Cluster$PacketSpeaker$BundlingQueue@0x4cabe9d8[fat lock]
    at jrockit/vm/Threads.waitForNotifySignal(JLjava/lang/Object;)Z(Native Method)
    at java/lang/Object.wait(J)V(Native Method)[optimized]
    at com/tangosol/coherence/component/util/queue/ConcurrentQueue.waitForEntry(ConcurrentQueue.CDB:16)
    ^-- Lock released while waiting: com/tangosol/coherence/component/net/Cluster$PacketSpeaker$BundlingQueue@0x4cabe9d8[fat lock]
    at com/tangosol/coherence/component/util/queue/ConcurrentQueue.remove(ConcurrentQueue.CDB:7)
    at com/tangosol/coherence/component/util/Queue.remove(Queue.CDB:1)
    at com/tangosol/coherence/component/util/daemon/queueProcessor/packetProcessor/PacketSpeaker.onNotify(PacketSpeaker.CDB:62)
    at com/tangosol/coherence/component/util/Daemon.run(Daemon.CDB:37)
    at java/lang/Thread.run(Thread.java:619)
    at jrockit/vm/RNI.c2java(JJJJJ)V(Native Method)
    -- end of trace
    "PacketListenerN" id=18 idx=0x4c tid=18163 prio=8 alive, in native, daemon
    at java/net/PlainDatagramSocketImpl.receive0(Ljava/net/DatagramPacket;)V(Native Method)
    ^-- Holding lock: java/net/PlainDatagramSocketImpl@0x4cb221b0[recursive]
    at java/net/PlainDatagramSocketImpl.receive(PlainDatagramSocketImpl.java:136)[inlined]
    at java/net/DatagramSocket.receive(DatagramSocket.java:712)[optimized]
    ^-- Holding lock: java/net/PlainDatagramSocketImpl@0x4cb221b0[thin lock]
    ^-- Holding lock: java/net/DatagramPacket@0x5e55a240[thin lock]
    ^-- Holding lock: java/net/MulticastSocket@0x4cb22178[thin lock]
    at com/tangosol/coherence/component/net/socket/UdpSocket.receive(UdpSocket.CDB:20)[optimized]
    at com/tangosol/coherence/component/net/UdpPacket.receive(UdpPacket.CDB:4)[optimized]
    at com/tangosol/coherence/component/util/daemon/queueProcessor/packetProcessor/PacketListener.onNotify(PacketListener.CDB:19)
    at com/tangosol/coherence/component/util/Daemon.run(Daemon.CDB:37)
    at java/lang/Thread.run(Thread.java:619)
    at jrockit/vm/RNI.c2java(JJJJJ)V(Native Method)
    -- end of trace
    "Cluster|Member(Id=2, Timestamp=2011-02-23 16:25:36.488, Address=10.10.100.77:8095, MachineId=25677, Location=site:emtzzzz.com,machine:njdev04,process:18142,member:cldev201~carboncachelauncher~181)" id=19 idx=0x50 tid=18164 prio=5 alive, in native, waiting, daemon
    -- Waiting for notification on: com/tangosol/coherence/component/util/queue/concurrentQueue/DualQueue@0x4cac64d8[fat lock]
    at jrockit/vm/Threads.waitForNotifySignal(JLjava/lang/Object;)Z(Native Method)
    at java/lang/Object.wait(J)V(Native Method)[optimized]
    at com/tangosol/coherence/component/util/Daemon.onWait(Daemon.CDB:18)[inlined]
    at com/tangosol/coherence/component/util/daemon/queueProcessor/service/Grid.onWait(Grid.CDB:9)[optimized]
    ^-- Lock released while waiting: com/tangosol/coherence/component/util/queue/concurrentQueue/DualQueue@0x4cac64d8[fat lock]
    at com/tangosol/coherence/component/util/Daemon.run(Daemon.CDB:34)
    at java/lang/Thread.run(Thread.java:619)
    at jrockit/vm/RNI.c2java(JJJJJ)V(Native Method)
    -- end of trace
    "TcpRingListener" id=20 idx=0x54 tid=18165 prio=6 alive, in native, daemon
    at java/net/PlainSocketImpl.socketAccept(Ljava/net/SocketImpl;)V(Native Method)
    at java/net/PlainSocketImpl.accept(PlainSocketImpl.java:384)
    ^-- Holding lock: java/net/SocksSocketImpl@0x4cb227d8[thin lock]
    at java/net/ServerSocket.implAccept(ServerSocket.java:453)
    at java/net/ServerSocket.accept(ServerSocket.java:421)
    at com/tangosol/coherence/component/net/socket/TcpSocketAccepter.accept(TcpSocketAccepter.CDB:18)
    at com/tangosol/coherence/component/util/daemon/TcpRingListener.acceptConnection(TcpRingListener.CDB:10)
    at com/tangosol/coherence/component/util/daemon/TcpRingListener.onNotify(TcpRingListener.CDB:9)
    at com/tangosol/coherence/component/util/Daemon.run(Daemon.CDB:37)
    at java/lang/Thread.run(Thread.java:619)
    at jrockit/vm/RNI.c2java(JJJJJ)V(Native Method)
    -- end of trace
    "Invocation:Management:EventDispatcher" id=22 idx=0x58 tid=18166 prio=5 alive, in native, waiting, daemon
    -- Waiting for notification on: com/tangosol/coherence/component/util/daemon/queueProcessor/Service$EventDispatcher$Queue@0x4ab76140[fat lock]
    at jrockit/vm/Threads.waitForNotifySignal(JLjava/lang/Object;)Z(Native Method)
    at java/lang/Object.wait(J)V(Native Method)[optimized]
    at com/tangosol/coherence/component/util/Daemon.onWait(Daemon.CDB:18)[optimized]
    ^-- Lock released while waiting: com/tangosol/coherence/component/util/daemon/queueProcessor/Service$EventDispatcher$Queue@0x4ab76140[fat lock]
    at com/tangosol/coherence/component/util/daemon/queueProcessor/Service$EventDispatcher.onWait(Service.CDB:7)
    at com/tangosol/coherence/component/util/Daemon.run(Daemon.CDB:34)
    at java/lang/Thread.run(Thread.java:619)
    at jrockit/vm/RNI.c2java(JJJJJ)V(Native Method)
    -- end of trace
    "Invocation:Management" id=23 idx=0x5c tid=18167 prio=5 alive, in native, waiting, daemon
    -- Waiting for notification on: com/tangosol/coherence/component/util/queue/concurrentQueue/DualQueue@0x4ab75508[fat lock]
    at jrockit/vm/Threads.waitForNotifySignal(JLjava/lang/Object;)Z(Native Method)
    at java/lang/Object.wait(J)V(Native Method)[optimized]
    at com/tangosol/coherence/component/util/Daemon.onWait(Daemon.CDB:18)[inlined]
    at com/tangosol/coherence/component/util/daemon/queueProcessor/service/Grid.onWait(Grid.CDB:9)[optimized]
    ^-- Lock released while waiting: com/tangosol/coherence/component/util/queue/concurrentQueue/DualQueue@0x4ab75508[fat lock]
    at com/tangosol/coherence/component/util/Daemon.run(Daemon.CDB:34)
    at java/lang/Thread.run(Thread.java:619)
    at jrockit/vm/RNI.c2java(JJJJJ)V(Native Method)
    -- end of trace
    "DistributedCache" id=25 idx=0x60 tid=18168 prio=5 alive, in native, waiting, daemon
    -- Waiting for notification on: com/tangosol/coherence/component/util/queue/concurrentQueue/DualQueue@0x4a3869b0[fat lock]
    at jrockit/vm/Threads.waitForNotifySignal(JLjava/lang/Object;)Z(Native Method)
    at java/lang/Object.wait(J)V(Native Method)[optimized]
    at com/tangosol/coherence/component/util/Daemon.onWait(Daemon.CDB:18)[inlined]
    at com/tangosol/coherence/component/util/daemon/queueProcessor/service/Grid.onWait(Grid.CDB:9)[optimized]
    ^-- Lock released while waiting: com/tangosol/coherence/component/util/queue/concurrentQueue/DualQueue@0x4a3869b0[fat lock]
    at com/tangosol/coherence/component/util/Daemon.run(Daemon.CDB:34)
    at java/lang/Thread.run(Thread.java:619)
    at jrockit/vm/RNI.c2java(JJJJJ)V(Native Method)
    -- end of trace
    ===== END OF THREAD DUMP ===============

    Charlie, Cameron,
    Thanks for following up. To answer Cameron's question - yes, running top shows that it is this process using a significant amount of the CPU. I captured a JRA this morning setting nativesamples=true.
    I see the following when I look at the Hot Methods - let me know if there is something else that would shed some more light on this issue.
    Method     Percent(%)     #Samples     Optimized     Native     Method Id.
    libjvm.so#mmDetGcFollowReferencesWorkers     28.575     2,062     0     1     0x2AE6B82EC2C1
    libpthread.so.0#__recvfrom_nocancel     26.012     1,877     0     1     0x3CC1A0D689
    libpthread.so.0#__accept_nocancel     26.012     1,877     0     1     0x3CC1A0D4A9
    libjvm.so#mmBalanceGetReference     4.975     359     0     1     0x2AE6B817A65B
    libjvm.so#objIterStepFieldFromBitmaskWord     4.753     343     0     1     0x2AE6B82A6B34
    libjvm.so#objIterStep     3.88     280     0     1     0x2AE6B82A6F8D
    libjvm.so#objIterInitInner     2.744     198     0     1     0x2AE6B82A71B7
    libjvm.so#mmBalanceStoreReference     1.414     102     0     1     0x2AE6B817A781
    libjvm.so#objIterSetupFields     0.637     46     0     1     0x2AE6B82A6B08
    libjvm.so#nativeInnerLockNotLinked     0.236     17     0     1     0x2AE6B829DE76
    libjvm.so#mmWorkPacketPoolGet     0.125     9     0     1     0x2AE6B817A4F5
    libjvm.so#qBitSetIsBitSetInWord     0.069     5     0     1     0x2AE6B82C8AA6
    libjvm.so#nativeInnerUnlockNotLinked     0.069     5     0     1     0x2AE6B829E028
    libjvm.so#mmWorkPacketPoolAdd     0.042     3     0     1     0x2AE6B817A48E
    libjvm.so#objPoolMarkAllHard     0.042     3     0     1     0x2AE6B82A4923
    libjvm.so#objIterSetupArray     0.028     2     0     1     0x2AE6B82A6BDB
    libjvm.so#markAcquired     0.028     2     0     1     0x2AE6B829DBD3
    libjvm.so#mmPointerMatrixTryInsertAtIndex     0.028     2     0     1     0x2AE6B82BEFEB
    libjvm.so#mmPointerMatrixClear     0.028     2     0     1     0x2AE6B82BF61C
    libjvm.so#objPoolMarkWeakConcurrently     0.028     2     0     1     0x2AE6B82A5358
    libjvm.so#qBitSetFindFirstNonClearedWord     0.014     1     0     1     0x2AE6B82C888E
    libjvm.so#utilCounterToNanos     0.014     1     0     1     0x2AE6B8313B53
    libjvm.so#waitForLockIfNeededAndMarkAcquired     0.014     1     0     1     0x2AE6B829DC30
    libjvm.so#vmtLinkData     0.014     1     0     1     0x2AE6B8325B2E
    libjvm.so#mmBitsIsObjectMarkedGrey     0.014     1     0     1     0x2AE6B82ED7E9
    libjvm.so#qBitSetFindLastSetBitInWord     0.014     1     0     1     0x2AE6B82C8854
    libjvm.so#mark_writebarriers     0.014     1     0     1     0x2AE6B8250EEA
    libjvm.so#signalNextInLockQueueIfNeeded     0.014     1     0     1     0x2AE6B829DF52
    libjvm.so#mmGetUsingMatrixes     0.014     1     0     1     0x2AE6B819F33C
    libjvm.so#ptGetThreadId     0.014     1     0     1     0x2AE6B82BD9ED
    libc.so.6#memset     0.014     1     0     1     0x3CC0A7A000
    libjvm.so#setupNodeForSelf     0.014     1     0     1     0x2AE6B829DD87
    libc.so.6#_int_free     0.014     1     0     1     0x3CC0A714E0
    libjvm.so#mmAddChunkToList     0.014     1     0     1     0x2AE6B830977C
    libjvm.so#vmtiUnlinkData     0.014     1     0     1     0x2AE6B8325B4A
    libjvm.so#nativeLockInSuspendCritical     0.014     1     0     1     0x2AE6B829E0E3
    libjvm.so#mmSweepHeapPart     0.014     1     0     1     0x2AE6B830989A
    libjvm.so#mmBalanceWorkSetSwapPackets     0.014     1     0     1     0x2AE6B817A647
    libc.so.6#_int_malloc     0.014     1     0     1     0x3CC0A71E80
    libjvm.so#charToJlcType     0.014     1     0     1     0x2AE6B8253633
    unknown#unknown functions     0     0     0     1     0x2AE6B8597090
    Running with nativemethods=false, I see the following under hot methods
    Method     Percent(%)     #Samples     Optimized     Native     Method Id.
    jrockit.vm.Locks.monitorExitSecondStage(Object)     50     1     1     0     0x161D30D0
    com.tangosol.coherence.component.util.daemon.queueProcessor.service.grid.ClusterService.onNotify()     50     1     1     0     0x169D2C70
    com.tangosol.coherence.component.util.daemon.queueProcessor.packetProcessor.PacketReceiver.onNotify()     0     0     1     0     0x1551C050
    com.tangosol.coherence.component.util.queue.ConcurrentQueue.checkFlush(int)     0     0     1     0     0x151C2180
    java.lang.Thread.run()     0     0     0     0     0x14E98EB0
    jrockit.vm.RNI.c2java(long, long, long, long, long)     0     0     0     0     0x14C07830
    unknown#unknown functions     0     0     0     1     0x2AE6B8597090
    com.tangosol.coherence.component.util.Daemon.run()     0     0     0     0     0x15CFEBC0
    com.tangosol.coherence.component.util.queue.ConcurrentQueue.add(Object)     0     0     1     0     0x15ED6B00
    com.tangosol.coherence.component.util.queue.ConcurrentQueue.onAddElement()     0     0     1     0     0x151C9C20
    com.tangosol.coherence.component.util.queue.ConcurrentQueue.flush(boolean)     0     0     1     0     0x1618CAD0
    Thanks,
    Jason

Maybe you are looking for

  • IPhone & iPad won´t show up in iTunes?

    Hi! I have a problem with my iTunes; non of my iDevices shows up -.- anyone having any idea what I should do, please?

  • Is there a stacking feature in LR v1.0?

    Like the stacking feature available in the Bridge beta to group similar photos.

  • CarPlay vs Siri "Eyes Free"?

    What's the difference between CarPlay and Siri "Eyes Free"?  I thought Carplay was the improved next generation of "Eyes Free", which came after "IOS for the car".  But now I'm confused. I'd like to buy a new car with CarPlay this year, and I'm tryin

  • Display GL Description in AP Invoice

    Hi Experts Within an AP Invoice (Service) the GL Account field is obviously available but the GL Description is not and I cant see it in Form Settings.  Any way to have this display the GL Description when the GL Account field is completed. Thanks! L

  • ISO Document in DMS

    Dear Sir I have to attached my ISO document in DMs with hierarch level Apex document>Quality manual>Departmet Manual>format in this manner i want to maintain my document Awaiting foe your valuable answer. regards kunal