CUCM Cluster with MCS-7845-I4

Hi,
I am doing a CUCM design for a customer with MCS-7845-I4 servers. We are building a Central solution with CUCM,CUPS, UCCX and Unity Connection.
There will be 4 CUCM servers. All gateways (about 40) will be H323, each with 1 E1 for PSTN. All gateways will have DSP for conferencing and Codec. MoH will be enable on CUCM. G.729 cross the WAN. G.711 to access Unity and UCCX from all sites configured with Regions to avoid codec issues.
The solution should support 2500 phones in phase 1.
My plan for Phase 1 is as follows:
1. Publisher/TFTP
2. Sub 1 - Sub 2(Primary and Failover)
3. Sub 3 - Sub 4(Primary and Failover)
4. Sub 2 and Sub 4 will be MoH
5. Sub 2 and Sub 4 will be backup for Media processing such as Conferencing. They will aslo Annuciator etc. This means I wont enable Cisco IP Voice Media Streaming App service on Pub, Sub1 and Sub 2
6. CTI manager will be enable on all subscribers
Question: Should I use backup TFTP on lets say Sub 2, what is your best experience?
The solution should support 8000 phones in phase 2.
My plan for Phase 2 is as follows:
1. Publisher
2. A separate TFTP server
Rest as Phase 1.
Question: Should I use backup TFTP on lets say Sub 2, what is your best experience? Can I manage phase 2 with Phase 1 design?
Any feedback would be much appreciated regards the total design, maybe you have a better design solution, maybe I have missed something here.
Br,
Mattias

Hi mattias,
It really looks like you've done your homework here
My one comment is on the need for "dedicated" TFTP servers;
TFTP Server
The TFTP server performs two main functions:
•The serving of files for services such as MoH, configuration files for devices such as phones and gateways, binary files for the upgrade of phones as well as some gateways, and various security files.
•Generation of configuration and security files. Most files generated by the Cisco TFTP service are signed and in some cases encrypted before being available for download.
The TFTP service can be enabled on any server in the cluster. However, in a cluster with more than 1250 users, other services might be impacted by configuration changes that can cause the TFTP service to regenerate configuration files. Therefore, Cisco recommends that you dedicate a specific server to the TFTP service in a cluster with more than 1250 users, with Extension Mobility, or with other features that cause configuration changes.
The TFTP server is used by phones and MGCP gateways to obtain configuration information. There is no restriction on the number of servers that can have TFTP service enabled, however Cisco recommends deploying 2 TFTP servers for a large cluster, thus providing redundancy for TFTP service. More than 2 TFTP servers can be deployed in a cluster, but this can result in an extended period for rebuilding of all TFTP files on all TFTP servers. When configuring the TFTP options using DHCP or statically, you can normally define an IP address array (more than one IP address) for a TFTP server. Therefore, you can assign half of the devices to use TFTP server A as the primary and TFTP server B as the backup, and the other half to use TFTP server B as the primary and TFTP server A as the backup. To improve performance on dedicated TFTP servers, you can set service parameters to increase the number of simultaneous TFTP sessions allowed on the server.
When upgrading a Unified CM cluster, Cisco highly recommends that you upgrade the TFTP servers after the publisher and before any other server, also allowing additional time following the upgrade for the TFTP server to rebuild all the configuration files. Either use the typical Cisco TFTP - BuildDuration time or use the real-time monitoring tool to monitor the Cisco TFTP - DeviceBuildCount until it stops incrementing. This upgrade order ensures that any new binaries and configuration changes are available before the upgrade of other services in the cluster. If you are manually adding a specific binary or firmware load for a phone or gateway, be sure to copy the file to each TFTP server in the cluster.
Cisco recommends that you use the same hardware platform for the TFTP servers as used for the call processing subscribers.
http://www.cisco.com/en/US/docs/voice_ip_comm/cucm/srnd/7x/callpros.html#wp1043790
Cheers!
Rob

Similar Messages

  • Limit of CSR's or UCCE users per CUCM cluster?

    Well my math and lack of documentation compete always, so trying to find max number of CSR's one can have, I understand it depends on ucce/ICM environment.
    A cucm cluster with 8 subscribers fully loaded hw etc and moderate CSR skill sets and average number of queues:
    Some others came up with  around 2000 csrs max if we were to poppulate on whole cluster nothig but ucce users or csrs.
    Is this limitation because of PG or CTIOS or what?  if its due to PG can we have 2 PG's or total of 4 pg's (A and B sides) will this double CSR capacity to 4000?
    As you may know capacity of cucm is 30K+ for users and devices.
    Thoughts, facts or CCBU advice?
    Thanks,
    Baseer

    If you using CTIOS as your desktop client you can run two PIMs on a CM PG for a max count of 4,000 agents. Unfortunately, if you are using CAD as your desktop client you are limit to a single PIM and I believe an agent count of 1,000 agents. CAD is very dense in relation to CTIOS. Also, if you plan on using a high number of mobile agents you need to count your agents with a heavier load. For CAD, 1 mobile agent translated to 1.7 agent load and CTIOS I believe is 1.3 agent load.
    My understanding on the limitiations is with the code in the jtapi interface on the PG. While UCM CTI manager can process a higher agent count the jtapi interface on the PG cannot.
    My two cents.

  • CCM IBM MCS-7845 Server RAID Configuration

    We have IBM MCS-7845 server and need to mirror the hard drive for disaster recovery.
    We have two array configured and each array has two HDs.
    Can we pull the HD from slot 0 and replace with the new one while server is running.
    What is the procedure of doing that ?
    Thanks

    No,
    You hava a change to corrupt your system, it has open files and the system might be writing while you pull.
    a. Turn off the server
    b. pull disks 0 & 2
    c. power on
    d. Insert a replacement drive into Slot 0.
    e. Press F5.
    f. Choose Start > Programs > ServeRaid Manager > ServeRaid Manager. You can view the progression of the drive mirroring.
    http://www.cisco.com/univercd/cc/td/doc/product/voice/c_callmg/3_3/install/upgrade/upgra333.htm#54464

  • Unity Connection server MCS-7845-H2-ECS4 hard drives only half have power

    I have 2 servers MCS-7845-H2-ECS4 which each have 8 hard drives. the 4 hard drives on the left are not powered up on either server. The last 4 hard drives all have green power lights. Is this normal operation? I thought I might have a failed controller in the first server and I ran over to check the other server, but its hard drives were exactly like the primary server. The server is running Unity Connection 7.1(5).
    I checked them with the RTMT tool and I don't see any alerts about any failed hard drives or controllers. Please help                  

    Hi there,
    I believe everything is working as designed We don't
    run those exact models but there is a reference here
    about removing 4 HDD's (out of 8) here;
    http://www.cisco.com/en/US/prod/collateral/voicesw/ps6789/ps5745/ps6509/data_sheet_c78-491743.html
    Cheers!
    Rob
    "Why do the best things always disappear " 
    - The Band

  • What is the procedure of Call Manager upgrade from version CUCM 7.1 (MCS Server) to version CUCM 10.2 (on UCS server)

    Hi
    What is the procedure of Call Manager upgrade from version CUCM 7.1 (MCS Server)  to version CUCM 10.2 (on UCS server) , please let me know the steps to what would be the backup procedures and upgrades.
    Regards
    Gaurav

    I'd suggest reading this guide to make sure you have prepared your environment to support 10.x first. The link below also contains the upgrade path to 10.0(1).
    One of the ways you can do this is:
    1/ Apply refresh upgrade cop file on all servers in cluster.
    2/ Upgrade the cluster to 8.6(2). Take a backup of the cluster.
    3/ Build a cluster of 8.6(2) servers on VMs, and assign them the same hostnames and IP address as the hardware cluster. (You may need to keep these on a separate network until switching off the hardware machines)
    4/ Restore the backup taken from the hardware cluster on to the VM cluster.
    5/ Upgrade VM cluster to 10.0(1).
    http://www.cisco.com/c/en/us/td/docs/voice_ip_comm/cucm/upgrade/10_0_1/CUCM_BK_U4214F9D_00_upgrade-guide-cucm-100/CUCM_BK_U4214F9D_00_upgrade-guide-cucm-100_chapter_010.html#CUCM_RF_UA60AFAB_00

  • Asking for advice for Jabber deployment - multi CUCM cluster\AD domains

    I would like some design advice for deploying Jabber and CUPS in our company. We have 2 locations, west coast (SiteA) and east coast (SiteB). Each site have their own CUCM 7.15 clusters, Unity clusters, AD domains (trusted, but not in the same forest).
    At SiteA I have setup CUPS (8.6.3.10000-20) and jabber and have it working great.
    I would like to setup CUPS\Jabber for SiteB, but they need to be able to IM\call\etc to SiteA (And vice-versa).
    SiteA and SiteB both have CUCM LDAP sync turned on, and LDAP directory synced with both domains (although SiteA cannot authenticate to CUCM at SiteB, and vice-versa due to the fact you can only LDAP sync authentication with one domain, CUCM user database contain users from SiteA and SiteB).
    We have SIP trucks setup to pass internal calls and line status(BLF) between the trunks, and can communicate via internal extensions just fine.
    The problem I’m running into is my jabber-config files uses the EDI directory – which can only look at one domain, so I cannot search the other domain. I believe  changing to UDS fixes this, but I understand it would require me to upgrade both CUCM clusters to 8.6.2 - unless I’m mistaken.
    I’m aware the desktop sharing will not work until CUCM is upgraded to 8.6.1 or 8.6.2.
    I’m wondering if anyone has any advice, or can confirm I’m on the right track. Thanks in advance!

    The thing that's important to understand is how CUP and Jabber build the XMPP URI. The URI has a left- and right-hand side; the left is the username while the right is the XMPP domain. CUP uses the LDAP attribute specified in CUCM's LDAP System page, sAMAccountName by default, for the left-hand-side. The right-hand side is the FQDN of the CUP cluster. Jabber must use the same values as CUP when displaying search results. Take note that nowhere in this process does the entire XMPP URI originate from the directory source.
    In your case you have two separate CUP clusters in two separate domains. This won't work because when a user searches for a contact in the directory using Jabber, the client will build the XMPP URI as [email protected]. Even if you got the other domain's user objects into the search results the right-hand-side of the URI would be wrong and the presence subscription would never succeed since the other cluster is in another domain. As such your first task must be to move the CUP clusters into the exact same fully-qualified DNS domain. Once this is done you can use Inter-Cluster Peering to build a larger XMPP network in which all users have the same presence domain. If you intend to do Inter-Domain Federation in the future this must be your public DNS domain, not your internal active directory domain. If you use a non-public DNS domain TLS handshake will never succeed for inter-domain federation requests.
    Once you have Inter-Cluster Peering in place you can use Active Directory Lightweight Directory Services (the new name for ADAM) to front-end both forests. Both CUCM clusters would need to import the full list of users representing both domains and the sAMAccountNames must be unique across both domains.
    Finally, you can instruct Jabber to use UDS and query it's local CUCM cluster which will be able to return a search result from both domains. Since the CUP clusters are peered in the same domain the XMPP URI can be built properly, the presence subscription can be routed to the correct cluster, and life will be good.
    By this point hopefully it's clear that EDI won't cut it since it would be limited to only returning search results from the local forest.
    Please remember to rate helpful responses and identify helpful or correct answers.

  • CUCM CallRecording with SNR

    We have CUCM cluster integrated with Zoom Call Recording. Spanless recording with a SIP trunk is used. If a call comes in and SNR routes the call to a cellular phone, can the call be recorded?

    Hello Jagpreet,
    That's one way. But we can always SPAN the gateway. I also got to know that using MTP (Media Termination Point), the calls made or received through SNR can be recorded.
    The approach I am thinking should work:
    When it's an incoming call: PSTN call comes to V.G.-> The deskphone and the SNR associated Mobile Phone rings- user receives the call on mobile- the deskphone and V.G is still in loop- SPAN the gateway and get the RTP.
    When it's an outgoing call: User dials using SNR associated mobile- call lands to enterprise number on CM- CM dials out to PSTN via. V.G- SPAN the gateway and get the RTP.
    Your help is appreciated.
    Rg,
    Bishay

  • CUCM Cluster "Partition Unalligned" Issue Resolution by PCD (Prime Collaboration Deployment)

    Dear all,
    I will test soon (asap i got bootable 9.1.2 software) to solve "partition unalligned" issue with CUCM Cluster version 9.1.2.11900-12.
    In my opinion, we can solve this issue with PCD as migrate unalligned cluster to a new cluster with same version.
    Do anybody test it before?
    What is your opinions?
    Best Regards,
    Mesut 

    Hi Aman,
    As i found out, correct answer should be "migration to 9.x unsupported by CPD" for CUCM version 9.X.
    But just imagine, you have 8 servers in a cluster at version 10.X with Partition Unalligned issue. Is PCD not a good alternative to DRS to solve this issue while migrating to a new cluster with PCD?
    Best Regards,
    Mesut 

  • Persistent Store Problems for MYSQL Enhanced Cluster With OpenMQ 4.4

    I am trying to implement an enhanced cluster with failover. I have edited the config files for each broker instance for a persistent store. I have appended the following to each of the config.properties files:
    imq.brokerid=myclusterinstanceINSTANCE1 # I substitute INSTANCE2 for INSTANCE1 for broker #2
    imq.persist.store=jdbc
    imq.persist.jdbc.dbVendor=mysql
    imq.persist.jdbc.mysql.property.url=jdbc:mysql://xxx.xxx.xxx.xx:3306/test
    imq.persist.jdbc.mysql.user=user1
    imq.persist.jdbc.mysql.needpassword=true
    imq.persist.jdbc.mysql.password=mypass
    imq.cluster.ha=true
    imq.cluster.clusterid=mycluster
    imq.cluster.brokerlist=xxx.xxx.xxx.x:37676,yyy.yyy.yyy.y:37676
    I then create the persistence storage with "imqdbmgr create tbl". When I view the data in the tables it creates, I have one row. Under Store_Version, I have 410. Under LOCK_ID, it has NULL. When I go to start the brokers with imqbrokerd, I get the following error:
    ERROR [B3198]: Error initializing cluster manager:
    com.sun.messaging.jmq.jmsserver.util.BrokerException: [B4239]: Failed to load persistent store version from database table MQVER41Cmycluster
    at com.sun.messaging.jmq.jmsserver.persist.jdbc.VersionDAOImpl.getStoreVersion(VersionDAOImpl.java:310)
    at com.sun.messaging.jmq.jmsserver.persist.jdbc.DBTool.updateStoreVersion410IfNecessary(DBTool.java:350)
    at com.sun.messaging.jmq.jmsserver.persist.jdbc.JDBCStore.checkStore(JDBCStore.java:3599)
    at com.sun.messaging.jmq.jmsserver.persist.jdbc.JDBCStore.<init>(JDBCStore.java:127)
    at sun.reflect.NativeConstructorAccessorImpl.newInstance0(Native Method)
    at sun.reflect.NativeConstructorAccessorImpl.newInstance(NativeConstructorAccessorImpl.java:39)
    at sun.reflect.DelegatingConstructorAccessorImpl.newInstance(DelegatingConstructorAccessorImpl.java:27)
    at java.lang.reflect.Constructor.newInstance(Constructor.java:513)
    at java.lang.Class.newInstance0(Class.java:355)
    at java.lang.Class.newInstance(Class.java:308)
    at com.sun.messaging.jmq.jmsserver.persist.StoreManager.getStore(StoreManager.java:157)
    at com.sun.messaging.jmq.jmsserver.Globals.getStore(Globals.java:967)
    at com.sun.messaging.jmq.jmsserver.cluster.ha.HAClusterManagerImpl.initialize(HAClusterManagerImpl.java:181)
    at com.sun.messaging.jmq.jmsserver.Globals.initClusterManager(Globals.java:903)
    at com.sun.messaging.jmq.jmsserver.Broker._start(Broker.java:777)
    at com.sun.messaging.jmq.jmsserver.Broker.start(Broker.java:410)
    at com.sun.messaging.jmq.jmsserver.Broker.main(Broker.java:1971)
    Caused by: java.lang.NullPointerException
    at com.mysql.jdbc.ResultSetImpl.findColumn(ResultSetImpl.java:1103)
    at com.mysql.jdbc.ResultSetImpl.getInt(ResultSetImpl.java:2777)
    at com.sun.messaging.jmq.jmsserver.persist.jdbc.VersionDAOImpl.getStoreVersion(VersionDAOImpl.java:298)
    ... 16 more
    I believe this error is attributed to the NULL value under LOCK_ID. I think that the value under LOCK_ID should be the name of the broker from the config file (even though I specified them in the config files). Any ideas?? THANKS!

    Just some pointers -- maybe this will be of use:
    If you haven't already read it, please take a look at the [ MySQL setup guide|https://mq.dev.java.net/OpenMQ_MySQLCluster_Setup_Guide.html] .
    We recommend using NDB Data-store of MySQL Cluster, though this isn't an absolute requirement. Due to some issues we have found with earlier versions, we recommend using MySQL Cluster, 7.0.9 or better (the current version is 7.0.16, or 7.1.5). Either of these would contain Connector/J.
    I'd also recommend using the latest version -- MQ 4.4update2 (just in case you happen to have an older copy). There were many minor improvements in the integration with MySQL from the original 4.4 release, to update 2. This is linked at the MQ download page: [https://mq.dev.java.net/downloads.html]

  • Can we install a new mssql cluster on the same windows cluster which already containes a mssql cluster with named instance

    We have a MSSQL 2008R2 Enterprise edition with a two node active passive fail-over cluster running on 2008R2 windows cluster with out any issues,
    Now my question is can we add one more MSSSQL cluster instance for the same setup with out disturbing the existing one ?
    Also give thoughts on load sharing as the second node is mostly ideal now except fail-over scenarios,
    Why we go for this situation is because of the collation setting which can be set only one per instance(Database collation setting change not working), we need a different default collation for the new setup

    hi,
    >>Now my question is can we add one more MSSSQL cluster instance for the same setup with out disturbing the existing one ?
    Yes it is possible .You need to add new drives as cluster aware and install SQL server and put data and log files on thse drives.YOu would need to create named instance of SQL server and need to create different resource group.Both old installation and new
    onw would work separately.
    >>Also give thoughts on load sharing as the second node is mostly ideal now except fail-over scenarios,
    Good point indeed.You are about to create Multi instance cluster and should plan for scenario where one node is down and other node is handling load for both instances.Memory and CPU should be enough to handle the load.
    >>Why we go for this situation is because of the collation setting which can be set only one per instance(Database collation setting change not working), we need a different default collation for the new setup .
    Just for collation if you are installing new instance seems little wierd to me.You can manage collation at column ,database and at server level.
    http://technet.microsoft.com/en-us/library/aa174903(v=sql.80).aspx
    Please mark this reply as the answer or vote as helpful, as appropriate, to make it useful for other readers

  • [SOLVED] Can't add a node to the cluster with error (Exchange 2010 SP3 DAG Windows Server 2012)

    Hi there!
    I have a problem which makes me very angry already :)
    I have two servers Exchange 2010 SP3 with MB role started on Windows Server 2012. I decided to create a DAG.
    I have created the prestaged AD object for the cluster called msc-co-exc-01c, assigned necessary permissions and disabled it. Allowed through the Windows Firewall traffic between nodes and prepared the File Share Witness server.
    Then I have tried to add nodes. The first node has been added successfully, but the second node doesn't want to be added :). Now I can add only one node to the DAG. I tried to add different servers first, but only the first one was added.
    LOGS on the second nodes: 
    Application Log
    "Failed to initialize cluster with error 0x80004005." (MSExchangeIS)
    Failover Clustering Diagnostic Log
    "[VER] Could not read version data from database for node msc-co-exc-04v (id 1)."
    CMDLET Error:
    Summary: 1 item(s). 0 succeeded, 1 failed.
    Elapsed time: 00:06:21
    MSC-CO-EXC-02V
    Failed
    Error:
    A database availability group administrative operation failed. Error: The operation failed. CreateCluster errors may result from incorrectly configured static addresses. Error: An error occurred while attempting a cluster operation. Error: Cluster API '"AddClusterNode()
    (MaxPercentage=100) failed with 0x5b4. Error: This operation returned because the timeout period expired"' failed. [Server: msc-co-exc-04v.int.krls.ru]
    An Active Manager operation failed. Error An error occurred while attempting a cluster operation. Error: Cluster API '"AddClusterNode() (MaxPercentage=100) failed with 0x5b4. Error: This operation returned because the timeout period expired"' failed..
    This operation returned because the timeout period expired
    Click here for help... http://technet.microsoft.com/en-US/library/ms.exch.err.default(EXCHG.141).aspx?v=14.3.174.1&t=exchgf1&e=ms.exch.err.ExC9C315
    Warning:
    Network name 'msc-co-exc-01c' is not online. Please check that the IP address configuration for the database availability group is correct.
    Warning:
    The operation wasn't successful because an error was encountered. You may find more details in log file "C:\ExchangeSetupLogs\DagTasks\dagtask_2014-11-17_13-54-56.543_add-databaseavailabiltygroupserver.log".
    Exchange Management Shell command attempted:
    Add-DatabaseAvailabilityGroupServer -MailboxServer 'MSC-CO-EXC-02V' -Identity 'msc-co-exc-01c'
    Elapsed Time: 00:06:21
    UPD:
    when Exchange servers ran on the same Hyper-V node, the DAG is working well, but if I move one of VM to another node, It stops working.
    I have installed Wireshark and captured trafic of cluster interface. When DAG members on the same HV-node, there is inbound and outbound traffic on the cluster interface, but if I move one of DAG member to another node, in Wireshark I see only outbound traffic
    on both nodes.
    It's confused me, because there is normal connectivity between these DAG members through the main interface.
    Please, help me if you can.

    Hi, Jared! Thank you for the reply.
    Of course I did it already :) I have new info:
    when Exchange servers ran on the same Hyper-V node, the DAG is working well, but if I move one of VM to another node, It stops working.
    I have installed Wireshark and captured trafic of cluster interface. When DAG members on the same HV-node, there is inbound and outbound traffic on the cluster interface, but if I move one of DAG member to another node, in Wireshark I see only outbound traffic
    on both nodes.
    It's confused me, because there is normal connectivity between these DAG members through the main interface.

  • Array of Cluster with event structure

    Hi,
    I am having Array of cluster with cluster having 1 String Control , 1 Combo box, 1 Led control and 2 Numeric control. In the combo box i am having two options to select ('Binary' and 'PWM').Whenever Binary is selected then Led control has be enabled and whenever PWM is selected 2 Numeric control has to be enabled.
    Is there any way to do this??
    Pleas help me...
    Regards
    Meenatchi

    actually in my application, the front panel has to have 16 rows of controls (1 string control, 1 combo box, 1 Led control, 2 numeric control).so i planned to keep all those controls in a cluster and create one array.If i didnt so i will have 16x5 controls in my front panel and i have to put 16 event cases for each combo box to do the enable and disable of controls..
    is there any simple way to do this..i have attached my front panel view
    Attachments:
    Untitled10.vi ‏139 KB

  • Cluster with WLS 5.1 and WLS 8.1 servers?

    hi all,
    we are planning to upgrade our application running on WLS 5.1 to WLS 8.1. We
    have configuraed a WLS 5.1 cluster with 4 servers joining this cluster. What
    I'm now thinking of is: is it possible to run a WLS cluster that has 2
    Servers running on WLS 5.1 and another 2 servers running on WLS 8.1?
    Reason of thinking so, is that want to have the most failsafe configration
    as possible during our upgrade.
    I searched the BEA docs, but didn't find anything.
    any help would be fine.
    thanks
    Alexander

    You can have it but not for fail-safe configuration.
    It is not possible to have clusters with different
    WLS versions especially between 5.1 & 8.1
    Kumar
    Alexander von Alkier wrote:
    hi all,
    we are planning to upgrade our application running on WLS 5.1 to WLS 8.1. We
    have configuraed a WLS 5.1 cluster with 4 servers joining this cluster. What
    I'm now thinking of is: is it possible to run a WLS cluster that has 2
    Servers running on WLS 5.1 and another 2 servers running on WLS 8.1?
    Reason of thinking so, is that want to have the most failsafe configration
    as possible during our upgrade.
    I searched the BEA docs, but didn't find anything.
    any help would be fine.
    thanks
    Alexander

  • Having issue with start weblogic cluster with tangosol cluster

    Hi,
    Oracle Coherence Version 3.3.1/389p1
    Grid Edition: Development mode
    We are using Weblogic 8.1.5 with Tangosol 3.3.1 on Linux servers.
    And we added the initializing logic in the servlet's init() method to get all NamedCaches and put into the ServletContext.
    When we start weblogic cluster, the first weblogic member will startup successfully with following messages :
    <Nov 7, 2007 10:12:30 AM EST> <Info> <HTTP> <BEA-101047> <[2007-11-07 10ServletContext(id=259640596,name=clusterqa,context-path=)] initObjects: init>:12:31.565 Oracle Coherence 3.3.1/389p1 <Info> (thread=Main Thread, member=n/a): Loaded operational configuration from resource "zip:/home/server/clusterqa/wls81/DOCVIEW/docqa1/.wlnotdelete/extract/docqa1_DOC_clusterqa/jarfiles/WEB-INF/lib/coherence.jar!/tangosol-coherence.xml"
    2007-11-07 10:12:31.598 Oracle Coherence 3.3.1/389p1 <Info> (thread=Main Thread, member=n/a): Loaded operational overrides from file "/home/www/WEB-INF/lib/tangosol-coherence-override.xml"
    Oracle Coherence Version 3.3.1/389p1
    Grid Edition: Development mode
    Copyright (c) 2000-2007 Oracle. All rights reserved.
    2007-11-07 10:12:31.938 Oracle Coherence GE 3.3.1/389p1 <Info> (thread=Main Thread, member=n/a): Loaded cache configuration from file "/home/www/WEB-INF/lib/pub-search-cache-config.xml"
    2007-11-07 10:12:31.983 Oracle Coherence GE 3.3.1/389p1 <Info> (thread=Main Thread, member=n/a): sun.misc.AtomicLong is not supported on this JVM; using a synchronized counter. Though safe to ignore, you may upgrade to BEA's 1.5 JVM to fix this issue.
    2007-11-07 10:12:33.267 Oracle Coherence GE 3.3.1/389p1 <Warning> (thread=Main Thread, member=n/a): UnicastUdpSocket failed to set receive buffer size to 1428 packets (2096304 bytes); actual size is 89 packets (131071 bytes). Consult your OS documentation regarding increasing the maximum socket buffer size. Proceeding with the actual value may cause sub-optimal performance.
    2007-11-07 10:12:34.118 Oracle Coherence GE 3.3.1/389p1 <D5> (thread=Cluster, member=n/a): Service Cluster joined the cluster with senior service member n/a
    2007-11-07 10:12:37.508 Oracle Coherence GE 3.3.1/389p1 <Info> (thread=Cluster, member=n/a): Created a new cluster with Member(Id=1, Timestamp=2007-11-07 10:12:33.323, Address=10.5.176.86:8088, MachineId=48982, Edition=Grid Edition, Mode=Development, CpuCount=4, SocketCount=2) UID=0x0A05B056000001161AAB782BBF561F98
    2007-11-07 10:12:37.736 Oracle Coherence GE 3.3.1/389p1 <D5> (thread=Invocation:Management, member=1): Service Management joined the cluster with senior service member 1
    2007-11-07 10:12:38.168 Oracle Coherence GE 3.3.1/389p1 <D5> (thread=DistributedCache, member=1): Service DistributedCache joined the cluster with senior service member 1
    <Nov 7, 2007 10:12:38 AM EST> <Info> <HTTP> <BEA-101047> <[ServletContext(id=259640596,name=clusterqa,context-path=)] xslProcessor: init>
    But trying to start the second weblogic member server, the startup process is stucked after tangosol cache initialization and the second weblogic member server never up running. Please see following messages :
    <Nov 7, 2007 9:49:38 AM EST> <Info> <HTTP> <BEA-101047> <[ServletContext(id=153019550,name=clusterqa,context-path=)] initDSNames: init>
    <Nov 7, 2007 9:49:42 AM EST> <Info> <HTTP> <BEA-101047> <[ServletContext(id=153019550,name=clusterqa,context-path=)] initObjects: init>
    2007-11-07 09:49:43.156 Oracle Coherence 3.3.1/389p1 <Info> (thread=Main Thread, member=n/a): Loaded operational configuration from resource "zip:/home/server/clusterqa/wls81/DOCVIEW/docqa2/.wlnotdelete/extract/docqa2_DOC_clusterqa/jarfiles/WEB-INF/lib/coherence.jar!/tangosol-coherence.xml"
    2007-11-07 09:49:43.188 Oracle Coherence 3.3.1/389p1 <Info> (thread=Main Thread, member=n/a): Loaded operational overrides from file "/home/www/WEB-INF/lib/tangosol-coherence-override.xml"
    Oracle Coherence Version 3.3.1/389p1
    Grid Edition: Development mode
    Copyright (c) 2000-2007 Oracle. All rights reserved.
    2007-11-07 09:49:43.528 Oracle Coherence GE 3.3.1/389p1 <Info> (thread=Main Thread, member=n/a): Loaded cache configuration from file "/home/www/WEB-INF/lib/pub-search-cache-config.xml"
    2007-11-07 09:49:43.571 Oracle Coherence GE 3.3.1/389p1 <Info> (thread=Main Thread, member=n/a): sun.misc.AtomicLong is not supported on this JVM; using a synchronized counter. Though safe to ignore, you may upgrade to BEA's 1.5 JVM to fix this issue.
    2007-11-07 09:49:44.829 Oracle Coherence GE 3.3.1/389p1 <Warning> (thread=Main Thread, member=n/a): UnicastUdpSocket failed to set receive buffer size to 1428 packets (2096304 bytes); actual size is 89 packets (131071 bytes). Consult your OS documentation regarding increasing the maximum socket buffer size. Proceeding with the actual value may cause sub-optimal performance.
    2007-11-07 09:49:45.419 Oracle Coherence GE 3.3.1/389p1 <D5> (thread=Cluster, member=n/a): Service Cluster joined the cluster with senior service member n/a
    2007-11-07 09:49:45.555 Oracle Coherence GE 3.3.1/389p1 <Info> (thread=Cluster, member=n/a): Failed to satisfy the variance: allowed=16, actual=47
    2007-11-07 09:49:45.555 Oracle Coherence GE 3.3.1/389p1 <Info> (thread=Cluster, member=n/a): Increasing allowable variance to 19
    2007-11-07 09:49:46.040 Oracle Coherence GE 3.3.1/389p1 <Info> (thread=Cluster, member=n/a): This Member(Id=2, Timestamp=2007-11-07 09:49:45.69, Address=10.5.176.85:8088, MachineId=48981, Edition=Grid Edition, Mode=Development, CpuCount=4, SocketCount=2) joined cluster with senior Member(Id=1, Timestamp=2007-11-07 09:45:10.205, Address=10.5.176.86:8088, MachineId=48982, Edition=Grid Edition, Mode=Development, CpuCount=4, SocketCount=2)
    Could you please explain why it happens, and what should I do to resolve this issues ?
    Many Thanks,
    Bing

    Hi, Gene
    Thank you for the response. I will send you our full log files and thread dumps.
    I just want to give you more details about our cases :
    1. This only happened without starting cache servers (com.tangosol.net.DefaultCacheServer).
    2. And our application which are running on weblogic cluster will just call "CacheFactory.getCache("XXX")", and running as the Tangosol DataClient.
    3. All weblogic member servers will be up running successfully if our cache servers are up running.
    Also I tried to test another case :
    Suppose all weblogic instances and cache server instances are up running. Now I trying to restart (kill weblogic instance process and restart) one of the weblogic member, It will up running successfully only if add some sleep times after killing weblogic processes and restarting it. Looks like tangosol cluster need certain time to aware the member has left cluster, then the restart process will be successful.
    Questions :
    1. Should we start our weblogic cluster only after cache server cluster is up running ?
    2. How do we decide how many time we should wait before start new process to join the cache cluster ?
    Could you please help to explain this one for me and let us if there anyway we can do to avoid the problem.
    Many Thanks !!!
    Bing

  • Sun Cluster with Netapps - iSCSI quorum and network port

    I am proposing Sun cluster with Netapps 3020C.
    May I know
    1) OS is Solaris 9. The SUN OSP says that we need to obtain an iSCSI license from Netapps. Is this the iSCSI initiator software for Solaris 9 to talk to the NAS quorum? Or do I need to purchased a 3rd party iSCSI initiator ?
    2) We provide 2 network ports for the Netapps private NAS LAN. Is it a must to cater another dedicated network port for the iSCSI communication with the quorum?
    3) If we need purchase a 3rd party iSCSI initiator, where can we get this? I have checked Qlogic and Cisco, they are both not suitable for my solution.
    Appreciate your help

    Hi,
    1) OS is Solaris 9. The SUN OSP says that we need to
    obtain an iSCSI license from Netapps. Is this the
    iSCSI initiator software for Solaris 9 to talk to the
    NAS quorum? Or do I need to purchased a 3rd party
    iSCSI initiator ?Have a look at http://docs.sun.com/app/docs/doc/817-7957/6mn8834r2?a=view
    I read the "Requirements When Configuring NAS Devices as Quorum Devices"
    section as this is the license for the iSCSI inititator software.
    So you need to enable iSCSI on the netapps box and need to install a package from netapps (NTAPclnas) on the cluster nodes.
    2) We provide 2 network ports for the Netapps
    private NAS LAN. Is it a must to cater another
    dedicated network port for the iSCSI communication
    with the quorum?Have a look at http://docs.sun.com/app/docs/doc/819-0580/6n30eahcc?a=view#ch4_quorum-9
    I don't read such a requirement there.
    3) If we need purchase a 3rd party iSCSI initiator,
    where can we get this? I have checked Qlogic and
    Cisco, they are both not suitable for my solution.
    Appreciate your helpI don't thibk you need such a 3rd party iSCSI initiator, unless this is stated in the above docs.
    Greets
    Thorsten

Maybe you are looking for

  • Problems after upgrading software to Lion

    I recently upgraded my Spring 2011 15 inch MB to lion and ever since it has became painfully slow.  Anything I can do to resolve this?

  • How do I set up my laptop to view activity in home while gone?

    I would like to use my Mac book pro to monitor activity in my home while I am gone. How do I set it up?

  • Window SBS server 2008 and window server 2008 R2 on the same domain

    Hi all, I am a person only work for linux system. Recently i work for my company that using microsoft technologie. So i have a lot of trouble. Could you please help me and below are my big troble that i have met: In my system, i have the first window

  • Tecra A11 - keyboard space key issue

    Just wonder, if someone with an A11 has similar issues or if my keyboard needs to be replaced. When I am typing, very often the space key is not accepted. Very annoying when you type without looking at the text all the time.

  • Requirements: Health care compliance

    Hello, in a healthcare compliance project, I need to do some requirements gathering which should eventually lead to the building of infoproviders for reporting purposes. Any helpful hints or documents in identifying the relevant characteristics and k