Strange Cluster Startup Timing Issues (Solaris 2.7)

          When we start up our admin server for our WLS 6.0sp1 cluster (running on Solaris
          2.7), then try to start all the managed servers approximately simultaneously,
          the managed servers come up apparently normally (no errors in the logs) but no
          EJBs ever deploy.
          If we follow the same steps, but wait until each server is all the way up before
          starting the next, everything is fine.
          Has anyone else run into this?
          -daniel
          

Hmm, I wonder if they "cannot" or "will not" supply you the patch...
          Oh well, hopefully this won't get me in trouble! Here's the patch, stick it in your weblogic /lib
          directory or in CLASSPATH, and add the following to your startup command:
          -Dweblogic.rmi.queuesize=n
          And if you are having problems, don't tell BEA who gave this to you! :-)
          Gene
          "Aaron Oas" <[email protected]> wrote in message
          news:[email protected]...
          > Thanks for the information.
          >
          > We wrote to BEA support; here's the response (i.e. no patch is forthcoming):
          >
          > ----------------------------- begin BEA
          > response -----------------------------
          > FR: tommy
          >
          > CASE_ID_NUM: 248170
          >
          > MESSAGE:
          >
          > Daniel,
          >
          > My name is Tommy, and I'm a Developer Relations Engineer with
          > BEA Technical Support. I wanted to drop you a quick note to tell you
          > that I've picked up your support case.
          >
          > At this time engineering does not have a patch for this CR. This issue is
          > slated to be fixed in WLS 6.1 release which is scheduled for end of July.
          > At this time engineering is pushing to have 6.1 released on schedule, and
          > since
          > there is a valid work around for this issue, they will not provide a patch.
          >
          > The work around is to simply boot your servers in increments of about 30
          > seconds.
          >
          > Thanks in Advance,
          >
          > Tommy Butcher
          >
          > ----------------------------- end BEA response -----------------------------
          >
          > I guess it could be worse.
          >
          > - Aaron
          >
          >
          > "Gene Chuang" <[email protected]> wrote in message
          > news:[email protected]...
          > > Currently, weblogic_admin_rmi_queue native thread count is fixed at 3 (or
          > was it 4?). I did get a
          > > patch which allows me to set the count as a flag at startup, although it
          > would seem the longterm
          > > solution is for WL to self-manage the thread count dynamically. For
          > anyone who needs this interim
          > > solution, write to support and mention the CR.
          > >
          > > Gene
          > >
          > > "Aaron Oas" <[email protected]> wrote in message
          > > news:[email protected]...
          > > > Gene (and clusterers in general) -
          > > >
          > > > It seems odd that so few people have reported problems with concurrently
          > > > starting servers in a cluster if it is a limitation of
          > > > weblogic_admin_rmi_queue native thread count. Is it a limitation
          > imposed by
          > > > the Solaris kernel on weblogic (i.e. Solaris max threads per process or
          > per
          > > > user can't be set high enough in /etc/system)?
          > > >
          > > > I would think that for a mission-critical environment, starting managed
          > > > servers in a weblogic cluster would ideally be automated and
          > simultaneous;
          > > > rather than serial, and certainly rather than manual (log into each
          > solaris
          > > > box in turn and run the startup script by hand).
          > > >
          > > > Does anyone out there running weblogic clusters automate the startup of
          > > > their weblogic cluster as a whole (e.g. a startup script on the admin
          > server
          > > > box that cranks up all the remote managed servers as well)?
          > > >
          > > > I have ideas about how to do this without using rexec and .rhosts files,
          > but
          > > > it's not trivial (create a listener on each box wait for startup
          > command,
          > > > check for managed server process, run startup script...), and I'm
          > wondering
          > > > if:
          > > > 1) there is a well-known approach to starting up all the managed servers
          > on
          > > > remote boxes, and
          > > > 2) there is a solution to the apparent requirement that clustered
          > managed
          > > > servers can't start at the same time, even on separate boxes
          > > >
          > > > Hope there are some thoughts on this out there,
          > > >
          > > >
          > > > - Aaron
          > > > "Gene Chuang" <[email protected]> wrote in message
          > > > news:[email protected]...
          > > > > I've filed a issue, CR042545, where I cannot concurrently start more
          > than
          > > > 2 managed servers in
          > > > > Solaris 2.7 due to weblogic_admin_rmi_queue native thread count
          > > > limitations... hopefully it has
          > > > > been fixed in 6.1?
          > > > >
          > > > > Gene
          > > > >
          > > > > "Fred Bloggs" <[email protected]> wrote in message
          > > > news:[email protected]...
          > > > > >
          > > > > > When we start up our admin server for our WLS 6.0sp1 cluster
          > (running on
          > > > Solaris
          > > > > > 2.7), then try to start all the managed servers approximately
          > > > simultaneously,
          > > > > > the managed servers come up apparently normally (no errors in the
          > logs)
          > > > but no
          > > > > > EJBs ever deploy.
          > > > > >
          > > > > > If we follow the same steps, but wait until each server is all the
          > way
          > > > up before
          > > > > > starting the next, everything is fine.
          > > > > >
          > > > > > Has anyone else run into this?
          > > > > >
          > > > > > -daniel
          > > > >
          > > > >
          > > >
          > > >
          > >
          > >
          >
          >
          [cr042545.jar]
          

Similar Messages

  • Diskgroup not mounted during cluster startup

    Hi,
    I have a 2 node RAC(11GR2) on VMWare 7.1.4. OS is Solaris 10
    I have registered 2 instances in the cluster.
    srvctl add database -d dbrac -o /u01/app/oracle/product/11.2.0/dbhome_1 -a "extdg,nordg"
    srvctl add instance -d dbrac -i dbrac2 -n vmsol2
    srvctl add instance -d dbrac -i dbrac1 -n vmsol1
    After after registering the 2 instances, initially the instance were automatically up when ever I execute a ./crsctl start cluster.
    But now the database instances on both nodes are not comming up; only ASM is up.
    While checking ASM disk group, I found EXTDG and NORDG is not mounted. So I mounted the disk group and started the database.
    Then I manually mounted, the disk groups and started the database.
    SQL> alter diskgroup nordg mount;
    Diskgroup altered
    SQL> alter diskgroup extdg mount;
    Later I tried removing the the database configuration from the cluster using SRVCTL and added the database to cluster again.
    srvctl remove instance -d dbrac -i dbrac1
    srvctl remove instance -d dbrac -i dbrac2
    srvctl remove database -d dbrac
    srvctl add database -d dbrac -o /u01/app/oracle/product/11.2.0/dbhome_1 -a "extdg,nordg"
    srvctl add instance -d dbrac -i dbrac2 -n vmsol2
    srvctl add instance -d dbrac -i dbrac1 -n vmsol1
    Still database is not starting during cluster startup.
    Why did the disk group is not getting mounted while cluster is starting? Can someone help me?
    Regards,
    Mat
    Edited by: user11278087 on Mar 9, 2012 7:37 PM
    Edited by: user11278087 on Mar 9, 2012 7:41 PM
    Edited by: user11278087 on Mar 9, 2012 7:42 PM

    Hi,
    Thank you for your replay.
    Disk group was mounting previously without doing any modification in spfile after executing the following commands.
    srvctl add database -d dbrac -o /u01/app/oracle/product/11.2.0/dbhome_1 -a "extdg,nordg"
    srvctl add instance -d dbrac -i dbrac2 -n vmsol2
    srvctl add instance -d dbrac -i dbrac1 -n vmsol1
    But suddenly this issue happened.
    I do not have metalink access. Could you please help me?
    Regards,
    Mat.

  • VISA Read timing issues

    I am using an RS232 to control an older model Power Supply (OXFORD PS 120-10).
    I have successfully written several VI's that all work, the only problem is that VISA Read takes WAY too long. I'm talking 10's of seconds to refresh. I need it have it refreshing in milliseconds or at least tens of seconds for the measurements we need. All of the VI's I have written have the same timing issue. 
    Attached is the most basic Serial Read/Write VI. Is there any way to improve the Read rate? Or might this just be an instrumentation issue. The strange thing is the Write commands work almost instantaneously (I can seem them on the instruments display).
    Please help if you can, I've only been working with LabVIEW for a few weeks and am very must still in the learning process. 
    Thanks!
    Solved!
    Go to Solution.
    Attachments:
    READandWRITE timing test.vi ‏14 KB
    READandWRITE timing test.vi ‏14 KB

    Do you have the communications protocol for the power supply? If you do not have everything right, you will have problems with communications.
    Tens of seconds is a clue that you may be getting timeout errors because the default timeout is 10 seconds. Try placing an inidicator on the error out wire inside the loop (after Read) to see if an error occurs on any iteration. The way you have the VI set up you only see the error on the last iteration of the loop.
    You are writing a carriage return to the instrument. If it requires that, it almost certainly sends a carriage return with the response. (This is why I asked about the protocol). If the instrument sends a carriage return (or other termination character), then you should Enable Termination Character on the Configuration VI and set the termination character to the correct value. The default is line feed (hex A or decimal 10). A carriage return is hex D or decimal 13. You must wire the numeric value to the termination character input for any value other than the default. Then change the byte count value (at the Read input) to a number larger than the longest message the instrument will ever send, perhaps 100 or 500. The Read will end as soon as the termination character is received, regardless of the number of characters.
    I suspect that this is the problem - the instrument sends fewer than 10 characters in most messages but does send a termination character.
    Lynn

  • [svn:bz-trunk] 17102: Rewrite all ImageSnapShot tests to avoid any timing issue on various app server .

    Revision: 17102
    Revision: 17102
    Author:   [email protected]
    Date:     2010-07-28 11:48:20 -0700 (Wed, 28 Jul 2010)
    Log Message:
    Rewrite all ImageSnapShot tests to avoid any timing issue on various app server.
    Modified Paths:
        blazeds/trunk/qa/apps/qa-regress/testsuites/mxunit/tests/remotingService/dataTypes/ImageS napshot/testCaptureBitmapData.mxml
        blazeds/trunk/qa/apps/qa-regress/testsuites/mxunit/tests/remotingService/dataTypes/ImageS napshot/testCaptureImage.mxml
        blazeds/trunk/qa/apps/qa-regress/testsuites/mxunit/tests/remotingService/dataTypes/ImageS napshot/testCaptureImageJPEG.mxml
        blazeds/trunk/qa/apps/qa-regress/testsuites/mxunit/tests/remotingService/dataTypes/ImageS napshot/testCaptureImageScaleLimitedFalse.mxml
        blazeds/trunk/qa/apps/qa-regress/testsuites/mxunit/tests/remotingService/dataTypes/ImageS napshot/testCaptureImageScaleLimitedFalseJPEG.mxml
        blazeds/trunk/qa/apps/qa-regress/testsuites/mxunit/tests/remotingService/dataTypes/ImageS napshot/testCaptureImageScaleLimitedTrue.mxml
        blazeds/trunk/qa/apps/qa-regress/testsuites/mxunit/tests/remotingService/dataTypes/ImageS napshot/testCaptureImageScaleLimitedTrueJPEG.mxml
        blazeds/trunk/qa/apps/qa-regress/testsuites/mxunit/tests/remotingService/dataTypes/ImageS napshot/testEncodeImageAsBase64.mxml

    I am modifying the correct httpd.conf file on the server, it just doesn't seem to work. - If I put the rewrite rules in the <Directory /> the rewrite works but it adds /Library/WebServer/Documents to the URL.
    I also tried putting the rewrite rules in <IfModule mod_rewrite.c> but that did not work either.
    mod_rewrite is enabled and running on the server.
    I will post the rewrite rules again in the code brackets. Sorry for the long post. - If some one can try them out on their Leopard Server to see if they can get them to work, it would be much appreciated. Again, these work on my Leopard Client but I can't get them to work on Server.
    -- The httpd.conf file posted above is just the default conf file found in /private/etc/apache2/
    <code>
    RewriteEngine On
    Options +FollowSymLinks
    RewriteRule ^(.+)/$ http://%{HTTP_HOST}$1 [R=301, L]
    RewriteCond %{THE_REQUEST} ^GET\ ([^\?]+)\.php(.*)\ HTTP
    RewriteRule (.+)\.php(.*)$ $1$2 [R, L]]
    RewriteCond %{THE_REQUEST} ^GET\ ([^\?]+)\.asp(.*)\ HTTP
    RewriteRule (.+)\.asp(.*)$ $1$2 [R, L]]
    RewriteCond %{THE_REQUEST} ^GET\ ([^\?]+)\.aspx(.*)\ HTTP
    RewriteRule (.+)\.aspx(.*)$ $1$2 [R, L]]
    RewriteCond %{THE_REQUEST} ^GET\ ([^\?]+)\.htm.(.)\ HTTP
    RewriteRule (.+)\.htm.(.)$ $1$2 [R, L]]
    RewriteCond %{THE_REQUEST} ^GET\ ([^\?]+)\.cfm(.*)\ HTTP
    RewriteRule (.+)\.cfm(.*)$ $1$2 [R, L]]
    RewriteCond %{THE_REQUEST} ^GET\ ([^\?]+)\.bak(.*)\ HTTP
    RewriteRule (.+)\.bak(.*)$ $1$2 [R, L]]
    RewriteCond %{THE_REQUEST} ^GET\ ([^\?]+)\.inc(.*)\ HTTP
    RewriteRule (.+)\.inc(.*)$ $1$2 [R, L]]
    RewriteCond %{THE_REQUEST} ^GET\ ([^\?]+)\..(.)\ HTTP
    RewriteRule (.+)\..(.)$ $1$2 [R, L]]
    <code>

  • Cluster 3.2 for Solaris 8

    Hi All,
    Please let me know Sun Cluster 3.2 support Solaris 8.
    Regards,
    R. Rajesh Kannan.

    Solaris Operating System (OS) � Sun Cluster 3.2 software and Quorum Server software requires the following minimum versions of the Solaris OS:
    Solaris 9 � Solaris 9 9/05 SPARC only.
    Solaris 10 � Solaris 10 11/06.

  • How to fix the Timing issue in Discoverer reports

    Hi,
    While running the discoverer report in Discoverer plus is taking more than 1 hour to complete( Gen.time + Extract to excel)
    where as the same report completes quickly in discoverer desktop.
    how to fix the timing issue in discoverer plus 
    Thanks
    Srinivas

    Timo Hahn wrote:
    There is a problem with autoHeightRows and columnStretching used together in 11gR1.
    Have you tried without columnStretching?
    Or have you tried if it works in 11gr2?
    TimoHi Timo, Thank you very much for taking time to respond to my question.. :)
    Back to my question...
    I tried removing the columnStretching although my requirement really requires this but no effect really happens even if I remove this.
    Based on my investigation on the generated HTML, I notice the following items:
    1. A table is being wrapped in a div that is being set at a fixed height.
    2. On first load, if your autoheight rows is set to 6, the framework is setting a height of 96px to the div. This height would almost cut the last row of the table.
    3. If you try to refresh the page or try to re-PPR the component, the framework resets it to 102px which causes the last row to be fully displayed.
    My only concern is that IE is perfectly displaying this while Chrome and FF are having problem.
    Based on my understanding, the framework is messing up the height only on first load. Not sure but this is how I see it. I am really not confident also on my findings
    and I would most likely hear other's comment.
    Thanks.

  • Cluster 2.2 and solaris 8

    can we run cluster 2.2 with solaris 8 ???
    thanks

    Hi,
    The Sun Cluster 2.2 4/00 Release is having support for Solaris 8.
    Hope this helps.
    Thanks,
    Gopinath
    Sun DTS.

  • [svn] 4859: -Fix packaging timing issue that was caused by the rebuilding of the air applicationupdater .

    Revision: 4859
    Author: [email protected]
    Date: 2009-02-05 10:15:22 -0800 (Thu, 05 Feb 2009)
    Log Message:
    -Fix packaging timing issue that was caused by the rebuilding of the air applicationupdater. The recompiled files would get laid down properly only to have the old files put back down on top of them. I also made sure the build directory was removed after updateAIR ran so it would not be included in the package
    -removed bundles.properties from the wireframe project
    bug:SDK-19128
    qa:yes
    doc:no
    checkintests:pass
    Ticket Links:
    http://bugs.adobe.com/jira/browse/SDK-19128
    Modified Paths:
    flex/sdk/branches/i10/build.xml
    flex/sdk/branches/i10/frameworks/build.xml
    Removed Paths:
    flex/sdk/branches/i10/frameworks/projects/wireframe/bundles.properties

    Hi Chris,
    I have confirmed this is an AIR SDK 13.0.0.83 bug.
    I have reported this bug to Adobe Bugbase, and I also included the workaround:
    https://bugbase.adobe.com/index.cfm?event=bug&id=3750892
    The workaround is:
    Simply comment the <externalSwfs> tag in your -app.xml will solve this bug, like this: <!-- <externalSwfs></externalSwfs> -->
    DarkStone
    2014-04-28

  • C3120 trunk timing issue on HP B460G1 ?

    hello
    customer encoured issue of trunking timing issue between
    C3120 and HP blade server, unable to get ip from DHCP
    server when server is booting up
    seems issue resolved in .53SE but i didnt  find either
    on CCO or HP Web ?
    any advice welcomed
    JYP

    Did you put the trunk in portfast mode ?
    regards,
    Geert

  • Questions about phase difference (possible timing issue) RC circuit

    Hello,
    Below is the program I am using to measure the phase difference in an RC circuit. Simply put I generate a 2kHz sine wave in LabView and send it to the circuit using an Analog output. Then I measure the output sine wave using an analog output.I also measure this using n oscilliscope. I cna clearly measure the phase difference with the oscilliscope and know it to be approximately 1.4 radians.
    Issues with the program:
    Different phase difference measured each time the program is run for the circuit. It is also never correct.
    Possible causes:
    You will notice by looking at the vi I measure the phase from the signal generator. Should I be using a second analog input to measure the sine wave as it is outputted at the start of the circuit?
    I mainly think that it is a timing issue. While the phase difference is constant each time the program it varies each on run. So the time each tone measurement begins its first measurement seems to be different each time and causes this different phase reading.
    The card I am using is a PCI 6221, is there a timing issue related from switching to input and output acquistion or are they seperate.
    Is there anyway to ensure that both tone measurements are measuring phase at the same point in (actual) time?
    I would really appreciate any advice or alterations on the program anyone could offer me (I am a college student and LabVIEW is not on our curriculum so I have no support, yet I am using it for my project (D'oh!))
    Solved!
    Go to Solution.
    Attachments:
    RC Circuit Test.vi ‏271 KB

    I would certainly acquire two signals.  Feed the analog output right back into an analog input and then your filtered signals in another.
    Initially, I would feed the analog output into both analog inputs and measure the phase delay due to the multiplexed A/D on the card.  Once you have that measurement, you can feed in the filtered signal and then measure the phase difference of that signal.
    Randall Pursley

  • Cluster of CMS in Solaris

    Please guide me to create cluster of CMS in solaris.
    Some document and tips will be of great help...
    Version: BOXIR2

    CMS clustering is OS indipendent.
    Admin guide and a Deployment guide will help you accomplish your task. http://help.sap.com - all documentation for BO products is there.

  • Strange MacPro startup issues

    I'm having the strangest issues with my MacPro. If I start it up it will go through the Apple logo screen with gears, then it goes to a blue screen and instantly into a black screen with the mouse icon. It stays at the black screen and when I move the mouse it makes a duplicate mouse so one pointer is in the top left and the other pointer moves with the mouse. I know the system is running behind this black screen as I can use key commands to shut down the machine (control-eject + return).
    I can use another machines clean 10.5.2 install as target disc mode and it does the same thing. I can do a single user startup just fine and also a safeboot (holding shift) just fine without the black screen issue. So I'm thinking there is an extension that is causing this problem that safeboot doesn't load. But what's also strange is when I startup off the Leopard DVD it has the same black screen issue.
    This is just really confusing me and I'm not sure what to do next besides replace the logic board. Does anybody have any suggestions or seen anything similar?
    Thanks in advance,
    Kye

    I'm as clueless as you.
    http://discussions.apple.com/message.jspa?messageID=6739362
    Currently my Mac Pro is sitting somewhere inside an Apple Store for a week...
    I thought if it can't boot off of the DVD installer (with an empty HD pretending to install a new OS X copy), then it's a hardware problem. I even took out all the HD and tried to boot from the DVD installer.
    My advice to you is that if you want to get your Mac Pro repaired, don't bother calling up AppleCare, they can't do much for you. I thought they were going to send me a box so I can put my Mac Pro in and wasted a week finding out they gave me the wrong information. They told me they only do it for laptops or small products. You need to bring it to an Apple Store or an authorized seller who can do Carry-in repairs.
    http://www.apple.com/buy/
    Where it says "Find a reseller", type in your location to search the nearest.

  • Interval Timing Issue On Solaris 10/Intel

    Not sure this is the right forum for this, but I'll give it a shot. Please redirect me if you think there's a better forum. This is going to look like a "C" problem, but this really is a question about something going on in the interactions between the operating system, the JVM, and a piece of native code we have.
    We have an interval timing facility that's implemented over a small set of platform-dependent native timing facilities. The *NIX "itimer" facility is used wherever it exists.  On RedHat/Fedora we also implement a timer over the RTC device.  On Windows we use a WaitableTimer.
    On all platforms there's a Java class that bridges to an appropriate timing facility via the JNI, and to date this has worked great on all platforms except for Solaris 10 on Intel. By "worked great" I mean that the clock tics end up being almost exactly what we expect for a given configured Hz update rate. So... e.g. if we set up a timer for 50 Hz, then we actually get 50 Hz +/- some minor variation much less than 1 second.
    On Solaris 10/Intel we set up a 50 Hz itimer in a plain-vanilla C app and we get ~50 Hz, but if we call the same C code from Java via the JNI we almost always end up with ~38Hz.
    The amount of Java and native code involved is almost trivial, and the hardware is a smoking fast contemporary PC. We ran over the itimer facility on an old Sparc 5 box and it worked fine.
    I've tried tweeking sundry esoteric -X and -XX JVM invocation options with no effect.
    J2SDK 1.4.2-07 in all cases.
    Thoughts?

    I'd think you should be posting in the forum for the Solaris 10 os, not a Java forum. Maybe someone in the Java Native Methods forum would have the necessary Solaris 10 expertise, also.
    If you do post elsewhere, either close this out with a final post or put a reference(s) to any other posts and say where you want replies posted, so replies aren't needlessly duplicated.

  • Interval Timing Issue On Solaris 10/Intel When The JNI Is Involved

    We've observed an odd behavior bridging to interval timers over the JNI on Solaris 10/Intel, and I was wondering if anybody here has ever seen anything like this.
    We have an interval timing facility that's implemented over a small set of platform-dependent native timing facilities. The *NIX "itimer" facility is used wherever it exists. On RedHat/Fedora we also implement a timer over the RTC device. On Windows we use a WaitableTimer.
    On all platforms there's a Java class that bridges to an appropriate timing facility via the JNI, and to date this has worked great on all platforms except for Solaris 10 on Intel. By "worked great" I mean that the clock tics end up being almost exactly what we expect for a given configured Hz update rate. So... e.g. if we set up a timer for 50 Hz, then we actually get 50 Hz +/- some minor variation much less than 1 second.
    On Solaris 10/Intel we set up a 50 Hz itimer in a plain-vanilla C app and we get ~50 Hz, but if we call the same C code from Java via the JNI we almost always end up with ~38Hz.
    The amount of Java and native code involved is almost trivial, and the hardware is a smoking fast contemporary PC. We ran over the itimer facility on an old Sparc 5 box and it worked fine.
    I've tried tweeking sundry esoteric -X and -XX JVM invocation options with no effect.
    J2SDK 1.4.2-07 in all cases.
    Thoughts?

    I'd think you should be posting in the forum for the Solaris 10 os, not a Java forum. Maybe someone in the Java Native Methods forum would have the necessary Solaris 10 expertise, also.
    If you do post elsewhere, either close this out with a final post or put a reference(s) to any other posts and say where you want replies posted, so replies aren't needlessly duplicated.

  • Weblogic 6.1 cluster startup issue

    We've recently experienced a horrendous morning with Weblogic 6.1 SP3. We were
    restarting all of our servers (4 Servlet containers (clustered), 2 EJB containers
    (clustered)), and found that after doing so, the web servers started dying off
    one by one with a hotspot error message. The stack sometimes seemed to indicate
    a failure in an weblogic 'internalLookup' method.
    After a while, I noticed that one of the EJB servers was consuming 0 CPU. I suspected
    that this EJB server had not properly started up, and bounced it. After I made
    sure that i had started up properly, I then proceeded to restart all of the Web
    servers, one by one. Since then everything has been fine.
    When restarting the clustered servers, we noticed that it took an inordinate amount
    of time compared to starting each one up individually... Is there a known problem
    related to starting up multiple clustered servers simultaneously?
    Thanks in Advance, Chad Stansbury

    Forget about it. Turns out that the instability was due to someone inadvertently
    modifying our startup scripts and limiting all our VMs to -mx64m. Not good.
    Anyway, sorry for any time you may have wasted.
    Chad
    Rajesh Mirchandani <[email protected]> wrote:
    Take multiple thread dumps of the servers in the cluster and post it
    here. What happens if
    you start one server after the other ?
    Chad Stansbury wrote:
    We've recently experienced a horrendous morning with Weblogic 6.1 SP3.We were
    restarting all of our servers (4 Servlet containers (clustered), 2EJB containers
    (clustered)), and found that after doing so, the web servers starteddying off
    one by one with a hotspot error message. The stack sometimes seemedto indicate
    a failure in an weblogic 'internalLookup' method.
    After a while, I noticed that one of the EJB servers was consuming0 CPU. I suspected
    that this EJB server had not properly started up, and bounced it. After I made
    sure that i had started up properly, I then proceeded to restart allof the Web
    servers, one by one. Since then everything has been fine.
    When restarting the clustered servers, we noticed that it took an inordinateamount
    of time compared to starting each one up individually... Is there aknown problem
    related to starting up multiple clustered servers simultaneously?
    Thanks in Advance, Chad Stansbury--
    Rajesh Mirchandani
    Developer Relations Engineer
    BEA Support

Maybe you are looking for