WLS 5.1 hangs due to many sockets in CLOSE_WAIT state

Hi,
we are having some serious problems with Weblogic Server 5.1 (SP 6) with
several of our customer installations (Solaris 5.8 and HP UX). The problem
relates to the WLS http server not dealing with socket connections in
CLOSE_WAIT state correctly. We have observered that these can be caused by
Netscape browsers when the end user clicks on the reload button. This causes
the socket connection to be placed in CLOSE_WAIT state, it stays like this
for ever and eventually when there are many sockets in this state WLS runs
out of file descriptors and hangs. We have tried using WLS Apache plugin but
Apache merely passes the CLOSE_WAIT's onto WLS web server. Any help
appreciated on this one,.....
regards Donal.

The problem is that your file descriptor table is filling up with sockets in the
CLOSE_WAIT state so there is no more room to put new file descriptors. Any
machine that speaks HTTP is subject to this so you need to apply these changes
both to the web server machine(s) and the application server machine(s).
The way to handle this on Solaris is to decrease the close wait interval:
ndd -set /dev/tcp tcp_close_wait_interval 60000 (Solaris 2.6 and below)
ndd -set /dev/tcp tcp_time_wait_interval 60000 (Solaris 2.7 and above)
(NOTE: It is recommended that you not go below 60 seconds for this, as per
Adrian Cockroft of SUN).
and increase the size of the file descriptor table by adding the following in
/etc/system:
rlim_fd_cur 8192
rlim_fd_max 8192
(NOTE: The close wait interval can also be set in /etc/system, though I don't
remember the name of the parameter off the top of my head.)
Also, make sure that you change the startWebLogic.sh script to grab more file
descriptors with ulimit (or simply remove the ulimit section of the script)...
Hope this helps,
Robert
Donal Mc Ateer wrote:
We have done this. We have made the following 'tweaks' to the Solaris
installation
ndd -set /dev/tcp tcp_keepalive_interval 300000
ndd -set /dev/tcp tcp_ip_abort_interval 2000
ndd -set /dev/tcp tcp_rexmit_interval_max 10000
ndd -set /dev/tcp tcp_fin_wait_2_flush_interval 67500
ndd -set /dev/tcp tcp_time_wait_interval 30000
is there anything else we can do? Is there any way of influencing the WLS
http server handling of these CLOSE_WAIT sockets? Is this being addressed in
WLS 6.0?
-Donal.
"shivu" <[email protected]> wrote in message
news:[email protected]...
try tuning the tcp parameters at solaris level.
(i think tcp_conn_wait)
"Donal Mc Ateer" <[email protected]> wrote:
Hi,
we are having some serious problems with Weblogic Server 5.1 (SP 6) with
several of our customer installations (Solaris 5.8 and HP UX). The
problem
relates to the WLS http server not dealing with socket connections in
CLOSE_WAIT state correctly. We have observered that these can be causedby
Netscape browsers when the end user clicks on the reload button. Thiscauses
the socket connection to be placed in CLOSE_WAIT state, it stays likethis
for ever and eventually when there are many sockets in this state WLSruns
out of file descriptors and hangs. We have tried using WLS Apache pluginbut
Apache merely passes the CLOSE_WAIT's onto WLS web server. Any help
appreciated on this one,.....
regards Donal.

Similar Messages

  • Re: WLS 6.0 SP2 hangs due to many sockets in CLOSE_WAIT state

    We have the same issue with HPUX and Weblogic 5.1 SP8. In fact, we get
    errors on the ListenThread because occasionally it (the ListenThread)
    says too many files open. Running a monitoring tool - the process
    typically only has ~130 file handles open at any time. When we start
    getting a lot of sockets "stuck" in close_wait state the number of open
    files increases dramatically occasionally freezing the server until (I
    believe) the OS steps in and cleans house.
    For those curious, our box is configured to have a max of 1024 files
    open. Please don't even suggest that we raise that parameter because it
    is already high enough.
    We are tempted to use apache as the webserver and proxy request through
    apache to weblogic. Apache doesn't suffer from this issue AFAIK.
    -Tim
    sogal nagavardhan wrote:
    Hi !!
    We are using weblogic 6.0 SP2 with jdk1.3.1 running on windows 2000 platform.
    Currently we are using running some stablity tests and noticing that there are
    lot of sockets in close_wait state and handle count keeps going up and eventually
    crashes the weblogic. We also tried reducing the TcpTimedWaitDelay as suggested
    in this newsgroup, and we are still noticing this problem. Does any body have
    the same issue?. is this a know problem with weblogic http server?. and finally
    any fixes or workarounds to this issue.
    Thanks
    Sogal.N

    BEA acknowledges this problem in 6.0 SP2 with iPlanet plug-in.
    Solution that they recommend is to use 6.1 SP2 plug-in (they claim
    full compatibility). Also, you may have to set KeepAliveEnabled to
    false in obj.conf (the problem is in wrong pooling).
    Hope it helps,
    Bala
    Robert Patrick <[email protected]> wrote in message news:<[email protected]>...
    There are a couple of issues.
    1.) You need to tune the machine running the web server and the machine
    running WebLogic to make sure that it is configured appropriately for a
    high-volume HTTP server. See
    http://e-docs.bea.com/wls/platforms/sun/index.html for details on tuning
    Solaris.
    2.) I believe that there is a bug in the plugins in 6.0 where it is not
    properly closing sockets. Try using the plugin from WLS 6.1 SP2...
    Hope this helps,
    Robert
    Kevin Phillips wrote:
    Tim Funk <[email protected]> wrote:
    I am experiencing similar problems with WL 6.0 SP2 on Solaris 2.6.
    We are already proxying all HTTP traffic through a
    cluster of apache servers. Unfortuantely, the apache-WL
    bridge seems to work so efficiently that our servers are
    still flooded with open sockets in the TCP_CLOSE_WAIT state.
    It seems to be bound by WebLogic's capacity to respond to
    the requests in a speedy enough fashion.
    Anybody have any suggestions?
    Kevin
    We have the same issue with HPUX and Weblogic 5.1 SP8. In fact, we get
    errors on the ListenThread because occasionally it (the ListenThread)
    says too many files open. Running a monitoring tool - the process
    typically only has ~130 file handles open at any time. When we start
    getting a lot of sockets "stuck" in close_wait state the number of open
    files increases dramatically occasionally freezing the server until (I
    believe) the OS steps in and cleans house.
    For those curious, our box is configured to have a max of 1024 files
    open. Please don't even suggest that we raise that parameter because
    it
    is already high enough.
    We are tempted to use apache as the webserver and proxy request through
    apache to weblogic. Apache doesn't suffer from this issue AFAIK.
    -Tim
    sogal nagavardhan wrote:
    Hi !!
    We are using weblogic 6.0 SP2 with jdk1.3.1 running on windows 2000
    platform.
    Currently we are using running some stablity tests and noticing that
    there are
    lot of sockets in close_wait state and handle count keeps going up
    and eventually
    crashes the weblogic. We also tried reducing the TcpTimedWaitDelay
    as suggested
    in this newsgroup, and we are still noticing this problem. Does any
    body have
    the same issue?. is this a know problem with weblogic http server?.
    and finally
    any fixes or workarounds to this issue.
    Thanks
    Sogal.N

  • Essbase server is hanging due to BSO applications

    Hi All,
    We upgraded from 9.3.1 to 9.3.3 in august and since then we are having issues. We have about 12 BSO applications, Size is 2GB each. After the upgrade when the users do some operations sucn as rerteval or outline edits etc the essbase hangs and we have to reboot to resolve this issue but after sometime (1 hour later) on performing the same operations it hangs again.
    We initially thought it is the server problem, so we migrated all these 12 apps to 3 diferent servers and the other server also started hanging due to these apps. SO this solidifies that there is something wrong in these BSO applications...Any ideas????
    System info: Windows environment
    sever: windows server 2003
    hyp - 9.3.3
    Oracle support also jumped in but could not resolve the issue.
    Any help would be appreciated.
    Thanks!!!

    Have a look on Essbase conf file
    for netdelay and net recount try increaseing 1000 each and look for diffrence
    Also look on avaiable RAM on server. Allot 2 gb per application
    let say you have 12 app then 12*2=24 + 6 Gb i.e=30gb so better get 32gb
    up to my understanding since ram seq goes 8 ,16, 32, 64
    also look at your processor how many core have alloted
    Have a read on :http://docs.oracle.com/cd/E17236_01/epm.1112/esb_dbag/frameset.htm?part_optimize.html

  • How to reduce buffer busy waits, session hanging due to buffer busy waits

    Hi,
    How to reduce buffer busy waits, session hanging due to buffer busy waits.
    Thanks,
    Sathis.

    When I see through enterprise manager I see lot of
    tables with buffer busy waits.
    Is there any way by table name we can check the
    blocks info.
    The simple way is to look at the SQL statement and corresponding table name?
    P1=file#, P2=block#. You can extract segment name(table or index) using this info.
    Query v$bh like following:
    SQL> select file#, block#, class#, objd from v$bh where file# = P1 and block# = P2;
    SQL> select object_name from all_objects where object_id = <objd>;See following doc:
    http://download-west.oracle.com/docs/cd/B19306_01/server.102/b14237/dynviews_1051.htm
    Or you can dump block:
    SQL> alter system dump datafie <P1> block <P2>;Some excerpts from block dump:
    scn: 0x07df.17e70782 seq: 0x01 flg: 0x04 tail: 0x07822301
    frmt: 0x02 chkval: 0x61d0 type: 0x23=PAGETABLE SEGMENT HEADER
    Map Header:: next  0x00000000  #extents: 1    obj#: 55881  flag: 0x10000000>
    Can we do something at table level that will reduce
    the waits.
    Yes, some methods are known. But before thinking of that, you must verify which block class and which access are involved.
    Typo... always. :(
    Message was edited by:
    Dion_Cho

  • HT4796 After migrating all of my adobe software (Cs4, cs5, etc.) to my new imac, the software won't open due to many errors. Any help on this?

    After migrating all of my adobe software (Cs4, cs5, etc.) to my new imac, the software won't open due to many errors. Any help on this?

    Welcome to the Apple Support Communities
    Adobe software is conflictive when you use Migration Assistant because their applications don't work properly after migration. If you keep the DVDs or the installers of your software, use them to reinstall all your Adobe applications and they should start working correctly

  • How many sockets can be opened to one port

    Hi All,
    There is a service installed on port 4444 of our server. In order to do some work with that service we open sockets to it with socket:localhost:4444. The thing we are finding is that it is quiet expensive to initialise these socket connections per request to our website and I'd like to pool connections. I have implemented a pool but currently this closing the connection when returning to the pool. I would actually like them to stayalive in the pool but I do not know if I can have many sockets open to one port?
    Thanks!

    You are unlikely to get an exact answer.
    The first limit you'll probably run into is the number of file descriptors that can be open at a time. This depends on your OS and configured per-process limits.
    Socket buffers use memory. Amount of memory, allowed process size (if your OS enforces that with in-kernel buffers), and size of send/receive buffers give another limit.
    Other OS-dependent limits, such as a max global file descriptor count, may exist.
    TCP/IPv4 (if you use that) connections are defined by a 4-tuple {source address, source port, destination address, destination port}. As long as source address&port are unique, the same destionation address&port can be used. So theoretically you could have some two hundred trillion connections to a port (32 bit addresses * 16 bit ports minus non-allowed addresses.)
    In practical terms: check max file descriptor count. A few dozen to a few hundred should be ok for web server -class systems. Try and see what works.

  • LABVIEW APPLICATION HANGS DUE TO INSUFICIENT MEMORY

    I AM USING LABVIEW 7.1. I BUILD ONE APPLICATION. IN THAT APPLICATION I USED MSCOMM ACTIVEX FOR SERIAL COMMUNICATION TO MY PLC. ALSO I USED TOOLBAR , STATUS BAR ACTIVEX IN MY APPLICATION. IT WORKS OK. BUT AFTER SOME TIME SOFTWARE GET HANGS DUE TO INSUFFICIENT MEMORY. I FOUND THAT AFTER 3SECOND THAT EXE SIZE IN MEMORY GET INCREASED BY 4K. WHY DON'T KNOW. PLEASE TELL WHY MEMORY SIZED INCREASED.

    How can we say why, without seeing your code ?
    -Check if you open references and not closing them.
    -Use "Performance and Memory" to see in which vi, the memory is increased

  • HT1212 i have an apple ipod touch 4s which is disabled due to many attempts of wrong passcode

    i have an apple ipod touch 4s 32gb which is disabled due to many attempts of incorrect passcode

    Disabled
    Place the iOS device in Recovery Mode and then connect to your computer and restore via iTunes. The iPod will be erased.
    iOS: Wrong passcode results in red disabled screen                         
    If recovery mode does not work try DFU mode.                        
    How to put iPod touch / iPhone into DFU mode « Karthik's scribblings        
    For how to restore:
    iTunes: Restoring iOS software
    To restore from backup see:
    iOS: How to back up     
    If you restore from iCloud backup the apps will be automatically downloaded. If you restore from iTunes backup the apps and music have to be in the iTunes library since synced media like apps and music are not included in the backup of the iOS device that iTunes makes.
    You can redownload most iTunes purchases by:
    Downloading past purchases from the App Store, iBookstore, and iTunes Store        

  • Ipseckey hang with error PF_KEY socket timed out

    When I use ipseckey program (on Solaris8 for intel) to add new SA,
    ipseckey hang with error PF_KEY socket timed out.
    other command like "dump" or "save all" still work
    can anybody help ?
    le huy

    Sorry, but I don't know exactly, it was work of network admin (or sysadmin). I think relation between Application Server and Database Server was corrected - at the server or at the Data Communications Equipment.
    But problem was OUT OF WebCenter Spaces, any adjustments at WebLogic Server, etc... had no results.
    Sorry.

  • IAS 6.5 requests hanging due to session deadlocks

    In our application on IAS 6.5, we are seeing requests hang due to PlatformNASSession.invalidate() getting stuck. At the time that this happens, our cpu utilization shoots to 100%. We're not sure if the cpu usage is the cause, the effect, or unrelated to the session hanging but the two problems seem to surface at the same time. Is there any known problem that causes the sessions to get hosed up?
    The only way we're able to fix this problem is to kill -9 the kjs process (only 1 of our 4 breaks at a time) that is using all the cpu. At that point, the kjs is automatically restarted and everything begins to work fine again. However, everything going on inside that kjs is lost and the system is running at a crawl in the meantime.
    Here is a stack trace of one of the threads that's getting stuck:
         "Thread-567" prio=5 tid=0x3858f0 nid=0x25f waiting on monitor [0xa9d7e000..0xa9d7fc68]
              at java.lang.Object.wait(Native Method)
              at java.lang.Object.wait(Object.java:420)
              at com.netscape.server.servlet.platformhttp.PlatformNASSession.invalidate(Unknown Source)
              at com.bankofamerica.ui.servlet.action.authentication.LogoffAction.perform(LogoffAction.java:42)
              at org.apache.struts.action.ActionServlet.processActionPerform(ActionServlet.java:1785)
              at com.bankofamerica.ui.servlet.BOAActionServlet.processActionPerform(BOAActionServlet.java:68)
              at org.apache.struts.action.ActionServlet.process(ActionServlet.java:1584)
              at com.bankofamerica.ui.servlet.BOAActionServlet.process(BOAActionServlet.java:41)
              at org.apache.struts.action.ActionServlet.doGet(ActionServlet.java:491)
              at javax.servlet.http.HttpServlet.service(HttpServlet.java:740)
              at javax.servlet.http.HttpServlet.service(HttpServlet.java:853)
              at com.netscape.server.servlet.servletrunner.ServletInfo.service(Unknown Source)
              at com.netscape.server.servlet.servletrunner.ServletRunner.execute(Unknown Source)
              at com.kivasoft.applogic.AppLogic.execute(Unknown Source)
              at com.kivasoft.applogic.AppLogic.execute(Unknown Source)
              at com.kivasoft.thread.ThreadBasic.run(Native Method)
              at java.lang.Thread.run(Thread.java:484)
    Here is the ias-web.xml that is seeing the issues:
         <session-info>
              <impl>distributed</impl>
              <timeout>240</timeout>
              <timeout-type>last-access</timeout-type>
              <secure>false</secure>
              <domain/>
              <path/>
              <scope/>
         </session-info>

    Hi! have you resolved this issue? Right now we are also experiencing the same problem.
    We're using IAS6.5 SP1, but to no avail when the incoming requests hangs, IAS doesnt generate core dump. As basis for our investigation, we just rely on the IAS Logs, but we couldnt find any abnormalities recorded on the logs that could help us resolve the problem.
    I appreciate any help from you.
    Best Regards,
    Rachele

  • I am having issues with connecting to the store...the page just hangs and when I run Diagnostics it states that a secure connection could not be established.

    I am having issues with connecting to the store...the page just hangs and when I run Diagnostics it states that a secure connection could not be established.
    I have contacted apple via email and tried all the steps they have recommended and it is still not working. 
    I have tried disabling my McAfee and that does not work either. 
    Anyone that the phone number to apple express so I don't have to keep waiting for them to email within 48 hours? 
    Thanks. 

    With those symptoms, I'd try the following document:
    Apple software on Windows: May see performance issues and blank iTunes Store
    (If there's a SpeedBit LSP showing up in Autoruns, it's usually best to just uninstall your SpeedBit Video Accelerator.)

  • HT1212 somone entered in password too many times and it states ipod disabled connect to itunes

    somone entered in password too many times and it states ipod disabled connect to itunes

    Do as the manual says and restore teh ipod.
    If you are unable, then type "unable to restore" into the search bar at the top of this page by Support and read the resulting help article.

  • Many sockets to choose from, which is the right one?

    Using Pavillon T279, with XP Home sp3.
    I have, for many years, tried to explore which socket on the computer's front corresponds to the "moveable discs" when right clicking "my computer" :
    There are several to choose from. My problem is as follows: I want to copy a program -Libre office - from my desktop to my laptop and I don't know in which socket I shall put my USB stick. Perhaps there is a map somewhere in the cyber area, but I haven't found it.
    PLEASE, SHOW ME.

    You can not "copy" a program file onto any type of media, it will lose functionality You can download the free program from here onto your laptop.
    I am a volunteer. I am not an HP employee.
    To say THANK YOU, press the "thumbs up symbol" to render a KUDO. Please click Accept as Solution, if your problem is solved. You can render both Solution and KUDO.
    The Law of Effect states that positive reinforcement increases the probability of a behavior being repeated. (B.F.Skinner). You toss me KUDO and/or Solution, and I perform better.
    (2) HP DV7t i7 3160QM 2.3Ghz 8GB
    HP m9200t E8400,Win7 Pro 32 bit. 4GB RAM, ASUS 550Ti 2GB, Rosewill 630W. 1T HD SATA 3Gb/s
    Custom Asus P8P67, I7-2600k, 16GB RAM, WIN7 Pro 64bit, EVGA GTX660 2GB, 750W OCZ, 1T HD SATA 6Gb/s
    Custom Asus P8Z77, I7-3770k, 16GB RAM, WIN7 Pro 64bit, EVGA GTX670 2GB, 750W OCZ, 1T HD SATA 6Gb/s
    Both Customs use Rosewill Blackhawk case.
    Printer -- HP OfficeJet Pro 8600 Plus

  • CS5 Photomerge hanging when using many files

    Hello,
    I have managed to work around the issue, but it's annoying nonetheless.
    The basic problem is that I've started creating very wide and tall Bokeh Panoramas or 'The Brenzier Method' http://www.ryanbrenizer.com/category/brenizer-method/
    From Lightroom I select the images I want and 'merge to panorama in PS'.  When I use say 2 - 30 images it's fine.  However when I use say 160 (ridiculous I know) it will hang when merging the files, processor usage stays on 13%, but the progress dialogue bar doesn't animate.  I tried it through photomerge and 'align all layers' from the edit menu.  I reduced this ridiculous amount to 60 and if I chose the 'vignette removal' option on the photomerge dialogue it will hang on this process after starting to create the panorama.  If I don't do the vignette, it will create the panorama.
    I know that it will depend upon the complexity of the merge, it's just annoying that it would fail whereas Microsoft ICE will complete the panorama without fail, it just may be of a lesser accuracy due to parallax.
    Other than using less images and attempting to overlap correctly, is there much else that can be done?
    I'm using an up to date version of CS5 64bit windows.  running on a 16gb Core i7 2600k.  The merged final image occupies 13gig of ram.  I have the scratch disc on a spare 2tb drive with about 1.5 tb free.
    Paul

    Mr_Fujisawa, I just wanted to come back on here and thank you for opening my mind to the "Brenizer Method", which I had not heard of before.  I've been experimenting with it today and enjoying the results!  Up to now I put effort into getting everything in focus all across my panos, which are generally landscapes.  It didn't dawn on me to use stitching on subjects up close.
    -Noel

  • Insert hangs due to pk violation (plsql)

    Hello everybody,
    i'm unable to cope with the following problem:
    1) suppose to have a table with a single column and it's primary key
    CREATE TABLE TABUSRLG
    TUL_CODUSR VARCHAR2(30 BYTE)
    ALTER TABLE TABUSRLG ADD (
    PRIMARY KEY
    (TUL_CODUSR) ENABLE);
    2) from a session execute
    insert into tabusrlg values 'A';
    and DON'T rollabck or commit
    3) from another session try the same insert
    insert into tabusrlg values 'A';
    this insert hangs (and i agree with this)
    but ...
    Is there a way to prevent the insert to hangs ? i'd like tpo skip the insert in this case without waiting for a timeout.
    Usually when i want to update a record i execute a select for update nowait ;
    if the instruction throws an exception the record is locked and i don't try the update
    I'm looking for something similar but in this case i don't find a way to sort out it
    Any suggestion ?
    Many thanks in advance
    Stefano

    Stefano,
    I am still not clear Why should Oracle hold a Lock and prevent from another session Inserting data?
    I was not able to replicate the situation. Can anybody send steps to replicate the scenario?
    SESSION 1:
    CREATE TABLE test_table (col varchar2(10));
    select username,
    a.sid,
    a.type,
    lmode,
    request
    from v$lock a, v$session b
    where a.sid = b.sid
      AND username = USER;
    USERNAME     SID     TYPE     LMODE     REQUEST
    PURVESH          23     AE          4          0
    PURVESH          17     AE          4          0
    PURVESH          143     JQ          4          0
    PURVESH          143     AE          6          0
    PURVESH          23     TO          3          0 
    insert into test_table values ('A');
    select username,
    a.sid,
    a.type,
    lmode,
    request
    from v$lock a, v$session b
    where a.sid = b.sid
      AND username = USER;
    USERNAME     SID     TYPE     LMODE     REQUEST
    PURVESH          23     AE          4          0
    PURVESH          17     AE          4          0
    PURVESH          143     JQ          6          0
    PURVESH          143     AE          4          0
    PURVESH          23     TO          3          0
    PURVESH          23     TM          3          0
    PURVESH          23     TX          6          0
    -- As you see here a Table Row Level lock (TX) and a Table Lock (TM) is placed to prevent Table Structure modification.
    SESSION 2:
    insert into test_table values ('B');
    select username,
    a.sid,
    a.type,
    lmode,
    request
    from v$lock a, v$session b
    where a.sid = b.sid
      AND username = USER;
    USERNAME     SID     TYPE     LMODE     REQUEST
    PURVESH          23     AE          4          0
    PURVESH          17     AE          4          0
    PURVESH          143     JQ          6          0
    PURVESH          143     AE          4          0
    PURVESH          23     TO          3          0
    PURVESH          17     TM          3          0
    PURVESH          23     TM          3          0
    PURVESH          17     TX          6          0
    PURVESH          23     TX          6          0
    The above data proves that another session was not prevented from inserting a record into the same table. SID 17 has placed another TX and TM in Mode 3.I could not understand why are you thinking of alternatives without finding out where the actual problem lies?
    I would recommend you to check out what locks are being held on your table and investigate the reason.
    You can definitely let us know if you face any issue, just ensure to post appropriate data for our analysis.
    Regards,
    P.

Maybe you are looking for