Uxwdog watchdog
I am having problems with the watchdog (uxwdog) process on both HP-UX 11 and Solaris with IPlanet 6 SP5 with servlets.
The watchdog will fail and wedges the IPlanet server when it fails to restart.
error failed to connect to watchdog
It is not a performance, process or memory issue. This failure has happened late at night with minimum users and during normal operations
We have this same problem also. We also are getting these errors in our syslog:
Nov 3 06:36:29 cosmoweb2 uxwdog[8401]: Poll failed: nmsgs=-216, errno=216 (Socket operation on non-socket)
Nov 3 06:56:29 cosmoweb2 uxwdog[8401]: Poll failed: nmsgs=-216, errno=216 (Socket operation on non-socket)
Nov 3 06:56:29 cosmoweb2 above message repeats 21515886 times
Did you get any replies?
Similar Messages
-
Uxwdog: Poll Failed messages in syslog
Hello,
I'm running iPlanet 6.0 service pack 5 on HPUX 11i.
I'm seeing the following in the syslog:
Sep 10 08:47:10 cosmoweb2 uxwdog[26300]: Poll failed: nmsgs=-216, errno=216 (Soc
ket operation on non-socket)
Sep 10 08:47:10 cosmoweb2 above message repeats 22350115 times
Sep 10 08:47:10 cosmoweb2 uxwdog[26300]: Poll failed: nmsgs=-216, errno=216 (Soc
ket operation on non-socket)
Sep 10 09:07:10 cosmoweb2 uxwdog[26300]: Poll failed: nmsgs=-216, errno=216 (Soc
ket operation on non-socket)
Sep 10 09:07:10 cosmoweb2 above message repeats 21885097 times
Sep 10 09:07:10 cosmoweb2 uxwdog[26300]: Poll failed: nmsgs=-216, errno=216 (Soc
ket operation on non-socket)
I've search sunsolve and google for this error message and am finding nothing. Eventually the web server hangs and we have to stop and restart it.
Does anyone know what causes this and what I can do to resolve it?We think we found out our problem with watchdog.
"Could not connect to watchdog socket
/tmp/https-eciwind1.bace.boeing.com
ab0d7966/iwswatchdog.7659".
The /tmp directory is cleaned up daily of files over 3 days old and directories over 7
days old. We revised the maintainance routines on the server to exempt anything that starts with 'iws*'. Hopefully that will solve the problem - we are still testing (waiting for a failure - none yet).
Greg -
Ocasionally on system reboot (Sun Enteprise, Solaris)
the following errors are logged and the none of the processes come up ns-httpd or uxwdog.
What can we attribute this sporadic failure to? Any help is greatly appreciated.
[05/Jul/2003:05:05:04] info ( 815): successful server startup
[05/Jul/2003:05:05:04] info ( 815): iPlanet-WebServer-Enterprise/6.0SP4 B07/17/
2002 14:04
[05/Jul/2003:05:05:04] failure ( 815): Error receiving response from watchdog
[05/Jul/2003:05:05:04] failure ( 815): Could not set PID path /opt2/iplanet/ser
vers/https-gbvsa1-atl.ims.teleconference.att.com/logs/pidIt's possible for this situation to occur if the system is not restarted cleanly (e.g. if the reboot command is used or if there's a power failure). Removing the pid file from the logs directory will fix the problem; you may want to add this cleanup task to your init scripts.
-
Watchdog restart ns-httpd frequently
Hello,
On Sun Solaris with Iplanet 4.1 et PHP 4.2.3, I have theses messages on
the error log
of the Web server :
[04/Jun/2003:16:04:54] info ( 7643): php4_init reports: Initialized PHP
Module
[04/Jun/2003:16:09:57] info ( 7679): php4_init reports: Initialized PHP
Module
[04/Jun/2003:16:11:12] info ( 7690): php4_init reports: Initialized PHP
Module
[04/Jun/2003:16:17:17] info ( 7742): php4_init reports: Initialized PHP
Module
[04/Jun/2003:16:17:25] info ( 7745): php4_init reports: Initialized PHP
Module
[04/Jun/2003:16:19:26] info ( 7761): php4_init reports: Initialized PHP
Module
[04/Jun/2003:16:37:52] info ( 7906): php4_init reports: Initialized PHP
Module
[04/Jun/2003:16:47:24] info ( 7985): php4_init reports: Initialized PHP
Module
[04/Jun/2003:16:48:40] info ( 7992): php4_init reports: Initialized PHP
Module
[04/Jun/2003:16:50:04] info ( 8002): php4_init reports: Initialized PHP
Module
[04/Jun/2003:16:54:13] info ( 8032): php4_init reports: Initialized PHP
Module
And at the same time, I have theses other messages on the console :
Jun 4 16:04:53 server1 uxwdog[23185]: server terminated (signal 10):
watchdog is restarting it
Jun 4 16:09:56 server1 uxwdog[23185]: server terminated (signal 11):
watchdog is restarting it
Jun 4 16:11:12 server1 uxwdog[23185]: server terminated (signal 10):
watchdog is restarting it
Jun 4 16:17:24 server1 last message repeated 2 times
Jun 4 16:19:25 server1 uxwdog[23185]: server terminated (signal 11):
watchdog is restarting it
Jun 4 16:37:51 server1 uxwdog[23185]: server terminated (signal 10):
watchdog is restarting it
Jun 4 16:47:23 server1 uxwdog[23185]: server terminated (signal 11):
watchdog is restarting it
Jun 4 16:48:40 server1 uxwdog[23185]: server terminated (signal 10):
watchdog is restarting it
Jun 4 16:50:03 server1 uxwdog[23185]: server terminated (signal 11):
watchdog is restarting it
Jun 4 16:54:12 server1 last message repeated 1 time
Why the process ns-httpd is restarting ? Is-it due to PHP ?
How can I debug this ?
ALHello,
I change the Unix right on directory https-exp3/config and now there is a core
So, I can generate a backtrace with this command
gdb /produits/netscape/server41sp9/bin/https/bin/ns-httpd
/produits/netscape/server41sp9/https-exp3/config/core.andre
And I get :
GNU gdb 4.18
Copyright 1998 Free Software Foundation, Inc.
GDB is free software, covered by the GNU General Public License, and you are
welcome to change it and/or distribute copies of it under certain conditions.
Type "show copying" to see the conditions.
There is absolutely no warranty for GDB. Type "show warranty" for details.
This GDB was configured as "sparc-sun-solaris2.7"...(no debugging symbols
found)...
Core was generated by `ns-httpd -d
/produits/netscape/server41sp9/https-exp3/config'.
Program terminated with signal 9, Killed.
Reading symbols from
/produits/netscape/server41sp9/bin/https/lib/libns-httpd40.so...(no debugging
symbols found)...done.
Reading symbols from
/produits/netscape/server41sp9/bin/https/lib/liblibsi18n.so...(no debugging
symbols found)...done.
Reading symbols from
/produits/netscape/server41sp9/bin/https/lib/libgetprop.so...(no debugging
symbols found)...done.
Reading symbols from
/produits/netscape/server41sp9/bin/https/lib/liblibdbm.so...(no debugging
symbols found)...done.
Reading symbols from
/produits/netscape/server41sp9/bin/https/lib/libnsprwrap.so...(no debugging
symbols found)...done.
Reading symbols from
/produits/netscape/server41sp9/plugins/lib/libMagnusPostInit.so...(no debugging
symbols found)...done.
Reading symbols from
/produits/netscape/server41sp9/bin/https/lib/libldap30.so...(no debugging
symbols found)...done.
Reading symbols from
/produits/netscape/server41sp9/bin/https/lib/libnsres30.so...(no debugging
symbols found)...done.
Reading symbols from
/produits/netscape/server41sp9/bin/https/lib/libnsuni30.so...(no debugging
symbols found)...done.
Reading symbols from
/produits/netscape/server41sp9/bin/https/lib/libnscnv30.so...(no debugging
symbols found)...done.
Reading symbols from
/produits/netscape/server41sp9/bin/https/lib/libnsfmt30.so...(no debugging
symbols found)...done.
Reading symbols from
/produits/netscape/server41sp9/bin/https/lib/libnscol30.so...(no debugging
symbols found)...done.
Reading symbols from
/produits/netscape/server41sp9/bin/https/lib/libnsbrk30.so...(no debugging
symbols found)...done.
Reading symbols from
/produits/netscape/server41sp9/bin/https/lib/libplc3.so...(no debugging symbols
found)...done.
Reading symbols from
/produits/netscape/server41sp9/bin/https/lib/libplds3.so...(no debugging
symbols found)...done.
Reading symbols from
/produits/netscape/server41sp9/bin/https/lib/libnspr3.so...(no debugging
symbols found)...done.
Reading symbols from /usr/lib/libsocket.so.1...(no debugging symbols
found)...done.
Reading symbols from /usr/lib/libnsl.so.1...(no debugging symbols
found)...done.
Reading symbols from /usr/lib/libdl.so.1...(no debugging symbols found)...done.
Reading symbols from /usr/lib/libposix4.so.1...(no debugging symbols
found)...done.
Reading symbols from /usr/lib/libm.so.1...(no debugging symbols found)...done.
Reading symbols from /usr/lib/libC.so.5...(no debugging symbols found)...done.
Reading symbols from /usr/lib/libw.so.1...
warning: Lowest section in /usr/lib/libw.so.1 is .hash at 0x74
(no debugging symbols found)...done.
Reading symbols from /usr/lib/libthread.so.1...(no debugging symbols
found)...done.
Reading symbols from /usr/lib/libc.so.1...(no debugging symbols found)...done.
Reading symbols from
/produits/netscape/server41sp9/bin/https/lib/libnsfc.so...(no debugging symbols
found)...done.
Reading symbols from
/produits/netscape/server41sp9/bin/https/lib/libnstp.so...(no debugging symbols
found)...done.
Reading symbols from
/produits/netscape/server41sp9/bin/https/lib/libdirmon4.so...(no debugging
symbols found)...done.
Reading symbols from
/produits/netscape/server41sp9/bin/https/lib/libnstime.so...(no debugging
symbols found)...done.
Reading symbols from
/produits/netscape/server41sp9/bin/https/lib/libsupport.so...(no debugging
symbols found)...done.
Reading symbols from
/produits/netscape/server41sp9/bin/https/lib/libares3.so...(no debugging
symbols found)...done.
Reading symbols from /usr/lib/libresolv.so.2...(no debugging symbols
found)...done.
Reading symbols from /usr/lib/libpthread.so.1...(no debugging symbols
found)...done.
Reading symbols from /usr/lib/libmp.so.2...(no debugging symbols found)...done.
Reading symbols from /usr/lib/libaio.so.1...(no debugging symbols
found)...done.
Reading symbols from
/produits/netscape/server41sp9/bin/https/lib/libatomic.so...(no debugging
symbols found)...done.
Reading symbols from /usr/platform/SUNW,Ultra-4/lib/libc_psr.so.1...(no
debugging symbols found)...done.
Reading symbols from
/produits/netscape/server41sp9/plugins/php4/exp2/nsapi/libphp4.so...(no
debugging symbols found)...done.
Reading symbols from /usr/lib/libpam.so.1...(no debugging symbols
found)...done.
Reading symbols from /usr/local/lib/libfreetype.so.6...done.
Reading symbols from /usr/local/lib/libpng.so.2...done.
Reading symbols from /usr/local/lib/libz.so...done.
---Type <return> to continue, or q <return> to quit---
Reading symbols from /usr/local/lib/libjpeg.so.62...done.
Reading symbols from /usr/lib/libcrypt_i.so.1...done.
Reading symbols from /usr/lib/libgen.so.1...done.
Reading symbols from /usr/lib/libsched.so.1...done.
Reading symbols from
/produits/oracle/product/8.1.7/lib/libclntsh.so.8.0...done.
Reading symbols from /usr/local/lib/libgcc_s.so.1...done.
Reading symbols from /produits/oracle/product/8.1.7/lib/libwtc8.so...done.
#0 0xfd6035cc in nnfgrne () from
/produits/oracle/product/8.1.7/lib/libclntsh.so.8.0
(gdb) bt
#0 0xfd6035cc in nnfgrne () from
/produits/oracle/product/8.1.7/lib/libclntsh.so.8.0
#1 0xfd78d2c4 in nlolgobj () from
/produits/oracle/product/8.1.7/lib/libclntsh.so.8.0
#2 0xfd601654 in nnfun2a () from
/produits/oracle/product/8.1.7/lib/libclntsh.so.8.0
#3 0xfd6012cc in nnfsn2a () from
/produits/oracle/product/8.1.7/lib/libclntsh.so.8.0
#4 0xfd5fe0bc in niqname () from
/produits/oracle/product/8.1.7/lib/libclntsh.so.8.0
#5 0xfd5feea0 in osncon () from
/produits/oracle/product/8.1.7/lib/libclntsh.so.8.0
#6 0xfd506e68 in kpuadef () from
/produits/oracle/product/8.1.7/lib/libclntsh.so.8.0
#7 0xfd51b380 in upiini () from
/produits/oracle/product/8.1.7/lib/libclntsh.so.8.0
#8 0xfd50c15c in upiah0 () from
/produits/oracle/product/8.1.7/lib/libclntsh.so.8.0
#9 0xfd51bde0 in kpuatch () from
/produits/oracle/product/8.1.7/lib/libclntsh.so.8.0
#10 0xfd5797cc in OCIServerAttach () from
/produits/oracle/product/8.1.7/lib/libclntsh.so.8.0
#11 0xfdeb042c in zm_info_oci () from
/produits/netscape/server41sp9/plugins/php4/exp2/nsapi/libphp4.so
#12 0xfdeb0964 in zm_info_oci () from
/produits/netscape/server41sp9/plugins/php4/exp2/nsapi/libphp4.so
#13 0xfdeb45ec in zif_ocilogon () from
/produits/netscape/server41sp9/plugins/php4/exp2/nsapi/libphp4.so
#14 0xfde54084 in execute () from
/produits/netscape/server41sp9/plugins/php4/exp2/nsapi/libphp4.so
#15 0xfde56bfc in execute () from
/produits/netscape/server41sp9/plugins/php4/exp2/nsapi/libphp4.so
#16 0xfde66334 in zend_execute_scripts () from
/produits/netscape/server41sp9/plugins/php4/exp2/nsapi/libphp4.so
#17 0xfde75514 in php_execute_script () from
/produits/netscape/server41sp9/plugins/php4/exp2/nsapi/libphp4.so
#18 0xfde71ddc in nsapi_module_main () from
/produits/netscape/server41sp9/plugins/php4/exp2/nsapi/libphp4.so
#19 0xfde71f58 in php4_execute () from
/produits/netscape/server41sp9/plugins/php4/exp2/nsapi/libphp4.so
#20 0xff26659c in
__0Fafunc_native_pool_wait_workPFP6GpblockP6HSessionP6HRequest_iUiP6GpblockP6HSessionP6HRequest
from /produits/netscape/server41sp9/bin/https/lib/libns-httpd40.so
#21 0xff265bbc in __0FNfunc_exec_strP6KFuncStructP6GpblockP6HSessionP6HRequest
from /produits/netscape/server41sp9/bin/https/lib/libns-httpd40.so
#22 0xff266b54 in INTobject_execute () from
/produits/netscape/server41sp9/bin/https/lib/libns-httpd40.so
#23 0xff26b7f4 in __0FQ_perform_serviceP6HSessionP6HRequestP6Mhttpd_object ()
from /produits/netscape/server41sp9/bin/https/lib/libns-httpd40.so
#24 0xff26b8b0 in INTservact_service () from
/produits/netscape/server41sp9/bin/https/lib/libns-httpd40.so
#25 0xff26bbc8 in INTservact_handle_processed () from
/produits/netscape/server41sp9/bin/https/lib/libns-httpd40.so
#26 0xff29a0fc in __0fLHttpRequestUUnacceleratedRespondPCcPcTC ()
from /produits/netscape/server41sp9/bin/https/lib/libns-httpd40.so
#27 0xff298ed8 in __0fLHttpRequestNHandleRequestP6Gnetbuf () from
/produits/netscape/server41sp9/bin/https/lib/libns-httpd40.so
#28 0xff2956f4 in __0fNDaemonSessionHRespondv () from
/produits/netscape/server41sp9/bin/https/lib/libns-httpd40.so
#29 0xff295580 in __0fNDaemonSessionKThreadMainv () from
/produits/netscape/server41sp9/bin/https/lib/libns-httpd40.so
#30 0xff29500c in CThreadMain () from
/produits/netscape/server41sp9/bin/https/lib/libns-httpd40.so
#31 0xfef32ad8 in ptroot () from
/produits/netscape/server41sp9/bin/https/lib/libnspr3.so
(gdb)
This can help someone to give me advice ? -
Hello,
I'm putting together a testing rack that will be used to control a system during testing. The heart of the rack is a PXIe-1065 chassis with PXIe-8135 controller. The rack will be controled remotely through Remote Desktoping into the controller (this is necessary because the test involves pressurizing sections of the system high enough that it is dangerous for any personnel to be in the room while the system is pressed up). The test control software will not be LabView, but our own software that interfaces with the modules in the PXIe chassis.
I would like a watchdog in the rack to monitor the PXIe controller and restart it without any need for someone to go in the room in the event that the system freezes up or otherwise becomes unresponsive. I'm aware of some possible solutions for a remote restart (Intel AMT, remote-controlled power strip), but response times on those could be too slow depending on the state of the rack when it goes unresponsive. I'm fairly certain I'm not the first to do this, so I was hoping to get some info and ideas.
From browsing the manual, it looks like there are watchdogs internal to the controller, but these are not accessible unless your using LabView RT. Anybody know if that is correct?
Are there any other options for a watchdog internal to the PXIe controller?
If not, any recommendations for getting an external watchdog? I assume it would have to connect up to the inhibit connector on the back and connect the proper pins in case it detects a problem.
Thanks!Hi pghohnst,
It is possible to control the power of a PXI Chassis with the DB 9 connector that is found on the back of the chassis. More information can be found in the KnowledgeBase below.
Remote Power of a PXI Chassis: http://digital.ni.com/public.nsf/allkb/FF5AB8BB6A1157DB8625756D00502D55?OpenDocument
Regards,
Jason D
Applications Engineer
National Instruments -
"NETDEV WATCHDOG: eth0: transmit timed out" proble
Hi all,
I did a "pacman -Syu" and had to upgrade a lot of packages. After I rebooted this machine, I'm have problem with its network connection...
When I do a scp of a big file to this machine, say 500+ MB, the copy progress will stall for a couple of seconds (2 to 10) and I receive the following errors in /var/log/messages.log: "NETDEV WATCHDOG: eth0: transmit timed out". This happens once or twice in this scp session. If I start a second, both it will stall several times and sometimes don't even finish (e.g; times out completely). I did not have this problem before all the upgrades.
Has anybody seen this before? Is my network broken and happened to be this right after the upgrade by coincidence?
Regardson some intel chipsets i've had a similar problem. try passing:
pci=noacpi
to the kernel. if that does not help either turn acpi off alltogether with:
acpi=off
then again, this is just a guess on what might solve the problem since i do not know which driver you use. -
Windows DPC Watchdog Violation
My windows partition automatically installed updates overnight. When I re-booted I received a DPC Watchdog Violation. The power button re-booted the machine and it appears to be running ok. Do I have a problem that needs to be addressed? Appreciate any info and suggestions.
perhaps this could help;
http://pc.net/helpcenter/answers/windows_8_dpc_watchdog_violation -
PXI-6528 Watchdog Timer parameters
I am using the PXI-6528 DIO on PXI chassis. This board is an opto-isolated TTL DIO. The card has an on-board Watchdog timer. I have DAQmx Generate - Digital Line Output (PXI-6528) steps in Sequence steps with "reuse hardware" permitted which set lines high or low as needed for commanding solid state relays (SSR). When the project runs, the behavior non-deterministically sets all lines to low when transitioning to the next Sequence step. If I run each DO step independently it works perfectly every time.
To investigate this phenomenon, I did a simple bench check to rule-out SE software errors. I simply patched from the DO port to the DI port on the same card. Then I added a DI Acquire step just after every DO step to electrically read the states. The result is that the DO port really is going low when the SE software is programmed for high or low! It seems to me that the only thing that can override the SE logic is the Watchdog timer.
Reading this NI White paper:
http://www.ni.com/white-paper/14616/en/#toc4
It seems that it is critical to configure the Watchdog timer to achieve stable behavior from the 6528 card. BUT, neither MAX, DAQ Assistant, or SE have an obvious way to configure the Watchdog timer.
QUESTION: Within SE-DAQmx-Generate-Digital Line Output step what are the parameters that control or disable the Watchdog timer for the PXI-6528?Update:
I have found more information at this URL:
http://zone.ni.com/reference/en-XX/help/370471AC-01/cdaqmxsupp/pci-6528/
This is the C programming reference specific to the PXI-6528. Reviewing the Watchdog timer properties we find that the "Timeout" property can disable the timer with a value of -1. On the DAQmx Assistant GUI for DO, the Advanced Timing tab does have a Timeout property which will accept a value of -1; however, one must set the Generation Mode to N-Samples to activate the Timeout input field. As the 6528 fundamentally cannot generate samples, then we must reset the Generation mode back to 1-Sample (On Demand) with the effect that the Timeout property is greyed out on the GUI - does this signify that the -1 value for Timeout is ignored? -
CRS Engine restarts every 2-3 days due to NOT_OK response from Watchdog
Hi,
we have 2 Cisco Unified IP IVR servers running version 7.0(1)SR03_Build011. Every 2-3 days, the CRS engine restarts at different times on both servers due to a Watchdog Thread received NOT_OK response from process CRS Engine.
These servers run independently of each other - (i.e not an HA pair) - but over the weekend, both servers had a CRS engine restart at the same time. I've looked at the MIVR and MCVD logs and they confirm this, but are so detailed, I can't actually still see what the cause is. There are a number of errors of different types, where the log seems to show a lot of 'exceptions', but it seems to lose connection to the Call Manager that causes the restart and it mentions buffer space.
We have a 3rd server which is not part of the solution that the other 2 servers provide, but it has the same OS, the same CRS application version and is connected to the same Call Manager; (version 6.1.3-200); but this server doesn't restart. It is on the same subnet as the other 2 servers.
The event log looks like this:-
Event Type: Information
Event Source: Cisco Unified CCX Node Manager
Event Category: Devices
Event ID: 3
Date: 8/30/2010
Time: 7:38:18 PM
User: N/A
Computer: CBXCCM2IVR01
Description:
The description for Event ID ( 3 ) in Source ( Cisco Unified CCX Node Manager ) cannot be found. The local computer may not have the necessary registry information or message DLL files to display messages from a remote computer. You may be able to use the /AUXSOURCE= flag to retrieve this description; see Help and Support for details. The following information is part of the event: WatchdogThread: received NOT_OK response from process CRS Engine, , , , .
Data:
0000: 06 00 ff 00 00 00 00 00 .......
0008: 00 00 00 00 03 00 01 21 .......!
0010: 10 0d f0 83 72 48 cb 01 ..?rH.
0018: 58 00 00 00 00 05 41 00 X.....A.
0020: 6e 6d 00 43 42 58 43 43 nm.CBXCC
0028: 4d 32 49 56 52 30 31 00 M2IVR01.
0030: 57 61 74 63 68 64 6f 67 Watchdog
0038: 54 68 72 65 61 64 3a 20 Thread:
0040: 72 65 63 65 69 76 65 64 received
0048: 20 4e 4f 54 5f 4f 4b 20 NOT_OK
0050: 72 65 73 70 6f 6e 73 65 response
0058: 20 66 72 6f 6d 20 70 72 from pr
0060: 6f 63 65 73 73 20 43 52 ocess CR
0068: 53 20 45 6e 67 69 6e 65 S Engine
0070: 00 00 00 00 00 00 00 00 ........
I have attached the MIVR log, but when the error occurs the relevent part of the MIVR log shows the following:-
3362183: Aug 30 19:38:16.033 BST %MIVR-CLUSTER_MGR-4-EXCEPTION: at EDU.oswego.cs.dl.util.concurrent.ClockDaemon$RunLoop.run(ClockDaemon.java:630)
3362184: Aug 30 19:38:16.033 BST %MIVR-CLUSTER_MGR-4-EXCEPTION: at com.cisco.util.ThreadPoolFactory$ThreadImpl.run(ThreadPoolFactory.java:853)
3362185: Aug 30 19:38:16.033 BST %MIVR-CLUSTER_MGR-4-EXCEPTION:Caused by: java.net.SocketException: No buffer space available (maximum connections reached?): connect
3362186: Aug 30 19:38:16.033 BST %MIVR-CLUSTER_MGR-4-EXCEPTION: at java.net.PlainSocketImpl.socketConnect(Native Method)
3362187: Aug 30 19:38:16.033 BST %MIVR-CLUSTER_MGR-4-EXCEPTION: at java.net.PlainSocketImpl.doConnect(Unknown Source)
3362188: Aug 30 19:38:16.033 BST %MIVR-CLUSTER_MGR-4-EXCEPTION: at java.net.PlainSocketImpl.connectToAddress(Unknown Source)
3362189: Aug 30 19:38:16.033 BST %MIVR-CLUSTER_MGR-4-EXCEPTION: at java.net.PlainSocketImpl.connect(Unknown Source)
3362190: Aug 30 19:38:16.033 BST %MIVR-CLUSTER_MGR-4-EXCEPTION: at java.net.SocksSocketImpl.connect(Unknown Source)
3362191: Aug 30 19:38:16.033 BST %MIVR-CLUSTER_MGR-4-EXCEPTION: at java.net.Socket.connect(Unknown Source)
3362192: Aug 30 19:38:16.033 BST %MIVR-CLUSTER_MGR-4-EXCEPTION: at java.net.Socket.connect(Unknown Source)
3362193: Aug 30 19:38:16.033 BST %MIVR-CLUSTER_MGR-4-EXCEPTION: at java.net.Socket.<init>(Unknown Source)
3362194: Aug 30 19:38:16.033 BST %MIVR-CLUSTER_MGR-4-EXCEPTION: at java.net.Socket.<init>(Unknown Source)
3362195: Aug 30 19:38:16.033 BST %MIVR-CLUSTER_MGR-4-EXCEPTION: at com.cisco.rmi.LoopbackClientSocketFactory.createSocket(LoopbackClientSocketFactory.java:73)
3362196: Aug 30 19:38:16.033 BST %MIVR-CLUSTER_MGR-4-EXCEPTION: ... 12 more
3362197: Aug 30 19:38:16.096 BST %MIVR-SS_TEL-7-UNK:RP[num=40600], conn=[40600:CCM2IPT/(P1-CBXCTI_User_1) GCID=(3,5066916)->INVALID]->DISCONNECTED, event=CallCtlConnDisconnectedEv, cause=Other: 17[17], meta=META_CALL_ENDING[132]
3362198: Aug 30 19:38:16.518 BST %MIVR-SS_TEL-7-UNK:RP[num=40600], conn=[40600:CCM2IPT/(P1-CBXCTI_User_1) GCID=(3,5066917)->INVALID]->DISCONNECTED, event=CallCtlConnDisconnectedEv, cause=Other: 17[17], meta=META_CALL_ENDING[132]
3362199: Aug 30 19:38:18.705 BST %MIVR-CLUSTER_MGR-2-THROWS_KEEP_ALIVE_EXCEPTION:Cluster Manager throws KeepAlive Exception: Exception=com.cisco.wfapi.WFKeepAliveException: MANAGER_CONNECTION_TO_PUBLISHER_LOST
3362200: Aug 30 19:38:18.705 BST %MIVR-CLUSTER_MGR-2-EXCEPTION:com.cisco.wfapi.WFKeepAliveException: MANAGER_CONNECTION_TO_PUBLISHER_LOST
3362201: Aug 30 19:38:18.705 BST %MIVR-CLUSTER_MGR-2-EXCEPTION: at com.cisco.cluster.impl.manager.AbstractClusterManager.restart(AbstractClusterManager.java:599)
3362202: Aug 30 19:38:18.705 BST %MIVR-CLUSTER_MGR-2-EXCEPTION: at com.cisco.cluster.impl.manager.Publisher.notifyOne(Publisher.java:104)
3362203: Aug 30 19:38:18.705 BST %MIVR-CLUSTER_MGR-2-EXCEPTION: at com.cisco.cluster.impl.manager.AbstractClusterManager$1.run(AbstractClusterManager.java:667)
3362204: Aug 30 19:38:18.705 BST %MIVR-CLUSTER_MGR-2-EXCEPTION: at com.cisco.executor.impl.ExecutorStubImpl$RequestImpl.runCommand(ExecutorStubImpl.java:690)
3362205: Aug 30 19:38:18.705 BST %MIVR-CLUSTER_MGR-2-EXCEPTION: at com.cisco.executor.impl.ExecutorStubImpl$RequestImpl.run(ExecutorStubImpl.java:486)
3362206: Aug 30 19:38:18.705 BST %MIVR-CLUSTER_MGR-2-EXCEPTION: at com.cisco.executor.impl.ExecutorStubImpl$RequestImpl.run(ExecutorStubImpl.java:762)
3362207: Aug 30 19:38:18.705 BST %MIVR-CLUSTER_MGR-2-EXCEPTION: at EDU.oswego.cs.dl.util.concurrent.ClockDaemon$RunLoop.run(ClockDaemon.java:630)
3362208: Aug 30 19:38:18.705 BST %MIVR-CLUSTER_MGR-2-EXCEPTION: at com.cisco.util.ThreadPoolFactory$ThreadImpl.run(ThreadPoolFactory.java:853)
3362209: Aug 30 19:38:18.705 BST %MIVR-NODE_MGR-1-NODE_MGR_KEEP_ALIVE_ERROR:Node Manager keep alive ping failed: Exception=com.cisco.wfapi.WFKeepAliveException: KeepAliveException in Manager/Startable ; nested exception is:
com.cisco.wfapi.WFKeepAliveException: MANAGER_CONNECTION_TO_PUBLISHER_LOST
3362210: Aug 30 19:38:18.705 BST %MIVR-NODE_MGR-1-EXCEPTION:com.cisco.wfapi.WFKeepAliveException: KeepAliveException in Manager/Startable ; nested exception is:
3362211: Aug 30 19:38:18.705 BST %MIVR-NODE_MGR-1-EXCEPTION: com.cisco.wfapi.WFKeepAliveException: MANAGER_CONNECTION_TO_PUBLISHER_LOSTHi
If it happened to two servers at the same time then I'd be looking off box for problems.
- Check whether your CCMs were stable (use RTMT and check for events at the time)
- Run a 'show spanning-tree active detail | i VLAN|hange' or similar to check for STP topology changes on the VLAN the servers are in. Short outages caused by bad port configs in the VLAN can cause CCX/IPIVR to get upset briefly and fail over when communication between different processes on the same box fall over; it can be very sensitive. Maybe also do a show log on the switches to see whether any other significant events happened at the same time.
Regards
Aaron
Please rate helpful posts.. -
Watchdog reset in WLC 5508 7.4.100.60
Hi ,
Last week we found that primary WLC was hanged so we need to reboot it.
As per logs:
Last Reset....................................... Watchdog reset
We are using 7.4.100.60.
Any idea what could be the root cause.
Regards,Check your WLC serial number as well, if it starts FCW1614 or later then it may be due to this bug as well.
CSCul68057
Symptom:
Wireless LAN Controller may encounter unexpected reload without crash file or coredump.
Console log output may include "reaperWatcher rebooting" and "!!!!! Watchdog detected LOCKUP !!!!!",
and there may be "#OSAPI-2-REAPER_WATCHER_INFO" message in syslog.
Conditions:
5508, 2504 or WiSM2 manufactured after April 2012.
Affected S/N: FCW1614xxxx and later
This is due to incompatibility of previous driver with some of the flash components used after that date
Workaround:
None, upgrade to one of the recommended software versions
Anyway your choices are very limited now, only 3 WLC software codes supported by Cisco 7.0.250.0(7.0MR5), 7.4.121.0 (7.4MR2) and 7.6.101.x(7.6MR1). So upgrade to 7.4.121.0 code as that is the highly recommended code at the moment. See below for more details
https://supportforums.cisco.com/docs/DOC-40178
I would suggest to go for FUS 1.9.0.0 as well.
HTH
Rasika
*** Pls rate all useful responses **** -
Possible bug with kernal/watchdog?
Last night, as root I ran
cat /dev/watchdog
and got an error message about invalid parameters(i think ), and a minute later my machine restarted in the middle of doing something. I meant to cat /dev/input/wacom, but I hit w and tabbed after /dev/ and catted watchdog instead.
I was confused as to why my machine rebooted, so I checked /var/log/kernel.log and saw
Feb 12 00:12:03 robot iTCO_wdt: Unexpected close, not stopping watchdog!
The next message in there was at 00:13:04, and was a BIOS/lowmem entry, so I'm guessing thats when the machine started to boot up again, since the message was the kernel version a second later.
Last edited by Sjoden (2009-02-12 21:12:39)http://linux.die.net/man/8/watchdog
...or simply "man watchdog" should provide you with the answers you seek. -
X2200 iLom and Watchdog timer...
I have setup our x2200 and have the iLom working great for most things.
I have two specific problems going on with it though....
First, sometimes the iLom just stops responding, I can't SSH, use the WebGUI or anything. I'm using the same mgmt. port for the global zones IP, which I can still log into. Once I log into the domain, I can reboot the server, or power it down and then the iLom starts responding again. Any ideas?
Second issue is that if I turn on the watchdog timer in the BIOS it will reset or power cycle (depending on which setting I setup) once the timer expires, even if booted into the OS. This is very disturbing, because I can't use the watchdog unless I want to reboot every five minutes. :/
Now I am installing the minimal Solaris 10 SUNWreq cluster, but I'm adding a number of packages back into the jumpstart session, including TCPWrappers. Is there a specific package that needs to be installed as part of the Solaris base install in order for the Watchdog to determine if the OS is up and running or should I open a hardware trouble ticket on the new server?
Thanks!
Information
Error in the ILOM Error Log:
OS Load Watchdog timeout cause STR_POWER_CYCLE1
Server x2200 M2
ILOM Version
BMC Version : 1.20
BIOS Version : S39_3A10
Description BMC Board Information
Device ID 5
Device Revision 0
Firmware Revision 1.20
IPMI Revision 2.0Did you ever figure this out? I have been having the same problem since receiving my x2200 M2 a couple of months ago. I'm running the latest OpenSolaris and the watchdog timer will reboot continuously until it seems to give up and then just stops rebooting. I have a feeling it also disables the watchdog timer at that point.
Another frightening thing this server does is leave the power off, even when bios is configured to power on after a power failure! -
I allowed someone to use my 2010 MBP and later found out she was a thief and steals info. I AM NOT COMPUTER SAVVY, however I was looking at my sync logs and found
erver|Warning| Refreshing watchdog because of a calendar time change alert.
2014-06-11 03:58:13:001|SyncServer|11493|0x7f904b5068b0|SyncServer|Warning| Refreshing watchdog because of a calendar time change alert.
2014-06-11 03:59:15:859|SyncServer|11493|0x7f904b5068b0|SyncServer|Warning| Refreshing watchdog because of a calendar time change alert.
2014-06-11 04:00:17:001|SyncServer|11493|0x7f904b5068b0|SyncServer|Warning| Refreshing watchdog because of a calendar time change alert.
2014-06-11 04:02:13:001|SyncServer|11493|0x7f904b5068b0|SyncServer|Warning| Refreshing watchdog because of a calendar time change alert.
2014-06-11 04:03:16:026|SyncServer|11493|0x7f904b5068b0|SyncServer|Warning| Refreshing watchdog because of a calendar time change alert.
2014-06-11 04:04:18:001|SyncServer|11493|0x7f904b5068b0|SyncServer|Warning| Refreshing watchdog because of a calendar time change alert.
2014-06-11 04:06:13:001|SyncServer|11493|0x7f904b5068b0|SyncServer|Warning| Refreshing watchdog because of a calendar time change alert.
2014-06-11 04:07:16:014|SyncServer|11493|0x7f904b5068b0|SyncServer|Warning| Refreshing watchdog because of a calendar time change alert.
2014-06-11 04:08:17:001|SyncServer|11493|0x7f904b5068b0|SyncServer|Warning| Refreshing watchd
I have too many of these to count. Also wanted to know if there is anything I need to check on my computer to see what she may have put on my computer or she may be accessing it?I allowed someone to use my 2010 MBP and later found out she was a thief and steals info. I AM NOT COMPUTER SAVVY, however I was looking at my sync logs and found
erver|Warning| Refreshing watchdog because of a calendar time change alert.
2014-06-11 03:58:13:001|SyncServer|11493|0x7f904b5068b0|SyncServer|Warning| Refreshing watchdog because of a calendar time change alert.
2014-06-11 03:59:15:859|SyncServer|11493|0x7f904b5068b0|SyncServer|Warning| Refreshing watchdog because of a calendar time change alert.
2014-06-11 04:00:17:001|SyncServer|11493|0x7f904b5068b0|SyncServer|Warning| Refreshing watchdog because of a calendar time change alert.
2014-06-11 04:02:13:001|SyncServer|11493|0x7f904b5068b0|SyncServer|Warning| Refreshing watchdog because of a calendar time change alert.
2014-06-11 04:03:16:026|SyncServer|11493|0x7f904b5068b0|SyncServer|Warning| Refreshing watchdog because of a calendar time change alert.
2014-06-11 04:04:18:001|SyncServer|11493|0x7f904b5068b0|SyncServer|Warning| Refreshing watchdog because of a calendar time change alert.
2014-06-11 04:06:13:001|SyncServer|11493|0x7f904b5068b0|SyncServer|Warning| Refreshing watchdog because of a calendar time change alert.
2014-06-11 04:07:16:014|SyncServer|11493|0x7f904b5068b0|SyncServer|Warning| Refreshing watchdog because of a calendar time change alert.
2014-06-11 04:08:17:001|SyncServer|11493|0x7f904b5068b0|SyncServer|Warning| Refreshing watchd
I have too many of these to count. Also wanted to know if there is anything I need to check on my computer to see what she may have put on my computer or she may be accessing it? -
SpringBoard was killed by watchdog
Im having this issues - i was listening music and suddenly my phone got freezed on black screen, but music was still playing. I plugged off my headphones, and phone rebooted. And it is still booting.....I can see silver apple logo and running circle on it. If i connect to PC/Mac - iTunes detects the phone and syncs it correctly, but after some time phone reboots again and everything continues. I`ve tried to restore it through Recovery and DFU mode, restore finishes normally , but no success. It has never been jailbroken/unlocked. Please , help me!
Here is a part of CrashReporter log:
Incident Identifier: B9C2DDF5-1CE3-4CD3-8587-B1D0FDE066F0
CrashReporter Key: d44f3137f47fb69838e570e9734bd06ba002c307
Process: SpringBoard [24]
Path: /System/Library/CoreServices/SpringBoard.app/SpringBoard
Identifier: SpringBoard
Version: ??? (???)
Code Type: ARM (Native)
Parent Process: launchd [1]
Date/Time: 2010-04-03 04:08:52.492 +0300
OS Version: iPhone OS 3.1.3 (7E18)
Report Version: 104
Exception Type: 00000020
Exception Codes: 0xfaded321
Highlighted Thread: 0
Application Specific Information:
Watchdog timeout: 120.029778s since last successful ping: 1200b50 1200r80b400 1100m10004003 1000m10004003 900m10004003 800m10004003 700m10004003 600m10000004 500m10000004 400m10000004 300m10000004 200m10000004 100m10000004 0m10000004 0c18
0 kernel_task 0xc0235698 0x9 0xc0224e5c
kernel cont 0xc0050fd1
0 kernel_task 0xe0432bb8 0x84 0
kernel cont 0xc0023e05
Message was edited by: MaDDaemoNI am having this same issue too,...
& crash logs look same.
look like Springboard not launching in allotted time, so system reboots.. etc., etc...
iPhone 3G, 3.13...
tried every form or restore & reboot I can..
anyone else any ideas??? -
Unable to set watchdog on cFP2200 .
I using cFP2200 for data acquisition and I want to set watchdog on the two cfP2200. Some time my code set the watchdog and some time I get time out exception in the following code .Please suggest.
CNiReal32Vector output;
CNiBoolVector status;
m_NIFPDAQSystemPtr->SetNetworkWatchdogTimeout(GetFailoverTimeOut());
CNiFieldPointIoModule &objIOModule=m_NIFPDAQSystemPtr->GetIoModule((short)GetModuleID());
short nChannelCount = objIOModule.GetChannelCount();
objIOModule.GetNetworkWatchdogSettings(status,output);
status[GetChannelNumber()] = TRUE;
output[GetChannelNumber()] = !GetInitialBit();
objIOModule.SetNetworkWatchdogSettings(status,output,INFINITE );
objIOModule.SetNetworkWatchdogEnable(TRUE);
objIOModule.GetChannel(GetChannelNumber()).Data = GetInitialBit();
/// Where m_NIFPDAQSystemPtr is pointer of CNiFieldPointNetworkModule ;Hi RK,
Are you sure the control name is "userAccount.oldpassword" , verfiy it by viewing the view source from browser.
Try this code,
function focusmethod{
var Obj = document.getElementbyId("userAccount.oldpassword");
Obj.focus();
or
Remove onload function from <body> tag
<script>
var Obj = document.getElementbyId("userAccount.oldpassword");
Obj.focus();
function checkPassword(action) {
Thanks,
Vijay
Maybe you are looking for
-
hi, i hope that someone can help me, i have a blackberry curve 9300 and none of the blackberry add ons such as app world, email and bbm work. I have contacted my network provider and i am paying for the internet and email add on that should allow me
-
does anyone know when i can download movies on my new 80gb ipod. The uk website says it not available yet, the american website won't let me Any ideas?
-
WebNFS copy 3Gb file results in arrayindexoutofboundsexception
Hi, Using webNFS to copy a backupfile to a SAN via a NFS share the file is 3Gb, and hence com.sun.nfs.Nfs.write results in a exeption XFileInputStream in = new XFileInputStream(srcFile); XFileOutputStream out = new XFileOutputStream(dstFile); int c;
-
Installed Itunes & Can't find my purchase music, is there a hole in my PC
I gave up on my first issue and installed itunes on a different PC. All my MAC friends laugh at me. Anyway I cannot find my purchase music and its not easy finding that old disco music. This should be an easy one but of course I can't figure it out.
-
How to create credit and debit for an account
Please tell how to create credit and debit for an account in currencies like EURO,GBP other than functional currency(USD).