LMS 4.2 Fault Monotor
Hi
I useLMS 4.2, In fault Monitor I have the folowing messages :
Event_Description
StateNotNormal
Device IP
10.92.8.142
Device Type
Switches and Hubs
Fault Last Updated At
17-Nov-2013 12:36:25
Component
PWR-I@/2058 [Sw2, PS1 Faulty, RPS NotExist]
Component Class
PowerSupply
Component Event Code
1085
Event Category
Environment
Event Source
DFM
Status
CRITICAL
cisco Env Mon Supply State
6 --> I have just 4 (not 6)
But when I check in thw switch everything is ok :
#sh env all
FAN 1 is OK
FAN 2 is OK
FAN PS-1 is OK
FAN PS-2 is OK
SYSTEM TEMPERATURE is OK
System Temperature Value: 30 Degree Celsius
System Temperature State: GREEN
Yellow Threshold : 46 Degree Celsius
Red Threshold : 60 Degree Celsius
SW PID Serial# Status Sys Pwr PoE Pwr Watts
1A C3KX-PWR-715WAC LIT170517XR OK Good Good 715/0
1B C3KX-PWR-715WAC LIT1705180V OK Good Good 715/0
2A C3KX-PWR-715WAC LIT170517UY OK Good Good 715/0
2B Not Present
3A C3KX-PWR-715WAC LIT170518XH OK Good Good 715/0
3B Not Present
SW Status RPS Name RPS Serial# RPS Port#
1 Not Present <>
2 Not Present <>
3 Not Present <>
#sh stack-power de
Power Stack Stack Stack Total Rsvd Alloc Unused Num Num
Name Mode Topolgy Pwr(W) Pwr(W) Pwr(W) Pwr(W) SW PS
SP-1 SP-PS Ring 2860 31 669 2160 3 4
Power stack name: SP-1
Stack mode: Power sharing
Stack topology: Ring
Switch 1:
Power budget: 943
Power allocated: 223
Low port priority value: 20
High port priority value: 11
Switch priority value: 2
Port 1 status: Connected
Port 2 status: Connected
Neighbor on port 1: Switch 3 - 7cad.7416.0680
Neighbor on port 2: Switch 2 - acf2.c5c1.4600
Switch 2:
Power budget: 943
Power allocated: 223
Low port priority value: 22
High port priority value: 13
Switch priority value: 4
Port 1 status: Connected
Port 2 status: Connected
Neighbor on port 1: Switch 1 - 4c4e.359b.c900
Neighbor on port 2: Switch 3 - 7cad.7416.0680
Switch 3:
Power budget: 943
Power allocated: 223
Low port priority value: 21
High port priority value: 12
Switch priority value: 3
Port 1 status: Connected
Port 2 status: Connected
Neighbor on port 1: Switch 2 - acf2.c5c1.4600
Neighbor on port 2: Switch 1 - 4c4e.359b.c900
I don't find any bug for this ussue.
someone has an idea?
Many thans for your help
This seems to be a BUG ,there are few alredy filed for 2960's and 3750's.
CSCtx16194 C3750 stack switches ciscoEnvMonSupplyState returns notFunctioning
CSCuj56140 SE5:Power supply status is wrong when dual power supply is removed
CSCti27620 SNMP trap and snmp walk don't reflect correct status for power supply
CSCtq97867 SNMP Generating "Faulty" status for Power supply when PS is inserted
this is a device side issue.
Thanks-
Afroz
Similar Messages
-
LMS 4.0 Fault Management Module alert doesn't show CurrentUtilization
Hi,
I would like to know if there's a way to show CurrentUtilization percentage within the messages generated by the Fault Management Module in LMS 4.0
EVENT ID = 00008Z1
TIME = S
STATUS = Active
SEVERITY = Critical
MANAGED OBJECT = switch
MANAGED OBJECT TYPE = Switches and Hubs
EVENT DESCRIPTION = HighUtilization::Component=PORT-switch/11150 [Gi3/0/50] [---> TRUNK ];ComponentClass=Port;ComponentEventCode=1057;TrafficRate=5.2261432E7 BYPS;DuplexMode=FULLDUPLEX;UtilizationThreshold=40;MaxSpeed=1000000000;Type
NOTIFICATION ORIGINATOR = Fault Management Module
As you can see above CurrentUtilization percentage is not shown in Event Description section.
Could someone help?
Thank you!
Massimiliano.To my knowledge nothing can be configured here.
You have to take traffic rate and max speed and do the calculation yourself
FAD! functioning as designed
Cheers,
Michel -
LMS 4.2 Fault Manager Issue
Hi All,
We are seeing many Unidentified traps on the DFM for multiple devices.
274. 008ZETI Active InformAlarm Mumbai_6509-1 Mumbai_6509-1: Unidentified Trap Generic Trap:6 Specific Trap:1 EnterpriseOid:.1.3.6.1.4.1.9 04-Feb-2015 01:59:28 NA
275. 008ZETG Cleared InformAlarm Mumbai_6509-1 Mumbai_6509-1: Unidentified Trap Generic Trap:6 Specific Trap:1 EnterpriseOid:.1.3.6.1.4.1.9 04-Feb-2015 01:59:03 NA
276. 008ZETA Active InformAlarm Mumbai_6509-1 Mumbai_6509-1: Unidentified Trap Generic Trap:6 Specific Trap:2 EnterpriseOid:.1.3.6.1.4.1.9.9.109.2 04-Feb-2015 01:58:22 NA
274. 008ZETI Active InformAlarm Mumbai_6509-1 Mumbai_6509-1: Unidentified Trap Generic Trap:6 Specific Trap:1 EnterpriseOid:.1.3.6.1.4.1.9 04-Feb-2015 01:59:28 NA
275. 008ZETG Cleared InformAlarm Mumbai_6509-1 Mumbai_6509-1: Unidentified Trap Generic Trap:6 Specific Trap:1 EnterpriseOid:.1.3.6.1.4.1.9 04-Feb-2015 01:59:03 NA
276. 008ZETA Active InformAlarm Mumbai_6509-1 Mumbai_6509-1: Unidentified Trap Generic Trap:6 Specific Trap:2 EnterpriseOid:.1.3.6.1.4.1.9.9.109.2 04-Feb-2015 01:58:22 NA
Regards,
ChannaHi,
The unidentified trap message in fault manager is expected when LMS receives a trap that is not in the list of traps that the fault manager is capable of processing.
Here are the traps that fault manager can process:
http://www.cisco.com/c/en/us/td/docs/net_mgmt/ciscoworks_lan_management_solution/4-2/user/guide/lms_monitor/lms_mnt/TrapFwd.html
The SNMP traps are only processed by the fault manager and as per the document above the ones that will be identified are pre-defined and the list cannot be modified.
Clearing an Unidentified Trap
You can manually clear Unidentified Traps from LMS. To do this:
Step 1 Select the Unidentified Trap and click Clear.
A message appears prompting you to confirm the clearing.
Step 2 Enter your user ID.
This will be used as a reference to identify who cleared the Unidentified Trap.
Step 3 Click OK to confirm.
The Unidentified trap is cleared.
To retain the trap click Cancel.
- Ashok
Please rate the post or mark as correct answer as it will help others looking for similar information -
LMS 4.2 Fault Monitor - Device Name and Frequency of Events
Hi all,
I've just installed LMS 4.2, like it a lot so far. But I'm running into a few problems, and I'm hoping someone out here has a suggestion.
In the Fault Monitor, the Device Name column shows the device's IP address rather than the host name. We need for it to show the host name, for ease of troubleshooting; most folks don't have the IP addresses memorized. Likewise, when an email is sent out for an event, the managed device field also shows up as the device's IP address.
The devices were all discovered with their IP addresses rather than a host name...should LMS have automatically found their host names? Regardless, I manually updated all of the device's host names, yet they still display as an IP address in the fault monitor.
Also, it appears I need to figure out some way to throttle alerts. One particular device will report an event (ie a temperature out of range) dozens of times in a polling period...several per second, even though it's the same alert. Any suggestions on where I can throttle this? My inbox is exploding.
If anyone has any ideas, I would appreciate it!
JenI am having the IP vs. device name issue as well. The funny thing is if you hover over the IP of the device in Fault Monitor, you can see the device name listed. I was on a conference call with Cisco recently and pointed this out. One of the reps said the faults and alerts showing IP address issue should be fixed in version 4.2.3, but I don't know if he was basing this on first-hand knowledge or was just assuming. Does anyone know if this is the case?
As for throttling alerts. I had a false positive alert from a device that I put in a custom fault group and changed the threshold on to keep them from firing. You probably have already resolved this issue, but I didn't see a resolution listed here. You would have to determine if raising a threshold for a device is adequate for your situation.
Thank you,
Mark -
Ciscoworks LMS 4.0 – Fault Device Details Issue
We currently use Ciscoworks LMS 4.0 but when I go into, Monitor > Fault Settings > Setup > Fault Device Details
I get the following message (see attached document with screenshot) and being a LMS newbie am unsure what to do? As have tried to search for this
file but no luck.
So thanks in advance for any advice.Check if the fault management rediscovery page shows device as discovered and known or does it have any errors?
Are you able to generate any fault management reports and view other pages?
Just try to reboot the server/restart daemon to see if it is goes away.
Else it is mostly corrupt FM DB. Which would need to be re-initialized.
Fault Mgmt reinitialize is very simple task, which doesnt removes a lot of data, except past 31 days of FM history and custom notifications, if configured.
Thanks
Vinod
**Rating Encourages contributors, and its really free. ** -
Cisco Works LMS 4.0 - Fault Monitor Issue
Hi!,
I have a little problem with the LMS application. when I try to access to the screen of: Monitor --> Monitoring Tools --> Fault Monitor, the application shows an error message that only said: "Sorry an error occurred" and thats all.
Any idea of what can I try in order to solve this?,
Thanks,Hi Duong!, thanks for your reply. The services was restarted but the problem still exist (the process DFMCTMStartup is allways down although I started manually)
And the only change that I see is that the Fault Monitor Screen show first a message "There are no faults available" but I Have marked 46 Faults!" -
LMS 4.0: Fault Monitoring Device Administration stuck in learning
Hello Members,
i have a problem with Fault Monitoring Device Administration. i have two devices which stuck in learning mode for almost forever. When the job is done the devices report an error SNMP timeout. I run a credential verfication job and the credentials for the devices are correct.
any ideas?
regards
alexAssuming these are the only four devices using SNMPv3 in LMS, then the engineID is a non-issue. However, if any other device has the same engineID as those two non-working devices, the problem could still be a duped engineID. Debugging with logs is fairly complex. The easiest way to identify the problem is to use a sniffer. Start a sniffer trace filtering on all traffic to one of the failing devices. Then, rediscover the device in Fault Monitoring under Admin > Collection Settings > Fault > Fault Monitoring Device Administration. When it goes to a Questioned state, look at the sniffer trace.
If you see SNMP report packets indicating an error of notInWindow, that points to the duplicate engineID problem. ICMP packets without responses points to problems with ping. -
LMS 4.1 - Fault Notification Group
I've populated my LMS v4.1 device database with all our managed devices (192.168.x.x). When I go to 'FAULT NOTIFICATION GROUP' and I must select either the nodes or groups that are associated with this this group, certain nodes are not appearing in the list where I must add them to the group.
As another test, I've created a 'USER DEFINED GROUP' and added all the nodes into it (including the ones I did not see in my prior test). When I add the "UDG" to the 'FNG", most of the are listed int he "Group Subscription Summary" but not all nodes.
I am running LMS v4.1 on Windows 2008.
I suspect it has something to do with the status of the device but I'm still trying to figure it out.
Any assistance would be appreciated.
Thanks,thanks Joe for your efforts! - and again, you are correct...
resolver.pl LMSserverName
returned 127.0.0.1 as the servers' address - and where did it come from? - the hosts file... There, the hostname and FQDN of the server had entries which pointed to 127.0.0.1 - no clue why this was done. For a test I commented out these entries and at least resolver.pl now returns the correct IP address of the server.
I tried to force a Fault Notification Trap to be sent, but currently no chance; Usually I can do this by just clear a currently active event which gets forwarded, but this time it seems not working. I can mark the event, I get a pop-up to to change the name and make annotations, I even can click on "Yes" to confirm the action -but then nothing happens. - I tried these steps a couple of times the last 45 mins- I think I will wait know until the HighUtil goes a way and drink a wine meanwhile :-)
and I believe cenAlarmServerAddress will be correct now... -
LMS 4.0 Fault discovery snmp timeout
Hi all,
I am running LMS 4.0 on windows 2008 SP2 64-bit.
I have the problem that all devices from dcr stay in the questioned State with error state snmp timeout.
I have done a credential verification and snmpwalk to the devices and this works correctly.
regards
JoergJoerg,
What kind of devices are they that are being questioned?
Make sure Fault Management in 4.0 supports the devices.
Rob -
LMS 4.01 Fault management says "Sorry there was an error"
Dears,
I find myself confronted with an indcation of multiple errors bu no way of viewing them in Faultview.
Other parts of the application do see the errors
At the same time I find the events from syslog ar no longer visisble in LMS.
They still come in the syslog.log, they are read by LMS because the automated action still fire.
But the syslog reports are empty.
What is the common factor here that can cause this.
Cheers,
MichelThanks Nael,
I've click on the number 47 in the faul panel as well as refreshing the entire page. This does not help
I've also tried to select various device groups and sometimes the router group say there are no errors. Going back to the switches group or "all devices" give the same "Sorry an errors occured"
Cheers,
Michel -
LMS 4.0 - Invisible faults
Since I installed LMS 4, the Faults panel at the top of the screen always reports 4 critical faults. When I click on the red icon to go to the Fault Monitor, I get "No faults are available" (screenshot attached).
Anyone know how to fix this?
ThanksThe quickest way is to reinitialize the DFM databases by following the instructions in https://supportforums.cisco.com/docs/DOC-8796 . However, if you want a more tactical analysis you can open a TAC service request, and they can go through the dfmEpm database to look at why these counters are wrong. The counters should be showing the total number of events, which are the atomic conditions occurring on the devices. This would not necessarily line up to the alerts seen at the top level of the Fault Monitor. However, in your case since you don't see any alerts, you shouldn't have any events.
-
LMS 4.2 with Nexus C7010 Switch
Hi Guys,
I tried to add Nexus C7010 into my LMS 4.2, the SNMP work like charm but not for the archieve config.
It just show "Incorrect" under SSH, further checking I found this
"Failed to establish SSH connection to 10.152.xx.xx - Cause: Authentication failed on device 3 times."
I understand that Nexus is using Role-Based Account and there is no enable secret password anymore.
How do I add the Credential for Nexus? Other devices with IOS is no problem at all.
Please Help... Thank you.Pls try this...
There is a legacy Button on top right corner, select DFM, then select Event sets and select the required triggering fields like flap, operationallydown etc... and make it as A (there is a tick mark available to group all these fields under 'A' group.
Secondly, go to email notification group and select all the devices which you want to monitor and select Alerts and over there select 'A', then select Critical, Warning etc, click next next...
For rest of it, pls go by the logic.
E-Mail Configuration Tasks
Managing E-Mail Notification Subscriptions
CiscoWorks LMS Portal > Device Fault Manager > Notification Services > E-Mail Configuration > E-Mail Notification
Admin > Network: Notification and Action Settings > Fault - Email notification
The E-Mail Notification Subscription page displays information about subscription, status and notification group. -
LMS 4.2 communication protocol
Hi All,
We are using the below functionality in the LMS 4.2
- Fault Monitoring (DFM)
- Performance Management (HUM)
- Inventory Collection and Configuration collections (RME)
On device side configuration currently we have configured the RO and RW strings (SNMPv2c). we need to remove the RW string configuration from the device. Is above functionality can work on RO string alone ??
I gone through below link where it says RW is not required for above functions apart from Common services. need to know the role of RW string in the common services.
http://www.cisco.com/c/en/us/products/collateral/cloud-systems-management/ciscoworks-lan-management-solution-3-2/white_paper_c07-552114.html#_Toc236468657
Regards,
ChannaCommon services is the main administrative module of LMS and all the credentials are stored in CS which are than distributed to other modules like HUM, RME etc.
Under RME various individual features like Configuration management, Software Image Management (SWIM), NetConfig etc depends on SNMP RW strings, however not very much and a failure can easily be found if due to removal of RW strings.
Still we recommend to have RW strings unless there is a strict security policy surrounding RW strings.
Other than being the centralized credential repository Common Service has an internal connection to CiscoView and Device Centre which may also need RW strings often.
You can plan to remove it once and see the aftereffects and plan accordingly.
-Thanks
Vinod
**Encourage Contributors. RATE Them.** -
DFM Error after Upgrade from LMS 3.1 to LMS 3.2
Hi Cisco Community,
I have a big problem with the LMS Application Device Fault Manager 3.2.0 after an Upgrade from LMS 3.1 to LMS 3.2
The LMS upgrade finished without error.
The Alerts and Activities Diplay encountered an error with the following message:
"An exception occured.Please check the AAD.log file for further details"
You find the "pdshow" and "netstat" output and also the AAD.log, DeviceManagent.log, EPM.log, brstart.log and TISServer.log attached to this message
Everything else works fine.
A Cisco Daemon Manager restart doesn't fix this problem.
Any ideas to fix this problem?
Thanks for your help.
Best regards
Stephan B.The output of the command "brcontrol" was:
Error attaching to broker: S25B2005:9002!
I found a workaround for this error message at the release notes for the Cisco Unified Operations Manager.
The workaround is:
1.) net stop crmdmgtd
2.) Wait 15 minutes
3.) net start crmdmgtd
After completion of this workaround, I got the following output:
Broker is located at: S25B2005:9002 Started: May 07 11:08:47 2010
Domain Host Name Port PID State Last Change Time
DFM S25B2005.drv.bb 21900 18060 RUNNING May 07 11:08:55 2010
DFM1 S25B2005.drv.bb 21903 13988 RUNNING May 07 11:08:55 2010
You find this outputs attached to this message.
Unfortunately the errors in the DFM application are the same as before.
Also I get some error messages in the "Polling and Threshold" menu.
You find this error message attached to this message.
I already execute the following steps from the Cisco TAC to solve this problem.....without success.
Stop Daemons >>> net stop crmdmgtd from CLI.
Open windows explorer and go to \CSCOpx\databases\dfmFh. Delete dfmFh.log.
Go to \CSCOpx\databases\dfmFh> from CLI and run:
dbsrv10.exe -f dfmFh.db from CLI.
Similarly open windows explorer and go to \CSCOpx\databases\dfmEpm. Delete dfmEpm.log.
dbsrv10.exe -f dfmEpm.db from CLI.
Similarly open windows explorer and go to \CSCOpx\databases\dfmInv. Delete dfmInv.log.
dbsrv10.exe -f dfmInv.db from CLI.
Note : Please check a screen will pop up each time you execute these commands.
Then, Start daemons >>> net start crmdmgtd from CLI. -
Does Ciscoworks keep a log locally on the server for all the Device alerts?
The reason for this question is I got pinged by an Auditor yesterday seeing if we can obtain this type of information.
Thanks in advance.The log file that has information about the alerts that LMS produces is the
AAD.log. It is located at NMSROOT\log\dfmLogs. The kind of information you
will see in the log file depends on the logging level that you currently
have for it.
Additionally you might want to see the syslog.log which is located at
NMSROOT\log and contains info about the syslogs that LMS receives from your
devices.
Also, LMS has a fault history option in which you should be able to see old
alerts.
Maybe you are looking for
-
Hiya, I'm new to this forum but surely not new to the world of technology and technical support. I've been an active technologist for almost 30 years and I run servers, design sites and server applications, author software and a host of other digita
-
MacBook fan running very loudly
I have been having issues with my MacBook overheating and causing the fan to run very loudly (so loudly I can't listen to music or watch videos on it very well). I have had the motherboard replaced twice this year alone by Apple and I still have cove
-
Text display issues with htmlText, Embedded Font
Hey All, I'm having an issue with the display of my hyperlinks in a textfield that is using embedded fonts. It offsets the hyperlinks to the left along the line they are on and the underline doesn't stretch all the way under the text field. The text
-
Hi, I've got a Macbook with os 10.4.11 and two problems re: sizing. 1st-I can't get it to print the full size on my canon mp830 printer. It reduces to about a third of the size and moves to the top left corner. I've tried everything I can think of to
-
How do I get rid of the magnifying cursor ?
Somehow I trigger the magnifying cursor. it is annoying t use and I do not know how to change it back. Any ideas ?