[RTMT-ALERT] CallProcessingNodeCpuPegging

We are receiving an RTMT alert for one of the node in the cluster.
Error is [RTMT-ALERT] CallProcessingNodeCpuPegging.
Processor load over 90 Percent.
Aupair (98 percent) uses most of the CPU.
Please suggest what to do for the resolution of this and also want to know what is this AUPAIR and why is it running on CCM server.
Its urgent.

Hi Sam,
If the CPU pegging alert is reported at some specific time of the day then it is most likely due to some scheduled activity like DRS Backup, CDR Load etc. Please check the following link
http://www.cisco.com/en/US/products/sw/voicesw/ps556/products_tech_note09186a00808ef0f4.shtml
CPU Pegging Alerts
CPUPegging/CallProcessNodeCPUPegging alerts monitor CPU usage based on       configured thresholds:
Note: %CPU is calculated as %system + %user + %nice + %iowait + %softirq +           %irq
Alert messages include these:
%system, %user, %nice, %iowait, %softirq, and %irq
The process that uses the most CPU
The processes that wait on Uninterruptible disk           sleep
CPU Pegging alerts can come up in RTMT due to higher CPU usage than       what is defined as the watermark level. Since CDR is a CPU intensive       application when it loads, check if you receive the alerts in the same period       as when the CDR is configured to run reports. In this case, you can need to       increase the threshold values on RTMT. Refer to       Alerts for more information about RTMT alerts.
HTH
Manish

Similar Messages

  • RTMT ALARM CallProcessingNodeCpuPegging

    We are receiving an RTMT alert for one of the node in the cluster.
    Error is [RTMT-ALERT] CallProcessingNodeCpuPegging.
    Processor load over 90 Percent.
    hostagt (45 percent) uses most of the CPU
    lease suggest what to do for the resolution of this and also want to know what is this hostagt and why is it running on CCM 7.1.3  server.
    Its urgent.
    Regards!!!

    I just download the files, in. txt. Know any documents which support for the interpretation of this information.
    Regards!!

  • RTMT ALERTS FOR CCM

    Getting the following alert in the call manager in the RTMT for low water mark exceeded, my experience tells setting the log alert value to a lower level helps on the issue , but not sure. Can somebody guide on this.
    ===============================================================================================
    12/21/2011 7:30 PM : UC_RTMT-2-RTMT_ALERT  2138: Dec 22 2011 00:30:17.348 UTC :  %UC_RTMT-2-RTMT_ALERT: %[Name=LogPartitionLowWaterMarkExceeded][Detail=
    UsedDiskSpace : 62
    MessageString : Common Disk utilization hits LWM!
    AppID : Cisco Log Partition Monitoring Tool ClusterID :
    NodeID : NY1PUB01
    TimeStamp : Wed Dec 21 19:29:52 EST 2011.
    The alarm is generated on Wed Dec 21 19:29:52 EST 2011.][App ID=Cisco AMC Service][Cluster ID=][Node ID=NY1PUB01]: RTMT Alert
    =======================================================================================================
    ccm 6.1.3

    Abhishek,
    Looks like the lower threshold for UsedDiskSpace is set at around 60% & that's why the alert has raised. Keeping it at 75-80% is also fine. All it means is whenever 75-80 % of disk space is full, give an alert. I feel alert at 60% is too low or can be ignored.
    Pls remember to rate helpful posts.
    GP.

  • RTMT alert CoreDumpFileFound

    Hi all,
    need some help in understanding the following RTMT alert that came in last night:
    At Sat Jan 24 01:29:52 PST 2015 on node , the following CoreDumpFileFound events generated: 
    TotalCoresFound : 1
    CoreDetails : The following lists up to 6 cores dumped by corresponding applications.
    Core1 : Unknown (core.17118.11.cimlistener.1422095365)
    AppID : Cisco Log Partition Monitoring Tool
    ClusterID : 
    NodeID : IPTSUB01
     TimeStamp : Sat Jan 24 01:29:28 PST 2015
    Is this a self-rectifying event?
    Would a therapeutic reboot be warranted?
    Thank you very much for your help.

    Hi Carlo,
    Thank you very much for your feedback..  I followed your suggested approach and it looks like the scenario matches Matthews':  Example 3: Core Stack Corruption ... (please see backtrace below)
      ====================================
     backtrace
     ===================================
     #0  0x004dabae in ?? ()
    #1  0xb542ee80 in ?? ()
    #2  0x00000001 in ?? ()
    #3  0x00000002 in ?? ()
    #4  0x00000000 in ?? ()
    per Matthew's article ...the next steps...  "It is recommended that the corresponding service log (e.g. ccm traces, tomcat logs) and the complete core file be retrieved from the affected system for TAC review."  Could you suggest which ccm traces (and for what period of time) will be needed? also, need a little clarification on the tomcat logs (how-to pull)... lastly, could you provide the best method to copy the core dump file for tac or will the syntax of the utils core analyze <coredump filename> be sufficient?
    Thank you!

  • RTMT ALERT ERROR CiscoDRFFailure

    From the past few days Im getting this RTMT alert constantly,
    Reason : DRF was unable to backup component PHX_CONFIG.Error : Unknown Database Error AppID : Cisco DRF Master ClusterID : 
    Any ideas what this error might be ?

    Hi Kamalakar,
    Looks to be the following
    https://tools.cisco.com/bugsearch/bug/CSCur24834/?reffering_site=dumpcr
    Symptom:
    UCCX backup fails on CUIC component for second node (phx_config)
    Conditions:
    UCCX 10.5 SU1 HA
    Workaround:
    1. Enable Root access on both first node and the second node
    2. Copy the /opt/cisco/desktop/openfire/passphrase from Primary Node to the same location on Secondary Node.
    3. Restart the Unified CCX Notification Service and the Cisco Finesse Tomcat service.
    4. Redo the Back up. It should work this time.
    HTH
    Manish

  • RTMT Alert

    Hi All,
    I have query related to RTMT alert. is it possible to edit/add more information about the alert?
    For Example, recently I got the below alert which has no necessary information like which router/gateway port number cluster details etc.. Its just a blind message.
    MGCP DChannel is out-of-service. Current total of 1 MGCP gateway device(s) with D-Channel-Out-Of-Service status. The alert is generated on Thu May 19 11:10:29 EDT 2011 on cluster StandAloneCluster.
    Do we have any option to enable/edit the more information for RTMT alert in CUCM?
    Thanks in advance !!!!

    Hi,
    I think you have to go into RTMT and click on alert details here to view the actual name of the device.
    Only other quick way to handle this is SNMP traps from the actual router and into email..
    Cheers,
    Tim

  • RTMT alert for PRI

    Is there any way to generate an RTMT alert as soon  as the PRI goes down in cisco voice gateway (MGCP). I have almost 20 gateways and all have multiple T1 circuits  and need to setup on all.

    Thanks Yosh, I have Alerts setup for "Number of Registered Gateways Decreased" but my management wants to have Alert specific for all PRI's.
    I wonder if I setup alert for D-Channel "DatalinkService" does it sent alert for the status of individual PRI "Down/up

  • [RTMT-ALERT] DirectoryConnectionFailed - CM 4.1(3)SR8

    hi
    continually getting alert after configuring RTMT relating to Directory Connection even though everything appears ok.
    [RTMT-ALERT] DirectoryConnectionFailed
    Directory connection failed .
    Monitored precanned object has value of 0.
    my callmanager cluster is integrated into our active directory so not using DC directory and i am wondering if this
    alert relates only to DC directory??
    If it does i will disable it, however, if it also monitors the connection to active directory then i want to leave it enabled.
    Everything looks ok on the callmanagers.
    Cheers,
    G

    it should be fixed on your version but since you mention it's all good
    K07469952
    In the Cisco CallManager 4.1(3), the RTMT generates directory replication and connection alarms even when the directory replication appears to function normally
    http://www.ciscotaccc.com/kaidara-advisor/voice/showcase?case=K07469952
    HTH
    java
    if this helps, please rate

  • RTMT alerts for UCM and UCCX:

    We have had issues with UCCX agents that were unable to login because the CTI service or Tomcat service is hung.  We usually have to restart either CTI or Tomcat on the publisher to correct.   Does anyone know of any RTMT alerts that can be setup to notify us that one of those services is not responding or hung? 

    Hi Hariharan,
    In addition to Atul's link please refer the below mentioned link CPU and Memory usage
    http://www.cisco.com/en/US/docs/voice_ip_comm/cucm/managed_services/cucm_health.html#wp1101115
    Tx,
    Hope this helps
    Shalu

  • RTMT Alert - SDLLinkOutOfService

    We currently have 1-Publisher and 4-Subscribers.  We were told by TAC to turn Call Manager Services off of the Publisher and are now receiving SDLLinkOutOfService alerts on RTMT.  We are trying to determine if we should just disable the alarm on the Publisher in RTMT or if there is a reason to leave this on and determine what could be causing the alarm.
    Thanks,
    Jeff                  

    Here's a copy of the alarm.  Thanks!
    [RTMT-ALERT-StandAloneCluster] SDLLinkOutOfService
    [email protected] [[email protected]]
    Sent:
    Monday, February 10, 2014 10:42 PM
    To:
    Jeff Mize; Julie L. Putman; Robert W. Durham; Greg Knight
    Attachments:
    local  CCM has lost communication with remote CCM on v8.6(2a)SU3
    Current outstanding SDLLinkOOS events:
    LocalNodeId : 1
    LocalApplicationID : 100
    RemoteIPAddress : 10.250.10.11
    RemoteNodeID : 2
    RemoteApplicationID : 100
    LinkID : 1:100:2:100
    AppID : Cisco CallManager
    ClusterID : StandAloneCluster
    NodeID : CMP1
    TimeStamp : Mon Feb 10 22:41:54 CST 2014 
    The alert is generated on Mon Feb 10 22:42:22 CST 2014 on node 10.250.10.10.

  • RTMT Alert Detail

    Hi,
    Is there any way to get more detail from the alerts generated by RTMT. I would like to know what gateway went down but we cannot tell from the messages that are sent from the system.Here is an example of what is recieved when we lose a PRI gateway:
    Thanks!
    Chris

    Hi,
    From the problem description I read that you are trying to figure out which device is
    having this registration and unregistration attempts. Now assuming that you are looking at
    these message from Alert central, could you please also review the Application logs under
    System > Tools > System Viewer for related details?
    --The RTMT alerts  severity must be set to Error, so in UCM can you
    modified it to be "Informational"
    You should  able to find more details next time they
    get a device unregistration message.
    Remember that check on all servers
    the registration and unregistration events are always displayed under the
    UCM server to which the gateway is registered.

  • [RTMT-ALERT-StandAloneCluster] CiscoDRFFailure

    Hi All,
    I am getting alert from my unity connections servers for DRF component. Yesterday I got the alert for pub node and today I got for the sub node around the same time. I have verified the backup status and everything seems to be fine. I am attaching the traces here, Could you tell me whats going on in my network.
    Subject: [RTMT-ALERT-StandAloneCluster] CiscoDRFFailure
    Reason : Unable to access SFTP server or SFTP server too slow to respond.
    AppID : Cisco DRF Master
    ClusterID : 
    NodeID : eccun005-sub
     TimeStamp : Tue Apr 08 21:01:00 GMT+00:00 2014.
    The alarm is generated on Tue Apr 08 21:01:00 GMT+00:00 2014
    Thanks,
    Lajith P

    The alert I got it for today and I could see the line in traces as
    2014-04-08 21:01:00,257 DEBUG [NetMessageDispatch] - drfAlarm:sendAlarm: Sending Alarm: DRFSftpFailure

  • RTMT-ALERT PEPeerNodeFailure

    Hi Folks!
    We have three Unified Presence Nodes running with Systemversion 8.6.4.12900-2. Since I enabled RTMT Email Alerts, I get the follogin Alert from time to time from different nodes (not always the same). Strange think is, that if I check service via serviceability interface, it is up and running without a new downtime:
    [RTMT-ALERT-StandAloneCluster12487] PEPeerNodeFailure
    PEPeerNodeFailureAlarmMessage : Node pe54005002: OUT-OF-SERVICE
    AppID : Cisco UP Presence Engine
    ClusterID : StandAloneCluster12487
    NodeID : cups-01
    TimeStamp : Fri Mar 15 09:59:00 CET 2013.
    The alarm is generated on Fri Mar 15 09:59:00 CET 2013.
    Hope someone can help me in this.
    Thanks.
    Regards

    Thanks Rene,
    We also experienced a network fail-over that seemed to kick off all our problems. After the CUP servers replicated and started migrating users, the nodes would reach high Virtual Memory and CPU and the processes would either crash or the servers would hang.
    during after hours we were able to at least stabalize the servers so they weren't crashing (still had high VirtualMemory and SWAP) with some speradic CPU util, and that's when we noticed the "split Brain" effect.
    Based on your suggestion Rene, we looked at the trace settings and noticed all were set to debug. We turned them all off and restarted the CUP XCP Config manager, and then CUP XCP Router on the Sub node. the Sub node came back up and we were able to see status and IM to user one the PUB.  After a few days of a working scenario we restart the CUP XCP Config and CUP XCP Router on the PUB, which restored virtualmemory and SWAP back to a normal operating conditions.

  • Hardware Failure - RTMT Alert

    Hello,
    Need a quick suggestion on a RTMT alert -- regarding which I need more clarity whether this is just a harmless caveat or something which can lead to an RMA.
    --> Daemon 4 Director Agent: LSIESG_StoragePool_Deleted 500605B001ACAF00 Removed: PD 01(e0xfc/s1) .
    --> Daemon 4 Director Agent: LSIESG_PortController_Modified 500605B001ACAF00 Removed: PD 01(e0xfc/s1)
    Researched on this -- where as   the bug  CSCti17353 can be a cause  , however that goes out of picture if we see the StoragePool message in the alert log.
    Some other bugs pointing out to these :  CSCtn86264 // CSCti05776
    Need more clarity on this - appreciate some response here.
    Product Ver  : 8.6.2.22900-9
    Whatever level you reach getting better never stops -- Sachin Tendulkar       

    Hi Joe,
    Good morning.
    It's a 7835I3 physical server. Please see outputs of show status / hardware in my post. 
    admin:show hardware
    HW Platform       : 7835I3
    Processors        : 1
    Type              : Intel(R) Xeon(R) CPU           E5504  @ 2.00GHz
    CPU Speed         : 2000
    Memory            : 6144 MBytes
    Object ID         : 1.3.6.1.4.1.9.1.585
    OS Version        : UCOS 5.0.0.0-2
    Serial Number     : 1234567

  • RTMT-Alerts

    Hello!
    The RTMT notified me of three alers.
    These are:
    1.,
    Processor load over 90 Percent.
    Tar (74 percent) uses most of the CPU.
    2.,
    Number of MediaListExhausted events exceed 0 within 60 minutes
    3.,
    Number of RouteListExhausted events exceed 0 within 60 minutes.
    Can you tell me what do they mean and how can i fax them?
    Thanks

    1. means that some processes/services are experiencing high CPU load for a period of time.
    Check to see if dial tone is delayed when the phone goes off-hook.Use the Windows Task Manager to check every service's CPU utilization.
    We always have this alert when BARS is running (01:00 AM)
    2-Means a phone tried to use a resource from a RouteList(see CCM Admin Page) and was unable to get one.
    the caller will then get a busy tone.
    (you may ignore this alert and nr 3 unless you see user having problem with the telephony)

Maybe you are looking for

  • How to save file as "current date-current time"

    Hello, I'm trying to rename a file as from file.mov to "current date-current time"movie.mov, So the final product should be something along the lines of 08-18-08-12:59:08movie.mov (or whatever format the current date/time is in osx). Any ideas on how

  • Forte access to AS/400 Databases

    Does anyone have any experiences accessing an AS/400 database from a Forte partition? If so, we would like to know if ODBC or some other technique was used. If ODBC was used, what vendor supplied the ODBC connection? To unsubscribe, email '[email pro

  • Logitech Quickcam Vision Pro as a Flash webcam?

    I need a webcam for my Mac Pro. I'm currently running Leopard but plan to upgrade to Snow Leopard soon. I'm looking at the Logitech Quickcam Vision Pro, but I've read reviews that say it works well in applications like iChat and Skype, but isn't reco

  • Why my sent emails are not showing up in the sent box

    The emails I'm sending are not showing up in my sent box. Why is this happening? 

  • WLC 7.4.100.0 Downgrade

    Hello, Some time ago I updated a WLC, model 2504, from version 7.3 to 7.4.100.0. I also update the FUS (Field Upgrade Software) to the latest release, 1.8.0.0. Now I need to downgrade the WLC back to 7.3 version. My doubt is: Can I just take the norm