Top- Large run queue & High load averages
From which section of top we can find Large run queue & High load averages ?
Can you be a little bit more specific about what OS your running on? Top is not consistent on all versions of Linux...
This link may help
http://www.oracle.com/technology/pub/articles/advanced-linux-commands/part2.html
Similar Messages
-
Very high "load average" in top
Hi,
our OES11SP1 two-server-cluster (fully patched) shows a very high "load
average" (>50, up to 110) in top in some circumstances. There are no
problems in normal operation, but administrator actions like shutdown or
cluster migrate might trigger the problem.
For example when I enter 'halt', then there is the following line in
/var/log/messages:
Sep 12 20:27:18 srv1 shutdown[14675]: shutting down for system halt
more than 20 minutes later:
Sep 12 20:51:19 srv1 init: Switching to runlevel: 0
Within thes 20 minutes nothing happens, but "average load" goes up to at
least 50, with ndsd at top. Access to storage related tools and commands is
not possible, for example 'nss /pool' hangs without any output.
This happens on nearly every shutdown, but from time to time it doesn't. The
same will sometimes be triggered by a cluster migrate.
This only happens with our OES11SP1 cluster, it does not happen with OES11
and OES2SP3; the only other difference I'm aware of: Novell CIFS is only
running on the OES11SP1 cluster.
Any ideas?
Thanks,
MirkoSorry for the delay, it seems it's a bad habit of me to ask questions
immediately before holidays...
Yes, these servers have replicas, all of them... Cache size is set to 195328
KB, which is about twice the DIB size. IIRC this was a recommendation I read
somewhere at Novell. But I'll check that information again.
Thanks,
Mirko
kjhurni wrote:
>
> Mirko Guldner;2283539 Wrote:
>> top shows ndsd on top - but it's there in normal operation too, so I
>> don't
>> know if this means something.. (?) And it's not always the CPU which is
>> at
>> 100% - I have an example screenshot with: load average 50.20, 51.61,
>> 41.0
>> 3.2%us, 1.0%sy, 0.0%ni, 77.0%id 18%wa 0.0%hi 0.3%si 0.0%st. But this is
>> only
>> an example - this differs.
>>
>> Thanks,
>> Mirko
>>
>> kjhurni wrote:
>>
>> >
>> > Mirko Guldner;2283448 Wrote:
>> >> Hi,
>> >>
>> >> our OES11SP1 two-server-cluster (fully patched) shows a very high
>> "load
>> >> average" (>50, up to 110) in top in some circumstances. There are no
>> >> problems in normal operation, but administrator actions like
>> shutdown
>> >> or
>> >> cluster migrate might trigger the problem.
>> >>
>> >> For example when I enter 'halt', then there is the following line in
>> >> /var/log/messages:
>> >>
>> >> Sep 12 20:27:18 srv1 shutdown[14675]: shutting down for system halt
>> >>
>> >> more than 20 minutes later:
>> >>
>> >> Sep 12 20:51:19 srv1 init: Switching to runlevel: 0
>> >>
>> >> Within thes 20 minutes nothing happens, but "average load" goes up
>> to
>> >> at
>> >> least 50, with ndsd at top. Access to storage related tools and
>> commands
>> >> is
>> >> not possible, for example 'nss /pool' hangs without any output.
>> >>
>> >> This happens on nearly every shutdown, but from time to time it
>> doesn't.
>> >> The
>> >> same will sometimes be triggered by a cluster migrate.
>> >>
>> >> This only happens with our OES11SP1 cluster, it does not happen with
>> >> OES11
>> >> and OES2SP3; the only other difference I'm aware of: Novell CIFS is
>> >> only
>> >> running on the OES11SP1 cluster.
>> >>
>> >> Any ideas?
>> >>
>> >> Thanks,
>> >> Mirko
>> >
>> > Which process(es) does top show as being the culprit?
>> >
>> > In the past (on OES2 SP3) we had issues with CIFS causing ncp to
>> cause
>> > high utilization, but that was fixed a while ago.
>> >
>> > --Kevin
>> >
>> >
>
> I have seen ncp issues cause high ndsd utilization, but we've not yet
> upgraded our cluster or DS servers to OES11 yet (waiting for new
> hardware to go in place first).
>
> Out of curiosity, are the servers with high utilization also replica
> servers? For some reason, during one of our upgrades on a replica
> server (we have a server that contains all R/W copies of everything),
> the cache size got set down really low and that caused all sorts of
> issues.
>
> Maybe one of my collegues will wander by and offer additional insight,
> as this may be eDir related and/or NCP related. Not sure if triggering
> a core manually would help (but you'd have to send that to Novell and
> open an SR to get it read).
>
> IF you suspect CIFS, do you have the ability to temporarily shut off
> CIFS for like a few days to see if that's the culprit?
>
> -
[Solved] Excessive high load average
My laptop seems to have an very excessive high load average. My system gets very slow even when I'm running openbox with a browser, terminal and music player. That is enough to make applications freeze for a few seconds. I've seen the load average goes higher 5.00. Right now I just logged in openbox using lxdm, open a browser and a terminal and my load average is 23:11:21 up 12 min, 2 users, load average: 1.15, 0.92, 0.58.
$ top
PID USER PR NI VIRT RES SHR S %CPU %MEM TIME+ COMMAND
1367 root 20 0 48140 2392 1832 R 100 0.1 7:39.51 lxdm-binary
1411 jesse 20 0 402m 9800 7176 S 1 0.2 0:03.10 indicator-multi
1370 root 20 0 110m 14m 6388 S 0 0.4 0:03.50 X
1479 jesse 20 0 263m 13m 10m S 0 0.3 0:00.99 terminal
1 root 20 0 4180 692 592 S 0 0.0 0:00.61 init
2 root 20 0 0 0 0 S 0 0.0 0:00.00 kthreadd
3 root 20 0 0 0 0 S 0 0.0 0:00.00 ksoftirqd/0
6 root RT 0 0 0 0 S 0 0.0 0:00.00 migration/0
7 root RT 0 0 0 0 S 0 0.0 0:00.00 watchdog/0
8 root RT 0 0 0 0 S 0 0.0 0:00.00 migration/1
9 root 20 0 0 0 0 S 0 0.0 0:00.00 kworker/1:0
As yon can see the process 1367 is using 100% of the cpu. I have no idea why this is happening. Also my laptop isn't old. It has intel i5 processor with 4gm ram.
Anybody has an idea of what is happening?
EDIT: I just checked my load average again
[jesse@myarch ~]$ uptime
23:23:46 up 24 min, 3 users, load average: 4.42, 2.19, 1.25
That is an absurd value, isn't is? Ah, and my system is up to date, if anyone asks.
Last edited by sollidsnake (2012-02-17 12:35:34)well it looks like back in december there were some bugs reported with libglib2.0-0 2.31.2 that where causing this issue you can try updating to make sure you have the most current version.
pacman -S glib2 -
One of 4 node RAC always have higher load averages and higher than others
Hello,
We have a 4 node rac, 9208 on linux 4. When viewing top, we noticed the same one node always have a higher load average than the other 3 nodes. Is this normal. Loan balance is working fine but this one node always have higher load average. This is the node where we do the rac installation. Thank you.I do not remember what is the default for clb_goal (client load balancing) for 9i but 10g is LONG.
check it
select clb_goal from dba_services where name = <service name>
you may have to change from LONG to SHORT OR SHORT to LONG depending your connection types.
dbms_service.MODIFY_SERVICE(‘<service>’,clb_goal=> dbms_service.CLB_GOAL_LONG);
Read the following article.
http://www.databasejournal.com/features/oracle/article.php/3659411/Oracle-RAC-Administration---Part-15-Connection-Load-Balancing-and-FAN.htm -
High load averages, low CPU usage
HI,
I recently upgraded to Lion and I am noticing high load averages ~0.7 for my system. The CPU is, however ~95% idle. I am not running any intensive apps. I first thought it is an I/O issue, but I have almost 500 MB free memory, and there is no disk activity. The system is perfectly fine, with all the eye candy/animations running smoothly. Is this a bug?
Thanks for any help.Same problem here
Did a clean install of Lion, moved everything manually. System was clean and running fast. Then, for some reason started to slow down after a few months. My last system (macbook snow leopard) was running fine for 3 years.
I noticed HIGH load averages (over 2.5!) while CPU is ideling (only 'round 30% for user and system). System is slow and CPU gets hot, resulting in loud fan noise.
Googled a lot, did standard maintenance tasks, tried to pinpoint cause - nothing so far. Will update when I find out more. Maybe someone else has a clue or Apple releases a fix. Fingers crossed. -
[SOLVED] High load average in X at idle
Hello Archers,
Recently my laptop has been showing abnormally high 1-minute load averages (~0.20-0.80 on a dual-core machine) at idle whenever I have X running. This figure has always settled down to 0.00 after a while, and indeed it does when I exit to the console. What's puzzling is that top reports a less-extreme 2-3% CPU usage number, with X taking 1% at most (still high, though, considering I'm running a very lightweight DWM setup). To be sure, this does heat up the CPU noticeably. I don't think I've changed anything on my system except for routine updating, and I've made sure my .xinitrc script isn't the culprit.
Any ideas on why this is?
EDIT: Marked as [SOLVED].
Last edited by ktkhuong (2010-08-27 00:28:45)Kernel .35.
https://bbs.archlinux.org/viewtopic.php?id=103346
Last edited by karol (2010-08-26 10:33:35) -
WRT1900AC running at high loads
I had too many issues with the EA6900, and connection dropping, so i decided to try the WRT1900AC, and OMG I LOVE IT!!! on 5 GHz network i get full signal 40 ft away from the router in another room with the door shut.
One question though..... I noticed the load on my EA6900 was around .050 on average but on my WRT1900AC its always around 1.30 CPU load...
Is this normal, and or just because its a faster CPU that it is able to do more so it runs a bigger load??? I'm a linux geek, and owned a web hosting business so i'm familiary with average loads etc..
THANKS!Thanks for your posts
Keep in mind that the WRT1900AC is a dual core system. So a load of 2.00 is 100% capacity.
For one minute you had 4.06 which high but still fine as long as it's not sustained.
For five minutes you had 2.08 with is fine but the WRT1900AC was running at capacity
For 15 minutes you had 1.77 which is good but doesn't leave a lot of head room
Keep monitoring and see if the 15 minute load is always below 2.00. That would mean that for the most part the WRT1900AC is running under capacity.
Please remember to Kudo those that help you.
Linksys
Communities Technical Support -
[Solved] High load average even when idle after update
Hello arch fellows!
I have a Dell Inspiron N4110 intel i5 4gb ram. I think it is a decent laptop and should run Arch Linux pretty well, right? Well, it does until I run 'pacman -Syu'. After the update, the load average gets over 1.00 even when idle. It hardly gets under 0.40. Before updating, the average is under 0.10 most times. The command top doesn't show me anything. All the processes are practically 0%. It seems to me the problem would be the kernel, am I right? But I tried the fallback option when booting but it is the same.
Does anybody have an idea of what could be the problem? I appreciate any help.
Last edited by sollidsnake (2012-05-18 00:32:52)Thanks for the reply. It was really a kernel problem. I downgraded it to an older version and it looks normal now
-
Hi all,
System: SunOS 5.10 Generic_127127-11 sun4u sparc SUNW,SPARC-Enterprise
load: 4:08pm up 78 day(s), 6:57, 2 users, load average: 1.10, 1.14, 1.14
ps -ef | more
root 3 0 0 Oct 29 ? 1185:48 fsflush
Is it normal that fsflush eats that much of time?
Users told me, that it took several minutes to open files. What could be the issue?
If you need more information (vmstat, iostat) let me know.
Thanks in advance.
Cheers,
Axelasgrunix wrote:
Hi all,
System: SunOS 5.10 Generic_127127-11 sun4u sparc SUNW,SPARC-Enterprise
load: 4:08pm up 78 day(s), 6:57, 2 users, load average: 1.10, 1.14, 1.14
ps -ef | more
root 3 0 0 Oct 29 ? 1185:48 fsflush
Is it normal that fsflush eats that much of time? Doesn't seem unreasonable. A few hours of CPU time in 2 1/2 months? I wouldn't be looking there for your performance issues.
Users told me, that it took several minutes to open files. What could be the issue?Slow storage? Failing storage? Heavy load on the machine at the time? Any of those things.
Darren -
All,
I see that when my machine is in idle state, i see that its load increases to 2+
7:46 up 53 mins, 2 users, load averages: 2.15 1.57 1.28
USER TTY FROM LOGIN@ IDLE WHAT
alan.j console - 7:11 34 -
alan.j s000 - 7:11 - load
Any idea what might be causing it?
-Alan.All,
Can i've an update on this? -
Hi Experts,
I have a SOA deployed on AS 10.1.3.2 which is integerated with BI EE 10.1.3.2 on OHEL 4.
With this setup, I have seeing very high load average on cpu side. When I stop the soa oc4j the load average comes to normal level of under 1. While with soa process started it goes as high as 15 which is pretty abnormal.
Any pointers to debug what could be the issue will be helpfu.
Thanks,
RishiHi Experts,
I have a SOA deployed on AS 10.1.3.2 which is integerated with BI EE 10.1.3.2 on OHEL 4.
With this setup, I have seeing very high load average on cpu side. When I stop the soa oc4j the load average comes to normal level of under 1. While with soa process started it goes as high as 15 which is pretty abnormal.
Any pointers to debug what could be the issue will be helpfu.
Thanks,
Rishi -
hey all,
my new x4100s running linux sit at a load average of 8+, even when idle.
this page http://www.mail-archive.com/[email protected]/msg45814.html
mentions a workaround, by rmmodding ohci_hcd - however that kills usb, which i need.
any ideas?What version of linux are you using?
A load average of 8+ means something is truly wrong. My 4100s running RedHat Enterprise Linux V3, Update 6 have a load average at idle of 0, as they should.
Did you do a 'top' to see what is consuming the CPU?
Here is a list of the modules loaded on a typical 4100 in our shop:
[root@iterppn]# lsmod
Module Size Used by
parport_pc 29185 0
lp 15089 0
parport 43981 2 parport_pc,lp
autofs4 24009 5
i2c_dev 13633 0
i2c_core 28609 1 i2c_dev
nfs 245617 2
lockd 77809 2 nfs
nfs_acl 5185 1 nfs
sunrpc 175545 7 nfs,lockd,nfs_acl
ds 21449 0
yenta_socket 22977 0
pcmcia_core 69329 2 ds,yenta_socket
button 9057 0
battery 11209 0
ac 6729 0
sr_mod 20581 0
usb_storage 70921 0
md5 5697 1
ipv6 282913 24
joydev 11841 0
ohci_hcd 24273 0
hw_random 7137 0
e1000 120761 0
dm_snapshot 19073 0
dm_zero 3649 0
dm_mirror 32465 0
ext3 137809 6
jbd 69104 1 ext3
dm_mod 68097 24 dm_snapshot,dm_zero,dm_mirror
mptscsih 2753 0
mptsas 11981 3 mptscsih
mptspi 11725 1 mptscsih
mptfc 10825 1 mptscsih
mptscsi 46161 3 mptsas,mptspi,mptfc
mptbase 66721 4 mptsas,mptspi,mptfc,mptscsi
sd_mod 19393 3
scsi_mod 141457 7 sr_mod,usb_storage,mptsas,mptspi,mptfc,mptscsi,sd_mod
You might want to compare your module list to this one. We have USB support and no high load averages. -
Ubuntu running under Hyper-V with high load
I'm running a VM for Ubuntu Server 12.04 LTS under Hyper-V and with no web server or any other public server installed, the system load is 1.0
The VM is using a 75GB fixed VHD on its own partition (the hard drive is using raid 0).
Is this normal for running a linux OS under Hyper-V?
Here's what top shows (i restarted the vm for fresh numbers, but it still shows a load of 1.00 even after its been running for a day):
top - 16:58:17 up 23 min, 1 user, load average: 1.00, 0.58, 0.24
Tasks: 235 total, 1 running, 234 sleeping, 0 stopped, 0 zombie
Cpu(s): 0.0%us, 0.0%sy, 0.0%ni, 98.8%id, 1.0%wa, 0.0%hi, 0.2%si, 0.0%st
Mem: 4040336k total, 870384k used, 3169952k free, 13232k buffers
Swap: 7812092k total, 0k used, 7812092k free, 225396k cached
The host OS is running Windows 2012 R2 and it too isint running anything else that would put a lot of load on the server.Hi Sir,
The following daemons must be installed manually for Ubuntu distributions:
VSS Snapshot daemon – This daemon is required to create live Linux virtual machine backups.
KVP daemon – This daemon allows setting and querying intrinsic and extrinsic key value pairs.
To install both daemons, please use the following command:
Copy
# sudo apt-get update
# sudo apt-get install hv-kvp-daemon-init
# uname –r
<kernel release>
# sudo apt-get install linux-tool-<kernel release>
# sudo apt-get install linux-cloud-tools-<kernel release>
Please refer to note 5 , 9 within following article :
https://technet.microsoft.com/en-us/library/dn531029.aspx
As for linux VM running on hyper- v, I would suggest you to try to get further assistance from this forum:
https://social.technet.microsoft.com/Forums/en-US/home?forum=linuxintegrationservices&filter=alltypes&sort=lastpostdesc
Best Regards,
Elton Ji
Please remember to mark the replies as answers if they help and unmark them if they provide no help. If you have feedback for TechNet Subscriber Support, contact [email protected] . -
Correlation between cpu load and the run queue
We are seeing cases where our cpu utilization is less than 50 % but we are getiing run queues of more than 10 minutes. I would expect a high run queue if we were seeing higher cpu utilization but I am having trouble correlating a high run queue with low cpu utilization. Has anyone else seen a condition like this or have any insights. Database is 11g RAC Exadata X2 full rack. We are using ORM to manage CPU resources as well.
We are seeing cases where our cpu utilization is less than 50 % but we are getiing run queues of more than 10How many cores do you have?
If 4, then it looks OK.
Even if 8, it may be correct.
The core does not pickup a waiting thread right after a running thread starts waiting for something like I/O. The switching of context has a cost.
A time of delay may depend on CPU and kernel scheduler, but generally it should be some delay after CPU switches context from one process/thread to another, in case when a running process started waiting prematurely in the middle of its time slice. That is why you may see less than 100% CPU utilization. -
Persistent Active Requests causing high CPU usage and load average
Hi, we've deployed a JSP application on standalone OC4J talking to Oracle DB. The whole thing runs on Sun Solaris. On some occasions (still investigating what causes this), the number of active requests rises (as seen from EM console) and never drops down. The weird thing is, the load average numbers are proportional to the number of persistent active request. If there are 3 active requests, load avg will be [3.x 3.x 3.x], and whenever persistent active request exist, CPU usage will shoot up to over 90% (usually between 95% - 100%). When the system is running normally, CPU usage is usually less than 20%. Is it possible to track down this problem and how? Many thanks.
BlueAeon
Welcome to Apple Discussions!!
The icons on the right of the menu bar … show a beachball when I put the cursor over themWell, if this was your only problem I would put it down to a corrupt preference or cache file. Nevertheless, it is worth seeing how much this helps.
The file you need to trash is 'systemuiserver.plist'. But there are potentially two of them, plus it is best to delete your caches as well, so ...
In "/Users/<yourname>/Library/Caches", delete everything
In "/Users/<yourname>/Library/Preferences", delete com.apple.systemuiserver.plist
In "/Users/<yourname>/Library/Preferences/ByHost", delete com.apple.systemuiserver.xxx.plist where 'xxx' is a 12 digit (hexadecimal) numeric string.
See if this improves the situation, and post back with any remaining oddities.
Maybe you are looking for
-
How can I use my iMac (2011 - 27") to play my x-box ?
I understand there are issues around the Thunderbolt port - is there any 3rd party adaptor available to allow 'video in' to the new iMac ?
-
Reporting exceptions on deferred constraints
Is it possible to report exceptions on deferred constraints? I am using this mechanism to load tables in an arbitrary order and to prevent FK violations whilst loading. As a starting point, the script below works as expected (enabling and disabling):
-
Hi gurus, I need help to solve basic problem ^^ I want to copy a field of IT0002 into a field of IT0105 ( P0002-inits to P0105-userid ) I must do that using dynamic action but I dont knwo the exact parameters to fill and them syntax :/ (condition ne
-
Image searches yield unwanted results
Say for example, i type in sandal and click search. It takes me to the search results, i find a picture i like, and click on it. when i go to the link that says "see full size image" safari pauses, the url jumps around and then it brings me to some s
-
Photoshop download install problems
Hello all, I bought photoshop off of amazon.com (the download-able version) and when trying to install it I get this message: "Error 1335. The cabinet file 'Data1.cab' required for this application is corrupt and cannot be used. This could indicate a