RHEL 6 CPU load
-
bfly88 at gmail.com
Thu Nov 21 14:57:41 UTC 2013
> What about the information from top? When comparing RHEL5 and RHEL6 systems,
> I would compare the total CPU usage of the server (out of 100% not 2400% or
> 1600%).
>
> Since the hardware is different, comparing a 16 named threads on a 16 core
> box at ???MHz against a 24 core box with 24 named threads at ???MHz may not
> necessarily be valid. If the CPUs are running at the same frequency (look at
> what speed they are actually running at vs the max speed... see
> /proc/cpuinfo ) then you can probably account for the 16 vs 24 core
> difference pretty easily. If the CPUs run at more than negligibly different
> frequencies you will have to factor that into any comparison or make the
> frequencies the same to make a 1:1 good comparison.
The systems run at the exact same frequency processors (RHEL 6 -
X5675 @ 3.07GHz, RHEL 5 - X5667 @ 3.07GHz). One is just a little
older and only has 16 CPUs. I've run named on the RHEL 6 system with
only 16 procs (named -n 16) to see if it made a difference and the
result was the same, named on the RHEL 6 system running 6-7 times the
load of a RHEL 5 system.
We aren't running DNSSec so I don't think the managed-keys-directory
should be an issue.
Running a 30 second strace on one of the named threads shows the
process is a lot busier and has more errors on the RHEL 6 system:
RHEL 6:
> strace -c -p 29904
Process 29904 attached - interrupt to quit
Process 29904 detached
% time seconds usecs/call calls errors syscall
------ ----------- ----------- --------- --------- ----------------
99.93 9.074464 135 67128 20375 futex
0.04 0.003689 1 2538 2430 recvmsg
0.02 0.002141 1 2498 write
0.00 0.000256 2 103 sendmsg
0.00 0.000138 4 36 socket
0.00 0.000065 2 36 connect
0.00 0.000019 0 72 setsockopt
0.00 0.000000 0 36 close
0.00 0.000000 0 36 bind
0.00 0.000000 0 36 getsockopt
0.00 0.000000 0 108 fcntl
------ ----------- ----------- --------- --------- ----------------
100.00 9.080772 72627 22805 total
RHEL 5:
> strace -c -p 18498
Process 18498 attached - interrupt to quit
Process 18498 detached
% time seconds usecs/call calls errors syscall
------ ----------- ----------- --------- --------- ----------------
99.97 1.549134 69 22399 5604 futex
0.02 0.000243 0 720 698 recvmsg
0.01 0.000193 0 722 write
0.00 0.000017 0 36 socket
0.00 0.000000 0 36 close
0.00 0.000000 0 36 connect
0.00 0.000000 0 39 sendmsg
0.00 0.000000 0 36 bind
0.00 0.000000 0 72 setsockopt
0.00 0.000000 0 36 getsockopt
0.00 0.000000 0 108 fcntl
------ ----------- ----------- --------- --------- ----------------
100.00 1.549587 24240 6302 total
Are others seeing the named process run at 130-180% on RHEL 6? We've
never seen this high of CPU usage for named on any system including a
Solaris 10 system running 32 CPUs. They have all run around 11-30% CPU
depending on the time of day.
--
Daniel
More information about the bind-users
mailing list