Bind 9.3.0 on FreeBSD 5.3 exits with signal 11

Dirk Janssen djanssen at netcologne.de
Wed Jan 19 20:30:53 UTC 2005


Hi,

I've got a problem with Bind9.3.0 under the following circumstances:

- Server: Sun Fire V65x, 2 x Xeon 2,8Ghz (Running with Hyperthreading), 2 GB=
   
  Ram
- OS: FreeBSD 5.3-STABLE, interface configured with two IPs
- 2 completly seperated instances of Bind 9.3.0 compiled from source,=
 running 
  with uid bind and each listening on a different ip
- I've compiled bind with the following options:
  --prefix=3D/usr/local/bind9/bind9-res1 \
  --exec-prefix=3D/usr/local/bind9/bind9-res1 \
  --with-openssl=3Dno

The Server is one of our caching-only-nameservers (serves only 3 zones and=
 does 
recursion otherwise) for our customers (in use for just about 3 days) and=
 each 
instance of bind processes about 500-1000 queries/s.
Everything works fine so far, but there is one major problem: 
The instances of named exit with signal 11 and coredump approximately every=
 
14-18 hours always at times when the cache of each named has grown about 550=
 mb 
and the queries are at their peak (max. 1000 queries/s). What I've done so=
 far 
is to limit the max-cache-size to 500 mb and modify 
recursive-clients/tcp-clients but without any effort. 
Here are the relevant parts of the options-statement:

++++++++++++++++++++++++++++++++++++++++++
options {
        listen-on { xxx.xxx.xxx.xxx; };
        listen-on-v6 { none; };
        directory "/usr/local/bind9/bind9-res1";
        pid-file "named.pid";
        allow-query { known_hosts; !any; };
        allow-transfer { trusted; !any; };
        blackhole { block; };
        notify no;
        recursive-clients 5000;
        tcp-clients 500;
        querylog no;
        max-cache-size 500M;
};
++++++++++++++++++++++++++++++++++++++++++

Another thing that I've encountered with this version of bind is that if the=
 
cache has grown approximately every hour each instance of bind comsumes=
 around 
95% CPU for a short period (estimated 1-2 minutes) which produces a high=
 load on 
the server. During this time named answers only a limited number of queries=
 
(around 100-200 queries/s), the others probably will timeout.

Has anybody an idea to solve this issue? Any help or hints would be=
 appreciated.

Kind Regards,

Dirk







More information about the bind-users mailing list