Solaris 10 Server Fail errors

JINMEI Tatuya / 神明達哉 Jinmei_Tatuya at isc.org
Tue Nov 11 19:15:12 UTC 2008


At Tue, 11 Nov 2008 09:29:05 -0500,
"Davenport, Steve M" <SDavenpo at mc.utmck.edu> wrote:

> I'm running 9.3.5-p1 on a lightly loaded secondary caching server under
> Solaris 10 sparc. I noticed from a stats graph and snoop that when a
> resolver hits this server with closely spaced, rapid queries, Server
> Fails are generated.
> 
> 13:45:51.51634  ns-p -> hhost DNS R  Error: 2(Server Fail)
> 13:45:51.51652  ns-p -> hhost DNS R  Error: 2(Server Fail)
> 13:45:51.51664  ns-p -> hhost DNS R  Error: 2(Server Fail)
> 13:45:51.51677  ns-p -> hhost DNS R  Error: 2(Server Fail)
> ...
> 
> During this time (or at least when I am looking)
> idle cpu is around 93% and the bind process uses about 4%
> and the named process is using about 32M of memory.
> The "rndc status" shows recursive clients:  15/1000
> Open sockets from "lsof -n|grep -c named":  40
> "netstat -I 5" shows no interface errors
> 
> Would 9.3.5-P2 reduce the server fails or should I be looking at another
> version? Has anyone running 9.3.5-p1 under Solaris 10 seen this issue?

9.3.5-P1 could cause server failures more often when it receives a
bulk of queries in a short period.

I'd strongly recommend you to try 9.3.6rc1, which should be much
better than 9.3.5-P1 and should be as stable as 9.3.5-P2.  If you
still have the same problem with 9.3.6rc1, please report it again.

---
JINMEI, Tatuya
Internet Systems Consortium, Inc.


More information about the bind-users mailing list