Bind server crashing (lots of EAGAIN, ENOENT, ...). With strace log.

K L kl.forwarder at gmail.com
Tue Nov 5 12:31:00 UTC 2013


All,

I am hoping you can help me. I had working DNS servers, now my internal
master server stopped. Restarting takes +1min. I have reinstalled it,
rebooted the machine, that did not help. Server has 3 (virtual) cores and
does not swap when the 'crash' happens.

What I mean by crash: the process is still running, but the server is not
responding to queries. Even a `/etc/init.d/named status` takes 28 - 60
seconds.

Here is a strace log from when it happens:
http://pastebin.com/raw.php?i=7i0PgALG . Example:
6500 recvmsg(518, {msg_name(16)={sa_family=AF_INET, sin_port=htons(53),
sin_addr=inet_addr("10.0.101.50")},
msg_iov(1)=[{"~\223\201\200\0\1\0\1\0\5\0\6\3ns3\5cymru\3com\0\0\1\0\1\300"...,
4096}], msg_controllen=32, {cmsg_len=32, cmsg_level=SOL_SOCKET,
cmsg_type=0x1d /* SCM_??? */, ...}, msg_flags=0}, 0) = 252
6500 recvmsg(518, 0x7fd4b6588900, 0) = -1 EAGAIN (Resource temporarily
unavailable)

I am not a C programmer, but from this, what I think I see is a packet is
being delivered to named, and that fails.

What could the problem be? Is this a bind problem? OS/System problem maybe?
I don't recall any (kernel) parameters since it worked.

Regards,
kl
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://lists.isc.org/pipermail/bind-users/attachments/20131105/580a564a/attachment.html>


More information about the bind-users mailing list