Wanted to update everyone on this. I've been working with ahu who has been extremely helpful. <br><br>He added some code into my release to fix the problem I was having. <br><br>Basically, there were some domains (out of 50,000 domains) that couldn't reach their master, and therefore would sit there and time out, causing major delays across pdns. He added some code to skip domains if they timed out a few times before, so it wouldn't use up all the "spawns". <br>
<br>It seems to work and all domains are happy again.<br><br>Thanks ahu! <br><br><br><div class="gmail_quote">On Thu, Oct 9, 2008 at 4:51 PM, Tyler Hall <span dir="ltr"><<a href="mailto:tyler@tylerhall.net">tyler@tylerhall.net</a>></span> wrote:<br>
<blockquote class="gmail_quote" style="border-left: 1px solid rgb(204, 204, 204); margin: 0pt 0pt 0pt 0.8ex; padding-left: 1ex;"><div dir="ltr"><br><div class="gmail_quote"><div class="Ih2E3d">On Thu, Oct 9, 2008 at 4:39 PM, Brendan Oakley <span dir="ltr"><<a href="mailto:gentux2@gmail.com" target="_blank">gentux2@gmail.com</a>></span> wrote:<br>
</div><blockquote class="gmail_quote" style="border-left: 1px solid rgb(204, 204, 204); margin: 0pt 0pt 0pt 0.8ex; padding-left: 1ex;"><div class="Ih2E3d">
<br>
<br>
Once you corrected the problem with the master, it was up to you to<br>
initiate the AXFR. Either by forcing a retrieve from the slave, or<br>
sending a notify from the master. That should clear that up.<br><div>
<br>
</div></div><div class="Ih2E3d">The evidence you have presented would seem to suggest a malfunction,<br>
rather than a limit. There is no reason to expect all 50,000 zones to<br>
be checked at the same time every 30 minutes. I have seen a few cases<br>
where a zone just stopped trying to update, but it was not a server<br>
with very many zones, so it would not seem to have to do with being<br>
able to keep up with the number of domains. I can't seem to find a<br>
reliable way to reproduce it, but my guess is it had to do with being<br>
a "hidden" slave and the master was not reachable when it was trying<br>
to retrieve it. Making strategic use of notifies, or forcing a<br>
retrieval of zones when you know they need it, should take care of<br>
these cases.<br>
<font color="#888888"><br></font></div></blockquote><div><br>Thanks for the email. I don't explain it to do a check every 30 minutes, I think my expire is 8 hours so even if it doesn't receive a NOTIFY/XFER, it should check the domains every 8 hours. <br>
<br>However, when an existing domain is in the system, my master nameserver will send NOTIFY/XFER requests, but the powerdns slaves are so backed up, they don't do anything about it. <br><br>I can see if I can get further logs of a NOTIFY being sent, and then when the pdns slave grabs it. <br>
<br></div></div><br></div>
</blockquote></div><br>