Bug #1003842 “dnsmasq sometimes fails to resolve private names i... : Bugs : network-manager package : Ubuntu

Revision history for this message

Launchpad Janitor (janitor) wrote on 2012-05-24:

#1

Status changed to 'Confirmed' because the bug affects multiple users.

Changed in network-manager (Ubuntu):
status:	New → Confirmed

Thomas Hood (jdthood) on 2012-05-24

summary:

- Enabling dnsmasq by default breaks systems with non-equivalent upstream
- nameservers
+ Upgrading to Precise NM with "dns=dnsmasq" breaks systems with non-
+ equivalent upstream nameservers

Revision history for this message

Thomas Hood (jdthood) wrote on 2012-05-24: Re: Upgrading to Precise NM with "dns=dnsmasq" breaks systems with non-equivalent upstream nameservers

#2

Probable duplicates: LP#993794, #997076.

Revision history for this message

Scott Moser (smoser) wrote on 2012-05-24:

#3

I think the most common case for this is a VPN as likely after you've vpn'd in somewhere, those dns servers have additional (local) results, that even possibly differ from external results. The other case is described in bug 993794. Although, to be honest, I'm not really sure what the benefit of dhcp servers on the same network giving 2 dns servers with different information available. I'm not exactly sure what expected behavior would be there.

There is upstream discussion on this at http://lists.thekelleys.org.uk/pipermail/dnsmasq-discuss/2009q3/003295.html .

One potential solution for this is to use:
--server=/example.com/1.2.3.4
which would send all dns lookups for 'example.com' to 1.2.3.4.

Note also per /usr/share/doc/dnsmasq-base/examples/dnsmasq.conf.example:
  # Example of routing PTR queries to nameservers: this will send all
  # address->name queries for 192.168.3/24 to nameserver 10.1.2.3
  #server=/3.168.192.in-addr.arpa/10.1.2.3

At one point in the past I had a solution for using resolvconf to manage dnsmasq on connection to a vpn using vpnc. I've described that at [2]. I'm not sure whether the code there still works or not. Perhaps a similar approach could be used by network manager.

--
[1] http://seife.kernalert.de/blog/2010/06/22/nifty-dnsmasq-trick-reverse-lookup-using-a-specific-server/
[2] http://smoser.brickies.net/git/?p=att-resolvconf.git;a=blob;f=README;h=f2eff389131f46d8bf7b6b805f4395d89187cd1d;hb=HEAD

Revision history for this message

Thomas Hood (jdthood) wrote on 2012-05-24:

#4

> I'm not really sure what the benefit of dhcp servers on
> the same network giving 2 dns servers with different
> information available.
> I'm not exactly sure what expected behavior would be there.

It's not the best way to configure DNS on a network. However, Ubuntu users don't always have control over the networks to which they want to connect.

Apparently Windows and Ubuntu before Precise behave well under the circumstances in question, at least in the sense that they can always resolve names.

Thomas Hood (jdthood) on 2012-05-25

summary:

- Upgrading to Precise NM with "dns=dnsmasq" breaks systems with non-
- equivalent upstream nameservers
+ Precise NM with "dns=dnsmasq" breaks systems with non-equivalent
+ upstream nameservers

Revision history for this message

Thomas Hood (jdthood) wrote on 2012-05-25: Re: Precise NM with "dns=dnsmasq" breaks systems with non-equivalent upstream nameservers

#5

In the past it has been noticed that dnsmasq does not try the nameservers one after the other as some resolver libraries do (including the GNU libc resolver(3)). People have asked if dnsmasq can be enhanced to exhibit the one-after-the-other behavior. But dnsmasq's author, Simon Kelley, writes (http://lists.thekelleys.org.uk/pipermail/dnsmasq-discuss/2011q2/005060.html):
> [T]he idea of searching a set of servers in a particular order is problematic.
>
> Assume you have two servers, one of which knows about some domains
> but the other does not. You query the "special" server first so that it can
> tell you about those domains. But DNS uses UDP, which is an unreliable
> transport, so at random, the queries to the special server might get
> lost, and then the queries will get answered from the second server, and
> randomly your extra domains get lost. Good luck diagnosing the problem.

This critique pertains to the aforementioned resolver libraries, too, of course.

From this we can infer that the networks with non-equivalent nameservers are badly configured.

Simon Kelley continues:
> Dnsmasq is written with the strong assumption that all "normal" upstream
> servers have the same view of the DNS. You can redirect queries for some
> domains to other servers like this
>
> server=/example.com/1.2.3.4
>
> and *.example.com will go to the special server and only the special
> server

He explains further at http://lists.thekelleys.org.uk/pipermail/dnsmasq-discuss/2009q3/003295.html

Given that such misconfigured networks exist, however, how should Ubuntu help users to deal with them?

* Should "dns=dnsmasq" be optional, not the default?
* Should there be an easy way of disabling "dns=dnsmasq"?
* Would it be possible for Ubuntu automatically to detect nonhomogeneous sets of nameservers and to turn off "dns=dnsmasq" in the event that such a set is detected?

Revision history for this message

Sergio Callegari (callegar) wrote on 2012-05-25:

#6

1) Searching servers in order is IMHO not as problematic as the author of dnsmasq suggests. If an udp packet gets lost, a name does not get resolved, because you may switch to the following nameserver. Yet, it is sufficient to retry the operation to have a good chance of success. Which is exactly the behavior that you get with the libc resolv.

2) An alternative would be not to search sequentially, but to keep asking the other nameservers, in case the first that answers fails resolution.

Revision history for this message

pdf (pdffs) wrote on 2012-05-26:

#7

@callegar - that's all well and good, and I agree, but this is unlikely to get solved in dnsmasq (though that would be ideal).

@jdthood:
1. Would have been the sane option for an LTS release (and server installs should use traditional resolv.conf model, if that's not the case)
2. Well, commenting the line is not too bad, except that other resolvconf bugs mean that doing so actually results in no name resolution at all
3. I'd suggest that's very hard to impossible

Revision history for this message

Wolf Rogner (war-rsb) wrote on 2012-05-26:

#8

Simon Kelley might have written dnsmaskq with the assumption that all DNS servers upstream have the same view about the namespace. However, this is not how RFC sees it nor how it is set up in a majority of installations.

Consider a small installation where the main server also serves DHCP address leases and has to maintain DNS names. All names end in an internal domain name domain.intern (like MS SBS not only recommends but enforces). The DNS server is set up to forward unresolvable requests to the upstream DNS server. Clearly the upstream DNS has no clue about domain.intern addresses.

Why not set up DNS to use the internal server to serve all requests and forward what cannot be resolved? Quite simple - speed and resillience.

If the internal server cannot answer the DNS request, the client redirects the query to the next server. This eliminates the internal server as a resolution bottleneck and allows the clients to continue with a basic set of functionality in case of a server outage (planned or not).

So now you easily have two separate DNS servers, on dealing internal requests and one external.

In our case, the router binds DNS so that it can forward DNS requests. This gives us extra resilience in case of a DNS server outage with our main provider. We use the router to forward to a different DNS server.

Server down -> No mail, no fileservices, no printing services, no database BUT Internet access still works
Router down -> OK, we have a problem
Upstream DNS down -> No problem at all

Also our internal DNS server serves requests to our own external domain as well as some others. So it definitely does not have the same view as upstream DNS. None of this violates RFC definitions.

@callegar: I agree with you: If dnsmask handles DNS resolution it MUST resolve the issue. That means either Kelley adapts his position or dnsmask has to go.

@jdthood: As a user I wasn't asking for dnsmask. It was chosen to improve DNS resolution. Which it does not. In a LTS release this is pretty hard (it is pretty hard in any release). Good design not only suggests but enforces that if a technology substitutes a predicessor it is required to provide a fallback in case of error.

Commenting dns=dnsmask in /etc/NetworkManager/NetworkManager.conf is a workaround but certainly not a solution.

Finding an automatism to resolve different DNS resolution paths would be the responsibility of the programmer in my (simplified) view.